Working Papers
30 Million Canvas Records Reveal Widespread Sequential Bias and System-induced Surname Initial Disparity in Grading
Jiaxin Pei*, Zhihan (Helen) Wang*, Jun Li (*Equal Contributions)
Major Revision at Management Science, SSRN Link
🏆 Best Student Paper Award at EAAMO
Media: Michigan News • Daily Mail • ABC News • Newsweek • Fox NewsThe Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets
Shenzhe Zhu, Jiao Sun, Yi Nian, Tobin South, Alex Pentland, Jiaxin Pei
[Paper] [Code]
Published Papers
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals’ Subjective Text Perceptions
Matthias Orlikowski, Jiaxin Pei, Paul Röttger, Philipp Cimiano, David Jurgens, Dirk Hovy
ACL 2025 [Paper] [Code]Sociodemographic Prompting is Not Yet an Effective Approach for Simulating Subjective Judgments with LLMs
†Huaman Sun, Jiaxin Pei, Minje Choi and David Jurgens (†Mentored student)
NAACL 2025
[Paper] [Code]User-Driven Value Alignment: Understanding Users’ Perceptions and Strategies for Addressing Biased and Discriminatory Statements in AI Companions
Xianzhe Fan, Qing Xiao, Xuhui Zhou, Jiaxin Pei, Maarten Sap, Zhicong Lu, Hong Shen
CHI 2025
[Paper]Who Reaps All the Superchats? A Large-Scale Analysis of Income Inequality in Virtual YouTuber Livestreaming
†Ruijing Zhao, †Brian Diep, Jiaxin Pei, Dongwook Yoon, David Jurgens, and Jian Zhu (†Mentored student)
CHI 2025
[Paper]Modeling and Detecting Company Risks from News
Jiaxin Pei, Soumya Vadlamannati, Liang-Kang Huang, Daniel Preotiuc-Pietro, Xinyu Hua
NAACL 2024
[Paper] [Blog]When “A Helpful Assistant” Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models
†Mingqian Zheng, Jiaxin Pei, Lajanugen Logeswaran, Moontae Lee and David Jurgens (†Mentored student)
EMNLP 2024 (Findings)
[Paper] [Code]
Media: The MarkupDo LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Jiaxin Pei*, Minje Choi*, Sagar Kumar, Chang Shu and David Jurgens (*Equal Contributions)
EMNLP 2023
[Paper] [Dataset] [Code]SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research
Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Leonardo Neves, Kiamehr Rezaee, Luis Espinosa-Anke, Jiaxin Pei, Jose Camacho-Collados
EMNLP 2023 (Findings)
[Paper] [Dataset]When Do Annotator Demographics Matter? Measuring The Influence of Annotator Demographics with the POPQUORN Dataset
Jiaxin Pei and David Jurgens
LAW-XVII @ ACL 2023
[Paper] [Dataset] [Interface]
Media: Michigan News • Prolific • Digit News • Today HeadlineExploring Linguistic Style Matching in Online Communities: The Role of Social Context and Conversation Dynamics
Aparna Ananthasubramaniam*, Hong Chen*, Jason Yan*, Kenan Alkiek*, **Jiaxin Pei*, Agrima Seth*, Lavinia Dunagan*, Minje Choi*, Benjamin Litterer* and David Jurgens* (*Equal Contributions)
SICon @ ACL 2023
🏆 Best Paper Award
[Paper] [Code]SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis
Jiaxin Pei, VĂtor Silva, Maarten Bos, Yozon Liu, Leonardo Neves, David Jurgens and Francesco Barbieri
SemEval 2023
[Website] [Paper]🥔POTATO: The Portable Text Annotation Tool
Jiaxin Pei, Aparna Ananthasubramaniam, Xingyao Wang, Naitian Zhou, Jackson Sargent, Apostolos Dedeloudis and David Jurgens
EMNLP 2022 (Demo)
🏆 Best Demo Paper Award at HCOMP 2024
[Paper] [Code] [Website]Modeling Information Change in Science Communication with Semantically Matched Paraphrases
Jiaxin Pei*, Dustin Wright*, David Jurgens and Isabelle Augenstein (*Equal Contributions)
EMNLP 2022
🏆 Honorable Mention Award at IC2S2 2023
[Paper] [Code] [Dataset] [PyPI] [Interface]Measuring Sentence-level and Aspect-level Uncertainty in Science Communications
Jiaxin Pei and David Jurgens
EMNLP 2021
[Website] [Paper] [Data] [URLs] [Model] [PyPI] [Interface]
Media: Poynter • Michigan News • phys.org • ZME ScienceQuantifying Intimacy in Language
Jiaxin Pei and David Jurgens
EMNLP 2020
[Paper] [Data] [Model] [PyPI] [Demo]MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization
Jiaxin Pei*, Canwen Xu*, Hongtao Wu, Yiyu Liu and Chenliang Li (*Equal Contributions)
ACL 2020
[Paper] [Data] [Model]Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders
Yu Duan, Canwen Xu, Jiaxin Pei, Jialong Han and Chenliang Li
ACL 2020
[Paper] [Model]DLocRL: A Deep Learning Pipeline for Fine-Grained Location Recognition and Linking in Tweets
Canwen Xu, Jing Li, Xiangyang Luo, Jiaxin Pei, Chenliang Li, Donghong Ji
WWW 2019
[Paper]S2SPMN: A Simple and Effective Framework for Response Generation with Relevant Information
Jiaxin Pei and Chenliang Li
EMNLP 2018
[Paper]