InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

Image credit: Unsplash

Abstract

Role-playing agents (RPAs), powered by large language models, have emerged as a flourishing field of applications. However, a key challenge lies in assessing whether RPAs accurately reproduce the personas of target characters, namely their character fidelity. Existing methods mainly focus on the knowledge and linguistic patterns of characters. This paper, instead, introduces a novel perspective to evaluate the personality fidelity of RPAs with psychological scales. Overcoming drawbacks of previous self-report assessments on RPAs, we propose InCharacter, namely Interviewing Character agents for personality tests. Experiments include various types of RPAs and LLMs, covering 32 distinct characters on 14 widely used psychological scales. The results validate the effectiveness of InCharacter in measuring RPA personalities. Then, with InCharacter, we show that state-of-the-art RPAs exhibit personalities highly aligned with the human-perceived personalities of the characters, achieving an accuracy up to 80.7%. Our demo, code, dataset, and results are publicly available.

Publication
ACL 2024
Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

Add the publication’s full text or supplementary notes here. You can use rich formatting such as including code, math, and images.

Xintao Wang
Xintao Wang
Ph.D Candidate

My research interests focus on large language models and autonomous agents, especially their personas and personalization.