How Well Do Large Language Models Capture Human Personality?

Aanisha Bhattacharyya*, Yaman Kumar Singla*, Rajiv Ratn Shah, Changyou Chen, Jitendra Ajmera
* Equal contribution
Adobe Media and Data Science Research (MDSR), Adobe

Get in touch with us at behavior-in-the-wild@googlegroups.com

Motivation

Persona-based simulation with LLMs rests on several foundational assumptions that are rarely questioned but are central to how such systems are designed, interpreted, and deployed:

Research Question

Do these core assumptions actually hold in practice? Specifically:

Abstract

Large language models (LLMs) are increasingly used to simulate human populations via persona prompting, often under the assumptions that richer persona descriptions improve behavioral fidelity, similarly sized attribute combinations are equally simulatable, and persona definitions generalize across tasks. In this work, we formalize these assumptions and systematically evaluate them across multiple architectures, scales, and simulation settings. We identify a fundamental limitation we term persona manifold collapse, where increasingly expressive persona specifications lead to systematic contraction of representational and behavioral diversity. Across models, increasing persona complexity consistently reduces inter-persona separation in latent space and weakens behavioral differentiation in downstream simulation tasks. These effects persist across multiple analyses as richer personas fail to preserve human subgroup disagreement, performance varies across attribute combinations of similar size, and adding descriptive detail often degrades rather than improves simulation fidelity. Surprisingly, simple Age–Gender personas consistently outperform richly specified Ideal Customer Profiles (ICPs) across industries, achieving substantially higher downstream prediction accuracy. We find that collapse is not uniform across attributes. Certain combinations remain behaviorally stable and preserve stronger alignment with human responses, forming localized regions we term alignment bridges. Together, our results provide empirical and conceptual foundations for understanding the limits of persona-conditioned simulation, highlighting the need for representation-aware persona construction rather than increasing persona expressivity alone.

Key Results

Takeaway

The foundational assumptions of persona-based LLM simulation do not generally hold. Richer, more complex personas do not reliably yield better behavioral fidelity or diversity — in fact, they often cause the opposite. Effective persona design depends less on maximizing expressivity and more on identifying behaviorally stable attribute combinations that the model can reliably represent. The field should shift toward representation-aware persona construction, focusing on stable, meaningful attribute combinations rather than increasing narrative detail or attribute count.

BibTeX

@article{bhattacharyya2025llmpersonality,
  title={How Well Do Large Language Models Capture Human Personality?},
  author={Bhattacharyya, Aanisha and Singla, Yaman Kumar and Shah, Rajiv Ratn and Chen, Changyou and Ajmera, Jitendra},
  year={2025},
  url={https://www.researchgate.net/publication/405480733_How_Well_Do_Large_Language_Models_Capture_Human_Personality}
}