The Impossibility of Fair LLMs

Anthis, Jacy; Lum, Kristian; Ekstrand, Michael; Feller, Avi; Tan, Chenhao

Computer Science > Computation and Language

arXiv:2406.03198 (cs)

[Submitted on 28 May 2024 (v1), last revised 5 Jun 2025 (this version, v2)]

Title:The Impossibility of Fair LLMs

Authors:Jacy Anthis, Kristian Lum, Michael Ekstrand, Avi Feller, Chenhao Tan

View PDF HTML (experimental)

Abstract:The rise of general-purpose artificial intelligence (AI) systems, particularly large language models (LLMs), has raised pressing moral questions about how to reduce bias and ensure fairness at scale. Researchers have documented a sort of "bias" in the significant correlations between demographics (e.g., race, gender) in LLM prompts and responses, but it remains unclear how LLM fairness could be evaluated with more rigorous definitions, such as group fairness or fair representations. We analyze a variety of technical fairness frameworks and find inherent challenges in each that make the development of a fair LLM intractable. We show that each framework either does not logically extend to the general-purpose AI context or is infeasible in practice, primarily due to the large amounts of unstructured training data and the many potential combinations of human populations, use cases, and sensitive attributes. These inherent challenges would persist for general-purpose AI, including LLMs, even if empirical challenges, such as limited participatory input and limited measurement methods, were overcome. Nonetheless, fairness will remain an important type of model evaluation, and there are still promising research directions, particularly the development of standards for the responsibility of LLM developers, context-specific evaluations, and methods of iterative, participatory, and AI-assisted evaluation that could scale fairness across the diverse contexts of modern human-AI interaction.

Comments:	Published in ACL 2025
Subjects:	Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Applications (stat.AP); Machine Learning (stat.ML)
Cite as:	arXiv:2406.03198 [cs.CL]
	(or arXiv:2406.03198v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.03198

Submission history

From: Jacy Reese Anthis [view email]
[v1] Tue, 28 May 2024 04:36:15 UTC (83 KB)
[v2] Thu, 5 Jun 2025 14:35:42 UTC (134 KB)

Computer Science > Computation and Language

Title:The Impossibility of Fair LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Impossibility of Fair LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators