Forking Paths in Neural Text Generation

Bigelow, Eric; Holtzman, Ari; Tanaka, Hidenori; Ullman, Tomer

Computer Science > Computation and Language

arXiv:2412.07961 (cs)

[Submitted on 10 Dec 2024]

Title:Forking Paths in Neural Text Generation

Authors:Eric Bigelow, Ari Holtzman, Hidenori Tanaka, Tomer Ullman

View PDF HTML (experimental)

Abstract:Estimating uncertainty in Large Language Models (LLMs) is important for properly evaluating LLMs, and ensuring safety for users. However, prior approaches to uncertainty estimation focus on the final answer in generated text, ignoring intermediate steps that might dramatically impact the outcome. We hypothesize that there exist key forking tokens, such that re-sampling the system at those specific tokens, but not others, leads to very different outcomes. To test this empirically, we develop a novel approach to representing uncertainty dynamics across individual tokens of text generation, and applying statistical models to test our hypothesis. Our approach is highly flexible: it can be applied to any dataset and any LLM, without fine tuning or accessing model weights. We use our method to analyze LLM responses on 7 different tasks across 4 domains, spanning a wide range of typical use cases. We find many examples of forking tokens, including surprising ones such as punctuation marks, suggesting that LLMs are often just a single token away from saying something very different.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2412.07961 [cs.CL]
	(or arXiv:2412.07961v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.07961

Submission history

From: Eric Bigelow [view email]
[v1] Tue, 10 Dec 2024 22:57:57 UTC (5,340 KB)

Computer Science > Computation and Language

Title:Forking Paths in Neural Text Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Forking Paths in Neural Text Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators