Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic

Xin Zheng; Jie Lou; Boxi Cao; Xueru Wen; Yuqiu Ji; Hongyu Lin; Yaojie Lu; Xianpei Han; Debing Zhang; Le Sun

doi:10.18653/v1/2025.findings-acl.89

Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic

Xin Zheng, Jie Lou, Boxi Cao, Xueru Wen, Yuqiu Ji, Hongyu Lin, Yaojie Lu, Xianpei Han, Debing Zhang, Le Sun

Abstract

Self-critic has become a crucial mechanism for enhancing the reasoning performance of LLMs. However, current approaches mainly involve basic prompts for intuitive instance-level feedback, which resembles System-1 processes and limits the reasoning capabilities. Moreover, there is a lack of in-depth investigations into the relationship between LLM’s ability to criticize and its task-solving performance. To address these issues, we propose Critic-CoT, a novel framework that pushes LLMs toward System-2-like critic capability. Through a step-wise CoT reasoning paradigm and the automatic construction of weak-supervision data without human annotation, Critic-CoT enables LLMs to engage in slow, analytic self-critique and refinement, thereby improving their reasoning abilities. Experiments on GSM8K and MATH and out-of-domain evaluation demonstrate that our enhanced model significantly boosts task-solving performance by filtering out invalid solutions or iterative refinement. Furthermore, we investigate the intrinsic correlation between critique and task-solving abilities within LLMs, discovering that these abilities can mutually reinforce each other rather than conflict.

Anthology ID:: 2025.findings-acl.89
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1768–1806
Language:
URL:: https://aclanthology.org/2025.findings-acl.89/
DOI:: 10.18653/v1/2025.findings-acl.89
Bibkey:
Cite (ACL):: Xin Zheng, Jie Lou, Boxi Cao, Xueru Wen, Yuqiu Ji, Hongyu Lin, Yaojie Lu, Xianpei Han, Debing Zhang, and Le Sun. 2025. Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic. In Findings of the Association for Computational Linguistics: ACL 2025, pages 1768–1806, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic (Zheng et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.89.pdf

PDF Cite Search Fix data