Curriculum-style Data Augmentation for LLM-based Metaphor Detection

Jia, Kaidi; Wu, Yanxia; Liu, Ming; Li, Rongsheng

Computer Science > Computation and Language

arXiv:2412.02956v2 (cs)

[Submitted on 4 Dec 2024 (v1), last revised 2 Mar 2025 (this version, v2)]

Title:Curriculum-style Data Augmentation for LLM-based Metaphor Detection

Authors:Kaidi Jia, Yanxia Wu, Ming Liu, Rongsheng Li

View PDF HTML (experimental)

Abstract:Recently, utilizing large language models (LLMs) for metaphor detection has achieved promising results. However, these methods heavily rely on the capabilities of closed-source LLMs, which come with relatively high inference costs and latency. To address this, we propose a method for metaphor detection by fine-tuning open-source LLMs, effectively reducing inference costs and latency with a single inference step. Furthermore, metaphor detection suffers from a severe data scarcity problem, which hinders effective fine-tuning of LLMs. To tackle this, we introduce Curriculum-style Data Augmentation (CDA). Specifically, before fine-tuning, we evaluate the training data to identify correctly predicted instances for fine-tuning, while incorrectly predicted instances are used as seed data for data augmentation. This approach enables the model to quickly learn simpler knowledge and progressively acquire more complex knowledge, thereby improving performance incrementally. Experimental results demonstrate that our method achieves state-of-the-art performance across all baselines. Additionally, we provide detailed ablation studies to validate the effectiveness of CDA.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.02956 [cs.CL]
	(or arXiv:2412.02956v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.02956

Submission history

From: Kaidi Jia [view email]
[v1] Wed, 4 Dec 2024 02:05:21 UTC (96 KB)
[v2] Sun, 2 Mar 2025 09:35:28 UTC (98 KB)

Computer Science > Computation and Language

Title:Curriculum-style Data Augmentation for LLM-based Metaphor Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Curriculum-style Data Augmentation for LLM-based Metaphor Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators