ORANSight-2.0: Foundational LLMs for O-RAN

Gajjar, Pranshav; Shah, Vijay K.

Computer Science > Computation and Language

arXiv:2503.05200 (cs)

[Submitted on 7 Mar 2025 (v1), last revised 22 Jul 2025 (this version, v2)]

Title:ORANSight-2.0: Foundational LLMs for O-RAN

Authors:Pranshav Gajjar, Vijay K. Shah

View PDF

Abstract:Despite the transformative impact of Large Language Models (LLMs) across critical domains such as healthcare, customer service, and business marketing, their integration into Open Radio Access Networks (O-RAN) remains limited. This gap is primarily due to the absence of domain-specific foundational models, with existing solutions often relying on general-purpose LLMs that fail to address the unique challenges and technical intricacies of O-RAN. To bridge this gap, we introduce ORANSight-2.0 (O-RAN Insights), a pioneering initiative to develop specialized foundational LLMs tailored for O-RAN. Built on 18 models spanning five open-source LLM frameworks -- Mistral, Qwen, Llama, Phi, and Gemma -- ORANSight-2.0 fine-tunes models ranging from 1B to 70B parameters, significantly reducing reliance on proprietary, closed-source models while enhancing performance in O-RAN-specific tasks. At the core of ORANSight-2.0 is RANSTRUCT, a novel Retrieval-Augmented Generation (RAG)-based instruction-tuning framework that employs two LLM agents -- a Mistral-based Question Generator and a Qwen-based Answer Generator -- to create high-quality instruction-tuning datasets. The generated dataset is then used to fine-tune the 18 pre-trained open-source LLMs via QLoRA. To evaluate ORANSight-2.0, we introduce srsRANBench, a novel benchmark designed for code generation and codebase understanding in the context of srsRAN, a widely used 5G O-RAN stack.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
Cite as:	arXiv:2503.05200 [cs.CL]
	(or arXiv:2503.05200v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.05200

Submission history

From: Pranshav Gajjar [view email]
[v1] Fri, 7 Mar 2025 07:44:31 UTC (855 KB)
[v2] Tue, 22 Jul 2025 20:40:41 UTC (593 KB)

Computer Science > Computation and Language

Title:ORANSight-2.0: Foundational LLMs for O-RAN

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ORANSight-2.0: Foundational LLMs for O-RAN

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators