Du et al., 2025 - Google Patents

Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks

Du et al., 2025

Document ID: 6565906062487252426
Author: Du B; Du H; Niyato D; Li R
Publication year: 2025
Publication venue: IEEE Transactions on Mobile Computing

External Links

Cited by

Snippet

Task-oriented semantic communication has emerged as a fundamental approach for enhancing performance in various communication scenarios. While recent advances in Generative Artificial Intelligence (GenAI), such as Large Language Models (LLMs), have …

Continue reading at arxiv.org (PDF) (other versions)

238000004891 communication 0 title abstract description 19

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management

Similar Documents

Publication	Publication Date	Title
Zhang et al.	2024	Generative AI-enabled vehicular networks: Fundamentals, framework, and case study
US12182507B2 (en)	2024-12-31	Text processing model training method, and text processing method and apparatus
US20170357720A1 (en)	2017-12-14	Joint heterogeneous language-vision embeddings for video tagging and search
CN112084331A (en)	2020-12-15	Text processing method, text processing device, model training method, model training device, computer equipment and storage medium
CN113254684B (en)	2021-10-29	Content aging determination method, related device, equipment and storage medium
CN116756574A (en)	2023-09-15	Training method, using method, device and equipment of multi-mode pre-training model
Du et al.	2025	Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks
Hu et al.	2024	Toward full-scene domain generalization in multi-agent collaborative bird’s eye view segmentation for connected and autonomous driving
CN112085120A (en)	2020-12-15	Multimedia data processing method and device, electronic equipment and storage medium
Liu et al.	2024	Cross-modal generative semantic communications for mobile AIGC: Joint semantic encoding and prompt engineering
Wang et al.	2025	Generative ai for autonomous driving: Frontiers and opportunities
Niu	2023	The effect of intelligent tour guide system based on attraction positioning and recommendation to improve the experience of tourists visiting scenic spots
CN119763019A (en)	2025-04-04	Information generation method based on multi-mode model and related equipment
Fourati et al.	2024	Xlm for autonomous driving systems: A comprehensive review
Zhang et al.	2025	Embodied AI-Enhanced Vehicular Networks: An Integrated Vision Language Models and Reinforcement Learning Method
Hou et al.	2022	Knowledge driven indoor object‐goal navigation aid for visually impaired people
Tian et al.	2024	Large (vision) language models for autonomous vehicles: Current trends and future directions
Han et al.	2024	Fostering college students’ mental well-being: the impact of social networking site utilization on emotion management and regulation
An et al.	2025	AI Flow: Perspectives, Scenarios, and Approaches
CN117372828A (en)	2024-01-09	Label generation method and device for multimedia information, storage medium and electronic equipment
Li et al.	2022	Without detection: Two‐step clustering features with local–global attention for image captioning
Ma et al.	2020	Multimodal data processing framework for smart city: A positional-attention based deep learning approach
Afif et al.	2022	Indoor objects detection system implementation using multi-graphic processing units
US20250299485A1 (en)	2025-09-25	Multi-object tracking using hierarchical graph neural networks
Wang	2025	Local pattern aware 3D video swin transformer with masked autoencoding for realtime augmented reality gesture interaction