Du et al., 2025 - Google Patents
Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle NetworksDu et al., 2025
View PDF- Document ID
- 6565906062487252426
- Author
- Du B
- Du H
- Niyato D
- Li R
- Publication year
- Publication venue
- IEEE Transactions on Mobile Computing
External Links
Snippet
Task-oriented semantic communication has emerged as a fundamental approach for enhancing performance in various communication scenarios. While recent advances in Generative Artificial Intelligence (GenAI), such as Large Language Models (LLMs), have …
- 238000004891 communication 0 title abstract description 19
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zhang et al. | Generative AI-enabled vehicular networks: Fundamentals, framework, and case study | |
| US12182507B2 (en) | Text processing model training method, and text processing method and apparatus | |
| US20170357720A1 (en) | Joint heterogeneous language-vision embeddings for video tagging and search | |
| CN112084331A (en) | Text processing method, text processing device, model training method, model training device, computer equipment and storage medium | |
| CN113254684B (en) | Content aging determination method, related device, equipment and storage medium | |
| CN116756574A (en) | Training method, using method, device and equipment of multi-mode pre-training model | |
| Du et al. | Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks | |
| Hu et al. | Toward full-scene domain generalization in multi-agent collaborative bird’s eye view segmentation for connected and autonomous driving | |
| CN112085120A (en) | Multimedia data processing method and device, electronic equipment and storage medium | |
| Liu et al. | Cross-modal generative semantic communications for mobile AIGC: Joint semantic encoding and prompt engineering | |
| Wang et al. | Generative ai for autonomous driving: Frontiers and opportunities | |
| Niu | The effect of intelligent tour guide system based on attraction positioning and recommendation to improve the experience of tourists visiting scenic spots | |
| CN119763019A (en) | Information generation method based on multi-mode model and related equipment | |
| Fourati et al. | Xlm for autonomous driving systems: A comprehensive review | |
| Zhang et al. | Embodied AI-Enhanced Vehicular Networks: An Integrated Vision Language Models and Reinforcement Learning Method | |
| Hou et al. | Knowledge driven indoor object‐goal navigation aid for visually impaired people | |
| Tian et al. | Large (vision) language models for autonomous vehicles: Current trends and future directions | |
| Han et al. | Fostering college students’ mental well-being: the impact of social networking site utilization on emotion management and regulation | |
| An et al. | AI Flow: Perspectives, Scenarios, and Approaches | |
| CN117372828A (en) | Label generation method and device for multimedia information, storage medium and electronic equipment | |
| Li et al. | Without detection: Two‐step clustering features with local–global attention for image captioning | |
| Ma et al. | Multimodal data processing framework for smart city: A positional-attention based deep learning approach | |
| Afif et al. | Indoor objects detection system implementation using multi-graphic processing units | |
| US20250299485A1 (en) | Multi-object tracking using hierarchical graph neural networks | |
| Wang | Local pattern aware 3D video swin transformer with masked autoencoding for realtime augmented reality gesture interaction |