Li et al., 2023 - Google Patents
Research on Audio Processing Method Based on 3D TechnologyLi et al., 2023
- Document ID
- 12730102622472592211
- Author
- Li K
- Tang Y
- Ouyang Y
- Publication year
- Publication venue
- International Conference on Computational Finance and Business Analytics
External Links
Snippet
The perception of sound by human auditory system includes not only subjective attributes such as loudness, tone and timbre, but also spatial attributes of sound. 3D sound effect is an acoustic concept, which has the characteristics of broad sound stage and strong sense of …
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3074—Audio data retrieval
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/155—Musical effects
- G10H2210/265—Acoustic effect simulation, i.e. volume, spatial, resonance or reverberation effects added to a musical sound, usually by appropriate filtering or delays
- G10H2210/295—Spatial effects, musical uses of multiple audio channels, e.g. stereo
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/131—Mathematical functions for musical analysis, processing, synthesis or composition
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zakariah et al. | Digital multimedia audio forensics: past, present and future | |
| US9093120B2 (en) | Audio fingerprint extraction by scaling in time and resampling | |
| US8396705B2 (en) | Extraction and matching of characteristic fingerprints from audio signals | |
| Liu et al. | Detection of double MP3 compression | |
| CN103959375A (en) | Enhanced chroma extraction from an audio codec | |
| Lu et al. | Self-supervised audio spatialization with correspondence classifier | |
| Umapathy et al. | Audio signal processing using time-frequency approaches: coding, classification, fingerprinting, and watermarking | |
| Zhu et al. | Sound texture modeling and time-frequency LPC | |
| Liu et al. | Anti-forensics of fake stereo audio using generative adversarial network | |
| US12437213B2 (en) | Bayesian graph-based retrieval-augmented generation with synthetic feedback loop (BG-RAG-SFL) | |
| CN119541516A (en) | Adaptive audio enhancement method, device, SoC chip and storage medium | |
| EP2489036B1 (en) | Method, apparatus and computer program for processing multi-channel audio signals | |
| Nematollahi et al. | Digital speech watermarking based on linear predictive analysis and singular value decomposition | |
| Li et al. | Research on Audio Processing Method Based on 3D Technology | |
| Akesbi | Audio denoising for robust audio fingerprinting | |
| You et al. | Music Identification System Using MPEG‐7 Audio Signature Descriptors | |
| Lan et al. | Research on improved DNN and MultiResU_Net network speech enhancement effect | |
| Yan | Automatic Annotation Method for Music Post-Production Based on Hybrid Boltzmann Machine and Pitch Features | |
| CN120299465B (en) | Audio data processing method, device, equipment, storage medium and program product | |
| CN119601023B (en) | Bluetooth sound box data processing method, system and storage medium | |
| Horsburgh et al. | Music-inspired texture representation | |
| Yang et al. | Multi-channel object-based spatial parameter compression approach for 3d audio | |
| Akesbi et al. | Music Augmentation And Denoising For Peak-Based Audio Fingerprinting | |
| Venkateswaralu et al. | Audio compression using Munich and Cambridge filters for audio coding with Morlet wavelet | |
| Deshmukh et al. | AutoFoley Sound Synthesis and Analysis: A Survey on Current Status and Its Future Scope |