Kim, 2019 - Google Patents
Hearing aid speech enhancement using phase difference-controlled dual-microphone generalized sidelobe cancellerKim, 2019
View PDF- Document ID
- 6448795683145573591
- Author
- Kim S
- Publication year
- Publication venue
- IEEE Access
External Links
Snippet
This paper proposes a new technique for improving a generalized sidelobe canceller (GSC) for dual-microphone speech enhancement to be applied in an auditory device such as a hearing aid. Here, the GSC is implemented on a 32-channel uniform polyphase discrete …
- 230000003595 spectral 0 abstract description 11
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K11/00—Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
- G10K11/18—Methods or devices for transmitting, conducting, or directing sound
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Parchami et al. | Recent developments in speech enhancement in the short-time Fourier transform domain | |
| Erdogan et al. | Improved MVDR beamforming using single-channel mask prediction networks. | |
| CA2621940C (en) | Method and device for binaural signal enhancement | |
| Kim | Hearing aid speech enhancement using phase difference-controlled dual-microphone generalized sidelobe canceller | |
| US20100217590A1 (en) | Speaker localization system and method | |
| Xiao et al. | The NTU-ADSC systems for reverberation challenge 2014 | |
| Taherian et al. | Deep learning based multi-channel speaker recognition in noisy and reverberant environments | |
| US11380312B1 (en) | Residual echo suppression for keyword detection | |
| Liu et al. | Inplace gated convolutional recurrent neural network for dual-channel speech enhancement | |
| Kim et al. | Factorized MVDR deep beamforming for multi-channel speech enhancement | |
| EP3847645B1 (en) | Determining a room response of a desired source in a reverberant environment | |
| Bohlender et al. | Neural networks using full-band and subband spatial features for mask based source separation | |
| Wisdom et al. | Enhancement and recognition of reverberant and noisy speech by extending its coherence | |
| Kovalyov et al. | Dfsnet: A steerable neural beamformer invariant to microphone array configuration for real-time, low-latency speech enhancement | |
| EP3566228A1 (en) | Audio capture using beamforming | |
| Hwang et al. | Dual microphone speech enhancement based on statistical modeling of interchannel phase difference | |
| Aroudi et al. | Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding | |
| Wang et al. | Dual-channel target speaker extraction based on conditional variational autoencoder and directional information | |
| Kim et al. | Hybrid probabilistic adaptation mode controller for generalized sidelobe cancellers applied to multi-microphone speech enhancement | |
| Liu et al. | A new neural beamformer for multi-channel speech separation | |
| Levi et al. | A robust method to extract talker azimuth orientation using a large-aperture microphone array | |
| Cheng et al. | Speech Enhancement Based on Beamforming and Post-Filtering by Combining Phase Information. | |
| Kim et al. | Noise variance estimation based on dual-channel phase difference for speech enhancement | |
| Zhao et al. | Directional noise suppression based on dual-microphone with desired direction presetting | |
| Šarić et al. | Mask-based Beamforming Applied to the end-fire microphone array |