Aroudi et al., 2020 - Google Patents

Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding

Aroudi et al., 2020

Document ID: 17600080323877210164
Author: Aroudi A; Delcroix M; Nakatani T; Kinoshita K; Araki S; Doclo S
Publication year: 2020
Publication venue: 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing (MLSP)

External Links

Cited by

Snippet

The performance of speech enhancement algorithms in a multi-speaker scenario depends on correctly identifying the target speaker to be enhanced. Auditory attention decoding (AAD) methods allow to identify the target speaker which the listener is attending to from …

Continue reading at arxiv.org (PDF) (other versions)

230000001149 cognitive 0 title abstract description 19

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/50—Customised settings for obtaining desired overall acoustical characteristics
- H04R25/505—Customised settings for obtaining desired overall acoustical characteristics using digital signal processing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R27/00—Public address systems

Similar Documents

Publication	Publication Date	Title
Hadad et al.	2016	The binaural LCMV beamformer and its performance analysis
Wang et al.	2021	Sequential multi-frame neural beamforming for speech separation and enhancement
Yousefian et al.	2011	A dual-microphone speech enhancement algorithm based on the coherence function
Aroudi et al.	2020	Cognitive-driven binaural beamforming using EEG-based auditory attention decoding
Pedersen et al.	2008	Two-microphone separation of speech mixtures
CN114078481B (en)	2024-12-17	Voice enhancement method and device based on two-channel neural network time-frequency masking and hearing aid equipment
Krueger et al.	2010	Speech enhancement with a GSC-like structure employing eigenvector-based transfer function ratios estimation
Aroudi et al.	2019	Cognitive-driven binaural LCMV beamformer using EEG-based auditory attention decoding
Doclo	2003	Multi-microphone noise reduction and dereverberation techniques for speech applications
Das et al.	2020	Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding
Schwartz et al.	2016	An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation
Schwartz et al.	2016	Joint estimation of late reverberant and speech power spectral densities in noisy environments using Frobenius norm
Yousefian et al.	2014	A coherence-based noise reduction algorithm for binaural hearing aids
Kleinschmidt et al.	2001	Combining speech enhancement and auditory feature extraction for robust speech recognition
Kim	2019	Hearing aid speech enhancement using phase difference-controlled dual-microphone generalized sidelobe canceller
Aroudi et al.	2020	Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding
Marin-Hurtado et al.	2011	Perceptually inspired noise-reduction method for binaural hearing aids
Pan et al.	2021	A single-input/binaural-output antiphasic speech enhancement method for speech intelligibility improvement
Schwartz et al.	2015	Nested generalized sidelobe canceller for joint dereverberation and noise reduction
Xu et al.	2024	Fovnet: Configurable field-of-view speech enhancement with low computation and distortion for smart glasses
Zohourian et al.	2018	GSC-based binaural speaker separation preserving spatial cues
Azarpour et al.	2017	Binaural noise reduction via cue-preserving MMSE filter and adaptive-blocking-based noise PSD estimation
Hadad et al.	2017	Comparison of two binaural beamforming approaches for hearing aids
Fischer et al.	2020	Robust constrained MFMVDR filters for single-channel speech enhancement based on spherical uncertainty set
Delfarah et al.	2018	Recurrent neural networks for cochannel speech separation in reverberant environments