WO2003015458A2 - Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment - Google Patents
Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment Download PDFInfo
- Publication number
- WO2003015458A2 WO2003015458A2 PCT/EP2002/009029 EP0209029W WO03015458A2 WO 2003015458 A2 WO2003015458 A2 WO 2003015458A2 EP 0209029 W EP0209029 W EP 0209029W WO 03015458 A2 WO03015458 A2 WO 03015458A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- gain
- wave
- audio processor
- wave parameter
- signal
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/004—Monitoring arrangements; Testing arrangements for microphones
- H04R29/005—Microphone arrays
- H04R29/006—Microphone matching
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2201/00—Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
- H04R2201/40—Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
- H04R2201/403—Linear arrays of transducers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
Definitions
- the present invention relates generally to audio signal processing. More specifically, the present invention relates to an audio processing system that exhibits an arbitrary directivity and gradient response.
- FIG. 1 a block diagram of a sound processing system 10 is shown,
- the system 10 includes at least one microphone 12 that picks up sounds from a sound field in which it is located and converts these sounds to electrical signals.
- a plurality of microphones are depicted and the microphones are numbered from one to Nl .
- the electrical signals from the microphones 12 are preferably input to an audio processor 14.
- the sounds are to be reproduced by one or more output devices 16 such as loudspeakers, earphones, and the like.
- the sound can optionally pass through transmission channels or additional processing before arriving at the output device 16. It may even be recorded and played back before arriving at the output device 16.
- the sound field into which the system 10 is placed contains not only the sounds to be picked up, referred to as a utility signal, but also unwanted sounds, referred to as noise or noise signals.
- noise or noise signals unwanted sounds
- One is known as static beamforming where the signals from two or more microphones are passed through filters and combined to form a single signal.
- the resulting signal will show a sensitivity to sounds that depends upon the direction of the sound incidence as compared to the direction of the microphone assembly.
- the directivity response will take the form of one or more beams. Due to the fact that the lobes of the directivity response have different magnitudes, the beamformer will show a signal to noise improvement when the beam is oriented so that the utility signal falls within the main lobe and the main part of the noise falls outside the main lobe.
- Static beamformers have the disadvantage that, in order to provide substantial noise reduction under general noise conditions, a large number of microphones are required.
- adaptive beamforming is achieved when the filters of a beamformer are variable and controlled by an adaptation process. Normally such an adaptation process works to minimize the output signal power.
- An adaptive beamformer can track noise sources and dynamically adjust the directivity response such that the sensitivity at the direction of the noise incidence is minimized while keeping the sensitivity at the utility direction high.
- Currently known adaptive beamformers show the disadvantage that they are only capable of tracking a limited number of noise sources, mostly only a single. Furthermore adaptive beamformers work with a fairly large time constant in the adaptation process. Therefore they are only able to track quasi-static noise sources.
- the incoming waves are amplified with a gain function based on these properties, that is, the directivity or the gradient.
- Based on the amplified waves an output signal is generated either by synthesizing the amplified waves or by applying filtering to an input signal combination.
- the present invention can operate in a number of applications including hearing aids, directional microphones, microphone arrays, silicon microphone assemblies, headsets, hearing protectors, cordless phones, mobile phones, camcorders, personal computers, laptops, palmtops, and personal digital assistants, ' among others.
- the present invention is especially suited to work with head worn microphones that pick up the speech signal of the wearer.
- the present invention offers a substantially improved noise reduction when compared to conventional solutions with comparable sound quality.
- a sound processing system including at least one microphone, an audio processor, and at least one output device.
- the audio processor includes an analog beamformer, a microphone equalizer, and an apparent incidence processor.
- Two different embodiments of the apparent incidence processor are disclosed, that is, a wave generation method and a forward filtering method. Both embodiments use the same principles to estimate the properties of the individual waves of the sound field.
- the present invention it is possible to implement arbitrary directivity or gradient responses using a small number of microphones only, that is, two or three microphones.
- the present invention offers improved noise reduction also for environments with many independent noise sources. Furthermore, the present invention works for signals and noises with arbitrary statistics.
- FIG. 1 is a block diagram of a sound processing system
- FIG. 2 is a block diagram according to a preferred embodiment of the present invention of the audio processor of FIG. 1;
- FIG. 3 is a block diagram of the analog beamformer of FIG. 2;
- FIG. 4 is a block diagram of the microphone equalizer of FIG. 2;
- FIG. 5 is a block diagram of the microphone equalization updater of FIG.
- FIG. 6 is a block diagram according to a preferred embodiment of the present invention of the apparent incidence processor of FIG. 2;
- FIG. 7 is a block diagram of the analysis beamformer of FIG. 6;
- FIG. 8 is a block diagram of the wave parameter estimator of FIG. 6;
- FIG. 9 is a block diagram of the equation solver of FIG. 8.
- FIG. 10 is a block diagram of an embodiment of the core solver of FIG. 9 using a table look up implementation with optional approximation;
- FIG. 11 is a block diagram of the output generator of FIG. 6;
- FIG. 12 is a block diagram of the statistical evaluator of FIG. 11;
- FIG. 13 is a block diagram of the wave generation gain controller of FIG.
- FIG. 14 is a pair of polar plots of a set of gain versus direction functions
- FIG. 15 is a block diagram of the gain mapper of FIG. 11 ;
- FIG. 16 is a block diagram of the signal generator of FIG. 11;
- FIG. 17 is a block diagram according to another preferred embodiment of the present invention of the apparent incidence processor of FIG. 2;
- FIG. 18 is a block diagram of the forward filter of FIG. 17;
- FIG- 1 is a block diagram of the forward beamformer of FIG. 18;
- FIG. 20 is a block diagram of the adaptor of FIG. 19;
- FIG. 21 is a block diagram of the forward filter gain controller of FIG. 18;
- FIG. 22 is a block diagram of the forward filter gain function applier of FIG. 21;
- FIG. 23 is a block diagram of a multiple output embodiment of the wave generation method
- FIG. 24 is a block diagram of a multiple output embodiment of the forward filtering method
- FIG. 25 is a block diagram of a forward filter/output generator
- FIG. 26 is a block diagram of the wave generator/forward filter gain controller of FIG. 25;
- FIG. 27 is a block diagram of a single combined mathematical transform processor
- FIG. 28 is a block diagram of a near field embodiment of the audio processor of FIG. 1;
- FIG. 29 is a block diagram of the microphone equalizer of FIG. 28;
- FIG. 30 is a block diagram of the microphone equalization updater of FIG. 29;
- FIG. 31 is a block diagram of the beamformer of FIG. 28;
- FIG. 32 is a block diagram of the near field gain controller of FIG. 28;
- FIG. 33 is a block diagram of the statistical evaluator of FIG. 32;
- FIG. 34 is a block diagram of an embodiment of the near field gain function applier of FIG. 32;
- FIG. 35 is a block diagram of an embodiment of the near field gain function applier of FIG 32 using a table look up implementation with subsequent approximation/interpolation;
- FIG. 36 is a pair of graphs of gain function of different widths.
- the components, process steps, and/or data structures may be implemented using various types of operating systems, ' computing platforms, computer programs, and/or general purpose machines.
- operating systems ' computing platforms, computer programs, and/or general purpose machines.
- devices of a less general purpose nature such as hardwired devices, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), or the like, may also be used without departing from the scope and spirit of the inventive concepts disclosed herein.
- FPGAs field programmable gate arrays
- ASICs application specific integrated circuits
- FIG. 2 a block diagram according to a preferred embodiment of the present invention of the audio processor 14 of FIG. 1 is shown.
- the audio processor 14 includes an analog beamformer 18, a microphone equalizer 20, and an apparent incidence processor 22.
- an apparent incidence processor 22 two different embodiments of the apparent incidence processor 22 will be disclosed, that is, a wave generation method and a forward filtering method. Both embodiments use the same principles to estimate the properties of the individual waves of the sound field.
- the method of apparent incidence processing involves complex signal operations. It is therefore possible that the processing will be performed with digital techniques.
- the microphone signals will be converted to digital signals with at least one analog to digital (A/D) converter 24 and the output signal will be converted back to an analog signal, if needed, with a digital to analog (D/A) converter 26.
- A/D analog to digital
- D/A digital to analog
- the analog beamformer 18 provides analog preprocessing according to conventional techniques of the microphone signals that enables the reduction of the resolution of all but one of the A/D converters 24. This can save size and reduce the power consumption. For hearing aids, for example, these properties are highly desirable. Depending on the circumstances however, the analog beamformer 18 may be deleted as unnecessary or too costly. [0019]
- the method of apparent incidence processing requires, as generally do conventional beamformers, that the microphones 12 of FIG. 1 have sensitivities that are matched.
- the microphone equalizer 20 equalizes the signals from the microphones 12 according to conventional techniques. With this equalization, the functioning of the processing downstream will still be possible even if the microphones have different sensitivities and even when the microphone sensitivities change over time. Again, depending on the circumstances however, the microphone equalizer 20 may be deleted as unnecessary or too costly.
- the analog beamformer 18 includes at least one filter 28 and a summing device 30. Each of the output beams of the analog beamformer 18 is derived as the sum of the filtered microphone signals. The variable i indexes the analog beam outputs. Each of the filters 28 will generally be different from microphone to microphone and from beam to beam.
- the beam amic(l) is formed as the sum of all microphone inputs and the other beams are formed as the difference of a specific microphone signal and a reference microphone signal (Microphone (1)).
- the filter transfer functions in the laplace domain that implements such a beamforming are as shown in (1) below.
- Nl N2.
- the generalized beam amic(i) will relate to the microphone signals as follows:
- each of the filters, Filter (j,i) approximates different time delays of the microphone signals, that is, either inverting or non-inverting.
- each of the beams amic(i) implements the same directivity using different microphone combinations.
- the microphones are placed equidistant along a common axis.
- the analog beamformer is defined according to (3) below. The numberingy of the microphones follows their placement along the common axis with number one being closest to the sound source. NB-1 is commonly referred to as the order of the directivity.
- Filter (j, i) Filter (j + 1, i + 1) for all j ⁇ Nl - 1, i ⁇ N2 - 1
- N2 N ⁇ -NB
- FIG. 4 a block diagram of the microphone equalizer 20 of FIG. 2 is shown. While the microphone equalizer 20 could be implemented in the time domain with a FIR or an IIR filter, the preferred scheme of FIG. 4 works in the frequency domain.
- the microphone signals (mic(l) to mic( ⁇ 2)) are first converted to the frequency domain in a plurality of forward transformers 32.
- the equalization is then accomplished by multiplying, in the frequency domain, the microphone signals with at least one equalization function (MicEq(i)) generated by a microphone equalization updater 34.
- the equalized signals are finally converted back to the time domain in a plurality of reverse transformers 36.
- one of the microphone signals is chosen as a reference.
- the reference signal is referred to with index one, rnic(l).
- This reference signal is by definition equalized and is thus passed through the processing of the microphone equalizer 20 unaltered.
- the equalization functions generated by the microphone equalization updater 34 should follow the definition of (4) below. (4) MicEq( ⁇ ) ⁇ S ⁇ V)
- S(l) is the sensitivity defined as the digital value at the input of the microphone equalizer 20 of FIG. 2 divided by the sound pressure of the reference microphone signal and S(i) is the corresponding sensitivity of the other microphones, respectively. All of the terms of (4) are implicit functions of frequency.
- FIG. 5 a block diagram of the microphone equalization updater 34 of FIG. 3 is shown.
- Some of the processing is done in a polar complex, that is, magnitude/phase, format so a rectangular to polar converter 38 and a polar to rectangular converter 40 are provided.
- the rest of the processing except as noted, uses a rectangular, that is, real/imaginary, format for complex numbers.
- a phase accumulator 42 and a magnitude accumulator 44 hold the equalization response of the specific microphone (i).
- the response is updated at regular intervals by accumulating small updates to the phase and magnitude accumulators 42, 44, respectively.
- the current equalized spectrum, CMIC(l) of the reference channel is divided by the equalized spectrum, CMICfi), of the respective channel.
- the quotient is converted to polar format as phase and magnitude in the rectangular to polar converter 38.
- the phase of the quotient is analyzed in a zero phase condition detector 46. If the phase indicates that the sound incidence is from a direction and a distance for which the current microphone signal should have the same magnitude sensitivity as the reference microphone signal, then the zero phase condition detector 46 will output a logic one as a ZeroPhase output signal. If the phase condition does not hold, then the analyzer will output a logic zero.
- a magnitude update value is obtained that is suitable to use to update the magnitude accumulator 44 for the equalizing function.
- phase update both the phase and the magnitude of the quotient are analyzed. Under normal conditions, it will generally not be possible to derive any information regarding a misfit of the phase of the microphone equalization response. But once in a while, triggered by a specific input from a specific direction, the microphone signals will relate to each other in a way that can only be possible if the equalization response is incorrect.
- a compute excess phase monitor 48 the signals are monitored and if such an "impossible condition" is found to exist the amount by which the phase is un-natural will be output as an excess phase signal. If the phase conditions are natural, the compute excess phase monitor 48 will output zero. As the excess phase conditions depend upon signal frequency, the compute excess phase monitor 48 estimates the frequency for its use.
- the phase update signal is scaled with the PAaseCoe/coefficient.
- the signal inband detector 50 outputs a logic one if the power in the current frequency band is contributed mainly by input contents of frequencies falling within the band. Conversely, the signal inband detector 50 outputs a logic zero if the contents are due mainly to input at frequencies outside of the band. It is widely known that most time-to-frequency transforms "spill" energy from the source band to neighboring bands due to windowing effects or similar mechanisms. However, only signals within the frequency band should be allowed to influence the equalizing value for a band. This differentiation is possible through the use of the Inband signal.
- the equalization response will be updated dynamically.
- the phase and the magnitude of the equalization response will be regulated independently. Updating follow through statistical processes that rely on the noise-like nature of the signals that are most likely to be encountered as inputs to the audio processor, such as speech, machine noise, wind noise, and the like.
- the update signals will contain large noise components, therefore they are scaled with small coefficients, that is, PhaseCoef and MagnitudeCoef, respectively, such that the adaptation times are slow. The use of the coefficients prevents noise from entering the forward signals through the equalization processes.
- phase and magnitude accumulators 42 and 44 are divided into static and dynamic parts, where the updates only influence the dynamic parts.
- the effective equalization response is the product of the static and dynamic parts.
- the static part of the equalization response is measured with standard measuring techniques once or regularly at the time of production test or at some other convenient times and saved.
- a forgetting factor is included with the dynamic part of the accumulator.
- the forgetting factor causes the dynamic response to converge towards zero when no updates are received.
- means are provided that can save the accumulated equalization response when the audio processor is powered down and read the saved response again when the processor is powered up the next time.
- the signals mic(i) are all omni directional and the zero phase condition detector 46 is implemented so as to compare the magnitude of the phase with a constant value. If the phase magnitude is smaller than the constant, then a logic one ZeroPhase signal is generated.
- the signals mic(i) are all omni directional and the compute excess phase monitor 48 generates the phase update signal according to (5) below.
- f(i) is the frequency as estimated by the compute excess phase monitor 48.
- a,f and ExcessPhase are all vectors covering the frequency range of the frequency transformation used.
- the * operation in (5) denotes an element-by-element multiplication and not a vector multiply.
- d(i) is the physical spacing between Microphonefi) and the reference microphone, Microphone(l).
- c is the speed of sound, s is a small positive constant.
- the center frequency, f(i), for the equalization is estimated as the center frequency of the band i.
- the frequency, /f ), for the equalization is estimated as in (6) below.
- k is the frequency band index
- fc(k) is the center frequency of band k
- BW(k) is the bandwidth of band k
- b is a positive constant
- the signal inband detector 50 for each frequency band evaluates the absolute value of its input signal in the current band and the two nearest neighboring bands. If the current band carries the highest absolute value, then the Inband signal for the current band is generated as a logic one and otherwise it is generated as a logic zero.
- FIG. 6 a block diagram according to a preferred embodiment of the present invention of the apparent incidence processor 22 of FIG. 2 is shown.
- the wave generation method is shown. The processing runs in three stages. First, analysis beamforming 52 is performed on the equalized microphone signals. Second, the parameters of the incoming sound waves are estimated in a wave parameter estimator 54. Finally, an output generator 56 produces a signal that contains the incoming waves modified in such a way that unwanted waves are attenuated by comparison to the utility waves.
- FIG. 7 a block diagram of the analysis beamformer 52 of FIG. 6 is shown.
- the analysis beamformer 52 is similar to the analog beamformer 18 of FIG. 3 described above but it works in the digital domain.
- the analysis beamformer 52 generates a plurality of abeam signals and for each signal it includes a plurality of filters 58 and a summing device 60.
- the analysis beamformer 52 serves, among other functions, to remove unwanted noise from the signal thus enhancing quality of the wave parameter estimation.
- the analysis beamforming of (7) is combined with the analog beamforming of (1) above. In such a combination, it can then be shown that the output of the analysis beamformer 52 will be estimates of the microphone outputs as described in (8) below.
- abeam( ⁇ ) c AD Microphone( ⁇ )
- C D is the A/D converter conversion gain.
- each of the filters 58, Filter(j,i) approximates different time delays of the microphone signals, that is, either inverting or non-inverting.
- the latest embodiment described is changed so that only two of the Filters (j, ⁇ ) for each beam i are present.
- each of the beams, abeam(i) implements the same directivity using different microphone combinations.
- the microphones are placed equidistant along a common axis.
- the analysis beamformer 52 is defined according to (9) below.
- the numbering j of the microphones follows their placement along the common axis with number one being closest to the sound source.
- NB-1 is commonly referred to as the order of the directivity.
- Filter (j, i) Filter (j + 1, i + 1) for all j ⁇ N2 - 1, i ⁇ Nl - 1
- N1 N2 -NR [0055]
- the analysis beamforming is performed in frequency bands.
- the wave parameter estimator 54 includes a plurality of analysis filters 62, a plurality of forward transformers 64, a normalizer 66, and an equation solver 68.
- the analysis filters 62 are optional and when implemented serve to create additional input signals to the equation solver 68 such that the individual components carry different weights. If the input consists of two or more sinusoidal waves of the same frequency, then it will not be possible to distinguish between the waves. However, if the waves carry different frequency content, then it will be possible to distinguish between the waves. Processing the input with filters of different magnitude responses, phase responses, or both creates additional information for the equation solver 68.
- the equation solver 68 is most efficiently implemented in the frequency domain. Therefore, if it has not been previously performed, the inputs are converted to the frequency domain in the forward transformers 64.
- the equation solver 68 utilizes mathematical functions. Such functions can be included either through table look-up, Taylor-series approximation, or the like. In any case, the dynamic range of the functions may be limited due to hardware constraints. In order to gain maximal use of a limited mathematical dynamic range, the input signals are normalized in the normalizer 66. However normalization may not be necessary or desirable, in which case the normalizer 66 may be omitted. When implemented, the output from the normalizer 66 carries not only the normalized frequency domain signals but also information about the amount by which the signals have been normalized, an exponent, collected in the output signal BeamExp. This exponent enables the recovery of the absolute values from the normalized values. Each beam input and frequency may be equalized independently but the same exponent may also be used across all beams, frequencies, or both.
- the analysis filters 62 perform differentiation with respect to time. Note that differentiation with respect to time can be expressed in the frequency domain with the transfer function of (10) below. In (10), j is the imaginary unit.
- the analysis filters 62 use the difference equation of
- n is the sample index and ES is the sampling frequency.
- a single analysis filters 62 is included to filter the first abeam signals only, that is, abeam(l).
- the forward transformers 64 are FFT based.
- the forward transformers 64 are performed with a time domain filterbank.
- the forward transformers 64 are performed with a time domain filterbank that delivers quadrature outputs from which phase information can be extracted.
- the microphone equalizer 20 of FIG. 4, the analysis beamformer 52 of FIG. 6, and the analysis filters 62 operate in the same domain. In this way, the forward transformers 32 of the microphone equalizer 20 will suffice and the reverse transformers 36 of the microphone equalizer 20 and the forward transformers 64of the wave parameter estimator 54 can be omitted.
- the normalizer 66 independently for each frequency band, finds the complex component, real or imaginary, of the ABEAM(i) signals, with the largest magnitude. A common exponent for all beams is found using this largest component. All beams are then normalized with the common exponent.
- the equation solver 68 uses floating point arithmetic. In such a case, the normalizer 66 also converts each of the beams to floating point notation. [0068] Turning now to FIG. 9, a block diagram of the equation solver 68 of FIG. 8 is shown. As the sound field equations are most efficiently solved with the input signals in a phase, magnitude notation, the signals are converted to such a polar notation before further processing is performed in the equation solver 68 by a plurality of rectangular to polar converters 70. Furthermore, it is convenient to use ratios of the various input signals when solving for angle of incidence and frequency, therefore the needed ratios are computed in an analysis ratio processor 72 and a beam ratios processor 74. The functioning of these processors can be described, respectively, by equations (11) and (12) below.
- a is the analysis filter index and i the abeam index.
- the equation solver 68 may include a time domain integrator 76 which is optional. When implemented, it integrates product factors P(il,al) *P(i2,a2) over time. Through the use of the time domain integrator 76, it may be feasible to enhance the analysis especially for any embodiment using time domain filterbanks as the forward transformers 64 of FIG. 8. [0071]
- the equation solver 68 most importantly includes a core solver 78 which solves the sound field equations.
- a sound field can be described in several ways. The description can be in the time domain or in the frequency domain, among others. Furthermore, the description can involve a potential field describing both pressure and velocity with the same function or the description can have distinct pressure and velocity equations.
- the sound field be described primarily in the frequency domain and simplifications will be used when judged feasible.
- the sound field will be described with two functions.
- the first function is a complex scalar function. It gives the sound pressure as a function of frequency and position.
- the second function is a complex vector function. It gives the sound particle velocity as a function of frequency and position.
- the sound consists of M waves.
- the waves are not required to be plane waves.
- Third, each of the waves in the description may in reality consist of the sum of several waves. The principle of superposition holds for sounds up until very high soimd pressure levels.
- each of the waves is quasi-sinusoidal in the sense that within each frequency band of the analysis the energy is mainly due to a single sinusoid only. Thus each wave may consist of several sinusoids, spread over the frequency range.
- v m is thus the direction of sound incidence of the wave.
- ⁇ m is the damping
- d(i) is the physical distance from the reference microphone, microphone 1, to microphone i.
- the sound field be described primarily in the frequency domain.
- the frequency domain is generally the most advantageous domain in terms of complexity and computing costs. Nevertheless in some cases the processing of the present invention is most feasible performed together with other audio processing applications. If such audio processing runs in the time domain it may prove efficient to implement the apparent incidence audio processing in the time domain as well. In the following, the sound field equations will therefore be stated in the time domain.
- (16) states the sound field equations in the time domain under the assumptions as used in the formulation of (14) above. In (16), p is used to describe the time domain version of P and n is the sample index.
- [0081] (16) can generally not be solved for a single sample measurement only. It is necessary to measure over a number of samples and perform some form of averaging. To enable the solution, new measurement signals are defined in (17) below. In (17), indexes al and a2 may have a value of zero. B is the number of samples to average over and win() is an optional window to weight the measurements with.
- the product terms in the right hand side of (17) will according to (16) be the product of constants and two cosine terms dependent on time through the sample index n.
- the product of two cosine results in a sum of two different cosines, namely at the sum angle and the difference angle.
- the sum and difference angles may be DC terms or AC terms.
- the integral of the product of two cosines goes to zero for large integration times if the cosines are of a different frequencies.
- (18) shows what remains of (17) when B is sufficiently large.
- f c is the center frequency of the individual bands of the frequency transform.
- the waves are assumed to be plane waves with ⁇ m
- the first wave is assumed to impinge from a target direction.
- a further embodiment solves for two waves.
- the first, signal, wave is constrained to have a direction from. within a certain tolerance around a target direction.
- the second, noise, wave is constrained to have a direction from outside the tolerance field.
- a still further embodiment solves for M waves.
- the solutions are not directly constrained but they are ordered so that the waves impinge further away from the target direction the higher the wave number is.
- the parameters to solve for may include the following:
- the first technique is referred to as direct solving. Under special conditions it is possible to solve (14) above directly using arithmetical methods. Such direct solving yields the wanted parameters as mathematical functions of the input signals to the core solver 78.
- the error is found by subtracting the actual measurement from the results that are obtained by inserting the current solution in the right hand side of (14).
- the error can be expressed in a mean square sense but it can also be taken as maximal of any of the measurements. However, the error can also be expressed as the relative difference between two successive solutions.
- the wanted directivity response is symmetric around a
- the error cost function, nerr is defined as the maximal relative difference between the last two iterative solutions as in (24) below.
- Wave iteration may be said to include the following steps:
- the third technique for equation solving is referred to as a full parameter scan. Unlike the iteration solution, this method will always find the global solution when properly set up. The drawback is a greater amount of computation.
- the full parameter scan the possible solution ranges are defined and within these ranges grids are set up. The grids are set to implement the wanted resolution for the respective parameters.
- the error cost function for example as in (22) above, is evaluated for all possible combinations of wave parameters on the grids. All parameter combinations for which the error cost function is smaller than a given threshold are possible solutions.
- the microphone configuration is axis-symmetric and the error cost function is defined as in (22).
- the parameter ranges are set to [0 ..
- full parameter scan is combined with iteration.
- Parameter ranges and grids are set up using a coarse grid. All parameter combinations on the coarse grid are used as starting guesses for an iteration leading to a "local solution" for the wave parameters.
- the local solution with the lowest error cost function is chosen as the correct solution.
- the fourth technique for equation solving is referred to as solution screening/optimizing for minimal power.
- the methods described above may yield more than one possible solution for the sound field equations for a given a set of measurements. Measurement noise from sources such as A/D conversion, microphones, etc. can be sources of ambiguity but the system may also be underestimated. The sources of underestimation are that the number of microphones used is not large enough to solve for the number of waves assumed or that the sound in reality consists of more waves than solved for.
- the chosen solution may not be the correct one, but even if the correct solution is not chosen the strategy results in a system gain for noise components equal to or lower than the noise gain that would have been the result if the correct solution was to be used.
- a specific embodiment uses the full parameter scan method. All parameter sets for which the error cost function is equal to or lower than the minimal cost function encountered plus a threshold are deemed possible solutions. The solution with the lowest
- the fifth technique for equation solving is referred to as solving for a subset.
- knowledge of the full set of wave parameters is not necessary in order to control the wave gain.
- a subset only will suffice. This may simplify the task of solving the sound field equations.
- a first implementation involves the use of conventional computer software.
- application software include mathematical programs such as Mathematica, Matlab, Maple, and the like.
- circuit simulators may be used under certain conditions.
- An embodiment of the system includes a standard computer architecture to do parts of the acoustical sound processing.
- the sound field equations are defined and solved for the wave parameters within a conventional software package on a conventional computer architecture.
- a second implementation involves the use of a conventional table look up.
- a table can be pre-computed that contains the optimal solutions with a certain resolution.
- the table can be computed using any of the solving techniques described. Once the table has been computed, one simply looks the solution up in the table to solve the sound field equations. Adding an iteration or approximation process to the look up process can enhance it by minimizing the size of the storage used for the table.
- FIG. 10 a block diagram of an embodiment of the core solver 78 of FIG. 9 using a table look up implementation with optional approximation is shown.
- the measurements are rounded to a predefined precision in rounder 80.
- the rounded measurements are then mapped to an integer space to yield an address in a map to address mapper 82.
- the address is used by look up 84 to look up in a pre-computed table 86.
- the table 86 may be stored on any storage device including RAM, ROM, hard disk, and the like. To save space, the table 86 may contain wave parameters in an encoded form.
- a map to parameter mapper 88 for mapping back to parameter space may optionally be inserted as shown.
- an interpolation 90 is optionally done to yield the parameter output.
- the table 86 may contain parameter derivatives besides parameter values.
- the approximation/interpolation process can be described with equation (28) below. In 28, the large dot denote inner product, m is the
- WP is the parameters as looked up
- FIG. 11 a block diagram of the output generator 56 of FIG. 6 is shown. Recall that the discussion continues to focus on the wave generation embodiment of the apparent incidence processor 22 of FIG. 2.
- the output generator 56 include a statistical evaluator 92, a wave generation (WG) gain controller 94, a gain smoother 96, a gain mapper 98, and a signal generator 100.
- the statistical evaluator 92 is optional and when implemented it analyzes the waves to obtain measures of the running signal and noise powers of the sound field.
- the WG gain controller 94 the individual waves are analyzed and a gain is attached to each wave. The wave gains are generated so that they attenuate unwanted waves, noise, while preserving utility waves.
- the raw gain output from the WG gain controller 94 is first smoothed and then mapped to the domain of the wave generation.
- the purpose of the gain smoother 96 is to prevent abrupt gain changes from occurring.
- the purpose of the gain mapper 98 is twofold. First, the raw gain may exhibit a frequency/value distribution that would cause time domain aliasing to occur if used in the raw state. Second, the raw gain may be defined. in another domain or with a different resolution than needed for the signal generator 100. In this case, the gain mapper 98 maps the gain to the different domain/resolution. In the signal generator 100, the waves are synthesized and weighted according to the mapped gain. [0119] Turning now to FIG. 12, a block diagram of the statistical evaluator 92 of FIG.
- Each set of wave parameters are analyzed in one of a plurality of signal or noise analyzers 102. For each wave and at each frequency band a decision is made as to whether the wave/band combination carries utility signal or noise information. If the combination carries signal information, then the corresponding part of an IsSignal signal is set to logic one and the corresponding part of an IsNoise signal is set to logic zero. If the combination carries noise, then IsSignal is set to logic zero and IsNoise is set to logic one.
- the IsSignal and IsNoise switch signals are multiplied with the wave powers, that is, the squared wave amplitudes.
- the wave power is summed over all waves in a signal summer 104 and a noise summer 106.
- the summed signal and noise powers are low pass filtered in aNarrowBand filter 108 to yield narrow band estimates of the signal and noise powers.
- the effective integration time of the filter 108 controls the speed of the measurement. It must be set large enough that inaccuracies in the wave parameter estimates are filtered out. The narrow band measurement may thus be relatively slow.
- the WideBandPowers output provides the same measurements as the NarrowBandPowers output with the exception that the measurement has been integrated over wide bands in sum over bands integrators 114 before being low pass filtered in WideBand filter 110. Due to the wide bandwidth the measurement may be performed at a faster rate, that is, a shorter integration time, and with a smaller delay than the narrow band measurement. Note that the dynamic characteristics of filters 108 and 110 control the update speed of the power signals. Therefore the filters will in general have different characteristics. ⁇
- the signal or noise analysis 102 is based on the measured direction of soimd incidence. If this is within a given tolerance equal to a target direction, then the wave/frequency pair is judged to be signal and otherwise it is judged to be noise.
- the signal or noise analysis 102 is based on the measured direction of sound incidence.
- the IsNoise signal is generated with the help of a directivity function as shown, for example, in the polar plots of FIG. 14.
- the IsNoise signal is taken as unity minus the IsSignal signal.
- the signal or noise analysis 102 is based on the measured wave damping. If this is greater than a given threshold, then the wave/frequency pair is judged to be signal and otherwise it is judged to be noise.
- an additional path generating a second NarrowBandPowers signal.
- the two NarrowBandPowers are generated with two different update rates.
- the WG gain controller 94 includes a strategy chooser 116, a gain function chooser 118, and a plurality of gain function appliers 120.
- the strategy chooser 116 an overall strategy is chosen based on the wideband power measurement. Strategies can, for example, be to use omni directional response or to use narrow beam directional response, among others. The strategy is controlled in wide bands.
- the gain function appliers 120 can be thought of as the heart of the processing. They directly control the gain of each wave as a function of some or all of the wave parameters including the direction of the sound incidence, the wave damping, and the frequency and amplitude. It is thus here that the directivity of the processing is implemented.
- the gain function chooser 118 selects the gain function that serves best for the current signal to noise ratio in view of the strategy input that has been chosen.
- the output of the gain function chooser 118 will typically control the width of the main lobe of the directional response.
- two gain strategies are implemented, that is, both omni directional and directional.
- the strategy chooser 116 compares the wideband signal power with the wideband noise power.
- the omni directional strategy is chosen in all narrow frequency bands covered by wide bands where signal power is greater than a predefined constant times the noise power. In all other bands, the directional strategy is chosen.
- the gain function appliers 120 operate by comparing the direction of incidence with a target direction. If the direction of incidence is within a predefined tolerance, the cut-off angle, from the target direction, then the raw gain is set to a predefined maximal gain and otherwise the raw gain is set to a predefined minimal gain. This results in a directivity as shown, for example, in the polar plot of FIG. 14a.
- the gain function chooser 118 outputs the cut-off angle as a GainSelector signal.
- the cut-off angle is controlled as a function of the narrowband band signal to noise ratio.
- the cut-off angle is controlled so as to produce a wide mainlobe of the beam for poor signal to noise ratios and a narrow mainlobe for good signal to noise ratios.
- the cut-off angle is controlled so as to produce a narrow mainlobe of the beam for poor signal to noise ratios and a wide mainlobe for good signal to noise ratios.
- the gain function appliers 120 operate by table lookup and optional approximation/interpolation in a similar way as the table look up process described with respect to FIG. 10 but with the wave parameters as input and the raw gain as output. Furthermore, the table output is mapped to gain space instead of parameter space.
- This embodiment can implement an arbitrary directivity. For example, any of the directivities depicted in the polar plots of FIGs. 14a and 14b can be implemented.
- the GainSelector signal switches between a finite number of different gain functions, for example, those implemented by different tables.
- An example of a set of gain versus direction functions is shown in the polar plot of FIG. 14b.
- the gain function is chosen to attenuate waves where the absolute value of the wave damping is greater than a predefined threshold. Thus waves are attenuated that are not far field waves.
- the gain function is chosen to attenuate waves where the value of the wave damping is lower than a given threshold. ' Thus far field waves are attenuated.
- the wave estimation and gain control processes will normally be performed on a block of samples.
- the duration of the blocks will be so large that it is possibly that the raw gain for a specific frequency band in consecutive blocks will differ significantly.
- an abrupt gain change may cause unwanted audible effects. Therefore it will generally be desirable to prevent abrupt gain changes. This is the purpose of the gain smoother 96 of FIG. 11.
- the gain smoother 96 of FIG. 11 copies its input to its output without making any changes. In effect, this eliminates the function of the gain smoother 96.
- the gain is smoothed in the gain smoother 96 of FIG. 11. The smoothed output is the average of the raw gains of the most recent Msmooth blocks.
- the gain is smoothed with exponential averaging of the raw gains of successive blocks in the gain smoother 96 of FIG. 11.
- the gain mapper 98 includes a reverse analysis transformer 122, a. gain windower 124, and a forward gain transfonner 126.
- the raw gain is first converted from the domain of the wave estimation to the time domain in the reverse analysis transformer 122.
- the converted raw gain is then shortened by applying a window and optionally padding with zeros in the gain windower 124.
- the length of the window is chosen so as not to provoke time domain aliasing artifacts when the gain is applied downstream.
- the windowed filter time (FIR) response is finally converted, by the forward gain transformer 126, to the domain that is to be used by the processing downstream.
- FIR windowed filter time
- the wave estimation is performed in the frequency domain.
- the reverse analysis transformer 122 is thus FFT-based.
- the wave estimation is performed in time domain filter bands.
- the reverse analysis transformer 122 is then implemented as a reconstruction filter bank.
- the wave estimation is performed in the time domain. The reverse analysis transformer 122 is thus omitted.
- the output of the gain mapper 98 is in the frequency domain.
- the forward gain transformer 126 is thus FFT-based.
- the output of the gain mapper 98 is in time domain filter bands.
- the forward gain transformer 126 is thus implemented as an analysis filter bank.
- the output of the gain mapper 98 is in the time domain.
- the forward gain transformer 126 is thus omitted.
- the signal generator 100 includes at least one wave generator 128, at least one gain applier 130, a wave summer 132, and a reverse signal transformer 134. Based on the amplitude, phase, and frequency of the wave, the wave is generated with the wave generator 128. The gain is applied by the gain applier 130, the individual waves are summed by the wave summer 132, and the sum of all the waves is converted, by the reverse signal transformer 134, back to the time domain as the output of signal generator 100 and the apparent incidence processor 22 of FIG. 2.
- the signals are generated in the frequency domain.
- the wave generator 128 thus merely has to output the complex amplitude, A m or abs(A m ) *exp(j* ⁇ m ). Then the gain is applied by multiplying with the frequency domain
- the reverse signal transformer 134 performs an inverse frequency transform.
- the signals are generated in the time domain with sine wave generators for the wave generators 128.
- the reverse signal transformer 134 is omitted.
- the signals are generated in the time domain in narrow frequency bands by the wave generators 128. Then the gain is applied by multiplying in the bands. Finally, the reverse signal transformer 134 is implemented with a reconstruction filter bank.
- FIG. 17 a block diagram according to another preferred embodiment of the present invention of the apparent incidence processor 22 of FIG. 2 is shown.
- the forward filtering method is shown.
- the processing runs in three stages with the first two being the same.
- analysis beamforming 52 is performed on the equalized microphone signals.
- the parameters of the incoming sound waves are estimated in a wave parameter estimator 54.
- the wave parameters are used in the forward filter 136 to generate filter coefficients for a filter that is applied to the input signals.
- the forward filter 136 include a statistical evaluator 92, a gain smoother 96, and a gain mapper 98 like those described above with respect to FIG. 11.
- the forward filter 136 includes a forward beamformer 138, a forward filter (FF) gain controller 140, a signal filter 142, and a beam summer 144.
- the inputs are beamformed by the forward beamformer 138 to produce a number of forward beam signals that are filtered by the signal filter 142 and summed by the beam summer 144 to form the output.
- the filter responses, that the forward beams are filtered with, are controlled by the FF gain controller 140 that in turn uses the wave parameters as well as the statistically evaluated signal and noise powers from the statistical evaluator 92 to calculate the filter responses.
- the forward beamformer 138 is optional and may be deleted leaving the input signals to be directly connected to the signal filter 142. When implemented, the forward beamformer 138 serves to remove noise from the signal thus enhancing the noise reduction performance achieved by the wave parameter controlled gain of the FF gain controller 140.
- the processing is similar to that of the analysis beamformer 52 of FIG. 6.
- the forward beamformer 138 is identical to the analysis beamformer 52 of FIG. 7. In this case, the fbeam signals are taken as the abeam outputs of the analysis beamformer 52. [0159] . In another embodiment, the forward beamformer 138 is in principle identical to the analysis beamformer 52 of FIG. 6 except that where the analysis beamformer 52 is optimized for frequency selectivity the forward beamformer 138 is optimized for low signal delay.
- two fbeam signals are generated, that is, an omni directional signal and a narrow beam, for example a supercardioid.
- FIG. 19 a block diagram of the forward beamformer 138 of FIG. 18 is shown.
- one or more of the fbeam signals may be generated with adaptive beamforming.
- the adaptive beamforming is achieved by first generating, through a plurality of beamformers 146, a number of fixed beam signals. The first being the target beam, pbeam, and the rest being one or more rear beams, rbeam(q).
- the rbeam signals are filtered by filters 148 and subtracted (150) from the pbeam to form the beamforming output
- pbeam is an ordinary beam with full sensitivity at the target direction suppressing other directions to some extent
- rbearn(q) is a number of different beam signals that all have zero sensitivity towards the target direction.
- the rbeam signals can thus be subtracted from the pbeam without influencing the signal coming from the target direction.
- the filter responses used to filter the rbeam signals are adapted by adaptors 152.
- FIG. 20 a block diagram of the adaptor 152 of FIG. 19 is shown.
- the fbeam output and rbeam signal are converted to the frequency domain by forward transformers 154 and correlated by a correlator 156.
- the cross-correlation is scaled by sealer 158 with an adaptation speed constant, mu, and is normalized with a lowpass filtered estimate, from a power filter 160, of the power of the rbeam signal.
- the scaled cross-correlation is integrated, by integrator and limiter 162, to yield the adapted filter response in the time domain. Besides being integrated, the filter response needs to be limited to eliminate convergence and computation noise problems.
- a gain mapping is performed on the adapted response by a gain mapper 164.
- the fbeam and rbeam signals are implemented in the frequency domain.
- the forward transforms 154 are thus not implemented.
- the correlator 156, the power filter 160, and the integrator and limiter 162 are implemented in the time domain.
- the rbeam and fbeam signals are likewise also implemented as time domain signals and the forward transformers 154 are omitted.
- the gain mapper 164 merely windows the filter response.
- the integrator and limiter 162 includes a forgetting factor causing the integrated response to tend towards zero during periods of no signal activity.
- two microphones are positioned along the target axis.
- the forward beams include an adaptive beamfbeam(il).
- pbeamfil is implemented with a beamformer 146 of FIG. 19 generating a supercardioid for the target direction as implemented by the beamformer filters defined in (29) below.
- a single rear beam is used at the adaptive beamforming, rbeam(l,il). It is a cardioid in the reverse direction of the target direction as described by the component filters of (30) below. (29) and (30) describe two beamformers 146 in the frequency domain.
- PBEAM (il) CMIC(l) * HI + CMIC(2) * H2 exp(-y - ⁇ -tO)
- e is constant in the range 0.5 to 1 and d(2) is the microphone spacing.
- d(2) is the microphone spacing.
- the pbeam signal for this forward beam is taken as the microphone signal cmic(l) directly without the use of the beamformer 146.
- FIG. 21 a block diagram of the forward filter gain controller 140 of FIG. 18 is shown.
- the FF gain controller 140 is similar to the WG gain controller 94 of FIG. 13.
- the strategy chooser 116 and the gain function chooser 118 are comparable to those in FIG. 13 above. A few differences exist between the gain controllers though as will be described below.
- the signal filtering in the forward filtering embodiments can be based on already beamformed input signals.
- the FF gain controller 140 therefore has to compensate for the directivity and near field characteristics of the forward beams.
- the BeamPar signal carries enough information about the forward beam that a plurality of FF gain function appliers 166 can compute a gain that implements the target directivity as shown, for example, in the polar plots of FIG. 14. For static forward beamformers 138 of FIG. 18, the BeamPar signal is not needed. .
- the beam directivity can be "hard coded" into the individual FF gain function appliers 166.
- the FF gain function applier 166 includes an amplitude updater 168, a plurality of gain function appliers 170, and a wave gain weighter 172.
- the amplitude updater 168 the wave amplitude is corrected to take the characteristics of the forward beamformer into account.
- the gain function applier 170 implements, for each wave, the directivity, amplitude, damping, and like responses in the same manner as described for the gain function appliers 120 of FIG. 13. Since the forward beam contain all waves but can only be assigned a single gain value, all of the different wave gains must be summarized. This is done in the wave gain weighter 172.
- all of the forward beams are static.
- the beam dependency has been included with the pre-computed tables for the gain function chooser 118 of FIG. 21 and thus the amplitude updater 168 is not used.
- the FORWARD GAIN(i) signal is taken as the maximal of the wave gains GAINRAW(j ,i) .
- the FOR WARDGAINfi) signal is taken as the minimal of the wave gains GAINRAW(j,i).
- the ROR WARDGAIN(i) signal is the power weighted average of the individual wave gains as defined in (31) below.
- At least one static forward beam signal and at least one adaptive beam signal are implemented.
- the FF gain function applier 166 monitors and analyses the BeamPar signal from the adaptive beam over time. When the BeamPar signal is stable, indicating a significant noise signal from a constant direction, then the GainSelector signal is switched to use the adaptive beam mainly to build the output. When the BeamPar signal resembles random noise, then the GainSelector signal is switched to remove the adaptive beam from the output.
- the signal filter 142 is performed in the time domain with FIR or ILR filters. In another embodiment, the signal filter 142 is performed in the time domain within frequency bands. In yet another embodiment, the signal filter 142 is performed in the frequency domain.
- FIGs. 23 and 24 show corresponding multiple output embodiments of the wave generation method and the forward filtering method, respectively.
- the first output contains the sum of the near field waves while the second output contains the sum of the far field waves.
- each output contains the sum of the waves originating from a specific range of directions.
- each output contains the sum of the waves originating from a specific range of directions.
- the wide band power of the waves in the sound field is measured.
- the individual output generation blocks are controlled in a way such that the first output is always generated using a range of directions centered around the origin of the wave with the largest power.
- FIG. 25 a block diagram of a forward filter/output generator 174 is shown.
- the forward filter/output generator 174 is a combination of the output generator 56 of FIG. 11 and the forward filter 136 of FIG. 18.
- the various elements are substantially similar except for a wave generator/forward filter (WGFF) gain controller 176 and an output summer 178.
- the forward filter/output generator 174 contains two output paths. The outputs from both paths are summed by the output summer 178 to yield the combined output.
- WGFF wave generator/forward filter
- FIG. 26 a block diagram of the WGFF gain controller 176 of FIG. 25 is shown. As with the forward filter/output generator 174 of FIG. 25, the functioning of the WGFF gain controller 176 follows from the .descriptions of the WG gain controller 94 of FIG. 13 and the FF gain controller 140 of FIG. 21, respectively.
- the WGFF gain controller 176 chooses the gain function so that: at high signal to noise ratios, forward filtering is the primary contributor to the output; and at low signal to noise ratios, wave generation is the primary contributor to the output.
- the process of finding gains for the waves of the sound has included two main steps, that is, finding the parameters of the waves and deriving a gain based on the parameters found. Both these main processes can be described by mathematical transforms, as depicted in (32) below, and in many cases they are best implemented using techniques known from mathematical pocket calculators Mathematically the gain may be described as a transform directly of the inputs as described in (33) below.
- FIG. 27 a block diagram of a single combined mathematical transform processor 180 is shown.
- the combined processor 180 utilizes both the wave generation method and the forward filtering method to implement the core equation solving and the gain control. This implementation is especially useful with portable devices because the size of the table used for the mathematical transform may be greatly reduced as the gain may be described using fewer bits than needed for the description of the wave parameters.
- the input signals for core solving, P, Q, QA, QB, and BeamExp, as well as the gain control inputs BeamPar and the statistical power measures are inputs to a table lookup and approximation unit 182 similar to that of FIG. 10.
- the table lookup directly yields the raw gains as output.
- the statistical evaluation is also performed with the help of the table lookup and approximation unit 182. It contains a model of the . mapping from input values to wave parameters to power values.
- the BlockNarrowBandPowers and BlockWideBandPowers signals contain the power estimates for the current block of samples.
- the block estimates are low pass filtered with appropriate time constants to yield the narrow band and wide band power signals, respectively. Note that the combined processor 180 still needs to solve for and output the wave parameters that are needed in order for the wave generation method to function. In pure forward filtering method embodiments contrarily, there will be no need to output the wave parameters.
- a single table lookup implements the combined core solving of the sound field equations and the gain control. No statistical evaluation is performed.
- the inputs to the table lookup are the magnitude and the phase of the ratio Q(2) of the quotient signal obtained when, in the frequency domain, the second microphone signal is divided by the first microphone signal.
- the phase of Q(2) is quantized to one of thirty two possible phases covering the total complex phase range.
- the magnitude of Q(2) is quantized to one of 512 possible magnitudes covering the range from 0.01 to 100.
- the gain is stored as a binary value of either one or zero.
- the table thus implemented requires 16 Kb of storage capacity.
- noise-canceling microphones For applications where speech is to be picked up from the wearer of a headset, as in a mobile phone, a hearing protector, or the like, two main directions regarding noise suppression have until now been used. The most effective method has been the use of so-called noise-canceling microphones. These microphones amplify near field signals while attenuating far field signals. Unfortunately, noise-canceling microphones have to be placed no farther than two to three centimeters away from the speech source in order to be effective. This may not always be possible or convenient. Another method has been to use directional microphones pointing towards the mouth of the wearer.
- a directional microphone can make no distinction between near and far field signals and thus it will not offer as large of a noise reduction as is possible with a properly placed noise-canceling microphone.
- a preferred near field embodiment of the present invention enables signal processing methods with which it is possible to produce sound pick-up with near field characteristics. It is possible to obtain noise reduction better than that possible with noise-canceling microphones. Furthermore, it is possible to maintain the near field characteristic with its noise reducing virtues at a distance further away from the speech source than is possible with conventional noise-canceling microphones.
- the near field method works by dividing the input signal into a number of frequency bands. In each band, the input signals are analyzed to see whether the activity in that band is due to near field sources or to far field sources. If the activity is from near field sources, then that band is replicated in the output with a high gain and otherwise it is replicated with a low gain.
- FIG. 28 a block diagram of a near field embodiment of the • audio processor 14 of FIG. 1 is shown.
- the near field processor shown is especially well suited for applications where only sound from sources near to the microphones 12 for FIG. 1 should be amplified. Examples of such applications include mobile phones, headsets, and the like.
- the near field processor includes an analog beamformer 18, at least one A/D converter 24, and at least one D/A converter 26 that are similar to those of FIGs. 2 and 3 above.
- the near field processor includes a gain smoother 96, a gain mapper 98, and a filter 142 that are similar to those of FIG. 18.
- the near field processor includes a microphone equalizer 200, a beamformer 202, and a near field gain controller 204.
- the microphone signals are converted to digital signals, the microphone sensitivities are equalized, and optional beamforming is performed to yield the bmic signals.
- the first output signal is taken as the reference input bmic(l) filtered with the filter response h.
- the near field gain controller 204 derives a gain in frequency bands. This gain directly yields the filter response h when mapped from the domain of the gain control to the domain of the filtering.
- the near field processor utilizes a gain function that maps the input pressures directly into band gain.
- the microphone equalizer 200 includes a plurality of forward transformers 32 and a plurality of reverse transformers 36 that are similar to those in FIG. 4.
- the microphone equalizer 200 includes a plurality of microphone equalization updaters 206.
- one microphone, mic(l) is chosen as the reference.
- the signals from the other microphone inputs are filtered so that the equalized microphone signals, cmic(i), all have the same absolute sensitivity, to sound pressure levels.
- the equalization is performed by multiplying with a frequency dependent gain, MicEqfi), in the frequency domain.
- MicEq(i) can be a static gain, measured and saved, for example, at production test time or MicEq may also be updated dynamically.
- FIG. 30 a block diagram of the microphone equalization updater 206 of FIG. 29 is shown. During operation, the phase of the reference microphone signal is compared with the phase of the normalized signal of the microphone to be equalized. In most applications, to utilize near field microphones, there will exist a limited range of directions of sound incidence for which the sound wave must arrive with the same absolute amplitude at both microphone locations, that is, mic(l) and mic(i).
- the zero phase condition detector 208 When the phase indicates that the direction of sound incidence is such, then the zero phase condition detector 208 outputs a logic one as its ZeroPhase signal output and otherwise the output will be a logic zero.
- the quotient of the magnitudes of the reference microphone and the current microphone minus one is gated with the ZeroPhase switch signal and scaled with a small constant mu, a signal results that is suitable for accumulating in accumulator 210, to yield a MicEq signal that will equalize the current microphone.
- the accumulator 210 is divided into static and dynamic parts, where the updates only influence the dynamic parts.
- the effective equalization response is the product of the static and dynamic parts.
- the static part of the equalization response is measured with standard measuring techniques once at the time of production test or at some other convenient time and saved.
- a forgetting factor is included with the dynamic part of the accumulator 210. The forgetting factor causes the dynamic response to converge towards zero when no updates are received.
- means are provided that can save the accumulated equalization response when the near field audio processor is powered down and read the saved response again when the processor is powered up the next time.
- the microphones used have the same directivity and frequency response except for the small tolerances that the microphone equalizer 200 of FIG. 28 should be able to compensate for.
- the direction of sound incidence of a sound wave is perpendicular to an axis connecting the reference microphone with the current microphone, then the sound wave must arrive with the same amplitude at both microphones.
- This perpendicular condition is detected by comparing the phases of the two microphone signals in the zero phase condition detector 208. If the phases differ by less than a certain tolerance, then the ZeroPhase signal is generated as a logic one and otherwise it is generated as a logic zero.
- the signal inband detector 212 for each frequency band evaluates the absolute value of its input signal in the current band and the two nearest neighboring bands. If the current band carries the highest absolute value, then the Inband signal for the current band is generated as a logic one and otherwise it is generated as a logic zero. [0199] Turning now to FIG. 31, a block diagram of the beamformer 202 of FIG. 28 is shown.
- the beamformer 202 is similar to the beamformer 52 of FIG. 7 and also includes a plurality of filters 214 and a summer 216. Again the beamformer 202 is optional and may be omitted.
- the aim of the beamforming process is to remove noise from the signal prior to the near field gain and filter processing, thereby enhancing the performance of these portions of the process.
- the microphone inputs are filtered with separate filters and summed to yield the beam output.
- microphones are placed along a common axis.
- the microphone signals are pair wise beamformed to yield N-M-l beams with identical directivity response.
- the beams are supercardioids.
- the beams are figures of eight.
- the beamforming is performed in the time domain with FIR or IIR filters.
- the beamforming is performed in the frequency domain.
- the near field gain controller 204 includes a forward transformer 218, a power filter 220, a phase filter 222, a statistical evaluator 224, and a near field gain function applier 226.
- the beam signals are split in frequency bands or converted to the frequency domain in the forward transformer 218.
- the power filter 220 the signal powers are measured with a given time constant.
- the outputs from the power filter 220, R(i) give the ratio between the power of the current microphone signal and the power of the reference microphone signal bmic(l).
- the phase filter 222 the filtered signal phases are compared.
- the PHI(i) outputs give the difference between the unwrapped phase of the current microphone and the unwrapped phase of the reference microphone bmic(l).
- the statistical evaluator 224 measures the signal and noise powers of different bandwidths and time constants. In the near field gain function applier 226, the raw channel gains are derived.
- the gain control processing is performed on blocks of samples. For each block, a single complex signal value per frequency band is computed.
- the power and phase filters 220, 222 only use the values from the current block to compute their respective outputs.
- the power and phase filters 220, 222 averages the signal powers and phases over consecutive blocks.
- phase averaging is power weighted.
- no phase information is utilized.
- the forward transformer 218 is implemented with a time domain filterbank, no phase information is generated or used, and the signal powers are measured with a finite time constant.
- the forward transformer 218 is FFT based.
- FIG. 33 a block diagram of the statistical evaluator 224 of FIG. 32 is shown.
- a signal or noise analyzer 2208 the power and phase inputs are evaluated.
- a decision is made as to whether the signal in the band carries utility signal or noise information. If the band carries signal information, then the corresponding part of the IsSignal signal is set to a logic one and the corresponding part of the IsNoise signal is set to a logic zero. If the signal carries noise, then IsSignal is set to a logic zero and IsNoise is set to a logic one.
- the IsSignal and IsNoise switch signals are multiplied with the wave powers, that is, the squared wave amplitudes.
- the weighted signal and noise powers are low pass filtered in NarrowBand filter 230 to yield narrow band estimates of the signal and noise powers.
- the effective integration time of the filter 230 controls the speed of the measurement. It must be set large enough that inaccuracies in the wave parameter estimates are filtered out. The narrow band measurement may thus be relatively slow.
- the WideBandPowers output provides the same measurements as the NarrowBandPowers output with the exception that the measurement has been integrated over wide bands in sum over bands integrators 240 before being low pass filtered in WideBand filter 232. Due to the wide bandwidth the measurement may be performed at a faster rate, that is, a shorter integration time, and with a smaller delay than the narrow band measurement. Note that the dynamic characteristics of filters 230 and 232 control the update speed of the power signals. Therefore the filters will in general have different characteristics.
- the signal or noise analyzer 228 is based on the R(2) signal. If this signal is less than a predefined threshold, then the signal is judged to be utility signal and otherwise it is judged to be noise.
- NarrowBandPowers signal there is an additional path that generates a second NarrowBandPowers signal.
- the two NarrowBandPowers are generated at two different update rates.
- the near field gain function applier 226 of FIG. 32 provides the core functionality of the near field processing method. It maps a set of level ratios and optionally phase and signal power information into a gain. The gain should provide larger amplification for frequency bands containing mainly near field source material and smaller gain for frequency bands containing mainly far field source material.
- the near field gain function applier 226 of FIG. 32 controls the gain in the frequency bands as a function of the ratio of the microphone powers in the bands as shown, for example, in the graph of FIG. 36a.
- FIG. 34 a block diagram of an embodiment of the near field gain function applier 226 of FIG. 32 is shown.
- the near field gain function applier 226 includes a threshold comparer 242, a combinatorial unit 244, and a gain mapper 246.
- the threshold comparer 242 generates logic signals as defined in (34) below.
- the combinatorial unit 244 performs Boolean algebra on these logic signals to yield an output logic signal, BINGAIN, that indicates whether the respective frequency band should be assigned a high gain for signal or a low gain for noise.
- the gain mapper 246 maps the output logic signal to actual gain values according to (35) below.
- the near field gain function can be written as in (36) below.
- the narrow band powers are evaluated.
- the gain function can be described with (38) below.
- FIG. 35 a block diagram of an embodiment of the near field gain function applier 226 of FIG 32 using a table look up implementation with subsequent approximation interpolation is shown.
- the function inputs are rounded to a predefined precision by rounder 248.
- the rounded inputs are then mapped, by address mapper 250, to an integer space to yield an address.
- the address is used by look up 252 to look up in a pre-computed table 254.
- the table 254 may be stored on any storage device including RAM, ROM, hard disk, and the like.
- the table 254 may contain gain values in an encoded form.
- a gain mapper 256 for mapping back to gain space may optionally be inserted as shown.
- an interpolator 258 is optionally provided to yield the raw gain output.
- the table 254 may contain parameter derivatives in addition to parameter values.
- the WidebandPowers are monitored. At good signal to noise ratios, all of the signals are passed through without attenuation. At poor signal to noise ratios, a near field characteristic is used.
- the NarrowBandPowers are monitored.
- gain functions of different widths are chosen. For example, the gain function of different widths are shown in the graph of FIG. 36b.
- the wideband signal power is compared with the wideband noise power and two gain strategies are implemented, that is, both omni directional and directional.
- the omni directional strategy is chosen in all narrow frequency bands covered by wide bands where signal power is greater than a predefined constant times the noise power. In all other bands, the directional strategy is chosen.
- the filtering is performed in the time domain with a FIR or IIR filter.
- the filtering is performed with a FFT based FIR filtering.
- the filtering is performed with a time domain filterbank.
Landscapes
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP02767370A EP1415502A2 (en) | 2001-08-10 | 2002-08-12 | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US92778401A | 2001-08-10 | 2001-08-10 | |
US09/927,784 | 2001-08-10 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2003015458A2 true WO2003015458A2 (en) | 2003-02-20 |
WO2003015458A3 WO2003015458A3 (en) | 2003-10-23 |
Family
ID=25455251
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2002/009029 WO2003015458A2 (en) | 2001-08-10 | 2002-08-12 | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP1415502A2 (en) |
WO (1) | WO2003015458A2 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008061534A1 (en) * | 2006-11-24 | 2008-05-29 | Rasmussen Digital Aps | Signal processing using spatial filter |
EP1931169A1 (en) * | 2005-09-02 | 2008-06-11 | Japan Advanced Institute of Science and Technology | Post filter for microphone array |
US9467775B2 (en) | 2011-09-02 | 2016-10-11 | Gn Netcom A/S | Method and a system for noise suppressing an audio signal |
CN117896655A (en) * | 2024-03-14 | 2024-04-16 | 深圳前海深蕾半导体有限公司 | Balanced frequency divider circuit and terminal equipment |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3057960A (en) * | 1961-03-13 | 1962-10-09 | Bell Telephone Labor Inc | Normalized sound control system |
US5978490A (en) * | 1996-12-27 | 1999-11-02 | Lg Electronics Inc. | Directivity controlling apparatus |
-
2002
- 2002-08-12 WO PCT/EP2002/009029 patent/WO2003015458A2/en not_active Application Discontinuation
- 2002-08-12 EP EP02767370A patent/EP1415502A2/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3057960A (en) * | 1961-03-13 | 1962-10-09 | Bell Telephone Labor Inc | Normalized sound control system |
US5978490A (en) * | 1996-12-27 | 1999-11-02 | Lg Electronics Inc. | Directivity controlling apparatus |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1931169A1 (en) * | 2005-09-02 | 2008-06-11 | Japan Advanced Institute of Science and Technology | Post filter for microphone array |
EP1931169A4 (en) * | 2005-09-02 | 2009-12-16 | Japan Adv Inst Science & Tech | POST-FILTER FOR A MICROPHONE MATRIX |
CN101263734B (en) * | 2005-09-02 | 2012-01-25 | 丰田自动车株式会社 | Post-filter for microphone array |
WO2008061534A1 (en) * | 2006-11-24 | 2008-05-29 | Rasmussen Digital Aps | Signal processing using spatial filter |
AU2007323521B2 (en) * | 2006-11-24 | 2011-02-03 | Sonova Ag | Signal processing using spatial filter |
US8565459B2 (en) | 2006-11-24 | 2013-10-22 | Rasmussen Digital Aps | Signal processing using spatial filter |
US8965003B2 (en) | 2006-11-24 | 2015-02-24 | Rasmussen Digital Aps | Signal processing using spatial filter |
US9467775B2 (en) | 2011-09-02 | 2016-10-11 | Gn Netcom A/S | Method and a system for noise suppressing an audio signal |
CN117896655A (en) * | 2024-03-14 | 2024-04-16 | 深圳前海深蕾半导体有限公司 | Balanced frequency divider circuit and terminal equipment |
Also Published As
Publication number | Publication date |
---|---|
EP1415502A2 (en) | 2004-05-06 |
WO2003015458A3 (en) | 2003-10-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2002331235B2 (en) | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment | |
AU2002331235A1 (en) | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment | |
US10225674B2 (en) | Robust noise cancellation using uncalibrated microphones | |
EP1994788B1 (en) | Noise-reducing directional microphone array | |
EP2848007B1 (en) | Noise-reducing directional microphone array | |
EP0720811B1 (en) | Noise reduction system for binaural hearing aid | |
RU2434262C2 (en) | Near-field vector signal enhancement | |
JP5762956B2 (en) | System and method for providing noise suppression utilizing nulling denoising | |
US10657981B1 (en) | Acoustic echo cancellation with loudspeaker canceling beamformer | |
AU2001245740B2 (en) | Adaptive microphone matching in multi-microphone directional system | |
US8971548B2 (en) | Motor noise reduction circuit | |
JP4973655B2 (en) | Adaptive array control device, method, program, and adaptive array processing device, method, program using the same | |
GB2453118A (en) | Generating a speech audio signal from multiple microphones with suppressed wind noise | |
WO2003015460A2 (en) | Sound processing system including wave generator that exhibits arbitrary directivity and gradient response | |
EP1415502A2 (en) | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment | |
WO2003015459A2 (en) | Sound processing system that exhibits arbitrary gradient response | |
AU2002331236A1 (en) | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment | |
EP1305975A4 (en) | ADAPTIVE MICROPHONE ARRAY SYSTEM WITH CONSERVATION OF BINAURAL NOTES | |
AU2002331238A1 (en) | Sound processing system including wave generator that exhibits arbitrary directivity and gradient response | |
AU2002331237A1 (en) | Sound processing system that exhibits arbitrary gradient response | |
Samtani et al. | FPGA implementation of adaptive beamforming in hearing aids | |
Geiser et al. | A differential microphone array with input level alignment, directional equalization and fast notch adaptation for handsfree communication | |
Zou et al. | A broadband speech enhancement technique based on frequency invariant beamforming and GSC | |
Chintakuntla | Keystroke Resilience in Laptop Acoustics Using Wiener Beamformer and Spectral Subtraction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BY BZ CA CH CN CO CR CU CZ DE DM DZ EC EE ES FI GB GD GE GH HR HU ID IL IN IS JP KE KG KP KR LC LK LR LS LT LU LV MA MD MG MN MW MX MZ NO NZ OM PH PL PT RU SD SE SG SI SK SL TJ TM TN TR TZ UA UG UZ VC VN YU ZA ZM Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ UG ZM ZW AM AZ BY KG KZ RU TJ TM AT BE BG CH CY CZ DK EE ES FI FR GB GR IE IT LU MC PT SE SK TR BF BJ CF CG CI GA GN GQ GW ML MR NE SN TD TG Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2002331236 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2002767370 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2002767370 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
NENP | Non-entry into the national phase in: |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: JP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2002767370 Country of ref document: EP |