CN100580774C - Method and device for eliminating recording surge - Google Patents
Method and device for eliminating recording surge Download PDFInfo
- Publication number
- CN100580774C CN100580774C CN200510097579A CN200510097579A CN100580774C CN 100580774 C CN100580774 C CN 100580774C CN 200510097579 A CN200510097579 A CN 200510097579A CN 200510097579 A CN200510097579 A CN 200510097579A CN 100580774 C CN100580774 C CN 100580774C
- Authority
- CN
- China
- Prior art keywords
- signal
- surge
- voice
- segment
- compensation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 49
- 230000003321 amplification Effects 0.000 claims description 4
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 4
- 230000003213 activating effect Effects 0.000 claims description 3
- 230000006872 improvement Effects 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 12
- 230000000694 effects Effects 0.000 description 6
- 238000013461 design Methods 0.000 description 4
- 230000007547 defect Effects 0.000 description 3
- 230000008030 elimination Effects 0.000 description 3
- 238000003379 elimination reaction Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 239000012634 fragment Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 229920001690 polydopamine Polymers 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
Images
Landscapes
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Abstract
Description
技术领域 technical field
本发明涉及一种消除录音突波的方法及其装置,特别是涉及一种消除电子设备在一开始录音时所录得的语音讯号上产生的突波的消除录音突波的方法及其装置。The present invention relates to a method and device for eliminating recording surges, in particular to a method and device for eliminating recording surges that are generated on voice signals recorded by electronic equipment at the beginning of recording.
背景技术 Background technique
随着移动(即行动,以下均称为移动)运算技术的不断演进与提升,将语音辨识功能附加在诸如PDA、移动电话等携带式电子设备上已然成为一重要的应用趋势。With the continuous evolution and improvement of mobile (ie, mobile, hereinafter referred to as mobile) computing technology, it has become an important application trend to add the voice recognition function to portable electronic devices such as PDAs and mobile phones.
请参阅图1所示,是具有语音辨识功能的一现有的以往电子设备的内部电路方块示意图。现有的以往具有语音辨识功能的携带式电子装置(例如PDA、移动导航器或移动电话等)1会将其麦克风10收录的语音讯号(例如一语音命令)直接送至其后端的一语音辨识器11进行语音辨识,并将完成语音辨识的语音讯号送至其后端的处理单元12,使其根据辨识出来的语音命令执行对应的功能。Please refer to FIG. 1 , which is a schematic block diagram of an internal circuit of an existing conventional electronic device with a speech recognition function. The existing portable electronic device (such as PDA, mobile navigator or mobile phone, etc.) 1 with voice recognition function in the past will directly send the voice signal (such as a voice command) recorded by its
但是一般设置在携带式电子装置上的麦克风10却与传统独立的麦克风具有不同的电气特性,尤其是当麦克风10刚被启动要进行录音的瞬间(大约前0.7秒期间),会使得在此期间录得的讯号波形产生巨大的偏移,如图2所示,是一发生突波的语音讯号波形图,此现象通常被称为突波(risingbias),突波对于后端的语音辨识主要会产生两个重大的影响。首先,当发生突波的位置没有语音产生时,会让语音辨识器11误认突波是语音讯号而造成语音辨识器11对语音片段的误判;再者,当发生突波的位置有语音产生时,突波会叠加在语音讯号上,破坏该语音讯号的特征,使语音讯号失真,造成语音辨识器11对正确语音的误判。这两者都会对语音辨识的正确率造成巨大的影响,并且因而降低使用者对语音辨识系统的信赖度与亲近度。But the
因此,以往一种消除突波的方法则禁止将麦克风启动瞬间(例如前0.7秒)所录得的讯号送入语音辨识器中,亦即将语音讯号中前0.7秒的讯号片段截掉后再送入语音辨识器进行语音辨识,借此消除突波对语音辫识的影响。但是此种作法对于在麦克风一启动的瞬间即有语音输入的情况来说,则会将部分语音片段连同突波一起截掉,而仍然会造成后端语音辨识器对于不完整语音讯号的误判。Therefore, in the past, a method of eliminating surges prohibited sending the signal recorded at the moment when the microphone was activated (for example, the first 0.7 seconds) to the speech recognizer, that is, the signal segment of the first 0.7 seconds in the voice signal was cut off and then sent to the The voice recognizer performs voice recognition, so as to eliminate the influence of the surge on the voice recognition. However, this method will cut off part of the voice segment together with the surge for the situation where there is voice input at the moment when the microphone is turned on, and still cause the back-end voice recognizer to misjudge the incomplete voice signal .
由此可见,上述现有的消除录音突波的方法及其装置在方法、产品结构及使用上,显然仍存在有不便与缺陷,而亟待加以进一步改进。为了解决消除录音突波的方法及其装置存在的问题,相关厂商莫不费尽心思来谋求解决之道,但长久以来一直未见适用的设计被发展完成,而一般方法及装置又没有适切的方法及结构能够解决上述问题,此显然是相关业者急欲解决的问题。因此如何能创设一种新的消除录音突波的方法及其装置,便成了当前业界极需改进的目标。This shows that the above-mentioned existing method for eliminating recording surge and its device obviously still have inconvenience and defects in method, product structure and use, and need to be further improved urgently. In order to solve the problems of the method and device for eliminating the recording surge, the relevant manufacturers have tried their best to find a solution, but no suitable design has been developed for a long time, and there is no suitable general method and device. The method and structure can solve the above-mentioned problems, and this is obviously a problem that relevant industry players are eager to solve. Therefore, how to create a new method and device for eliminating the recording surge has become a goal that the current industry needs to improve.
有鉴于上述现有的消除录音突波的方法及其装置存在的缺陷,本发明人基于从事此类产品设计制造多年丰富的实务经验及专业知识,并配合学理的运用,积极加以研究创新,以期创设一种新的消除录音突波的方法及其装置,能够改进一般现有的方法及其装置,使其更具有实用性。经过不断的研究、设计,并经反复试作及改进后,终于创设出确具实用价值的本发明。In view of the defects in the above-mentioned existing method for eliminating recording surges and the devices thereof, the inventor has been engaged in the design and manufacture of this type of product for many years with rich practical experience and professional knowledge, and cooperated with the application of theories to actively research and innovate, in order to A new method and device for eliminating recording surge can be created, which can improve the general existing method and device, making it more practical. Through continuous research, design, and after repeated trials and improvements, the present invention with practical value is finally created.
发明内容 Contents of the invention
本发明的目的在于,克服现有的消除录音突波的方法及其装置存在的缺陷,而提供一种能够有效减少携带式电子装置的语音辨识错误率的消除录音突波的方法及其装置,从而更加适于实用。The object of the present invention is to overcome the defects of existing methods and devices for eliminating recording surges, and provide a method and device for eliminating recording surges that can effectively reduce the speech recognition error rate of portable electronic devices. Therefore, it is more suitable for practical use.
本发明的目的及解决其技术问题是采用以下技术方案来实现的。依据本发明提出的一种消除录音突波的方法,应用于一电子设备上,该电子设备具有一用以收录语音以产生一语音讯号的麦克风,该方法包括以下的步骤:(A)、于该语音讯号中截取产生突波的一讯号区段;(B)、将一预设的补偿讯号与该讯号区段进行补偿运算,以消除该突波;以及(C)、组合经过补偿的该讯号区段与经过步骤(A)的被截取该讯号区段的一剩余语音讯号,以形成一完整的语音讯号。The purpose of the present invention and the solution to its technical problems are achieved by adopting the following technical solutions. A method for eliminating recording surge according to the present invention is applied to an electronic device, and the electronic device has a microphone for recording voice to generate a voice signal. The method includes the following steps: (A), Intercepting a signal segment that produces a surge in the voice signal; (B), performing a compensation operation on a preset compensation signal and the signal segment to eliminate the surge; and (C), combining the compensated signal segment The signal segment and a remaining voice signal of the signal segment intercepted through the step (A) to form a complete voice signal.
本发明的目的及解决其技术问题还采用以下技术措施来进一步实现。The purpose of the present invention and the solution to its technical problems also adopt the following technical measures to further realize.
前述的消除录音突波的方法,其中所述的步骤(A)中,该突波是发生在该语音讯号开始后的一预定时间内,且该讯号区段是指在该预定时间内的该语音讯号。The aforementioned method for eliminating recording spikes, wherein in the step (A), the spikes occur within a predetermined time after the start of the voice signal, and the signal segment refers to the voice signal.
前述的消除录音突波的方法,其中所述的步骤(B)中,该补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。In the aforementioned method for eliminating recording spikes, in the step (B), the compensation signal is a spike signal with the same length as the signal segment and the same surge amplitude and position.
前述的消除录音突波的方法,其中所述的步骤(B)中,该补偿讯号是一与该讯号区段的长度相同且突波位置及振幅相同但相位相反的突波讯号。In the aforementioned method for eliminating recording spikes, in the step (B), the compensation signal is a spike signal with the same length as the signal segment, the same spike position and amplitude, but opposite phase.
前述的消除录音突波的方法,其中所述的补偿讯号是预先借由反复启动该麦克风以收录复数产生突波的无语音讯号,然后预先经由步骤(A)于该等复数无语音讯号中取出产生突波的复数讯号区段后,将该等复数讯号区段加总并平均后,乘上一放大增益所产生。The aforementioned method for eliminating recording spikes, wherein the compensation signal is to record multiple non-speech signals that generate spikes by repeatedly activating the microphone in advance, and then extract them from the multiple non-speech signals through step (A) in advance After the complex signal segments of the surge are generated, the complex signal segments are summed and averaged, and then multiplied by an amplification gain.
前述的消除录音突波的方法,其中所述的步骤(B)中,是借由将该讯号区段与该补偿讯号相减而消除该讯号区段上的突波。In the aforementioned method for eliminating recording spikes, in the step (B), the spikes on the signal segment are eliminated by subtracting the signal segment from the compensation signal.
前述的消除录音突波的方法,其中所述的步骤(B)中,是借由将该讯号区段与该补偿讯号相加而消除该讯号区段上的突波。In the aforementioned method for eliminating recording spikes, in the step (B), the spikes on the signal segment are eliminated by adding the signal segment to the compensation signal.
本发明的目的及解决其技术问题还采用以下技术方案来实现的。依据本发明提出的一种消除录音突波的装置,设置在一电子设备的一麦克风输出端,用以接收由该麦克风收录的一语音讯号,该装置包括:一讯号截取单元,用以于该语音讯号中截取产生突波的一讯号区段;一放大单元,是根据该麦克风的增益放大一预设的补偿讯号;一补偿单元,对该讯号区段及该放大的补偿讯号进行补偿运算,以消除该讯号区段中的突波;以及一组合单元,用以组合经过补偿的该讯号区段与被截取该讯号区段的一剩余语音讯号,以输出一完整的语音讯号。The purpose of the present invention and the solution to its technical problems are also achieved by the following technical solutions. A device for eliminating recording surges according to the present invention is provided at a microphone output end of an electronic device to receive a voice signal recorded by the microphone, and the device includes: a signal intercepting unit for the Intercepting a signal segment that generates a surge in the voice signal; an amplifying unit that amplifies a preset compensation signal according to the gain of the microphone; a compensation unit that performs a compensation operation on the signal segment and the amplified compensation signal, to eliminate the surge in the signal segment; and a combining unit for combining the compensated signal segment and a residual voice signal of the intercepted signal segment to output a complete voice signal.
本发明的目的及解决其技术问题还采用以下技术措施来进一步实现。The purpose of the present invention and the solution to its technical problems also adopt the following technical measures to further realize.
前述的消除录音突波的装置,其中所述的放大的补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。In the aforementioned device for eliminating recording spikes, the amplified compensation signal is a surge signal with the same length as the signal segment and the same surge amplitude and position.
前述的消除录音突波的装置,其中所述的消除录音突波的装置还包括一突波向量资料库,且该补偿讯号是预存在该突波向量资料库中。The aforementioned device for eliminating recording surge, wherein the device for eliminating recording surge further includes a surge vector database, and the compensation signal is pre-stored in the surge vector database.
本发明与现有技术相比具有明显的优点和有益效果。由以上技术方案可知,本发明的主要技术内容如下:Compared with the prior art, the present invention has obvious advantages and beneficial effects. As can be seen from above technical scheme, main technical content of the present invention is as follows:
为了达到上述目的,本发明提供了一种消除录音突波的方法,其应用在一电子设备上,该电子设备具有一用以收录语音以产生一语音讯号的麦克风,该方法包括:(A)由该语音讯号中截取产生突波的一讯号区段;(B)以一预设的补偿讯号与该讯号区段进行补偿运算,以消除该突波;以及(C)组合经过补偿的该讯号区段与经过步骤(A)的被截取该讯号区段的该剩余语音讯号,以形成一完整的语音讯号。In order to achieve the above object, the present invention provides a method for eliminating recording surge, which is applied on an electronic device, and the electronic device has a microphone for recording voice to generate a voice signal, the method comprising: (A) Intercepting a signal segment that generates a spike from the voice signal; (B) performing a compensation operation on the signal segment with a preset compensation signal to eliminate the spike; and (C) combining the compensated signal segment and the remaining voice signal of the intercepted signal segment after step (A) to form a complete voice signal.
该突波是发生在该语音讯号开始后的一预定时间内,且该讯号区段是指在该预定时间内的该语音讯号。The burst occurs within a predetermined time after the start of the voice signal, and the signal segment refers to the voice signal within the predetermined time.
该补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。The compensation signal is a surge signal with the same length as the signal section and the same surge amplitude and position.
该补偿讯号是一与该讯号区段的长度相同且突波位置及振幅相同但相位相反的突波讯号。The compensation signal is a surge signal with the same length as the signal section, the same surge position and amplitude but opposite phase.
该补偿讯号是预先借由反复启动该麦克风以收录复数产生突波的无语音讯号,然后预先经由步骤(A)由该等无语音讯号中取出产生突波的复数讯号区段后,将该等讯号区段加总并平均后,乘上一放大增益所产生。The compensation signal is obtained by repeatedly activating the microphone in advance to record a plurality of non-speech signals that generate surges, and then extracting the multiple signal segments that generate surges from the non-speech signals through step (A) in advance, and then After the signal segments are summed and averaged, they are multiplied by an amplification gain.
在步骤(B)中,是借由将该讯号区段与该补偿讯号相减而消除该讯号区段上的突波。In step (B), the glitch on the signal segment is eliminated by subtracting the signal segment from the compensation signal.
在步骤(B)中,是借由将该讯号区段与该补偿讯号相加而消除该讯号区段上的突波。In step (B), the glitch on the signal segment is removed by adding the signal segment to the compensation signal.
另外,为了达到上述目的,本发明还提供了一种用以实现上述方法的消除录音突波的装置,设置在一电子设备的一麦克风输出端,用以接收由该麦克风收录的一语音讯号。该装置包括一讯号截取单元、一放大单元、一补偿单元及一组合单元。该讯号截取单元用以由该语音讯号中截取产生突波的一讯号区段。该放大单元是根据该麦克风的增益放大一预设的补偿讯号。该补偿单元对该讯号区段及该放大的补偿讯号进行补偿运算,以消除该讯号区段中的突波。该组合单元用以组合经过补偿的该讯号区段与被截取该讯号区段的该剩余语音讯号,以输出一完整的语音讯号。In addition, in order to achieve the above object, the present invention also provides a recording surge elimination device for realizing the above method, which is installed at a microphone output end of an electronic device to receive a voice signal recorded by the microphone. The device includes a signal intercepting unit, an amplifying unit, a compensating unit and a combination unit. The signal intercepting unit is used for intercepting a signal segment generating a surge from the voice signal. The amplifying unit amplifies a preset compensation signal according to the gain of the microphone. The compensation unit performs compensation operation on the signal section and the amplified compensation signal to eliminate the surge in the signal section. The combining unit is used for combining the compensated signal segment and the remaining voice signal of the intercepted signal segment to output a complete voice signal.
该放大的补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。The amplified compensation signal is a surge signal with the same length as the signal segment and the same surge amplitude and position.
该消除录音突波的装置还包括一突波向量资料库,且该补偿讯号是预存在该突波向量资料库中。The device for eliminating recording surge also includes a surge vector database, and the compensation signal is pre-stored in the surge vector database.
借由上述技术方案,本发明消除录音突波的方法及其装置至少具有下列优点:本发明借由截取语音讯号中发生突波的片段讯号,并以一预先求得的补偿讯号与该片段讯号进行补偿运算,以抵消该片段讯号中的突波并保留其中的语音讯号,使该部分语音特征不会受到突波的破坏,而能够有效提升后端进行语音辨识时的辨识正确率。By means of the above technical solution, the method and device for eliminating recording spikes of the present invention have at least the following advantages: the present invention intercepts the segment signal where the surge occurs in the speech signal, and uses a pre-obtained compensation signal to match the segment signal Compensation calculations are performed to offset the surge in the segment signal and retain the voice signal in it, so that the part of the voice feature will not be damaged by the surge, and can effectively improve the recognition accuracy of the back-end voice recognition.
综上所述,本发明提供了一种能够有效减少携带式电子装置的语音辨识错误率的消除录音突波的方法及其装置。该消除突波的方法,应用在一电子设备上,该电子设备具有一收录语音以产生一语音讯号的麦克风,该方法包括由该语音讯号中截取发生突波的一讯号区段,并以一预设的补偿讯号与该讯号区段进行补偿运算,以消除该讯号区段上的突波,然后再组合经过补偿的该讯号区段与被截取该讯号区段的该剩余语音讯号,以组成一完整的语音讯号;借此,可以保持语音讯号的真实性与完整性,而有助于电子设备进行语音辨识时的辨识正确率的提升。其具有上述诸多的优点及实用价值,不论在方法、产品结构或功能上皆有较大改进,在技术上有较大进步,并产生了好用及实用的效果,且较现有的消除录音突波的方法及其装置具有增进的多项功效,从而更加适于实用,诚为一新颖、进步、实用的新设计。To sum up, the present invention provides a method and device for eliminating recording spikes that can effectively reduce the speech recognition error rate of portable electronic devices. The method for eliminating surge is applied to an electronic device, and the electronic device has a microphone for recording voice to generate a voice signal. The method includes intercepting a signal segment where a surge occurs from the voice signal, and using a The preset compensation signal and the signal segment are compensated to eliminate the surge on the signal segment, and then the compensated signal segment and the remaining voice signal of the intercepted signal segment are combined to form A complete voice signal; thereby, the authenticity and integrity of the voice signal can be maintained, and it is helpful to improve the recognition accuracy of the electronic device when performing voice recognition. It has the above-mentioned many advantages and practical value, no matter in method, product structure or function, it has been greatly improved, and it has made great progress in technology, and has produced easy-to-use and practical effects, and it is better than the existing recording elimination method. The surge method and its device have multiple enhanced effects, so it is more suitable for practical use, and it is a novel, progressive and practical new design.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其他目的、特征和优点能够更明显易懂,以下特举较佳实施例,并配合附图,详细说明如下。The above description is only an overview of the technical solution of the present invention. In order to better understand the technical means of the present invention, it can be implemented according to the contents of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and understandable , the following preferred embodiments are specifically cited below, and are described in detail as follows in conjunction with the accompanying drawings.
附图说明 Description of drawings
图1是具有语音辨识功能的一现有的以往电子设备的内部电路方块示意图。FIG. 1 is a schematic block diagram of an internal circuit of a conventional conventional electronic device with a speech recognition function.
图2是一发生突波的语音讯号波形图。FIG. 2 is a waveform diagram of a speech signal in which a burst occurs.
图3是本发明消除录音突波的装置的一实施例应用在一具有语音辨识功能的电子设备上的电路方块示意图。FIG. 3 is a schematic circuit block diagram of an embodiment of the device for eliminating recording spikes of the present invention applied to an electronic device with a voice recognition function.
图4是本实施例消除录音突波的装置的详细电路方块图。Fig. 4 is a detailed circuit block diagram of the device for eliminating recording surge in this embodiment.
图5是本实施例用以产生补偿讯号的电路方块图。FIG. 5 is a block diagram of a circuit for generating compensation signals in this embodiment.
图6是一具有突波的无语音讯号波形图。FIG. 6 is a waveform diagram of a non-speech signal with spikes.
图7是一应用本实施例消除突波后的语音讯号波形图。FIG. 7 is a waveform diagram of a speech signal after the surge is eliminated by applying this embodiment.
具体实施方式 Detailed ways
为更进一步阐述本发明为达成预定发明目的所采取的技术手段及功效,以下结合附图及较佳实施例,对依据本发明提出的消除录音突波的方法及其装置其具体实施方式、方法、步骤、结构、特征及其功效,详细说明如后。In order to further explain the technical means and effects that the present invention takes to achieve the intended purpose of the invention, below in conjunction with the accompanying drawings and preferred embodiments, the specific implementation methods and methods of the method and device for eliminating recording surges proposed according to the present invention , step, structure, feature and effect thereof, detailed description is as follows.
请参阅图3所示,是本发明消除录音突波的装置的一实施例,本实施例消除录音突波的装置3,是设置在一具有语音辨识功能的携带式电子设备2(例如移动电话、移动导航器或个人数位助理(PDA)等移动装置)的收音麦克风20与进行语音辨识的语音辨识器21之间,使用者通常可以借由按压电子设备2上预设的一快速键(图中未示)来启动麦克风20以输入语音(例如一语音命令),且该麦克风20在收录来自外部的一语音命令(通常是一人名、一名词或一句话)并对应产生一语音讯号后即被禁能,直到该快速键再次被按。而由麦克风20产生的语音讯号则被送至消除录音突波的装置3进行突波消除作业后,才送至语音辨识器21进行语音辨识作业。其中,语音辨识器21中包括有一端点侦测器模组(图中未示),其直接接收外来的声音讯号并判别所录到的声音是否为一语音讯号。See also shown in Fig. 3, it is an embodiment of the device for eliminating the recording surge of the present invention, the
请再参阅图4所示,是本实施例消除录音突波的装置的详细电路方块图。本实施例的消除录音突波的装置3,主要包括一讯号截取单元31、一放大单元32、一补偿单元33、一组合单元34以及一突波向量资料库37,其中:Please refer to FIG. 4 again, which is a detailed circuit block diagram of the device for eliminating recording surge in this embodiment. The
该讯号截取单元31,其与麦克风20相连接,用以接收由麦克风收录的一语音讯号23,如图2所示。并且,由于受到麦克风20先天电气特性的影响,该麦克风20在一开始被启动的瞬间会固定在语音讯号23的前端位置(例如前0.7秒期间)产生如图2所示的突波24。因此,为了消除突波24,根据突波发生位置不变的特性,讯号截取单元31在收到来自麦克风20的语音讯号23时,会先对语音讯号23进行取样,以截取语音讯号23的发生突波的片段讯号25(即前0.7秒期间的讯号),且在本实施例中,其是对语音讯号23进行取样(取样11552点,相当于0.7秒的讯号长度),然后将发生突波的片段讯号25送至补偿单元33,并将被截取片段讯号25的剩余语音讯号(即取样点11552点以后的讯号)26送至组合单元34暂存。The
该放大单元32,在本实施例中是一乘法器,其接收该麦克风20的一增益值以及来自突波向量资料库37的一补偿讯号29。其中,该增益值是一变数,其是在麦克风20被启动时,借由读取系统晶片中对麦克风20所设定的增益值而获得。该补偿讯号29是被事先求得,其产生方式如下:The amplifying
如图5所示,是本实施例用以产生补偿讯号的电路方块图。首先借由反复启动麦克风20收录(收集)复数产生突波(与上述语音讯号的突波位置相同)的无语音讯号27(约20~30个),如图6所示,是一具有突波的无语音讯号波形图,然后将该等无语音讯号27送至讯号截取单元31,使一一对该等无语音讯号27进行取样(取11552点),以截取该等无语音讯号的前0.7秒中包含有突波的复数片段讯号28,然后将该等片段讯号28经由一加总及平均电路35进行加总并平均后,再送入一曲线修饰单元36(选择性的,可有可无)中进行波形修饰后,即可获得该补偿讯号29,并将该补偿讯号29预存在突波向量资料库37中。因此,该补偿讯号29实际上是一与该片段讯号25长度相同且突波位置相同的突波讯号。As shown in FIG. 5 , it is a circuit block diagram for generating compensation signals in this embodiment. First by repeatedly starting the
所以,当该补偿讯号29被送入放大单元32时,放大单元32会根据当时麦克风20的增益值(例如增益=6)及补偿讯号29产生当时的麦克风增益值(例如增益=3),将该补偿讯号29乘上一适当增益值(6/3=2)并将其相位反相,使得被放大的补偿讯号29’与片段讯号25的突波振幅相同但相位相反,再将放大的补偿讯号29’输出至补偿单元33。Therefore, when the
且在本实施例中,补偿单元33实际上是一加法器,所以当该放大的补偿讯号29’被送入补偿单元33与片段讯号25相加时,即可与片段讯号25中的突波相加相抵消,而消除片段讯号25中的突波。And in this embodiment, the compensation unit 33 is actually an adder, so when the amplified compensation signal 29' is sent to the compensation unit 33 to be added to the
此外,值得一指出的是,补偿单元33除了使用加法器外,亦可以使用减法器来代替,这时补偿讯号29只需被放大而不需反相,如此放大的补偿讯号被送入补偿单元33中与片段讯号25相减时,片段讯号25中的突波则会因为与补偿讯号相减而被消除。In addition, it is worth pointing out that the compensation unit 33 can also use a subtractor instead of an adder. At this time, the
然后,消除突波后的片段讯号25’被输入组合单元34中与被截取片段讯号25的剩余语音讯号26进行组合,以组成如图7所示的一完整语音讯号23’后,才送入语音辨识器21中进行后续的语音辨识作业。所以,请参阅图7所示,是一应用本实施例消除突波后的语音讯号波形图,由图7显示可知,语音讯号23经过本实施例的突波消除装置3处理后,确实可以将语音讯号23’上的突波消除,而且仍然能够保留原先收录的语音讯号特征。Then, the segment signal 25' after the surge is eliminated is combined with the remaining
又请参阅图3所示,经由语音辨识器21完成语音辨识的语音讯号会被送至其后端的处理单元22,使根据辨识出来的语音执行对应的功能。并且由于语音辨识器21的语音辨识过程是一以往的技术且非本发明的技术特征所在,故在此不再加以详述。Please also refer to FIG. 3 , the voice signal through the
请再参阅表1所示,是未应用本实施例与应用本实施例的前后实验数据比较表。Please refer to Table 1 again, which is a comparison table of experimental data before and after this embodiment is not applied and this embodiment is applied.
表1Table 1
根据表1所示实验数据可知,在尚未解决突波问题的情况下,以麦克风反复收录由两个人轮流说出的384个名词(例如人名)中,语音辨识器21可以正确辨识其中的351个名词(错33个);而当以本实施例的装置3消除语音讯号中的突波后,则麦克风所收录的384个名词中,语音辨识器可以正确辨识其中的358个名词(错26个),由此可见本实施例可以将原先的错误率由33个向上修正7个,亦即可以减少7/33=21.21%的辨识错误率。According to the experimental data shown in Table 1, it can be seen that in the situation where the surge problem has not been solved, the microphone repeatedly records 384 nouns (such as names) spoken by two people in turn, and the
此外,由表1中亦可看出,在本实施例中,补偿讯号不论是否有经过曲线修饰单元36的曲线修饰处理,其所达到的效果相同。In addition, it can also be seen from Table 1 that, in this embodiment, whether the compensation signal is subjected to the curve modification processing by the curve modification unit 36 or not, the effect achieved is the same.
经由上述的说明可知,本实施例借由截取语音讯号23中发生突波的片段讯号24,并以一预先求得的补偿讯号29与该片段讯号25进行补偿运算,以抵消该片段讯号25中的突波并保留其中的语音讯号,再将该补偿后的片段讯号25’与被截取片段讯号25的剩余语音讯号26组合成完整的语音讯号,借此,不但在消除突波的同时,可以保留在该片段讯号中的部分语音讯号,并使该部分语音特征不会受到突波的破坏,而能够有效提升后端进行语音辨识时的辨识正确率。It can be seen from the above description that in this embodiment, the
以上所述,仅是本发明的较佳实施例而已,并非对本发明作任何形式上的限制,虽然本发明已以较佳实施例揭露如上,然而并非用以限定本发明。The above descriptions are only preferred embodiments of the present invention, and do not limit the present invention in any form. Although the present invention has been disclosed as above with preferred embodiments, it is not intended to limit the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200510097579A CN100580774C (en) | 2005-12-30 | 2005-12-30 | Method and device for eliminating recording surge |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200510097579A CN100580774C (en) | 2005-12-30 | 2005-12-30 | Method and device for eliminating recording surge |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1991979A CN1991979A (en) | 2007-07-04 |
CN100580774C true CN100580774C (en) | 2010-01-13 |
Family
ID=38214191
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200510097579A Expired - Fee Related CN100580774C (en) | 2005-12-30 | 2005-12-30 | Method and device for eliminating recording surge |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100580774C (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105719657A (en) * | 2016-02-23 | 2016-06-29 | 惠州市德赛西威汽车电子股份有限公司 | Human voice extracting method and device based on microphone |
TWI827997B (en) * | 2021-11-03 | 2024-01-01 | 大陸商星宸科技股份有限公司 | Recording method, and associated integrated circuit |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1251194A (en) * | 1997-03-25 | 2000-04-19 | 英国国防部 | Recognition system |
CN1296607A (en) * | 1998-02-04 | 2001-05-23 | 夸尔柯姆股份有限公司 | System and method for noise-compensated speech recognition |
JP2003333682A (en) * | 2002-05-15 | 2003-11-21 | Nippon Telegr & Teleph Corp <Ntt> | Signal extraction method and apparatus, signal extraction program, and recording medium recording this program |
CN1585972A (en) * | 2002-08-01 | 2005-02-23 | 松下电器产业株式会社 | Audio decoding apparatus and audio decoding method |
-
2005
- 2005-12-30 CN CN200510097579A patent/CN100580774C/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1251194A (en) * | 1997-03-25 | 2000-04-19 | 英国国防部 | Recognition system |
CN1296607A (en) * | 1998-02-04 | 2001-05-23 | 夸尔柯姆股份有限公司 | System and method for noise-compensated speech recognition |
JP2003333682A (en) * | 2002-05-15 | 2003-11-21 | Nippon Telegr & Teleph Corp <Ntt> | Signal extraction method and apparatus, signal extraction program, and recording medium recording this program |
CN1585972A (en) * | 2002-08-01 | 2005-02-23 | 松下电器产业株式会社 | Audio decoding apparatus and audio decoding method |
Also Published As
Publication number | Publication date |
---|---|
CN1991979A (en) | 2007-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8762137B2 (en) | Target voice extraction method, apparatus and program product | |
US10319391B2 (en) | Impulsive noise suppression | |
Sadjadi et al. | Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification | |
WO2020088154A1 (en) | Method for voice audio noise reduction, storage medium and mobile terminal | |
US8401206B2 (en) | Adaptive beamformer using a log domain optimization criterion | |
US8032364B1 (en) | Distortion measurement for noise suppression system | |
US20200349964A1 (en) | Detection and suppression of keyboard transient noise in audio streams with aux keybed microphone | |
US9311933B2 (en) | Method of processing a voice segment and hearing aid | |
US8582792B2 (en) | Method and hearing aid for enhancing the accuracy of sounds heard by a hearing-impaired listener | |
CN106297772A (en) | Detection method is attacked in the playback of voice signal distorted characteristic based on speaker introducing | |
CN108696648B (en) | Method, device, equipment and storage medium for processing short-time voice signal | |
US20080198779A1 (en) | Multi-channel communication device and methods for reducing echoes by inserting a training sequence under a spectral mask | |
CN108364656B (en) | Feature extraction method and device for voice playback detection | |
JP2008052117A (en) | Noise eliminating device, method and program | |
CN100580774C (en) | Method and device for eliminating recording surge | |
US20180261238A1 (en) | Confused state determination device, confused state determination method, and storage medium | |
US10964307B2 (en) | Method for adjusting voice frequency and sound playing device thereof | |
WO2025043993A1 (en) | Echo cancellation method and apparatus, computer readable storage medium, and terminal device | |
TWI295053B (en) | ||
CN105427864A (en) | Method for adding contact persons through voice and terminal | |
Peng et al. | Effective Phase Encoding for End-To-End Speaker Verification. | |
JP3118023B2 (en) | Voice section detection method and voice recognition device | |
US20080198778A1 (en) | Audio communication device and methods for reducing echoes by inserting a training sequence under a spectral mask | |
CN114171032A (en) | Cross-channel voiceprint model training method, identification method, device and readable medium | |
KR100574883B1 (en) | Speech Extraction Method by Non-Voice Rejection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20100113 Termination date: 20211230 |
|
CF01 | Termination of patent right due to non-payment of annual fee |