CN100580774C

CN100580774C - Method and device for eliminating recording surge

Info

Publication number: CN100580774C
Application number: CN200510097579A
Authority: CN
Inventors: 洪英士
Original assignee: Acer Inc
Current assignee: Acer Inc
Priority date: 2005-12-30
Filing date: 2005-12-30
Publication date: 2010-01-13
Anticipated expiration: 2025-12-30
Also published as: CN1991979A

Abstract

The invention relates to a method and a device for eliminating recording surge. The method for eliminating surge is applied to an electronic device which is provided with a microphone for recording voice to generate a voice signal, and comprises the steps of intercepting a signal section in which the surge occurs from the voice signal, carrying out compensation operation on a preset compensation signal and the signal section to eliminate the surge on the signal section, and then combining the compensated signal section and the residual voice signal of the intercepted signal section to form a complete voice signal. The device for eliminating the recording surge comprises: a signal intercepting unit, an amplifying unit, a compensating unit and a combining unit for combining the compensated signal section and the residual voice signal intercepted from the signal section to output a complete voice signal. Therefore, the authenticity and the integrity of the voice signal can be kept, and the improvement of the identification accuracy rate when the electronic equipment carries out voice identification is facilitated.

Description

Method and device for eliminating recording surge

技术领域 technical field

本发明涉及一种消除录音突波的方法及其装置，特别是涉及一种消除电子设备在一开始录音时所录得的语音讯号上产生的突波的消除录音突波的方法及其装置。The present invention relates to a method and device for eliminating recording surges, in particular to a method and device for eliminating recording surges that are generated on voice signals recorded by electronic equipment at the beginning of recording.

背景技术 Background technique

随着移动(即行动，以下均称为移动)运算技术的不断演进与提升，将语音辨识功能附加在诸如PDA、移动电话等携带式电子设备上已然成为一重要的应用趋势。With the continuous evolution and improvement of mobile (ie, mobile, hereinafter referred to as mobile) computing technology, it has become an important application trend to add the voice recognition function to portable electronic devices such as PDAs and mobile phones.

请参阅图1所示，是具有语音辨识功能的一现有的以往电子设备的内部电路方块示意图。现有的以往具有语音辨识功能的携带式电子装置(例如PDA、移动导航器或移动电话等)1会将其麦克风10收录的语音讯号(例如一语音命令)直接送至其后端的一语音辨识器11进行语音辨识，并将完成语音辨识的语音讯号送至其后端的处理单元12，使其根据辨识出来的语音命令执行对应的功能。Please refer to FIG. 1 , which is a schematic block diagram of an internal circuit of an existing conventional electronic device with a speech recognition function. The existing portable electronic device (such as PDA, mobile navigator or mobile phone, etc.) 1 with voice recognition function in the past will directly send the voice signal (such as a voice command) recorded by its microphone 10 to a voice recognition system at its back end. The device 11 performs speech recognition, and sends the speech signal that completes the speech recognition to the processing unit 12 at the back end, so that it can execute the corresponding function according to the recognized speech command.

但是一般设置在携带式电子装置上的麦克风10却与传统独立的麦克风具有不同的电气特性，尤其是当麦克风10刚被启动要进行录音的瞬间(大约前0.7秒期间)，会使得在此期间录得的讯号波形产生巨大的偏移，如图2所示，是一发生突波的语音讯号波形图，此现象通常被称为突波(risingbias)，突波对于后端的语音辨识主要会产生两个重大的影响。首先，当发生突波的位置没有语音产生时，会让语音辨识器11误认突波是语音讯号而造成语音辨识器11对语音片段的误判；再者，当发生突波的位置有语音产生时，突波会叠加在语音讯号上，破坏该语音讯号的特征，使语音讯号失真，造成语音辨识器11对正确语音的误判。这两者都会对语音辨识的正确率造成巨大的影响，并且因而降低使用者对语音辨识系统的信赖度与亲近度。But the microphone 10 that is generally arranged on the portable electronic device has different electrical characteristics with the traditional independent microphone, especially when the microphone 10 is just started to record the moment (during about 0.7 seconds before), it will make during this period The recorded signal waveform has a huge offset, as shown in Figure 2, which is a waveform diagram of a speech signal with a surge. This phenomenon is usually called a rising bias. Two major effects. First of all, when there is no voice at the position where the surge occurs, the speech recognizer 11 will mistakenly identify the surge as a voice signal and cause the speech recognizer 11 to misjudge the speech segment; When generated, the surge will be superimposed on the voice signal, destroying the characteristics of the voice signal, distorting the voice signal, and causing the voice recognizer 11 to misjudge the correct voice. Both of these will have a huge impact on the accuracy of speech recognition, and thus reduce the user's trust and intimacy with the speech recognition system.

因此，以往一种消除突波的方法则禁止将麦克风启动瞬间(例如前0.7秒)所录得的讯号送入语音辨识器中，亦即将语音讯号中前0.7秒的讯号片段截掉后再送入语音辨识器进行语音辨识，借此消除突波对语音辫识的影响。但是此种作法对于在麦克风一启动的瞬间即有语音输入的情况来说，则会将部分语音片段连同突波一起截掉，而仍然会造成后端语音辨识器对于不完整语音讯号的误判。Therefore, in the past, a method of eliminating surges prohibited sending the signal recorded at the moment when the microphone was activated (for example, the first 0.7 seconds) to the speech recognizer, that is, the signal segment of the first 0.7 seconds in the voice signal was cut off and then sent to the The voice recognizer performs voice recognition, so as to eliminate the influence of the surge on the voice recognition. However, this method will cut off part of the voice segment together with the surge for the situation where there is voice input at the moment when the microphone is turned on, and still cause the back-end voice recognizer to misjudge the incomplete voice signal .

由此可见，上述现有的消除录音突波的方法及其装置在方法、产品结构及使用上，显然仍存在有不便与缺陷，而亟待加以进一步改进。为了解决消除录音突波的方法及其装置存在的问题，相关厂商莫不费尽心思来谋求解决之道，但长久以来一直未见适用的设计被发展完成，而一般方法及装置又没有适切的方法及结构能够解决上述问题，此显然是相关业者急欲解决的问题。因此如何能创设一种新的消除录音突波的方法及其装置，便成了当前业界极需改进的目标。This shows that the above-mentioned existing method for eliminating recording surge and its device obviously still have inconvenience and defects in method, product structure and use, and need to be further improved urgently. In order to solve the problems of the method and device for eliminating the recording surge, the relevant manufacturers have tried their best to find a solution, but no suitable design has been developed for a long time, and there is no suitable general method and device. The method and structure can solve the above-mentioned problems, and this is obviously a problem that relevant industry players are eager to solve. Therefore, how to create a new method and device for eliminating the recording surge has become a goal that the current industry needs to improve.

有鉴于上述现有的消除录音突波的方法及其装置存在的缺陷，本发明人基于从事此类产品设计制造多年丰富的实务经验及专业知识，并配合学理的运用，积极加以研究创新，以期创设一种新的消除录音突波的方法及其装置，能够改进一般现有的方法及其装置，使其更具有实用性。经过不断的研究、设计，并经反复试作及改进后，终于创设出确具实用价值的本发明。In view of the defects in the above-mentioned existing method for eliminating recording surges and the devices thereof, the inventor has been engaged in the design and manufacture of this type of product for many years with rich practical experience and professional knowledge, and cooperated with the application of theories to actively research and innovate, in order to A new method and device for eliminating recording surge can be created, which can improve the general existing method and device, making it more practical. Through continuous research, design, and after repeated trials and improvements, the present invention with practical value is finally created.

发明内容 Contents of the invention

本发明的目的在于，克服现有的消除录音突波的方法及其装置存在的缺陷，而提供一种能够有效减少携带式电子装置的语音辨识错误率的消除录音突波的方法及其装置，从而更加适于实用。The object of the present invention is to overcome the defects of existing methods and devices for eliminating recording surges, and provide a method and device for eliminating recording surges that can effectively reduce the speech recognition error rate of portable electronic devices. Therefore, it is more suitable for practical use.

本发明的目的及解决其技术问题是采用以下技术方案来实现的。依据本发明提出的一种消除录音突波的方法，应用于一电子设备上，该电子设备具有一用以收录语音以产生一语音讯号的麦克风，该方法包括以下的步骤：(A)、于该语音讯号中截取产生突波的一讯号区段；(B)、将一预设的补偿讯号与该讯号区段进行补偿运算，以消除该突波；以及(C)、组合经过补偿的该讯号区段与经过步骤(A)的被截取该讯号区段的一剩余语音讯号，以形成一完整的语音讯号。The purpose of the present invention and the solution to its technical problems are achieved by adopting the following technical solutions. A method for eliminating recording surge according to the present invention is applied to an electronic device, and the electronic device has a microphone for recording voice to generate a voice signal. The method includes the following steps: (A), Intercepting a signal segment that produces a surge in the voice signal; (B), performing a compensation operation on a preset compensation signal and the signal segment to eliminate the surge; and (C), combining the compensated signal segment The signal segment and a remaining voice signal of the signal segment intercepted through the step (A) to form a complete voice signal.

本发明的目的及解决其技术问题还采用以下技术措施来进一步实现。The purpose of the present invention and the solution to its technical problems also adopt the following technical measures to further realize.

前述的消除录音突波的方法，其中所述的步骤(A)中，该突波是发生在该语音讯号开始后的一预定时间内，且该讯号区段是指在该预定时间内的该语音讯号。The aforementioned method for eliminating recording spikes, wherein in the step (A), the spikes occur within a predetermined time after the start of the voice signal, and the signal segment refers to the voice signal.

前述的消除录音突波的方法，其中所述的步骤(B)中，该补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。In the aforementioned method for eliminating recording spikes, in the step (B), the compensation signal is a spike signal with the same length as the signal segment and the same surge amplitude and position.

前述的消除录音突波的方法，其中所述的步骤(B)中，该补偿讯号是一与该讯号区段的长度相同且突波位置及振幅相同但相位相反的突波讯号。In the aforementioned method for eliminating recording spikes, in the step (B), the compensation signal is a spike signal with the same length as the signal segment, the same spike position and amplitude, but opposite phase.

前述的消除录音突波的方法，其中所述的补偿讯号是预先借由反复启动该麦克风以收录复数产生突波的无语音讯号，然后预先经由步骤(A)于该等复数无语音讯号中取出产生突波的复数讯号区段后，将该等复数讯号区段加总并平均后，乘上一放大增益所产生。The aforementioned method for eliminating recording spikes, wherein the compensation signal is to record multiple non-speech signals that generate spikes by repeatedly activating the microphone in advance, and then extract them from the multiple non-speech signals through step (A) in advance After the complex signal segments of the surge are generated, the complex signal segments are summed and averaged, and then multiplied by an amplification gain.

前述的消除录音突波的方法，其中所述的步骤(B)中，是借由将该讯号区段与该补偿讯号相减而消除该讯号区段上的突波。In the aforementioned method for eliminating recording spikes, in the step (B), the spikes on the signal segment are eliminated by subtracting the signal segment from the compensation signal.

前述的消除录音突波的方法，其中所述的步骤(B)中，是借由将该讯号区段与该补偿讯号相加而消除该讯号区段上的突波。In the aforementioned method for eliminating recording spikes, in the step (B), the spikes on the signal segment are eliminated by adding the signal segment to the compensation signal.

本发明的目的及解决其技术问题还采用以下技术方案来实现的。依据本发明提出的一种消除录音突波的装置，设置在一电子设备的一麦克风输出端，用以接收由该麦克风收录的一语音讯号，该装置包括：一讯号截取单元，用以于该语音讯号中截取产生突波的一讯号区段；一放大单元，是根据该麦克风的增益放大一预设的补偿讯号；一补偿单元，对该讯号区段及该放大的补偿讯号进行补偿运算，以消除该讯号区段中的突波；以及一组合单元，用以组合经过补偿的该讯号区段与被截取该讯号区段的一剩余语音讯号，以输出一完整的语音讯号。The purpose of the present invention and the solution to its technical problems are also achieved by the following technical solutions. A device for eliminating recording surges according to the present invention is provided at a microphone output end of an electronic device to receive a voice signal recorded by the microphone, and the device includes: a signal intercepting unit for the Intercepting a signal segment that generates a surge in the voice signal; an amplifying unit that amplifies a preset compensation signal according to the gain of the microphone; a compensation unit that performs a compensation operation on the signal segment and the amplified compensation signal, to eliminate the surge in the signal segment; and a combining unit for combining the compensated signal segment and a residual voice signal of the intercepted signal segment to output a complete voice signal.

前述的消除录音突波的装置，其中所述的放大的补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。In the aforementioned device for eliminating recording spikes, the amplified compensation signal is a surge signal with the same length as the signal segment and the same surge amplitude and position.

前述的消除录音突波的装置，其中所述的消除录音突波的装置还包括一突波向量资料库，且该补偿讯号是预存在该突波向量资料库中。The aforementioned device for eliminating recording surge, wherein the device for eliminating recording surge further includes a surge vector database, and the compensation signal is pre-stored in the surge vector database.

本发明与现有技术相比具有明显的优点和有益效果。由以上技术方案可知，本发明的主要技术内容如下：Compared with the prior art, the present invention has obvious advantages and beneficial effects. As can be seen from above technical scheme, main technical content of the present invention is as follows:

为了达到上述目的，本发明提供了一种消除录音突波的方法，其应用在一电子设备上，该电子设备具有一用以收录语音以产生一语音讯号的麦克风，该方法包括：(A)由该语音讯号中截取产生突波的一讯号区段；(B)以一预设的补偿讯号与该讯号区段进行补偿运算，以消除该突波；以及(C)组合经过补偿的该讯号区段与经过步骤(A)的被截取该讯号区段的该剩余语音讯号，以形成一完整的语音讯号。In order to achieve the above object, the present invention provides a method for eliminating recording surge, which is applied on an electronic device, and the electronic device has a microphone for recording voice to generate a voice signal, the method comprising: (A) Intercepting a signal segment that generates a spike from the voice signal; (B) performing a compensation operation on the signal segment with a preset compensation signal to eliminate the spike; and (C) combining the compensated signal segment and the remaining voice signal of the intercepted signal segment after step (A) to form a complete voice signal.

该突波是发生在该语音讯号开始后的一预定时间内，且该讯号区段是指在该预定时间内的该语音讯号。The burst occurs within a predetermined time after the start of the voice signal, and the signal segment refers to the voice signal within the predetermined time.

该补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。The compensation signal is a surge signal with the same length as the signal section and the same surge amplitude and position.

该补偿讯号是一与该讯号区段的长度相同且突波位置及振幅相同但相位相反的突波讯号。The compensation signal is a surge signal with the same length as the signal section, the same surge position and amplitude but opposite phase.

该补偿讯号是预先借由反复启动该麦克风以收录复数产生突波的无语音讯号，然后预先经由步骤(A)由该等无语音讯号中取出产生突波的复数讯号区段后，将该等讯号区段加总并平均后，乘上一放大增益所产生。The compensation signal is obtained by repeatedly activating the microphone in advance to record a plurality of non-speech signals that generate surges, and then extracting the multiple signal segments that generate surges from the non-speech signals through step (A) in advance, and then After the signal segments are summed and averaged, they are multiplied by an amplification gain.

在步骤(B)中，是借由将该讯号区段与该补偿讯号相减而消除该讯号区段上的突波。In step (B), the glitch on the signal segment is eliminated by subtracting the signal segment from the compensation signal.

在步骤(B)中，是借由将该讯号区段与该补偿讯号相加而消除该讯号区段上的突波。In step (B), the glitch on the signal segment is removed by adding the signal segment to the compensation signal.

另外，为了达到上述目的，本发明还提供了一种用以实现上述方法的消除录音突波的装置，设置在一电子设备的一麦克风输出端，用以接收由该麦克风收录的一语音讯号。该装置包括一讯号截取单元、一放大单元、一补偿单元及一组合单元。该讯号截取单元用以由该语音讯号中截取产生突波的一讯号区段。该放大单元是根据该麦克风的增益放大一预设的补偿讯号。该补偿单元对该讯号区段及该放大的补偿讯号进行补偿运算，以消除该讯号区段中的突波。该组合单元用以组合经过补偿的该讯号区段与被截取该讯号区段的该剩余语音讯号，以输出一完整的语音讯号。In addition, in order to achieve the above object, the present invention also provides a recording surge elimination device for realizing the above method, which is installed at a microphone output end of an electronic device to receive a voice signal recorded by the microphone. The device includes a signal intercepting unit, an amplifying unit, a compensating unit and a combination unit. The signal intercepting unit is used for intercepting a signal segment generating a surge from the voice signal. The amplifying unit amplifies a preset compensation signal according to the gain of the microphone. The compensation unit performs compensation operation on the signal section and the amplified compensation signal to eliminate the surge in the signal section. The combining unit is used for combining the compensated signal segment and the remaining voice signal of the intercepted signal segment to output a complete voice signal.

该放大的补偿讯号是一与该讯号区段的长度相同且突波振幅及位置相同的突波讯号。The amplified compensation signal is a surge signal with the same length as the signal segment and the same surge amplitude and position.

该消除录音突波的装置还包括一突波向量资料库，且该补偿讯号是预存在该突波向量资料库中。The device for eliminating recording surge also includes a surge vector database, and the compensation signal is pre-stored in the surge vector database.

借由上述技术方案，本发明消除录音突波的方法及其装置至少具有下列优点：本发明借由截取语音讯号中发生突波的片段讯号，并以一预先求得的补偿讯号与该片段讯号进行补偿运算，以抵消该片段讯号中的突波并保留其中的语音讯号，使该部分语音特征不会受到突波的破坏，而能够有效提升后端进行语音辨识时的辨识正确率。By means of the above technical solution, the method and device for eliminating recording spikes of the present invention have at least the following advantages: the present invention intercepts the segment signal where the surge occurs in the speech signal, and uses a pre-obtained compensation signal to match the segment signal Compensation calculations are performed to offset the surge in the segment signal and retain the voice signal in it, so that the part of the voice feature will not be damaged by the surge, and can effectively improve the recognition accuracy of the back-end voice recognition.

综上所述，本发明提供了一种能够有效减少携带式电子装置的语音辨识错误率的消除录音突波的方法及其装置。该消除突波的方法，应用在一电子设备上，该电子设备具有一收录语音以产生一语音讯号的麦克风，该方法包括由该语音讯号中截取发生突波的一讯号区段，并以一预设的补偿讯号与该讯号区段进行补偿运算，以消除该讯号区段上的突波，然后再组合经过补偿的该讯号区段与被截取该讯号区段的该剩余语音讯号，以组成一完整的语音讯号；借此，可以保持语音讯号的真实性与完整性，而有助于电子设备进行语音辨识时的辨识正确率的提升。其具有上述诸多的优点及实用价值，不论在方法、产品结构或功能上皆有较大改进，在技术上有较大进步，并产生了好用及实用的效果，且较现有的消除录音突波的方法及其装置具有增进的多项功效，从而更加适于实用，诚为一新颖、进步、实用的新设计。To sum up, the present invention provides a method and device for eliminating recording spikes that can effectively reduce the speech recognition error rate of portable electronic devices. The method for eliminating surge is applied to an electronic device, and the electronic device has a microphone for recording voice to generate a voice signal. The method includes intercepting a signal segment where a surge occurs from the voice signal, and using a The preset compensation signal and the signal segment are compensated to eliminate the surge on the signal segment, and then the compensated signal segment and the remaining voice signal of the intercepted signal segment are combined to form A complete voice signal; thereby, the authenticity and integrity of the voice signal can be maintained, and it is helpful to improve the recognition accuracy of the electronic device when performing voice recognition. It has the above-mentioned many advantages and practical value, no matter in method, product structure or function, it has been greatly improved, and it has made great progress in technology, and has produced easy-to-use and practical effects, and it is better than the existing recording elimination method. The surge method and its device have multiple enhanced effects, so it is more suitable for practical use, and it is a novel, progressive and practical new design.

上述说明仅是本发明技术方案的概述，为了能够更清楚了解本发明的技术手段，而可依照说明书的内容予以实施，并且为了让本发明的上述和其他目的、特征和优点能够更明显易懂，以下特举较佳实施例，并配合附图，详细说明如下。The above description is only an overview of the technical solution of the present invention. In order to better understand the technical means of the present invention, it can be implemented according to the contents of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and understandable , the following preferred embodiments are specifically cited below, and are described in detail as follows in conjunction with the accompanying drawings.

附图说明 Description of drawings

图1是具有语音辨识功能的一现有的以往电子设备的内部电路方块示意图。FIG. 1 is a schematic block diagram of an internal circuit of a conventional conventional electronic device with a speech recognition function.

图2是一发生突波的语音讯号波形图。FIG. 2 is a waveform diagram of a speech signal in which a burst occurs.

图3是本发明消除录音突波的装置的一实施例应用在一具有语音辨识功能的电子设备上的电路方块示意图。FIG. 3 is a schematic circuit block diagram of an embodiment of the device for eliminating recording spikes of the present invention applied to an electronic device with a voice recognition function.

图4是本实施例消除录音突波的装置的详细电路方块图。Fig. 4 is a detailed circuit block diagram of the device for eliminating recording surge in this embodiment.

图5是本实施例用以产生补偿讯号的电路方块图。FIG. 5 is a block diagram of a circuit for generating compensation signals in this embodiment.

图6是一具有突波的无语音讯号波形图。FIG. 6 is a waveform diagram of a non-speech signal with spikes.

图7是一应用本实施例消除突波后的语音讯号波形图。FIG. 7 is a waveform diagram of a speech signal after the surge is eliminated by applying this embodiment.

具体实施方式 Detailed ways

为更进一步阐述本发明为达成预定发明目的所采取的技术手段及功效，以下结合附图及较佳实施例，对依据本发明提出的消除录音突波的方法及其装置其具体实施方式、方法、步骤、结构、特征及其功效，详细说明如后。In order to further explain the technical means and effects that the present invention takes to achieve the intended purpose of the invention, below in conjunction with the accompanying drawings and preferred embodiments, the specific implementation methods and methods of the method and device for eliminating recording surges proposed according to the present invention , step, structure, feature and effect thereof, detailed description is as follows.

请参阅图3所示，是本发明消除录音突波的装置的一实施例，本实施例消除录音突波的装置3，是设置在一具有语音辨识功能的携带式电子设备2(例如移动电话、移动导航器或个人数位助理(PDA)等移动装置)的收音麦克风20与进行语音辨识的语音辨识器21之间，使用者通常可以借由按压电子设备2上预设的一快速键(图中未示)来启动麦克风20以输入语音(例如一语音命令)，且该麦克风20在收录来自外部的一语音命令(通常是一人名、一名词或一句话)并对应产生一语音讯号后即被禁能，直到该快速键再次被按。而由麦克风20产生的语音讯号则被送至消除录音突波的装置3进行突波消除作业后，才送至语音辨识器21进行语音辨识作业。其中，语音辨识器21中包括有一端点侦测器模组(图中未示)，其直接接收外来的声音讯号并判别所录到的声音是否为一语音讯号。See also shown in Fig. 3, it is an embodiment of the device for eliminating the recording surge of the present invention, the device 3 for eliminating the recording surge in this embodiment is to be arranged on a portable electronic device 2 (such as a mobile phone) with a voice recognition function , mobile navigator or personal digital assistant (PDA) and other mobile devices) between the microphone 20 and the speech recognizer 21 for speech recognition, the user can usually press a preset quick key on the electronic device 2 (Fig. not shown in the middle) to activate the microphone 20 to input voice (such as a voice command), and the microphone 20 records a voice command (usually a name, a noun or a sentence) from the outside and generates a voice signal correspondingly. Disabled until the hotkey is pressed again. The voice signal generated by the microphone 20 is then sent to the device 3 for eliminating the recording surge, and then sent to the speech recognizer 21 for the speech recognition operation after the surge elimination operation is performed. Wherein, the voice recognizer 21 includes an endpoint detector module (not shown in the figure), which directly receives an external voice signal and judges whether the recorded voice is a voice signal.

请再参阅图4所示，是本实施例消除录音突波的装置的详细电路方块图。本实施例的消除录音突波的装置3，主要包括一讯号截取单元31、一放大单元32、一补偿单元33、一组合单元34以及一突波向量资料库37，其中：Please refer to FIG. 4 again, which is a detailed circuit block diagram of the device for eliminating recording surge in this embodiment. The device 3 for eliminating recording surges in this embodiment mainly includes a signal interception unit 31, an amplification unit 32, a compensation unit 33, a combination unit 34, and a surge vector database 37, wherein:

该讯号截取单元31，其与麦克风20相连接，用以接收由麦克风收录的一语音讯号23，如图2所示。并且，由于受到麦克风20先天电气特性的影响，该麦克风20在一开始被启动的瞬间会固定在语音讯号23的前端位置(例如前0.7秒期间)产生如图2所示的突波24。因此，为了消除突波24，根据突波发生位置不变的特性，讯号截取单元31在收到来自麦克风20的语音讯号23时，会先对语音讯号23进行取样，以截取语音讯号23的发生突波的片段讯号25(即前0.7秒期间的讯号)，且在本实施例中，其是对语音讯号23进行取样(取样11552点，相当于0.7秒的讯号长度)，然后将发生突波的片段讯号25送至补偿单元33，并将被截取片段讯号25的剩余语音讯号(即取样点11552点以后的讯号)26送至组合单元34暂存。The signal intercepting unit 31 is connected to the microphone 20 for receiving a voice signal 23 recorded by the microphone, as shown in FIG. 2 . Moreover, due to the influence of the inherent electrical characteristics of the microphone 20, the microphone 20 will be fixed at the front position of the voice signal 23 (for example, during the first 0.7 seconds) at the moment when the microphone 20 is activated to generate a surge 24 as shown in FIG. 2 . Therefore, in order to eliminate the surge 24, according to the characteristic that the location of the surge does not change, the signal interception unit 31 will first sample the voice signal 23 when receiving the voice signal 23 from the microphone 20, so as to intercept the occurrence of the voice signal 23 The segment signal 25 of the burst (i.e. the signal during the previous 0.7 seconds), and in the present embodiment, it is to sample the voice signal 23 (sampling 11552 points, equivalent to the signal length of 0.7 seconds), and then the burst will occur The fragment signal 25 is sent to the compensation unit 33, and the remaining speech signal (ie, the signal after the sampling point 11552) 26 of the intercepted fragment signal 25 is sent to the combination unit 34 for temporary storage.

该放大单元32，在本实施例中是一乘法器，其接收该麦克风20的一增益值以及来自突波向量资料库37的一补偿讯号29。其中，该增益值是一变数，其是在麦克风20被启动时，借由读取系统晶片中对麦克风20所设定的增益值而获得。该补偿讯号29是被事先求得，其产生方式如下：The amplifying unit 32 is a multiplier in this embodiment, which receives a gain value of the microphone 20 and a compensation signal 29 from a surge vector database 37 . Wherein, the gain value is a variable, which is obtained by reading the gain value set for the microphone 20 in the system chip when the microphone 20 is activated. The compensation signal 29 is obtained in advance, and its generation method is as follows:

如图5所示，是本实施例用以产生补偿讯号的电路方块图。首先借由反复启动麦克风20收录(收集)复数产生突波(与上述语音讯号的突波位置相同)的无语音讯号27(约20～30个)，如图6所示，是一具有突波的无语音讯号波形图，然后将该等无语音讯号27送至讯号截取单元31，使一一对该等无语音讯号27进行取样(取11552点)，以截取该等无语音讯号的前0.7秒中包含有突波的复数片段讯号28，然后将该等片段讯号28经由一加总及平均电路35进行加总并平均后，再送入一曲线修饰单元36(选择性的，可有可无)中进行波形修饰后，即可获得该补偿讯号29，并将该补偿讯号29预存在突波向量资料库37中。因此，该补偿讯号29实际上是一与该片段讯号25长度相同且突波位置相同的突波讯号。As shown in FIG. 5 , it is a circuit block diagram for generating compensation signals in this embodiment. First by repeatedly starting the microphone 20 to record (collect) a plurality of non-speech signals 27 (about 20 to 30) that generate bursts (same as the burst position of the above-mentioned speech signal), as shown in Figure 6, it is a burst wave Then these non-voice signals 27 are sent to the signal intercepting unit 31, so that these non-voice signals 27 are sampled one by one (get 11552 points), to intercept the first 0.7 points of these non-voice signals Include the complex segment signal 28 of surge in the second, then these segment signals 28 are summed up and averaged through a summation and averaging circuit 35, and then sent to a curve modification unit 36 (optional, dispensable ), the compensation signal 29 can be obtained, and the compensation signal 29 is pre-stored in the surge vector database 37 . Therefore, the compensation signal 29 is actually a spike signal with the same length and the same spike position as the segment signal 25 .

所以，当该补偿讯号29被送入放大单元32时，放大单元32会根据当时麦克风20的增益值(例如增益＝6)及补偿讯号29产生当时的麦克风增益值(例如增益＝3)，将该补偿讯号29乘上一适当增益值(6/3＝2)并将其相位反相，使得被放大的补偿讯号29’与片段讯号25的突波振幅相同但相位相反，再将放大的补偿讯号29’输出至补偿单元33。Therefore, when the compensation signal 29 is sent to the amplifying unit 32, the amplifying unit 32 will generate the current microphone gain value (for example, gain=3) according to the current gain value of the microphone 20 (for example, gain=6) and the compensation signal 29. The compensation signal 29 is multiplied by an appropriate gain value (6/3=2) and its phase is reversed, so that the amplified compensation signal 29 ′ is the same as the surge amplitude of the segment signal 25 but the phase is opposite, and then the amplified compensation The signal 29 ′ is output to the compensation unit 33 .

且在本实施例中，补偿单元33实际上是一加法器，所以当该放大的补偿讯号29’被送入补偿单元33与片段讯号25相加时，即可与片段讯号25中的突波相加相抵消，而消除片段讯号25中的突波。And in this embodiment, the compensation unit 33 is actually an adder, so when the amplified compensation signal 29' is sent to the compensation unit 33 to be added to the segment signal 25, it can be combined with the surge in the segment signal 25 Addition cancels out, eliminating the glitches in segment signal 25 .

此外，值得一指出的是，补偿单元33除了使用加法器外，亦可以使用减法器来代替，这时补偿讯号29只需被放大而不需反相，如此放大的补偿讯号被送入补偿单元33中与片段讯号25相减时，片段讯号25中的突波则会因为与补偿讯号相减而被消除。In addition, it is worth pointing out that the compensation unit 33 can also use a subtractor instead of an adder. At this time, the compensation signal 29 only needs to be amplified without inversion, and the amplified compensation signal is sent to the compensation unit. When the segment signal 25 in 33 is subtracted, the surge in the segment signal 25 will be eliminated because of subtraction from the compensation signal.

然后，消除突波后的片段讯号25’被输入组合单元34中与被截取片段讯号25的剩余语音讯号26进行组合，以组成如图7所示的一完整语音讯号23’后，才送入语音辨识器21中进行后续的语音辨识作业。所以，请参阅图7所示，是一应用本实施例消除突波后的语音讯号波形图，由图7显示可知，语音讯号23经过本实施例的突波消除装置3处理后，确实可以将语音讯号23’上的突波消除，而且仍然能够保留原先收录的语音讯号特征。Then, the segment signal 25' after the surge is eliminated is combined with the remaining speech signal 26 of the intercepted segment signal 25 in the input combination unit 34 to form a complete speech signal 23' as shown in Figure 7 before being sent into the The subsequent voice recognition operation is performed in the voice recognizer 21 . Therefore, please refer to Fig. 7, which is a waveform diagram of the speech signal after the application of the present embodiment to eliminate the surge. As can be seen from Fig. 7, after the speech signal 23 is processed by the surge canceling device 3 of the present embodiment, it can indeed be eliminated. The surge on the voice signal 23' is eliminated, and the characteristics of the original recorded voice signal can still be retained.

又请参阅图3所示，经由语音辨识器21完成语音辨识的语音讯号会被送至其后端的处理单元22，使根据辨识出来的语音执行对应的功能。并且由于语音辨识器21的语音辨识过程是一以往的技术且非本发明的技术特征所在，故在此不再加以详述。Please also refer to FIG. 3 , the voice signal through the voice recognizer 21 for voice recognition will be sent to the back-end processing unit 22 to perform corresponding functions according to the recognized voice. And since the voice recognition process of the voice recognizer 21 is a conventional technology and not the technical feature of the present invention, it will not be described in detail here.

请再参阅表1所示，是未应用本实施例与应用本实施例的前后实验数据比较表。Please refer to Table 1 again, which is a comparison table of experimental data before and after this embodiment is not applied and this embodiment is applied.

表1Table 1

根据表1所示实验数据可知，在尚未解决突波问题的情况下，以麦克风反复收录由两个人轮流说出的384个名词(例如人名)中，语音辨识器21可以正确辨识其中的351个名词(错33个)；而当以本实施例的装置3消除语音讯号中的突波后，则麦克风所收录的384个名词中，语音辨识器可以正确辨识其中的358个名词(错26个)，由此可见本实施例可以将原先的错误率由33个向上修正7个，亦即可以减少7/33＝21.21％的辨识错误率。According to the experimental data shown in Table 1, it can be seen that in the situation where the surge problem has not been solved, the microphone repeatedly records 384 nouns (such as names) spoken by two people in turn, and the speech recognizer 21 can correctly identify 351 of them Nouns (wrong 33); and after eliminating the surge in the voice signal with the device 3 of the present embodiment, among the 384 nouns recorded by the microphone, the speech recognizer can correctly identify 358 nouns (wrong 26 nouns) ), it can be seen that this embodiment can correct the original error rate from 33 to 7, that is, it can reduce the recognition error rate by 7/33=21.21%.

此外，由表1中亦可看出，在本实施例中，补偿讯号不论是否有经过曲线修饰单元36的曲线修饰处理，其所达到的效果相同。In addition, it can also be seen from Table 1 that, in this embodiment, whether the compensation signal is subjected to the curve modification processing by the curve modification unit 36 or not, the effect achieved is the same.

经由上述的说明可知，本实施例借由截取语音讯号23中发生突波的片段讯号24，并以一预先求得的补偿讯号29与该片段讯号25进行补偿运算，以抵消该片段讯号25中的突波并保留其中的语音讯号，再将该补偿后的片段讯号25’与被截取片段讯号25的剩余语音讯号26组合成完整的语音讯号，借此，不但在消除突波的同时，可以保留在该片段讯号中的部分语音讯号，并使该部分语音特征不会受到突波的破坏，而能够有效提升后端进行语音辨识时的辨识正确率。It can be seen from the above description that in this embodiment, the segment signal 24 in which a surge occurs in the voice signal 23 is intercepted, and a pre-obtained compensation signal 29 is used to perform a compensation operation on the segment signal 25 to cancel the segment signal 25. surge and retain the voice signal therein, and then combine the compensated segment signal 25' with the remaining voice signal 26 of the intercepted segment signal 25 to form a complete voice signal, thereby not only eliminating the surge, but also A part of the voice signal in the segment signal is retained, and the part of the voice feature is not damaged by the surge, which can effectively improve the recognition accuracy rate when the back end performs voice recognition.

以上所述，仅是本发明的较佳实施例而已，并非对本发明作任何形式上的限制，虽然本发明已以较佳实施例揭露如上，然而并非用以限定本发明。The above descriptions are only preferred embodiments of the present invention, and do not limit the present invention in any form. Although the present invention has been disclosed as above with preferred embodiments, it is not intended to limit the present invention.

Claims

1. A method for eliminating recording surge, applied to an electronic device, which has a microphone for recording voice to generate a voice signal, characterized in that the method comprises the following steps:

(A), intercepting a signal segment that generates a surge in the voice signal;

(B), performing a compensation operation on a preset compensation signal and the signal segment to eliminate the surge; and

(C) Combining the compensated signal segment with a remaining voice signal of the intercepted signal segment after step (A) to form a complete voice signal.

2. The method for eliminating recording spikes according to claim 1, wherein in the step (A), the spike occurs within a predetermined time after the start of the voice signal, and the signal area A segment refers to the voice signal within the predetermined time.

3. The method for eliminating recording spikes according to claim 1, characterized in that in the step (B), the compensation signal is a signal with the same length as the signal segment and the same surge amplitude and position burst signal.

4. The method for eliminating recording spikes according to claim 1, characterized in that in the step (B), the compensation signal is a signal that has the same length as the signal segment and the same surge position and amplitude but Pulse signals with opposite phases.

5. The method for eliminating recording surges according to claim 3 or 4, wherein the compensation signal is to record multiple non-speech signals that generate surges by repeatedly activating the microphone in advance, and then go through the steps of (A) After extracting the complex signal segments that generate the surge from the complex non-speech signals, the complex signal segments are summed and averaged, and then multiplied by an amplification gain.

6. The method for eliminating recording spike according to claim 3, characterized in that in the step (B), the signal segment on the signal segment is eliminated by subtracting the compensation signal from the signal segment. surge.

7. The method for eliminating recording spikes according to claim 4, characterized in that in the step (B), the signal segment on the signal segment is eliminated by adding the signal segment to the compensation signal. surge.

8. A device for eliminating recording surges, which is installed at a microphone output end of an electronic device for receiving a voice signal recorded by the microphone, characterized in that the device includes:

A signal interception unit, used to intercept a signal segment generating a surge in the voice signal;

an amplifying unit, which amplifies a preset compensation signal according to the gain of the microphone;

a compensation unit, which performs a compensation operation on the signal segment and the amplified compensation signal, so as to eliminate the surge in the signal segment; and

A combination unit is used to combine the compensated signal segment and a residual voice signal of the intercepted signal segment to output a complete voice signal.

9. The device for eliminating recording spikes according to claim 8, wherein the amplified compensation signal is a surge signal having the same length as the signal segment and the same amplitude and position of the spike.

10. The device for eliminating recording surge according to claim 8, further comprising a surge vector database, and the compensation signal is pre-stored in the surge vector database.