WO2019228135A1 - Method and device for adjusting matching threshold, storage medium and electronic device - Google Patents
Method and device for adjusting matching threshold, storage medium and electronic device Download PDFInfo
- Publication number
- WO2019228135A1 WO2019228135A1 PCT/CN2019/085534 CN2019085534W WO2019228135A1 WO 2019228135 A1 WO2019228135 A1 WO 2019228135A1 CN 2019085534 W CN2019085534 W CN 2019085534W WO 2019228135 A1 WO2019228135 A1 WO 2019228135A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- preset
- target
- matching
- matching threshold
- value
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 57
- 238000012795 verification Methods 0.000 claims description 41
- 238000004590 computer program Methods 0.000 claims description 12
- 238000011084 recovery Methods 0.000 claims 1
- 230000001755 vocal effect Effects 0.000 abstract 3
- 230000008569 process Effects 0.000 description 16
- 239000000284 extract Substances 0.000 description 11
- 238000010586 diagram Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 4
- 201000007100 Pharyngitis Diseases 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 206010013952 Dysphonia Diseases 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 208000027498 hoarse voice Diseases 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/30—Authentication, i.e. establishing the identity or authorisation of security principals
- G06F21/31—User authentication
- G06F21/32—User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
Definitions
- the present application belongs to the technical field of terminals, and in particular, to a method, an apparatus, a storage medium, and an electronic device for adjusting a matching threshold.
- a user can control a terminal by using a voice, that is, after receiving a voice message sent by the user, the terminal can analyze the voice information to obtain a control instruction. Before executing the control instruction, the terminal needs to extract voiceprint features from the voice information, and perform voiceprint recognition for the user according to the extracted voiceprint features. Only after the voiceprint recognition is passed, the terminal will execute the control instruction corresponding to the voice information.
- the embodiments of the present application provide a method, a device, a storage medium, and an electronic device for adjusting a matching threshold, which can improve the flexibility of voiceprint recognition.
- An embodiment of the present application provides a method for adjusting a matching threshold, including:
- the value of the preset matching threshold is adjusted.
- An embodiment of the present application provides a device for adjusting a matching threshold, including:
- An acquisition module configured to acquire voiceprint feature information of the current user
- a matching module configured to match the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree
- a verification module configured to perform identity verification on the current user if a preset condition is satisfied between the target matching degree and a preset matching threshold
- An adjustment module is configured to adjust the value of the preset matching threshold after the identity verification is passed.
- An embodiment of the present application provides a storage medium having a computer program stored thereon.
- the computer program When the computer program is executed on a computer, the computer is caused to execute a process in a method for adjusting a matching threshold provided by an embodiment of the present application.
- An embodiment of the present application further provides an electronic device including a memory and a processor, where the processor executes a process in a method for adjusting a matching threshold provided by an embodiment of the present application by calling a computer program stored in the memory.
- FIG. 1 is a schematic flowchart of a method for adjusting a matching threshold according to an embodiment of the present application.
- FIG. 2 is another schematic flowchart of a method for adjusting a matching threshold according to an embodiment of the present application.
- 3 to 5 are schematic diagrams of a method for adjusting a matching threshold according to an embodiment of the present application.
- FIG. 6 is a schematic structural diagram of a matching threshold adjustment device according to an embodiment of the present application.
- FIG. 7 is another schematic structural diagram of a matching threshold adjustment device according to an embodiment of the present application.
- FIG. 8 is a schematic structural diagram of a mobile terminal according to an embodiment of the present application.
- FIG. 9 is another schematic structural diagram of a mobile terminal according to an embodiment of the present application.
- An embodiment of the present application provides a method for adjusting a matching threshold, including:
- the value of the preset matching threshold is adjusted.
- performing identity verification on the current user may include: if the target matching degree is less than the preset matching threshold Obtaining the difference between the preset matching threshold and the target matching degree; if it is detected that the difference is less than or equal to the preset difference, determining between the target matching degree and the preset matching threshold The preset conditions are met, and the current user is authenticated.
- adjusting the value of the preset matching threshold may include: acquiring multiple voices of the current user, and extracting target voiceprint feature information from each voice; The pattern feature information is matched with the preset voiceprint feature information one by one to obtain a plurality of standby matching degrees; an average value of the plurality of standby matching degrees is calculated; and the value of the preset matching threshold is changed from the current The target value is adjusted to the average value.
- the method may further include: setting a valid period; and when the valid period ends, setting the preset period The value of the matching threshold is restored from the average value to the target value.
- the method before acquiring the multi-segment voice of the current user, the method further includes: calculating a percentage difference between the preset matching threshold and the target matching degree; and converting the percentage difference into one hundred Is the fraction of the denominator; the number of the numerator of the fraction is determined as the target value.
- acquiring the multi-segment voices of the current user may include: acquiring multiple multi-segment voices whose number is the target value.
- the method may further include: setting a preset value
- acquiring the multi-segment voices of the current user may include: acquiring the multi-segment voices whose quantity is the preset value.
- performing identity verification on the current user may include: performing identity verification on the current user through fingerprint recognition, face recognition, or iris recognition.
- the execution subject of the embodiment of the present application may be a terminal device such as a smart phone or a tablet computer.
- FIG. 1 is a schematic flowchart of a matching threshold adjustment method according to an embodiment of the present application.
- the process may include:
- a user can control a terminal by using a voice, that is, after receiving a voice message sent by the user, the terminal can analyze the voice information to obtain a control instruction. Before executing the control instruction, the terminal needs to extract voiceprint features from the voice information, and perform voiceprint recognition for the user according to the extracted voiceprint features. Only after the voiceprint recognition is passed, the terminal will execute the control instruction corresponding to the voice message.
- the terminal has poor flexibility when performing voiceprint recognition on the user.
- the terminal may obtain voiceprint feature information of the current user. For example, the terminal may first obtain a voice uttered by the current user, and then extract voiceprint feature information from the voice.
- the voiceprint feature information of the current user is matched with the preset voiceprint feature information to obtain a corresponding target matching degree.
- the terminal may match the voiceprint feature information of the current user with the preset voiceprint feature information to obtain a corresponding target matching degree.
- the preset voiceprint feature information may be voiceprint feature information of the terminal owner and / or voiceprint feature information of a user (such as the owner ’s family, etc.) permitted by the terminal owner.
- the terminal may detect whether the target matching degree and the preset matching threshold value meet a preset condition.
- the preset matching threshold value is a value used to determine whether the voiceprint recognition is passed (that is, the preset matching threshold value is a value used to determine whether the voiceprint feature information of the current user and the preset voiceprint feature information are successfully matched), that is, During voiceprint recognition, if the degree of matching between the user's voiceprint feature information and the preset voiceprint feature information reaches a preset matching threshold, the terminal may determine that the voiceprint recognition passes.
- the terminal may perform other operations.
- step 103 is performed.
- step S104 after the identity verification is passed, the value of the preset matching threshold is adjusted.
- 103 and 104 can include:
- the terminal detects that a preset condition is satisfied between the target matching degree and a preset matching threshold. In this case, the terminal can authenticate the current user to verify whether the current user is the terminal owner or other legal user.
- the terminal can preset a value of the matching threshold value for adjustment. After that, the terminal can perform voiceprint recognition according to the preset matching threshold adjusted by the value.
- the terminal when the terminal detects that the matching degree between the voiceprint feature information and the preset voiceprint feature information of the current user satisfies a preset condition and presets a matching threshold, The current user is authenticated. After the identity verification is passed, the terminal can adjust the value of the preset matching threshold, so that the terminal can subsequently perform voiceprint recognition according to the preset matching threshold adjusted by the value. Since the preset matching threshold for judging whether the voiceprint recognition passes can be adjusted in this embodiment, this embodiment can improve the flexibility of the terminal when performing voiceprint recognition.
- FIG. 2 is another schematic flowchart of a matching threshold adjustment method according to an embodiment of the present application.
- the process may include:
- the terminal acquires voiceprint feature information of the current user.
- the terminal may obtain voiceprint characteristic information of the current user.
- the terminal matches the voiceprint feature information of the current user with the preset voiceprint feature information to obtain a corresponding target matching degree.
- the terminal may match the voiceprint feature information of the current user with the preset voiceprint feature information to obtain a corresponding target matching degree.
- the preset voiceprint feature information may be voiceprint feature information of the terminal owner and / or voiceprint feature information of a user (such as the owner ’s family, etc.) permitted by the terminal owner.
- the terminal may compare the target matching degree with a preset matching threshold.
- the preset matching threshold is a value used to determine whether voiceprint recognition is passed. That is, when the voiceprint recognition is performed, if the degree of matching between the user's voiceprint feature information and the preset voiceprint feature information reaches a preset matching threshold, Then the terminal may determine that the voiceprint recognition is passed.
- the terminal can execute the control instruction corresponding to the voice issued by the current user.
- the process may proceed to 203.
- the terminal obtains a difference between the preset matching threshold and the target matching degree.
- the terminal determines that a preset condition is satisfied between the target matching degree and the preset matching threshold, and performs identity verification on the current user.
- 203 and 204 can include:
- the terminal detects that the target matching degree between the voiceprint feature information of the current user and the preset voiceprint feature information is 86%, which is less than a preset matching threshold of 90%.
- the terminal may obtain a difference between the preset matching threshold and the target matching degree.
- the terminal obtains that the difference between the preset matching threshold and the target matching degree is 4% (90% -86%).
- the terminal can detect whether the difference is less than or equal to a preset difference.
- the terminal may perform other operations.
- the terminal can determine that the target matching degree between the voiceprint feature information and the preset voiceprint feature information of the current user and the preset matching threshold satisfy a preset condition. In this case, it can be considered that the voiceprint feature information of the current user is close to the preset voiceprint feature information. At this point, the terminal can be triggered to authenticate the current user.
- the terminal may perform identity authentication on the current user by using other methods than voiceprint recognition, such as fingerprint recognition, face recognition, or iris recognition.
- the current user may be considered as an illegal user, and the terminal may not execute the voice instruction sent by the current user at this time.
- the terminal obtains multiple pieces of speech of the current user, and extracts target voiceprint feature information from each piece of speech.
- the terminal matches all the target voiceprint feature information with the preset voiceprint feature information one by one, and obtains multiple standby matching degrees.
- the terminal calculates an average value of the plurality of standby matching degrees.
- the terminal adjusts the value of the preset matching threshold value from the current target value to the average value.
- 205, 206, 207, and 208 can include:
- the terminal can obtain multiple pieces of voice of the current user and extract voiceprint feature information from each voice, that is, target voiceprint feature information.
- the terminal can match all target voiceprint feature information with the preset voiceprint feature information one by one, that is, the terminal can match each target voiceprint feature information with the preset voiceprint feature information, thereby obtaining multiple matching degrees. , That is, multiple in-use matches.
- the terminal may calculate an average of the standby matching degrees. Then, the terminal may adjust the value of the preset matching threshold from the current target value to the average value.
- the terminal may acquire 5 segments of voice of the current user, and extract target voiceprint feature information from each segment of voice. Then, the terminal can match each target voiceprint feature information with preset voiceprint feature information, thereby obtaining 5 standby matching degrees. For example, the five standby matches are 88%, 87%, 85%, 85%, 87%, respectively. After that, the terminal can calculate an average of the five standby matching degrees, for example, the average is 86.4%. After calculating the average value, the terminal may adjust the value of the preset matching threshold from the current 90% to 86.4%.
- the terminal sets a validity period, and when the validity period ends, the preset matching threshold value is restored from the average value to the target value.
- the terminal may set a validity period.
- the terminal may restore the value of the preset matching threshold from 86.4% to 90%.
- the terminal may set the validity period for the value adjustment to 5 days, that is, the terminal needs to restore the preset matching threshold value from 86.4% after 5 days 90%.
- the process may further include the following process:
- the terminal calculates a percentage difference between a preset matching threshold and a target matching degree
- the terminal converts the percentage difference into a fraction with a hundred as a denominator
- the terminal determines the number of the numerator of the score as the target value.
- the process of obtaining the multi-segment voice of the current user by the terminal in 205 may include: The terminal obtains the multi-segment voice whose quantity is the target value.
- the terminal may first calculate a percentage difference between a preset matching threshold and a target matching degree. For example, if the preset matching threshold is 90% and the target matching degree is 86%, then the percentage difference between the preset matching threshold and the target matching degree is 4%.
- the terminal may convert the percentage difference between the preset matching threshold and the target matching degree into a score with a hundred as the denominator, and determine the number of the numerator of the score as the target value. For example, after a percentage difference of 4% is converted to a fraction with a hundred as the denominator, the numerator of the fraction is 4. Then, the target value can be determined to be 4.
- the terminal can obtain the multi-segment voice whose number is the target value. For example, if the target value is 4, then the terminal can obtain 4 segments of voice of the current user.
- the terminal may determine specifically how many pieces of speech to acquire according to the difference between the preset matching threshold and the target matching degree.
- the terminal In the case where the difference between the preset matching threshold and the target matching degree is small, that is, the change of the voiceprint characteristics of the current user is small, the terminal can obtain fewer segments of speech.
- the terminal when the difference between the preset matching threshold and the target matching degree is large, that is, when the voiceprint feature of the current user changes greatly, the terminal can obtain more voices, so that the terminal can more accurately
- the voiceprint changes to adjust the value of the preset matching threshold and also increases the flexibility of adjusting the preset matching threshold.
- the terminal may also determine the specific needs to acquire several voices by:
- the terminal may set a preset value in advance, and when the terminal needs to obtain multiple segments of speech, the terminal obtains multiple segments of speech with the preset value. For example, if the preset value is 5, then the terminal can obtain the 5 voices of the current user, and extract the target voiceprint feature information of the current user from each of the 5 voices, and set each target voiceprint feature information. Match with the preset voiceprint feature information to get 5 standby matching degrees. Then, the terminal may calculate an average value of the five standby matching degrees, and adjust the preset matching threshold value from the current value to the average value.
- the terminal may randomly generate 5 segments of speech, and ask the user to pronounce these 5 segments of speech, and then extract target voiceprint feature information of the current user from each of the 5 segments of speech spoken by the user, Each target voiceprint feature information is matched with preset voiceprint feature information to obtain 5 standby matching degrees. Then, the terminal may calculate an average value of the five standby matching degrees, and adjust the preset matching threshold value from the current value to the average value. In one embodiment, the terminal may also obtain the required 5 segments of user voice when performing voice interaction with the user, and execute subsequent related processes.
- the terminal may first analyze the voice issued by the current user. If the control instruction corresponding to the voice includes two or more alternative operations, the terminal may also determine the specific needs to obtain several segments of voice according to the number of alternative operations. For example, if the user's voice is "Open Music Application", then if the terminal detects that the terminal contains 3 music applications, such as A application, B application, and C application, then the terminal can determine that it needs to obtain 3 segments of voice. Then, the terminal can sequentially generate 3 pieces of voice information for asking the user whether to open an application. When receiving the answer voice of the current user, the terminal may extract the voiceprint feature information of the current user from it.
- the terminal may extract the voiceprint feature information of the current user from it.
- FIG. 3 to FIG. 5 are schematic diagrams of a method for adjusting a matching threshold provided by an embodiment of the present application.
- the user's voiceprint characteristics are not static. In some cases, there are some changes in the user's voiceprint characteristics. For example, when the user's throat is inflamed, the user's voice will become hoarse, or when the user is eating and talking, the user's voice will also change slightly.
- the current user's voice is hoarse due to throat inflammation, which causes the voiceprint characteristics of the current user to change.
- the current user needs to use the terminal's voice assistant function at this time.
- the current user sends the following voice "Xiaoou, please open the music application" to the terminal, as shown in Figure 3.
- the terminal After receiving the voice "Xiaoou, please open the music application", the terminal can extract the voiceprint feature information of the current user from the voice, and then match the voiceprint feature information of the current user with the preset voiceprint feature information. And get the corresponding target match.
- the preset voiceprint feature information is the voiceprint feature information of the owner of the terminal. For example, due to the hoarse voice of the current user, the target matching degree between the voiceprint feature information of the current user and the preset voiceprint feature information is only 86%, and the preset matching threshold is not reached 90%, that is, the voiceprint recognition of the current user Fail.
- the terminal may calculate a difference between the target matching degree and a preset matching threshold.
- the difference between the target matching degree and the preset matching threshold is 4% (90% -86%).
- the terminal After obtaining the difference between the target matching degree and the preset matching threshold, the terminal can detect whether the difference is less than or equal to the preset difference. For example, the preset difference is 5%. Therefore, in this embodiment, the terminal can detect that the difference between the target matching degree and the preset matching threshold 4% is less than the preset difference 5%. In this case, the terminal may determine that the target matching degree and the preset matching threshold satisfy a preset condition, that is, the terminal may consider that the voiceprint feature information of the current user and the preset voiceprint feature information are close to match. The terminal can then authenticate the current user. For example, the terminal may authenticate the current user in one or more of fingerprint recognition, face recognition, and iris recognition to verify whether the current user is the terminal owner.
- the identity verification of the current user passes, that is, the terminal can confirm that the current user is the terminal owner.
- the terminal can obtain multiple voices of the current user and extract target voiceprint feature information from each voice. For example, after receiving the voice “Xiaoou, please open the music application” from the user, the terminal parses the voice and detects that the terminal contains two music applications, namely, the A application and the B application. At this point, the terminal can issue the following voice to the user: "The current terminal contains A music application and B music application, may I ask which application you need to open now", as shown in Figure 4.
- the terminal After hearing the voice from the terminal, the current user answers the following voice "Open A application", as shown in FIG. 5. After receiving the user's response and determining that the current user is the owner, the terminal can open the A application.
- the terminal can continue to obtain the multi-segment voice of the current user. Then, the terminal may select a plurality of pieces of speech with a preset number from the terminal. For example, if the preset value is 5, then the terminal can obtain 5 segments of voice of the current user and extract target voiceprint feature information of the current user from each segment of the voice. Then, the terminal may match the target voiceprint feature information with the preset voiceprint feature information, thereby obtaining a corresponding matching degree. That is, the terminal may obtain multiple matching degrees and determine the multiple matching degrees as the standby matching degrees. After that, the terminal may calculate an average value of the plurality of standby matching degrees, and adjust the value of the preset matching threshold value from the current target value to the average value.
- the terminal when the current user uses a voice assistant to control the terminal to play music, the terminal obtains 5 segments of voice and extracts 5 target voiceprint feature information from the terminal.
- the matching degree between the five target voiceprint feature information and the preset voiceprint feature information is 88%, 87%, 85%, 85%, 87% in order.
- the terminal can calculate the average of the five matching degrees, for example, the average is 86.4%.
- the terminal may adjust the value of the preset matching threshold from the current 90% to 86.4%.
- the terminal can then set a validity period. For example, if the terminal is set to be valid for 5 days, then in the next 5 days, the terminal can perform voiceprint quilts on the current user according to a matching threshold of 86.4%, and after 5 days, the terminal can set a preset matching threshold The value was restored from 86.4% to 90%.
- the terminal can set a validity period, which can correspond to The time it takes for the user's throat to heal.
- the terminal may restore the value of the preset matching threshold.
- the matching threshold adjustment device 300 may include: an obtaining module 301, a matching module 302, a verification module 303, and an adjustment module 304.
- the obtaining module 301 is configured to obtain voiceprint feature information of a current user.
- the matching module 302 is configured to match the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree.
- the verification module 303 is configured to perform identity verification on the current user if a preset condition is satisfied between the target matching degree and a preset matching threshold.
- the adjustment module 304 is configured to adjust the value of the preset matching threshold after the identity verification is passed.
- the verification module 303 may be configured to:
- the target matching degree is less than the preset matching threshold, obtaining a difference between the preset matching threshold and the target matching degree;
- the adjustment module 304 may be configured to:
- the adjustment module is further configured to:
- the verification module 303 may be configured to:
- FIG. 7 is another schematic structural diagram of a matching threshold adjustment device according to an embodiment of the present application.
- the matching threshold adjustment device 300 may further include a restoration module 305 and a determination module 306.
- the restoration module 305 is configured to set a validity period, and when the validity period ends, restore the value of the preset matching threshold value from the average value to the target value.
- a determining module 306 configured to: calculate a percentage difference between the preset matching threshold and the target matching degree; convert the percentage difference into a fraction with a hundred denominator; determine the number of the numerator of the fraction Is the target value.
- the adjustment module 304 may be configured to: obtain a plurality of pieces of speech whose quantity is the target value.
- An embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed on a computer, the computer is caused to execute the method in the method for adjusting a matching threshold provided in this embodiment. step.
- An embodiment of the present application further provides an electronic device including a memory and a processor, where the processor executes steps in the method for adjusting a matching threshold provided by the embodiment by calling a computer program stored in the memory.
- the above electronic device may be a mobile terminal such as a tablet computer or a smart phone.
- a mobile terminal such as a tablet computer or a smart phone.
- FIG. 8 is a schematic structural diagram of a mobile terminal according to an embodiment of the present application.
- the mobile terminal 400 may include components such as a microphone 401, a memory 402, and a processor 403. Those skilled in the art can understand that the structure of the mobile terminal shown in FIG. 8 does not constitute a limitation on the mobile terminal, and may include more or fewer components than those shown in the figure, or some components may be combined, or different components may be arranged.
- the microphone 401 may be used to pick up a voice and the like emitted by a user.
- the memory 402 may be used to store application programs and data.
- the application program stored in the memory 402 contains executable code.
- Applications can be composed of various functional modules.
- the processor 403 executes various functional applications and data processing by running application programs stored in the memory 402.
- the processor 403 is the control center of the mobile terminal. It uses various interfaces and lines to connect various parts of the entire mobile terminal, and executes the mobile terminal by running or executing application programs stored in the memory 402, and calling data stored in the memory 402. Various functions and processing data to monitor the mobile terminal as a whole.
- the processor 403 in the mobile terminal loads the executable code corresponding to the process of one or more applications into the memory 402 according to the following instructions, and the processor 403 runs and stores the memory in the memory. 402 application to achieve the steps:
- the value of the preset matching threshold is adjusted.
- the mobile terminal 500 may include a microphone 501, a memory 502, a processor 503, an input unit 504, an output unit 505, a speaker 506, and other components.
- the microphone 501 may be used to pick up a voice and the like emitted by a user.
- the memory 502 may be used to store application programs and data.
- the application program stored in the memory 502 contains executable code.
- Applications can be composed of various functional modules.
- the processor 503 executes various functional applications and data processing by running an application program stored in the memory 502.
- the processor 503 is the control center of the mobile terminal. It uses various interfaces and lines to connect various parts of the entire mobile terminal, and executes the mobile terminal by running or executing application programs stored in the memory 502, and calling data stored in the memory 502. Various functions and processing data to monitor the mobile terminal as a whole.
- the input unit 504 can be used to receive inputted numbers, character information, or user characteristic information (such as fingerprints), and generate keyboard, mouse, joystick, optical, or trackball signal inputs related to user settings and function control.
- user characteristic information such as fingerprints
- the output unit 505 may be used to display information input by the user or information provided to the user and various graphical user interfaces of the mobile terminal. These graphical user interfaces may be composed of graphics, text, icons, videos, and any combination thereof.
- the output unit may include a display panel.
- the processor 503 in the mobile terminal loads the executable code corresponding to the process of one or more applications into the memory 502 according to the following instructions, and the processor 503 runs the stored code in the memory 502 application to achieve the steps:
- the value of the preset matching threshold is adjusted.
- the processor 503 when the processor 503 executes the step of authenticating the current user if a preset condition is satisfied between the target matching degree and a preset matching threshold, the processor 503 may perform: if the target If the degree of matching is less than the preset matching threshold, a difference between the preset matching threshold and the target matching degree is obtained; if it is detected that the difference is less than or equal to the preset difference, determining the target matching degree And a preset condition is met between the preset matching threshold and the current user is authenticated.
- the processor 503 may perform: obtaining multiple voices of the current user, and extracting a target voiceprint from each voice Feature information; matching all of the target voiceprint feature information with the preset voiceprint feature information one by one, and obtaining multiple standby matching degrees; calculating an average of the multiple standby matching degrees; The value of the preset matching threshold is adjusted from the current target value to the average value.
- the processor 503 may further perform: setting a validity period; when the validity period ends, Restoring the value of the preset matching threshold from the average value to the target value.
- the processor 503 executes
- the processor 503 may further perform: calculating a percentage difference between the preset matching threshold and the target matching degree; and dividing the percentage difference Converting the value to a fraction with a hundred as the denominator; determining the number of the numerator of the fraction as the target value;
- the processor 503 executes the step of acquiring the multi-segment voice of the current user, it may execute: acquiring the multi-segment voice whose quantity is the target value.
- the processor 503 may further perform: setting a preset value.
- the processor 503 executes acquiring the multi-segment voice of the current user, it may execute: acquiring the multi-segment voice whose quantity is the preset value.
- the processor 503 when the processor 503 performs identity verification on the current user, the processor 503 may execute: identity verification on the current user through fingerprint recognition, face recognition, or iris recognition.
- the apparatus for adjusting the matching threshold provided in the embodiment of the present application and the method for adjusting the matching threshold in the foregoing embodiment belong to the same concept.
- the apparatus for adjusting the matching threshold may be run in the embodiment of the method for adjusting the matching threshold.
- the computer program may be stored in a computer-readable storage medium, such as stored in the memory, and executed by at least one processor, and may include adjustment of the matching threshold during the execution process
- the storage medium may be a magnetic disk, an optical disk, a read only memory (ROM, Read Only Memory), a random access memory (RAM, Random Access Memory), and the like.
- the device for adjusting the matching threshold may be integrated into one processing chip, or each module may exist separately physically, or two or more modules may be integrated into one module.
- the above integrated modules can be implemented in the form of hardware or software functional modules. If the integrated module is implemented in the form of a software functional module and sold or used as an independent product, it may also be stored in a computer-readable storage medium, such as a read-only memory, a magnetic disk, or an optical disk. .
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computer Security & Cryptography (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computer Hardware Design (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- General Engineering & Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
- Collating Specific Patterns (AREA)
Abstract
A method for adjusting a matching threshold, comprising: acquiring vocal print feature information of a current user (101); matching the vocal print feature information of the current user with preset vocal print feature information to obtain a corresponding target matching rate (102); if the target matching rate and a preset matching threshold satisfy a present condition, performing authentication on the current user (103); and adjusting a numerical value of the preset matching threshold after authentication is passed.
Description
本申请要求于2018年05月30日提交中国专利局、申请号为201810542931.2、申请名称为“匹配阈值的调整方法、装置、存储介质及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on May 30, 2018, with an application number of 201810542931.2, and the application name is "Adjustment method, device, storage medium, and electronic device for matching threshold". Citations are incorporated in this application.
本申请属于终端技术领域,尤其涉及一种匹配阈值的调整方法、装置、存储介质及电子设备。The present application belongs to the technical field of terminals, and in particular, to a method, an apparatus, a storage medium, and an electronic device for adjusting a matching threshold.
随着技术的发展,人机之间的交互方式变得越来越丰富。相关技术中,用户可以通过语音对终端进行控制,即终端在接收到用户发出的语音信息后,可以对该语音信息进行解析,得到控制指令。在执行该控制指令之前,终端需要从该语音信息中提取声纹特征,并根据提取到的声纹特征对用户进行声纹识别。只有在声纹识别通过后,终端才会执行语音信息对应的控制指令。With the development of technology, the interaction modes between human and machine have become more and more abundant. In related technology, a user can control a terminal by using a voice, that is, after receiving a voice message sent by the user, the terminal can analyze the voice information to obtain a control instruction. Before executing the control instruction, the terminal needs to extract voiceprint features from the voice information, and perform voiceprint recognition for the user according to the extracted voiceprint features. Only after the voiceprint recognition is passed, the terminal will execute the control instruction corresponding to the voice information.
发明内容Summary of the Invention
本申请实施例提供一种匹配阈值的调整方法、装置、存储介质及电子设备,可以提高声纹识别的灵活性。The embodiments of the present application provide a method, a device, a storage medium, and an electronic device for adjusting a matching threshold, which can improve the flexibility of voiceprint recognition.
本申请实施例提供一种匹配阈值的调整方法,包括:An embodiment of the present application provides a method for adjusting a matching threshold, including:
获取当前用户的声纹特征信息;Obtain the voiceprint feature information of the current user;
将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;Matching the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;
若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;If a preset condition is satisfied between the target matching degree and a preset matching threshold, performing identity verification on the current user;
在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。After the identity verification is passed, the value of the preset matching threshold is adjusted.
本申请实施例提供一种匹配阈值的调整装置,包括:An embodiment of the present application provides a device for adjusting a matching threshold, including:
获取模块,用于获取当前用户的声纹特征信息;An acquisition module, configured to acquire voiceprint feature information of the current user;
匹配模块,用于将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;A matching module, configured to match the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;
验证模块,用于若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;A verification module, configured to perform identity verification on the current user if a preset condition is satisfied between the target matching degree and a preset matching threshold;
调整模块,用于在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。An adjustment module is configured to adjust the value of the preset matching threshold after the identity verification is passed.
本申请实施例提供一种存储介质,其上存储有计算机程序,当所述计算机程序在计算机上执行时,使得所述计算机执行本申请实施例提供的匹配阈值的调整方法中的流程。An embodiment of the present application provides a storage medium having a computer program stored thereon. When the computer program is executed on a computer, the computer is caused to execute a process in a method for adjusting a matching threshold provided by an embodiment of the present application.
本申请实施例还提供一种电子设备,包括存储器,处理器,所述处理器通过调用所述存储器中存储的计算机程序,用于执行本申请实施例提供的匹配阈值的调整方法中的流程。An embodiment of the present application further provides an electronic device including a memory and a processor, where the processor executes a process in a method for adjusting a matching threshold provided by an embodiment of the present application by calling a computer program stored in the memory.
下面结合附图,通过对本申请的具体实施方式详细描述,将使本申请的技术方案及其有益效果显而易见。The following detailed description of specific embodiments of the present application will make the technical solution of the present application and its beneficial effects obvious with reference to the accompanying drawings.
图1是本申请实施例提供的匹配阈值的调整方法的流程示意图。FIG. 1 is a schematic flowchart of a method for adjusting a matching threshold according to an embodiment of the present application.
图2是本申请实施例提供的匹配阈值的调整方法的另一流程示意图。FIG. 2 is another schematic flowchart of a method for adjusting a matching threshold according to an embodiment of the present application.
图3至图5是本申请实施例提供的匹配阈值的调整方法的场景示意图。3 to 5 are schematic diagrams of a method for adjusting a matching threshold according to an embodiment of the present application.
图6是本申请实施例提供的匹配阈值的调整装置的结构示意图。FIG. 6 is a schematic structural diagram of a matching threshold adjustment device according to an embodiment of the present application.
图7是本申请实施例提供的匹配阈值的调整装置的另一结构示意图。FIG. 7 is another schematic structural diagram of a matching threshold adjustment device according to an embodiment of the present application.
图8是本申请实施例提供的移动终端的结构示意图。FIG. 8 is a schematic structural diagram of a mobile terminal according to an embodiment of the present application.
图9是本申请实施例提供的移动终端的另一结构示意图。FIG. 9 is another schematic structural diagram of a mobile terminal according to an embodiment of the present application.
请参照图示,其中相同的组件符号代表相同的组件,本申请的原理是以实施在一适当的运算环境中来举例说明。以下的说明是基于所例示的本申请具体实施例,其不应被视为限制本申请未在此详述的其它具体实施例。Please refer to the figure, wherein the same component symbols represent the same components. The principle of the present application is illustrated by implementing in an appropriate computing environment. The following description is based on the exemplified specific embodiments of the present application, which should not be construed as limiting other specific embodiments that are not detailed herein.
本申请实施例提供一种匹配阈值的调整方法,包括:An embodiment of the present application provides a method for adjusting a matching threshold, including:
获取当前用户的声纹特征信息;Obtain the voiceprint feature information of the current user;
将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;Matching the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;
若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;If a preset condition is satisfied between the target matching degree and a preset matching threshold, performing identity verification on the current user;
在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。After the identity verification is passed, the value of the preset matching threshold is adjusted.
在一种实施方式中,若所述目标匹配度和预设匹配阈值之间满足预设条件则对所述当前用户进行身份验证,可以包括:若所述目标匹配度小于所述预设匹配阈值,则获取所述预设匹配阈值与所述目标匹配度的差值;若检测到所述差值小于或等于预设差值,则确定所述目标匹配度和所述预设匹配阈值之间满足预设条件,并对所述当前用户进行身份验证。In an implementation manner, if the target matching degree and a preset matching threshold satisfy a preset condition, performing identity verification on the current user may include: if the target matching degree is less than the preset matching threshold Obtaining the difference between the preset matching threshold and the target matching degree; if it is detected that the difference is less than or equal to the preset difference, determining between the target matching degree and the preset matching threshold The preset conditions are met, and the current user is authenticated.
在一种实施方式中,对所述预设匹配阈值的数值进行调整,可以包括:获取所述当前用户的多段语音,并从每段语音中提取目标声纹特征信息;将所有所述目标声纹特征信息与所述预设声纹特征信息进行逐一匹配,并得到多个待用匹配度;计算所述多个待用匹配度的平均值;将所述预设匹配阈值的数值由当前的目标值调整为所述平均值。In one embodiment, adjusting the value of the preset matching threshold may include: acquiring multiple voices of the current user, and extracting target voiceprint feature information from each voice; The pattern feature information is matched with the preset voiceprint feature information one by one to obtain a plurality of standby matching degrees; an average value of the plurality of standby matching degrees is calculated; and the value of the preset matching threshold is changed from the current The target value is adjusted to the average value.
在一种实施方式中,在将所述预设匹配阈值的数值由当前的目标值调整为所述平均值之后,还可以包括:设置一有效期;当所述有效期结束时,将所述预设匹配阈值的数值由所述平均值恢复为所述目标值。In an implementation manner, after the value of the preset matching threshold is adjusted from the current target value to the average value, the method may further include: setting a valid period; and when the valid period ends, setting the preset period The value of the matching threshold is restored from the average value to the target value.
在一种实施方式中,在获取所述当前用户的多段语音之前,还包括:计算所述预设匹配阈值与所述目标匹配度的百分比差值;将所述百分比差值转换为以一百为分母的分数;将所述分数的分子的数字确定为目标数值。In an implementation manner, before acquiring the multi-segment voice of the current user, the method further includes: calculating a percentage difference between the preset matching threshold and the target matching degree; and converting the percentage difference into one hundred Is the fraction of the denominator; the number of the numerator of the fraction is determined as the target value.
那么,获取所述当前用户的多段语音,可以包括:获取数量为所述目标数值的多段语音。Then, acquiring the multi-segment voices of the current user may include: acquiring multiple multi-segment voices whose number is the target value.
在一种实施方式中,在获取所述当前用户的多段语音之前,还可以包括:设定预设数值;In an implementation manner, before acquiring the multi-segment voice of the current user, the method may further include: setting a preset value;
那么,获取所述当前用户的多段语音,可以包括:获取数量为所述预设数值的多段语音。Then, acquiring the multi-segment voices of the current user may include: acquiring the multi-segment voices whose quantity is the preset value.
在一种实施方式中,对所述当前用户进行身份验证,可以包括:通过指纹识别或者人脸识别或者虹膜识别对所述当前用户进行身份验证。In an implementation manner, performing identity verification on the current user may include: performing identity verification on the current user through fingerprint recognition, face recognition, or iris recognition.
可以理解的是,本申请实施例的执行主体可以是诸如智能手机或平板电脑等的终端设备。It can be understood that the execution subject of the embodiment of the present application may be a terminal device such as a smart phone or a tablet computer.
请参阅图1,图1是本申请实施例提供的匹配阈值的调整方法的流程示意图,流程可以包括:Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a matching threshold adjustment method according to an embodiment of the present application. The process may include:
在101中,获取当前用户的声纹特征信息。In 101, voiceprint feature information of the current user is acquired.
随着技术的发展,人机之间的交互方式变得越来越丰富。相关技术中,用户可以通过语音对终端进行控制,即终端在接收到用户发出的语音信息后,可以对该语音信息进行解析,得到控制指令。在执行该控制指令之前,终端需要从该语音信息中提取声纹特征,并根据提取到的声纹特征对用户进行声纹识别。只有在声纹识别通过后,终端才会执行语音 信息对应的控制指令。然而相关技术中,终端在对用户进行声纹识别时的灵活性较差。With the development of technology, the interaction modes between human and machine have become more and more abundant. In related technology, a user can control a terminal by using a voice, that is, after receiving a voice message sent by the user, the terminal can analyze the voice information to obtain a control instruction. Before executing the control instruction, the terminal needs to extract voiceprint features from the voice information, and perform voiceprint recognition for the user according to the extracted voiceprint features. Only after the voiceprint recognition is passed, the terminal will execute the control instruction corresponding to the voice message. However, in the related art, the terminal has poor flexibility when performing voiceprint recognition on the user.
在本申请的101中,比如,在用户使用终端提供的语音助手来操作终端,即需要进行声纹识别时,终端可以获取当前用户的声纹特征信息。例如,终端可以先获取当前用户发出的语音,然后从该语音中提取声纹特征信息。In 101 of this application, for example, when a user uses a voice assistant provided by the terminal to operate the terminal, that is, when voiceprint recognition is required, the terminal may obtain voiceprint feature information of the current user. For example, the terminal may first obtain a voice uttered by the current user, and then extract voiceprint feature information from the voice.
在102中,将该当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度。In 102, the voiceprint feature information of the current user is matched with the preset voiceprint feature information to obtain a corresponding target matching degree.
比如,在获取到当前用户的声纹特征信息后,终端可以将当前用户的声纹特征信息与预设声纹特征信息进行匹配,从而得到相应的目标匹配度。在一种实施方式中,预设声纹特征信息可以是本终端机主的声纹特征信息和/或经过终端机主允许的用户(如机主的家人等)的声纹特征信息。For example, after acquiring the voiceprint feature information of the current user, the terminal may match the voiceprint feature information of the current user with the preset voiceprint feature information to obtain a corresponding target matching degree. In one embodiment, the preset voiceprint feature information may be voiceprint feature information of the terminal owner and / or voiceprint feature information of a user (such as the owner ’s family, etc.) permitted by the terminal owner.
在得到当前用户的声纹特征信息与预设声纹特征信息的目标匹配度之后,终端可以检测该目标匹配度与预设匹配阈值之间是否满足预设条件。其中,该预设匹配阈值为用于判断声纹识别是否通过的数值(即预设匹配阈值为用于判断当前用户的声纹特征信息与预设声纹特征信息是否匹配成功的数值),即在进行声纹识别时,若用户的声纹特征信息和预设声纹特征信息的匹配度达到预设匹配阈值,则终端可以确定声纹识别通过。After obtaining the target matching degree between the voiceprint feature information of the current user and the preset voiceprint feature information, the terminal may detect whether the target matching degree and the preset matching threshold value meet a preset condition. The preset matching threshold value is a value used to determine whether the voiceprint recognition is passed (that is, the preset matching threshold value is a value used to determine whether the voiceprint feature information of the current user and the preset voiceprint feature information are successfully matched), that is, During voiceprint recognition, if the degree of matching between the user's voiceprint feature information and the preset voiceprint feature information reaches a preset matching threshold, the terminal may determine that the voiceprint recognition passes.
若检测到该目标匹配度与预设匹配阈值之间不满足预设条件,那么终端可以执行其它操作。If it is detected that the preset condition is not satisfied between the target matching degree and the preset matching threshold, the terminal may perform other operations.
若检测到该目标匹配度与预设匹配阈值之间满足预设条件,那么进入103。If it is detected that the preset condition is satisfied between the target matching degree and the preset matching threshold, then step 103 is performed.
在103中,若该目标匹配度和预设匹配阈值之间满足预设条件,则对该当前用户进行身份验证。In 103, if a preset condition is satisfied between the target matching degree and a preset matching threshold, the current user is authenticated.
在步骤S104中,在该身份验证通过后,对该预设匹配阈值的数值进行调整。In step S104, after the identity verification is passed, the value of the preset matching threshold is adjusted.
比如,103和104可以包括:For example, 103 and 104 can include:
终端检测到目标匹配度与预设匹配阈值之间满足预设条件。在这种情况下,终端可以对当前用户进行身份验证,以验证当前用户是否为终端机主或其他合法用户。The terminal detects that a preset condition is satisfied between the target matching degree and a preset matching threshold. In this case, the terminal can authenticate the current user to verify whether the current user is the terminal owner or other legal user.
在当前用户的身份验证通过,即在终端确定出当前用户为终端机主或其他合法用户后,终端可以预设匹配阈值的数值进行调整。之后,终端就可以根据数值调整后的预设匹配阈值来进行声纹识别。After the identity verification of the current user is passed, that is, after the terminal determines that the current user is the terminal owner or another legal user, the terminal can preset a value of the matching threshold value for adjustment. After that, the terminal can perform voiceprint recognition according to the preset matching threshold adjusted by the value.
可以理解的是,本申请实施例中,终端可以在检测到当前用户的声纹特征信息和预设声纹特征信息这二者的匹配度与预设匹配阈值之间满足预设条件时,对当前用户进行身份验证。当身份验证通过后,终端可以对预设匹配阈值的数值进行调整,以使终端后续可以根据数值调整后的预设匹配阈值来进行声纹识别。由于本实施例可以对用于判断声纹识别是否通过的预设匹配阈值进行调整,因此本实施例可以提高终端在进行声纹识别时的灵活性。It can be understood that, in the embodiment of the present application, when the terminal detects that the matching degree between the voiceprint feature information and the preset voiceprint feature information of the current user satisfies a preset condition and presets a matching threshold, The current user is authenticated. After the identity verification is passed, the terminal can adjust the value of the preset matching threshold, so that the terminal can subsequently perform voiceprint recognition according to the preset matching threshold adjusted by the value. Since the preset matching threshold for judging whether the voiceprint recognition passes can be adjusted in this embodiment, this embodiment can improve the flexibility of the terminal when performing voiceprint recognition.
请参阅图2,图2为本申请实施例提供的匹配阈值的调整方法的另一流程示意图,流程可以包括:Please refer to FIG. 2. FIG. 2 is another schematic flowchart of a matching threshold adjustment method according to an embodiment of the present application. The process may include:
在201中,终端获取当前用户的声纹特征信息。In 201, the terminal acquires voiceprint feature information of the current user.
比如,在用户使用终端提供的语音助手来操作终端,即需要进行声纹识别时,终端可以获取当前用户的声纹特征信息。For example, when a user uses a voice assistant provided by the terminal to operate the terminal, that is, when voiceprint recognition is required, the terminal may obtain voiceprint characteristic information of the current user.
在202中,终端将该当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度。In 202, the terminal matches the voiceprint feature information of the current user with the preset voiceprint feature information to obtain a corresponding target matching degree.
比如,在获取到当前用户的声纹特征信息后,终端可以将当前用户的声纹特征信息与预设声纹特征信息进行匹配,从而得到相应的目标匹配度。在一种实施方式中,预设声纹特征信息可以是本终端机主的声纹特征信息和/或经过终端机主允许的用户(如机主的家人等)的声纹特征信息。For example, after acquiring the voiceprint feature information of the current user, the terminal may match the voiceprint feature information of the current user with the preset voiceprint feature information to obtain a corresponding target matching degree. In one embodiment, the preset voiceprint feature information may be voiceprint feature information of the terminal owner and / or voiceprint feature information of a user (such as the owner ’s family, etc.) permitted by the terminal owner.
在得到当前用户的声纹特征信息与预设声纹特征信息的目标匹配度之后,终端可以将该目标匹配度与预设匹配阈值进行比较。其中,该预设匹配阈值为用于判断声纹识别是否通过的数值,即在进行声纹识别时,若用户的声纹特征信息和预设声纹特征信息的匹配度达到预设匹配阈值,则终端可以确定声纹识别通过。After obtaining the target matching degree between the voiceprint feature information of the current user and the preset voiceprint feature information, the terminal may compare the target matching degree with a preset matching threshold. The preset matching threshold is a value used to determine whether voiceprint recognition is passed. That is, when the voiceprint recognition is performed, if the degree of matching between the user's voiceprint feature information and the preset voiceprint feature information reaches a preset matching threshold, Then the terminal may determine that the voiceprint recognition is passed.
若检测到目标匹配度达到该预设匹配阈值,例如当前用户的声纹特征信息与预设声纹特征信息的匹配度为95%,而预设匹配阈值为90%,即目标匹配度超过了预设匹配阈值,那么可以认为当前用户为合法用户。在这种情况下,终端可以执行当前用户发出的语音所对应的控制指令。If it is detected that the target matching degree reaches the preset matching threshold, for example, the matching degree of the voiceprint feature information of the current user and the preset voiceprint feature information is 95%, and the preset matching threshold is 90%, that is, the target matching degree exceeds If the preset matching threshold is set, the current user can be regarded as a legitimate user. In this case, the terminal can execute the control instruction corresponding to the voice issued by the current user.
若检测到目标匹配度小于该预设匹配阈值,那么可以进入203。If it is detected that the target matching degree is less than the preset matching threshold, then the process may proceed to 203.
在203中,若该目标匹配度小于预设匹配阈值,则终端获取该预设匹配阈值与该目标匹配度的差值。In 203, if the target matching degree is less than a preset matching threshold, the terminal obtains a difference between the preset matching threshold and the target matching degree.
在204中,若检测到该差值小于或等于预设差值,则终端确定该目标匹配度和该预设匹配阈值之间满足预设条件,并对该当前用户进行身份验证。In 204, if it is detected that the difference is less than or equal to a preset difference, the terminal determines that a preset condition is satisfied between the target matching degree and the preset matching threshold, and performs identity verification on the current user.
比如,203和204可以包括:For example, 203 and 204 can include:
终端检测到当前用户的声纹特征信息与预设声纹特征信息的目标匹配度为86%,小于预设匹配阈值90%。在这种情况下,终端可以获取预设匹配阈值与该目标匹配度的差值。例如,终端获取到预设匹配阈值与该目标匹配度的差值为4%(90%-86%)。The terminal detects that the target matching degree between the voiceprint feature information of the current user and the preset voiceprint feature information is 86%, which is less than a preset matching threshold of 90%. In this case, the terminal may obtain a difference between the preset matching threshold and the target matching degree. For example, the terminal obtains that the difference between the preset matching threshold and the target matching degree is 4% (90% -86%).
之后,终端可以检测该差值是否小于或等于预设差值。After that, the terminal can detect whether the difference is less than or equal to a preset difference.
若检测到该差值大于预设差值,那么终端可以执行其它操作。If the difference is detected to be greater than a preset difference, the terminal may perform other operations.
若检测到该差值小于或等于预设差值,例如预设差值为5%,而当前用户的声纹特征信息与预设声纹特征信息的匹配度86%与预设匹配阈值90%的差值为4%,那么终端可以确定出当前用户的声纹特征信息和预设声纹特征信息的目标匹配度与预设匹配阈值之间满足预设条件。在这种情况下,可以认为当前用户的声纹特征信息与预设声纹特征信息接近于匹配。此时,可以触发终端对当前用户进行身份验证。If the difference is detected to be less than or equal to a preset difference, for example, the preset difference is 5%, and the matching degree of the voiceprint feature information of the current user with the preset voiceprint feature information is 86% and the preset matching threshold is 90% The difference is 4%, then the terminal can determine that the target matching degree between the voiceprint feature information and the preset voiceprint feature information of the current user and the preset matching threshold satisfy a preset condition. In this case, it can be considered that the voiceprint feature information of the current user is close to the preset voiceprint feature information. At this point, the terminal can be triggered to authenticate the current user.
在一种实施方式中,终端可以通过诸如指纹识别、人脸识别或者虹膜识别等除声纹识别之外的其他方式对当前用户进行身份验证。In one embodiment, the terminal may perform identity authentication on the current user by using other methods than voiceprint recognition, such as fingerprint recognition, face recognition, or iris recognition.
如果当前用户的身份验证未通过,那么可以认为当前用户是非法用户,此时终端可以不执行当前用户发出的语音指令。If the authentication of the current user fails, the current user may be considered as an illegal user, and the terminal may not execute the voice instruction sent by the current user at this time.
如果当前用户的身份验证通过,那么可以进入205。If the authentication of the current user passes, then 205 can be entered.
在205中,在该身份验证通过后,终端获取该当前用户的多段语音,并从每段语音中提取目标声纹特征信息。In 205, after the identity verification is passed, the terminal obtains multiple pieces of speech of the current user, and extracts target voiceprint feature information from each piece of speech.
在206中,终端将所有该目标声纹特征信息与该预设声纹特征信息进行逐一匹配,并得到多个待用匹配度。In 206, the terminal matches all the target voiceprint feature information with the preset voiceprint feature information one by one, and obtains multiple standby matching degrees.
在207中,终端计算该多个待用匹配度的平均值。In 207, the terminal calculates an average value of the plurality of standby matching degrees.
在208中,终端将该预设匹配阈值的数值由当前的目标值调整为该平均值。In 208, the terminal adjusts the value of the preset matching threshold value from the current target value to the average value.
比如,205、206、207和208可以包括:For example, 205, 206, 207, and 208 can include:
在当前用户的身份验证通过后,终端可以获取当前用户的多段语音,并从每段语音中提取声纹特征信息,即目标声纹特征信息。After the identity verification of the current user is passed, the terminal can obtain multiple pieces of voice of the current user and extract voiceprint feature information from each voice, that is, target voiceprint feature information.
然后,终端可以将所有目标声纹特征信息与该预设声纹特征信息进行逐一匹配,即终端可以将各个目标声纹特征信息均与预设声纹特征信息进行匹配,从而得到多个匹配度,即多个待用匹配度。Then, the terminal can match all target voiceprint feature information with the preset voiceprint feature information one by one, that is, the terminal can match each target voiceprint feature information with the preset voiceprint feature information, thereby obtaining multiple matching degrees. , That is, multiple in-use matches.
在得到多个待用匹配度之后,终端可以计算这些待用匹配度的平均值。然后,终端可以将预设匹配阈值的数值由当前的目标值调整为该平均值。After obtaining multiple standby matching degrees, the terminal may calculate an average of the standby matching degrees. Then, the terminal may adjust the value of the preset matching threshold from the current target value to the average value.
例如,终端可以获取当前用户的5段语音,并从每段语音中提取目标声纹特征信息。 然后,终端可以将每一目标声纹特征信息与预设声纹特征信息进行匹配,从而得到5个待用匹配度。例如,这5个待用匹配度分别为88%、87%、85%、85%、87%。之后,终端可以计算这5个待用匹配度的平均值,例如平均值为86.4%。在计算得到平均值之后,终端可以将预设匹配阈值的数值由当前的90%调整为86.4%。For example, the terminal may acquire 5 segments of voice of the current user, and extract target voiceprint feature information from each segment of voice. Then, the terminal can match each target voiceprint feature information with preset voiceprint feature information, thereby obtaining 5 standby matching degrees. For example, the five standby matches are 88%, 87%, 85%, 85%, 87%, respectively. After that, the terminal can calculate an average of the five standby matching degrees, for example, the average is 86.4%. After calculating the average value, the terminal may adjust the value of the preset matching threshold from the current 90% to 86.4%.
在209中,终端设置一有效期,并当该有效期结束时将该预设匹配阈值的数值由该平均值恢复为该目标值。In 209, the terminal sets a validity period, and when the validity period ends, the preset matching threshold value is restored from the average value to the target value.
比如,在将预设匹配阈值由当前的90%调整为86.4%之后,终端可以设置一有效期。当检测到该有效期结束时,终端可以将预设匹配阈值的数值由86.4%恢复为90%。For example, after adjusting the preset matching threshold from the current 90% to 86.4%, the terminal may set a validity period. When the end of the validity period is detected, the terminal may restore the value of the preset matching threshold from 86.4% to 90%.
例如,在将预设匹配阈值由当前的90%调整为86.4%之后,终端可以为本次数值调整设置有效期为5天,即在5天之后终端需要将预设匹配阈值的数值由86.4%恢复为90%。For example, after adjusting the preset matching threshold from the current 90% to 86.4%, the terminal may set the validity period for the value adjustment to 5 days, that is, the terminal needs to restore the preset matching threshold value from 86.4% after 5 days 90%.
在一种实施方式中,在205中终端获取当前用户的多段语音的流程之前,还可以包括如下流程:In an implementation manner, before the process for the terminal to obtain the multi-segment voice of the current user in 205, the process may further include the following process:
终端计算预设匹配阈值与目标匹配度的百分比差值;The terminal calculates a percentage difference between a preset matching threshold and a target matching degree;
终端将该百分比差值转换为以一百为分母的分数;The terminal converts the percentage difference into a fraction with a hundred as a denominator;
终端将该分数的分子的数字确定为目标数值。The terminal determines the number of the numerator of the score as the target value.
那么,205中终端获取当前用户的多段语音的流程,可以包括:终端获取数量为该目标数值的多段语音。Then, the process of obtaining the multi-segment voice of the current user by the terminal in 205 may include: The terminal obtains the multi-segment voice whose quantity is the target value.
比如,在身份验证通过后,终端可以先计算预设匹配阈值与目标匹配度的百分比差值。例如,预设匹配阈值为90%,目标匹配度为86%,那么预设匹配阈值与目标匹配度的百分比差值为4%。For example, after the identity verification is passed, the terminal may first calculate a percentage difference between a preset matching threshold and a target matching degree. For example, if the preset matching threshold is 90% and the target matching degree is 86%, then the percentage difference between the preset matching threshold and the target matching degree is 4%.
之后,终端可以将预设匹配阈值和目标匹配度的百分比差值转换为以一百为分母的分数,并将该分数的分子的数字确定为目标数值。例如,百分比差值4%转换为以一百为分母的分数后,该分数的分子的数字为4。那么,目标数值可以确定为4。After that, the terminal may convert the percentage difference between the preset matching threshold and the target matching degree into a score with a hundred as the denominator, and determine the number of the numerator of the score as the target value. For example, after a percentage difference of 4% is converted to a fraction with a hundred as the denominator, the numerator of the fraction is 4. Then, the target value can be determined to be 4.
之后,当终端需要获取当前用户的多段语音时,终端就可以获取数量为目标数值的多段语音。例如,目标数值为4,那么终端就可以获取当前用户的4段语音。After that, when the terminal needs to obtain the multi-segment voice of the current user, the terminal can obtain the multi-segment voice whose number is the target value. For example, if the target value is 4, then the terminal can obtain 4 segments of voice of the current user.
可以理解的是,本实施例中,终端可以根据预设匹配阈值与目标匹配度之间的差值,来确定具体获取几段语音。在预设匹配阈值与目标匹配度之间的差值较小,即当前用户的声纹特征变化较小的情况下,终端可以获取较少段的语音。而在预设匹配阈值与目标匹配度之间的差值较大,即当前用户的声纹特征变化较大的情况下,终端可以获取较多段的语音,从而使得终端可以更加准确地根据用户的声纹变化来调整预设匹配阈值的数值,同时也增加了预设匹配阈值调整的灵活性。It can be understood that, in this embodiment, the terminal may determine specifically how many pieces of speech to acquire according to the difference between the preset matching threshold and the target matching degree. In the case where the difference between the preset matching threshold and the target matching degree is small, that is, the change of the voiceprint characteristics of the current user is small, the terminal can obtain fewer segments of speech. However, when the difference between the preset matching threshold and the target matching degree is large, that is, when the voiceprint feature of the current user changes greatly, the terminal can obtain more voices, so that the terminal can more accurately The voiceprint changes to adjust the value of the preset matching threshold, and also increases the flexibility of adjusting the preset matching threshold.
在一些实施方式中,当需要获取当前用户的多段语音并根据这多段语音来调整预设匹配阈值的数值时,终端也可以通过如下方式来确定具体需要获取几段语音:In some implementations, when it is necessary to obtain multiple voices of the current user and adjust the value of the preset matching threshold according to the multiple voices, the terminal may also determine the specific needs to acquire several voices by:
比如,终端可以预先设定一个预设数值,那么当终端需要获取多段语音时,终端就获取数量为该预设数值的多段语音。例如预设数值为5,那么终端就可以获取当前用户的5段语音,并从这5段语音中的每一段语音中提取当前用户的目标声纹特征信息,并将每一目标声纹特征信息与预设声纹特征信息进行匹配,从而得到5个待用匹配度。然后,终端可以计算这5个待用匹配度的平均值,并将预设匹配阈值由当前值调整为该平均值。For example, the terminal may set a preset value in advance, and when the terminal needs to obtain multiple segments of speech, the terminal obtains multiple segments of speech with the preset value. For example, if the preset value is 5, then the terminal can obtain the 5 voices of the current user, and extract the target voiceprint feature information of the current user from each of the 5 voices, and set each target voiceprint feature information. Match with the preset voiceprint feature information to get 5 standby matching degrees. Then, the terminal may calculate an average value of the five standby matching degrees, and adjust the preset matching threshold value from the current value to the average value.
在一种实施方式中,终端可以随机生成5段语音,并要求用户念出这5段语音,然后从用户念出的5段语音中的每一段语音中提取当前用户的目标声纹特征信息,并将每一目标声纹特征信息与预设声纹特征信息进行匹配,从而得到5个待用匹配度。然后,终端可以计算这5个待用匹配度的平均值,并将预设匹配阈值由当前值调整为该平均值。在一种实施方式中,终端也可以在与用户进行语音交互时,获取所需的5段用户语音,并执行后续的相关流程。In one embodiment, the terminal may randomly generate 5 segments of speech, and ask the user to pronounce these 5 segments of speech, and then extract target voiceprint feature information of the current user from each of the 5 segments of speech spoken by the user, Each target voiceprint feature information is matched with preset voiceprint feature information to obtain 5 standby matching degrees. Then, the terminal may calculate an average value of the five standby matching degrees, and adjust the preset matching threshold value from the current value to the average value. In one embodiment, the terminal may also obtain the required 5 segments of user voice when performing voice interaction with the user, and execute subsequent related processes.
或者,终端可以先对当前用户发出的语音进行解析,若该语音对应的控制指令包含两个以上的备选操作,那么终端也可以根据备选操作的数量来确定具体需要获取几段语音。例如,用户发出的语音为“打开音乐应用”,那么如果终端检测到本终端中包含3款音乐应用,例如分别为A应用、B应用和C应用,那么终端可以确定出需要获取3段语音。然后,终端可以依次生成3段用于询问用户是否打开某一应用的语音信息。当接收到当前用户的回答语音时,终端可以从中提取当前用户的声纹特征信息。Alternatively, the terminal may first analyze the voice issued by the current user. If the control instruction corresponding to the voice includes two or more alternative operations, the terminal may also determine the specific needs to obtain several segments of voice according to the number of alternative operations. For example, if the user's voice is "Open Music Application", then if the terminal detects that the terminal contains 3 music applications, such as A application, B application, and C application, then the terminal can determine that it needs to obtain 3 segments of voice. Then, the terminal can sequentially generate 3 pieces of voice information for asking the user whether to open an application. When receiving the answer voice of the current user, the terminal may extract the voiceprint feature information of the current user from it.
请参阅图3至图5,图3至图5为本申请实施例提供的匹配阈值的调整方法的场景示意图。Please refer to FIG. 3 to FIG. 5, which are schematic diagrams of a method for adjusting a matching threshold provided by an embodiment of the present application.
在日常生活中,用户的声纹特征并不是一成不变的。在一些情况下,用户的声纹特征会发生一些变化。比如,当用户咽喉发炎时用户的声音会变得沙哑,或者当用户边吃食物边说话时用户的声音也会发生轻微变化。In daily life, the user's voiceprint characteristics are not static. In some cases, there are some changes in the user's voiceprint characteristics. For example, when the user's throat is inflamed, the user's voice will become hoarse, or when the user is eating and talking, the user's voice will also change slightly.
例如,本实施例中当前用户因咽喉发炎引起声音沙哑,导致当前用户的声纹特征发生变化。在这种情况下,比如此时当前用户需要使用终端的语音助手功能。例如,当前用户对着终端发出如下语音“小欧,请打开音乐应用”,如图3所示。For example, in this embodiment, the current user's voice is hoarse due to throat inflammation, which causes the voiceprint characteristics of the current user to change. In this case, for example, the current user needs to use the terminal's voice assistant function at this time. For example, the current user sends the following voice "Xiaoou, please open the music application" to the terminal, as shown in Figure 3.
终端在接收到上述语音“小欧,请打开音乐应用”后,可以从该语音中提取当前用户的声纹特征信息,然后将该当前用户的声纹特征信息与预设声纹特征信息进行匹配,并得到相应的目标匹配度。其中,预设声纹特征信息为本终端机主的声纹特征信息。例如,由于当前用户的声音沙哑,导致当前用户的声纹特征信息与预设声纹特征信息的目标匹配度仅有86%,未达到预设匹配阈值90%,即目前当前用户的声纹识别不通过。After receiving the voice "Xiaoou, please open the music application", the terminal can extract the voiceprint feature information of the current user from the voice, and then match the voiceprint feature information of the current user with the preset voiceprint feature information. And get the corresponding target match. The preset voiceprint feature information is the voiceprint feature information of the owner of the terminal. For example, due to the hoarse voice of the current user, the target matching degree between the voiceprint feature information of the current user and the preset voiceprint feature information is only 86%, and the preset matching threshold is not reached 90%, that is, the voiceprint recognition of the current user Fail.
之后,终端可以计算目标匹配度与预设匹配阈值之间的差值。例如,目标匹配度与预设匹配阈值之间的差值为4%(90%-86%)。After that, the terminal may calculate a difference between the target matching degree and a preset matching threshold. For example, the difference between the target matching degree and the preset matching threshold is 4% (90% -86%).
在得到目标匹配度与预设匹配阈值的差值后,终端可以检测该差值是否小于或等于预设差值。例如,预设差值为5%。因此,本实施例中,终端可以检测到目标匹配度与预设匹配阈值的差值4%小于预设差值5%。在这种情况下,终端可以确定目标匹配度与预设匹配阈值满足预设条件,即可以认为当前用户的声纹特征信息与预设声纹特征信息接近于匹配。然后,终端可以对当前用户进行身份验证。例如,终端可以通过指纹识别、人脸识别以及虹膜识别中的一种或多种方式对当前用户进行身份验证,以验证当前用户是否为终端机主。After obtaining the difference between the target matching degree and the preset matching threshold, the terminal can detect whether the difference is less than or equal to the preset difference. For example, the preset difference is 5%. Therefore, in this embodiment, the terminal can detect that the difference between the target matching degree and the preset matching threshold 4% is less than the preset difference 5%. In this case, the terminal may determine that the target matching degree and the preset matching threshold satisfy a preset condition, that is, the terminal may consider that the voiceprint feature information of the current user and the preset voiceprint feature information are close to match. The terminal can then authenticate the current user. For example, the terminal may authenticate the current user in one or more of fingerprint recognition, face recognition, and iris recognition to verify whether the current user is the terminal owner.
例如,本实施例,当前用户的身份验证通过,即终端可以确认当前用户为终端机主。在这种情况下,终端可以获取当前用户的多段语音,并从每段语音中提取目标声纹特征信息。例如,终端在接收到用户发出的语音“小欧,请打开音乐应用”后,对该语音进行了解析,并检测到终端中包含两款音乐应用,分别为A应用和B应用。此时,终端可以向用户发出如下语音“当前终端中包含A音乐应用和B音乐应用,请问您现在需要打开哪个应用”,如图4所示。For example, in this embodiment, the identity verification of the current user passes, that is, the terminal can confirm that the current user is the terminal owner. In this case, the terminal can obtain multiple voices of the current user and extract target voiceprint feature information from each voice. For example, after receiving the voice “Xiaoou, please open the music application” from the user, the terminal parses the voice and detects that the terminal contains two music applications, namely, the A application and the B application. At this point, the terminal can issue the following voice to the user: "The current terminal contains A music application and B music application, may I ask which application you need to open now", as shown in Figure 4.
例如,当前用户在听到终端发出的语音后,回答如下语音“打开A应用”,如图5所示。在接收到用户的回答并确定当前用户为机主的情况下,终端可以打开A应用。For example, after hearing the voice from the terminal, the current user answers the following voice "Open A application", as shown in FIG. 5. After receiving the user's response and determining that the current user is the owner, the terminal can open the A application.
之后,用户可以根据需要继续使用语音助手来控制终端完成相应的操作。终端在这一过程中,可以继续获取到当前用户的多段语音。然后,终端可以从中选取数量为预设数值的多段语音。例如,预设数值为5,那么终端可以获取当前用户的5段语音,并从每段语音中提取当前用户的目标声纹特征信息。然后,终端可以将各目标声纹特征信息与预设声纹特征信息进行匹配,从而得到相应的匹配度。即,终端可以得到多个匹配度,并将这多个匹配度确定为待用匹配度。之后,终端可以计算这多个待用匹配度的平均值,并将预设匹配阈值的数值由当前的目标值调整为该平均值。After that, the user can continue to use the voice assistant to control the terminal to complete corresponding operations as needed. In this process, the terminal can continue to obtain the multi-segment voice of the current user. Then, the terminal may select a plurality of pieces of speech with a preset number from the terminal. For example, if the preset value is 5, then the terminal can obtain 5 segments of voice of the current user and extract target voiceprint feature information of the current user from each segment of the voice. Then, the terminal may match the target voiceprint feature information with the preset voiceprint feature information, thereby obtaining a corresponding matching degree. That is, the terminal may obtain multiple matching degrees and determine the multiple matching degrees as the standby matching degrees. After that, the terminal may calculate an average value of the plurality of standby matching degrees, and adjust the value of the preset matching threshold value from the current target value to the average value.
例如,在当前用户使用语音助手控制终端播放音乐的过程中,终端获取到5段语音,并从中提取了5个目标声纹特征信息。这5个目标声纹特征信息与预设声纹特征信息的匹 配度依次为88%、87%、85%、85%、87%。之后,终端可以计算这5个匹配度的平均值,例如平均值为86.4%。在计算得到平均值之后,终端可以将预设匹配阈值的数值由当前的90%调整为86.4%。For example, when the current user uses a voice assistant to control the terminal to play music, the terminal obtains 5 segments of voice and extracts 5 target voiceprint feature information from the terminal. The matching degree between the five target voiceprint feature information and the preset voiceprint feature information is 88%, 87%, 85%, 85%, 87% in order. After that, the terminal can calculate the average of the five matching degrees, for example, the average is 86.4%. After calculating the average value, the terminal may adjust the value of the preset matching threshold from the current 90% to 86.4%.
然后,终端可以设置一个有效期。例如,终端设置得有效期为5天,那么在接下来的5天中,终端可以按照86.4%的匹配阈值来对当前用户进行声纹是被,而在5天之后,终端可以将预设匹配阈值的数值由86.4%恢复为90%。The terminal can then set a validity period. For example, if the terminal is set to be valid for 5 days, then in the next 5 days, the terminal can perform voiceprint quilts on the current user according to a matching threshold of 86.4%, and after 5 days, the terminal can set a preset matching threshold The value was restored from 86.4% to 90%.
可以理解的是,一般的,在用户因咽喉发炎引起声音沙哑导致声纹发生变化的情况下,用户的咽喉发炎在经过一段时间后就会痊愈,因此终端可以设置一个有效期,该有效期可以对应于用户咽喉发炎痊愈所需的时间。在该有效期到期时,终端可以将预设匹配阈值的数值恢复。It can be understood that, in general, in the case that the user's throat sound changes due to throat inflammation caused by the inflammation of the throat, the user's throat inflammation will heal after a period of time, so the terminal can set a validity period, which can correspond to The time it takes for the user's throat to heal. When the validity period expires, the terminal may restore the value of the preset matching threshold.
请参阅图6,图6为本申请实施例提供的匹配阈值的调整装置的结构示意图。匹配阈值的调整装置300可以包括:获取模块301,匹配模块302,验证模块303,以及调整模块304。Please refer to FIG. 6, which is a schematic structural diagram of a matching threshold adjustment device according to an embodiment of the present application. The matching threshold adjustment device 300 may include: an obtaining module 301, a matching module 302, a verification module 303, and an adjustment module 304.
获取模块301,用于获取当前用户的声纹特征信息。The obtaining module 301 is configured to obtain voiceprint feature information of a current user.
匹配模块302,用于将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度。The matching module 302 is configured to match the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree.
验证模块303,用于若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证。The verification module 303 is configured to perform identity verification on the current user if a preset condition is satisfied between the target matching degree and a preset matching threshold.
调整模块304,用于在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。The adjustment module 304 is configured to adjust the value of the preset matching threshold after the identity verification is passed.
在一种实施方式中,所述验证模块303,可以用于:In one embodiment, the verification module 303 may be configured to:
若所述目标匹配度小于所述预设匹配阈值,则获取所述预设匹配阈值与所述目标匹配度的差值;If the target matching degree is less than the preset matching threshold, obtaining a difference between the preset matching threshold and the target matching degree;
若检测到所述差值小于或等于预设差值,则确定所述目标匹配度和所述预设匹配阈值之间满足预设条件,并对所述当前用户进行身份验证。If it is detected that the difference is less than or equal to a preset difference, it is determined that a preset condition is satisfied between the target matching degree and the preset matching threshold, and the current user is authenticated.
在一种实施方式中,所述调整模块304,可以用于:In an implementation manner, the adjustment module 304 may be configured to:
获取所述当前用户的多段语音,并从每段语音中提取目标声纹特征信息;Acquiring a plurality of voices of the current user, and extracting target voiceprint feature information from each voice;
将所有所述目标声纹特征信息与所述预设声纹特征信息进行逐一匹配,并得到多个待用匹配度;Matching all the target voiceprint feature information with the preset voiceprint feature information one by one, and obtaining multiple standby matching degrees;
计算所述多个待用匹配度的平均值;Calculating an average value of the plurality of standby matching degrees;
将所述预设匹配阈值的数值由当前的目标值调整为所述平均值。Adjusting the value of the preset matching threshold from the current target value to the average value.
在一种实施方式中,所述调整模块还用于:In one embodiment, the adjustment module is further configured to:
设定预设数值;Set a preset value;
获取数量为所述预设数值的多段语音。Acquire multiple pieces of speech with the preset value.
在一种实施方式中,所述验证模块303可以用于:In an implementation manner, the verification module 303 may be configured to:
通过指纹识别或者人脸识别或者虹膜识别对所述当前用户进行身份验证。Performing identity verification on the current user through fingerprint recognition, face recognition or iris recognition.
请一并参阅图7,图7为本申请实施例提供的匹配阈值的调整装置的另一结构示意图。在一实施例中,匹配阈值的调整装置300还可以包括:恢复模块305,和确定模块306。Please refer to FIG. 7 together. FIG. 7 is another schematic structural diagram of a matching threshold adjustment device according to an embodiment of the present application. In an embodiment, the matching threshold adjustment device 300 may further include a restoration module 305 and a determination module 306.
恢复模块305,用于:设置一有效期,当所述有效期结束时,将所述预设匹配阈值的数值由所述平均值恢复为所述目标值。The restoration module 305 is configured to set a validity period, and when the validity period ends, restore the value of the preset matching threshold value from the average value to the target value.
确定模块306,用于:计算所述预设匹配阈值与所述目标匹配度的百分比差值;将所述百分比差值转换为以一百为分母的分数;将所述分数的分子的数字确定为目标数值。A determining module 306, configured to: calculate a percentage difference between the preset matching threshold and the target matching degree; convert the percentage difference into a fraction with a hundred denominator; determine the number of the numerator of the fraction Is the target value.
那么,所述调整模块304可以用于:获取数量为所述目标数值的多段语音。Then, the adjustment module 304 may be configured to: obtain a plurality of pieces of speech whose quantity is the target value.
本申请实施例提供一种计算机可读的存储介质,其上存储有计算机程序,当所述计算机程序在计算机上执行时,使得所述计算机执行如本实施例提供的匹配阈值的调整方法中 的步骤。An embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed on a computer, the computer is caused to execute the method in the method for adjusting a matching threshold provided in this embodiment. step.
本申请实施例还提供一种电子设备,包括存储器,处理器,所述处理器通过调用所述存储器中存储的计算机程序,用于执行本实施例提供的匹配阈值的调整方法中的步骤。An embodiment of the present application further provides an electronic device including a memory and a processor, where the processor executes steps in the method for adjusting a matching threshold provided by the embodiment by calling a computer program stored in the memory.
例如,上述电子设备可以是诸如平板电脑或者智能手机等移动终端。请参阅图8,图8为本申请实施例提供的移动终端的结构示意图。For example, the above electronic device may be a mobile terminal such as a tablet computer or a smart phone. Please refer to FIG. 8, which is a schematic structural diagram of a mobile terminal according to an embodiment of the present application.
该移动终端400可以包括麦克风401、存储器402、处理器403等部件。本领域技术人员可以理解,图8中示出的移动终端结构并不构成对移动终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。The mobile terminal 400 may include components such as a microphone 401, a memory 402, and a processor 403. Those skilled in the art can understand that the structure of the mobile terminal shown in FIG. 8 does not constitute a limitation on the mobile terminal, and may include more or fewer components than those shown in the figure, or some components may be combined, or different components may be arranged.
麦克风401可以用于拾取用户发出的语音等。The microphone 401 may be used to pick up a voice and the like emitted by a user.
存储器402可用于存储应用程序和数据。存储器402存储的应用程序中包含有可执行代码。应用程序可以组成各种功能模块。处理器403通过运行存储在存储器402的应用程序,从而执行各种功能应用以及数据处理。The memory 402 may be used to store application programs and data. The application program stored in the memory 402 contains executable code. Applications can be composed of various functional modules. The processor 403 executes various functional applications and data processing by running application programs stored in the memory 402.
处理器403是移动终端的控制中心,利用各种接口和线路连接整个移动终端的各个部分,通过运行或执行存储在存储器402内的应用程序,以及调用存储在存储器402内的数据,执行移动终端的各种功能和处理数据,从而对移动终端进行整体监控。The processor 403 is the control center of the mobile terminal. It uses various interfaces and lines to connect various parts of the entire mobile terminal, and executes the mobile terminal by running or executing application programs stored in the memory 402, and calling data stored in the memory 402. Various functions and processing data to monitor the mobile terminal as a whole.
在本实施例中,移动终端中的处理器403会按照如下的指令,将一个或一个以上的应用程序的进程对应的可执行代码加载到存储器402中,并由处理器403来运行存储在存储器402中的应用程序,从而实现步骤:In this embodiment, the processor 403 in the mobile terminal loads the executable code corresponding to the process of one or more applications into the memory 402 according to the following instructions, and the processor 403 runs and stores the memory in the memory. 402 application to achieve the steps:
获取当前用户的声纹特征信息;Obtain the voiceprint feature information of the current user;
将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;Matching the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;
若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;If a preset condition is satisfied between the target matching degree and a preset matching threshold, performing identity verification on the current user;
在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。After the identity verification is passed, the value of the preset matching threshold is adjusted.
请参阅图9,移动终端500可以包括麦克风501、存储器502、处理器503、输入单元504、输出单元505、扬声器506等部件。Referring to FIG. 9, the mobile terminal 500 may include a microphone 501, a memory 502, a processor 503, an input unit 504, an output unit 505, a speaker 506, and other components.
麦克风501可以用于拾取用户发出的语音等。The microphone 501 may be used to pick up a voice and the like emitted by a user.
存储器502可用于存储应用程序和数据。存储器502存储的应用程序中包含有可执行代码。应用程序可以组成各种功能模块。处理器503通过运行存储在存储器502的应用程序,从而执行各种功能应用以及数据处理。The memory 502 may be used to store application programs and data. The application program stored in the memory 502 contains executable code. Applications can be composed of various functional modules. The processor 503 executes various functional applications and data processing by running an application program stored in the memory 502.
处理器503是移动终端的控制中心,利用各种接口和线路连接整个移动终端的各个部分,通过运行或执行存储在存储器502内的应用程序,以及调用存储在存储器502内的数据,执行移动终端的各种功能和处理数据,从而对移动终端进行整体监控。The processor 503 is the control center of the mobile terminal. It uses various interfaces and lines to connect various parts of the entire mobile terminal, and executes the mobile terminal by running or executing application programs stored in the memory 502, and calling data stored in the memory 502. Various functions and processing data to monitor the mobile terminal as a whole.
输入单元504可用于接收输入的数字、字符信息或用户特征信息(比如指纹),以及产生与用户设置以及功能控制有关的键盘、鼠标、操作杆、光学或者轨迹球信号输入。The input unit 504 can be used to receive inputted numbers, character information, or user characteristic information (such as fingerprints), and generate keyboard, mouse, joystick, optical, or trackball signal inputs related to user settings and function control.
输出单元505可用于显示由用户输入的信息或提供给用户的信息以及移动终端的各种图形用户接口,这些图形用户接口可以由图形、文本、图标、视频和其任意组合来构成。输出单元可包括显示面板。The output unit 505 may be used to display information input by the user or information provided to the user and various graphical user interfaces of the mobile terminal. These graphical user interfaces may be composed of graphics, text, icons, videos, and any combination thereof. The output unit may include a display panel.
在本实施例中,移动终端中的处理器503会按照如下的指令,将一个或一个以上的应用程序的进程对应的可执行代码加载到存储器502中,并由处理器503来运行存储在存储器502中的应用程序,从而实现步骤:In this embodiment, the processor 503 in the mobile terminal loads the executable code corresponding to the process of one or more applications into the memory 502 according to the following instructions, and the processor 503 runs the stored code in the memory 502 application to achieve the steps:
获取当前用户的声纹特征信息;Obtain the voiceprint feature information of the current user;
将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;Matching the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;
若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;If a preset condition is satisfied between the target matching degree and a preset matching threshold, performing identity verification on the current user;
在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。After the identity verification is passed, the value of the preset matching threshold is adjusted.
在一种实施方式中,处理器503执行所述若所述目标匹配度和预设匹配阈值之间满足预设条件则对所述当前用户进行身份验证的步骤时,可以执行:若所述目标匹配度小于所述预设匹配阈值,则获取所述预设匹配阈值与所述目标匹配度的差值;若检测到所述差值小于或等于预设差值,则确定所述目标匹配度和所述预设匹配阈值之间满足预设条件,并对所述当前用户进行身份验证。In one embodiment, when the processor 503 executes the step of authenticating the current user if a preset condition is satisfied between the target matching degree and a preset matching threshold, the processor 503 may perform: if the target If the degree of matching is less than the preset matching threshold, a difference between the preset matching threshold and the target matching degree is obtained; if it is detected that the difference is less than or equal to the preset difference, determining the target matching degree And a preset condition is met between the preset matching threshold and the current user is authenticated.
在一种实施方式中,处理器503执行所述对所述预设匹配阈值的数值进行调整的步骤时,可以执行:获取所述当前用户的多段语音,并从每段语音中提取目标声纹特征信息;将所有所述目标声纹特征信息与所述预设声纹特征信息进行逐一匹配,并得到多个待用匹配度;计算所述多个待用匹配度的平均值;将所述预设匹配阈值的数值由当前的目标值调整为所述平均值。In an implementation manner, when the processor 503 executes the step of adjusting the value of the preset matching threshold, the processor 503 may perform: obtaining multiple voices of the current user, and extracting a target voiceprint from each voice Feature information; matching all of the target voiceprint feature information with the preset voiceprint feature information one by one, and obtaining multiple standby matching degrees; calculating an average of the multiple standby matching degrees; The value of the preset matching threshold is adjusted from the current target value to the average value.
在一种实施方式中,在将所述预设匹配阈值的数值由当前的目标值调整为所述平均值的步骤之后,处理器503还可以执行:设置一有效期;当所述有效期结束时,将所述预设匹配阈值的数值由所述平均值恢复为所述目标值。处理器503执行In an implementation manner, after the step of adjusting the value of the preset matching threshold value from the current target value to the average value, the processor 503 may further perform: setting a validity period; when the validity period ends, Restoring the value of the preset matching threshold from the average value to the target value. The processor 503 executes
在一种实施方式中,在获取所述当前用户的多段语音的步骤之前,处理器503还可以执行:计算所述预设匹配阈值与所述目标匹配度的百分比差值;将所述百分比差值转换为以一百为分母的分数;将所述分数的分子的数字确定为目标数值;In an embodiment, before the step of obtaining the multi-segment voice of the current user, the processor 503 may further perform: calculating a percentage difference between the preset matching threshold and the target matching degree; and dividing the percentage difference Converting the value to a fraction with a hundred as the denominator; determining the number of the numerator of the fraction as the target value;
那么,处理器503执行获取所述当前用户的多段语音的步骤时,可以执行:获取数量为所述目标数值的多段语音。Then, when the processor 503 executes the step of acquiring the multi-segment voice of the current user, it may execute: acquiring the multi-segment voice whose quantity is the target value.
在一种实施方式中,在获取所述当前用户的多段语音之前,处理器503还可以执行:设定预设数值。In an implementation manner, before acquiring the multi-segment voice of the current user, the processor 503 may further perform: setting a preset value.
那么,处理器503执行获取所述当前用户的多段语音时,可以执行:获取数量为所述预设数值的多段语音。Then, when the processor 503 executes acquiring the multi-segment voice of the current user, it may execute: acquiring the multi-segment voice whose quantity is the preset value.
在一种实施方式中,处理器503在执行对所述当前用户进行身份验证时,可以执行:通过指纹识别或者人脸识别或者虹膜识别对所述当前用户进行身份验证。In an embodiment, when the processor 503 performs identity verification on the current user, the processor 503 may execute: identity verification on the current user through fingerprint recognition, face recognition, or iris recognition.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见上文针对匹配阈值的调整方法的详细描述,此处不再赘述。In the above embodiments, the description of each embodiment has its own emphasis. For the parts that are not described in detail in an embodiment, please refer to the detailed description of the adjustment method for the matching threshold above, which will not be repeated here.
本申请实施例提供的所述匹配阈值的调整装置与上文实施例中的匹配阈值的调整方法属于同一构思,在所述匹配阈值的调整装置上可以运行所述匹配阈值的调整方法实施例中提供的任一方法,其具体实现过程详见所述匹配阈值的调整方法实施例,此处不再赘述。The apparatus for adjusting the matching threshold provided in the embodiment of the present application and the method for adjusting the matching threshold in the foregoing embodiment belong to the same concept. The apparatus for adjusting the matching threshold may be run in the embodiment of the method for adjusting the matching threshold. For the specific implementation process of any method provided, see the embodiment of the method for adjusting the matching threshold in detail, and details are not described herein again.
需要说明的是,对本申请实施例所述匹配阈值的调整方法而言,本领域普通技术人员可以理解实现本申请实施例所述匹配阈值的调整方法的全部或部分流程,是可以通过计算机程序来控制相关的硬件来完成,所述计算机程序可存储于一计算机可读取存储介质中,如存储在存储器中,并被至少一个处理器执行,在执行过程中可包括如所述匹配阈值的调整方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储器(ROM,Read Only Memory)、随机存取记忆体(RAM,Random Access Memory)等。It should be noted that, for the method for adjusting the matching threshold described in the embodiments of the present application, those skilled in the art can understand that all or part of the process for implementing the method for adjusting the matching threshold described in the embodiments of the present application can be implemented by a computer program. Controlling related hardware to complete, the computer program may be stored in a computer-readable storage medium, such as stored in the memory, and executed by at least one processor, and may include adjustment of the matching threshold during the execution process The flow of an embodiment of the method. The storage medium may be a magnetic disk, an optical disk, a read only memory (ROM, Read Only Memory), a random access memory (RAM, Random Access Memory), and the like.
对本申请实施例的所述匹配阈值的调整装置而言,其各功能模块可以集成在一个处理芯片中,也可以是各个模块单独物理存在,也可以两个或两个以上模块集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中,所述存储介质譬如为只读存储器,磁盘或光盘等。For the device for adjusting the matching threshold according to the embodiment of the present application, its functional modules may be integrated into one processing chip, or each module may exist separately physically, or two or more modules may be integrated into one module. . The above integrated modules can be implemented in the form of hardware or software functional modules. If the integrated module is implemented in the form of a software functional module and sold or used as an independent product, it may also be stored in a computer-readable storage medium, such as a read-only memory, a magnetic disk, or an optical disk. .
以上对本申请实施例所提供的一种匹配阈值的调整方法、装置、存储介质以及电子设 备进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The method, device, storage medium and electronic device for adjusting the matching threshold provided in the embodiments of the present application have been described in detail above. The specific examples are used in this document to explain the principle and implementation of the present application. The description is only used to help understand the method of the application and its core ideas; at the same time, for those skilled in the art, according to the ideas of the application, there will be changes in the specific implementation and application scope. In summary, The contents of this description should not be construed as limiting the present application.
Claims (20)
- 一种匹配阈值的调整方法,其中,包括:A method for adjusting a matching threshold, including:获取当前用户的声纹特征信息;Obtain the voiceprint feature information of the current user;将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;Matching the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;If a preset condition is satisfied between the target matching degree and a preset matching threshold, performing identity verification on the current user;在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。After the identity verification is passed, the value of the preset matching threshold is adjusted.
- 根据权利要求1所述的匹配阈值的调整方法,其中,若所述目标匹配度和预设匹配阈值之间满足预设条件则对所述当前用户进行身份验证,包括:The method for adjusting a matching threshold according to claim 1, wherein, if the preset condition is satisfied between the target matching degree and the preset matching threshold, authenticating the current user comprises:若所述目标匹配度小于所述预设匹配阈值,则获取所述预设匹配阈值与所述目标匹配度的差值;If the target matching degree is less than the preset matching threshold, obtaining a difference between the preset matching threshold and the target matching degree;若检测到所述差值小于或等于预设差值,则确定所述目标匹配度和所述预设匹配阈值之间满足预设条件,并对所述当前用户进行身份验证。If it is detected that the difference is less than or equal to a preset difference, it is determined that a preset condition is satisfied between the target matching degree and the preset matching threshold, and the current user is authenticated.
- 根据权利要求1所述的匹配阈值的调整方法,其中,对所述预设匹配阈值的数值进行调整,包括:The method for adjusting a matching threshold according to claim 1, wherein adjusting the value of the preset matching threshold comprises:获取所述当前用户的多段语音,并从每段语音中提取目标声纹特征信息;Acquiring a plurality of voices of the current user, and extracting target voiceprint feature information from each voice;将所有所述目标声纹特征信息与所述预设声纹特征信息进行逐一匹配,并得到多个待用匹配度;Matching all the target voiceprint feature information with the preset voiceprint feature information one by one, and obtaining multiple standby matching degrees;计算所述多个待用匹配度的平均值;Calculating an average value of the plurality of standby matching degrees;将所述预设匹配阈值的数值由当前的目标值调整为所述平均值。Adjusting the value of the preset matching threshold from the current target value to the average value.
- 根据权利要求3所述的匹配阈值的调整方法,其中,在将所述预设匹配阈值的数值由当前的目标值调整为所述平均值之后,还包括:The method for adjusting a matching threshold according to claim 3, wherein after adjusting a value of the preset matching threshold from a current target value to the average value, further comprising:设置一有效期;Set a validity period;当所述有效期结束时,将所述预设匹配阈值的数值由所述平均值恢复为所述目标值。When the validity period ends, the value of the preset matching threshold is restored from the average value to the target value.
- 根据权利要求3所述的匹配阈值的调整方法,其中,在获取所述当前用户的多段语音之前,还包括:The method for adjusting a matching threshold according to claim 3, wherein before acquiring the multi-segment voice of the current user, further comprising:计算所述预设匹配阈值与所述目标匹配度的百分比差值;Calculating a percentage difference between the preset matching threshold and the target matching degree;将所述百分比差值转换为以一百为分母的分数;Converting the percentage difference into a fraction with a hundred denominator;将所述分数的分子的数字确定为目标数值;Determining the number of the numerator of the score as the target value;获取所述当前用户的多段语音,包括:获取数量为所述目标数值的多段语音。Acquiring the multi-segment speech of the current user includes: acquiring a multi-segment speech whose number is the target value.
- 根据权利要求3所述的匹配阈值的调整方法,其中,在获取所述当前用户的多段语音之前,还包括:设定预设数值;The method for adjusting the matching threshold according to claim 3, wherein before acquiring the multi-segment voice of the current user, further comprising: setting a preset value;获取所述当前用户的多段语音,包括:获取数量为所述预设数值的多段语音。Acquiring the multi-segment speech of the current user includes: acquiring a plurality of multi-segment speeches with the preset value.
- 根据权利要求1所述的匹配阈值的调整方法,其中,对所述当前用户进行身份验证,包括:The method for adjusting a matching threshold according to claim 1, wherein the authenticating the current user comprises:通过指纹识别或者人脸识别或者虹膜识别对所述当前用户进行身份验证。Performing identity verification on the current user through fingerprint recognition, face recognition or iris recognition.
- 一种匹配阈值的调整装置,其中,包括:A device for adjusting a matching threshold, including:获取模块,用于获取当前用户的声纹特征信息;An acquisition module, configured to acquire voiceprint feature information of the current user;匹配模块,用于将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;A matching module, configured to match the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;验证模块,用于若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;A verification module, configured to perform identity verification on the current user if a preset condition is satisfied between the target matching degree and a preset matching threshold;调整模块,用于在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。An adjustment module is configured to adjust the value of the preset matching threshold after the identity verification is passed.
- 根据权利要求8所述的匹配阈值的调整装置,其中,所述验证模块,用于:The apparatus for adjusting a matching threshold according to claim 8, wherein the verification module is configured to:若所述目标匹配度小于所述预设匹配阈值,则获取所述预设匹配阈值与所述目标匹配度的差值;If the target matching degree is less than the preset matching threshold, obtaining a difference between the preset matching threshold and the target matching degree;若检测到所述差值小于或等于预设差值,则确定所述目标匹配度和所述预设匹配阈值之间满足预设条件,并对所述当前用户进行身份验证。If it is detected that the difference is less than or equal to a preset difference, it is determined that a preset condition is satisfied between the target matching degree and the preset matching threshold, and the current user is authenticated.
- 根据权利要求9所述的匹配阈值的调整装置,其中,所述调整模块,用于:The apparatus for adjusting a matching threshold according to claim 9, wherein the adjustment module is configured to:获取所述当前用户的多段语音,并从每段语音中提取目标声纹特征信息;Acquiring a plurality of voices of the current user, and extracting target voiceprint feature information from each voice;将所有所述目标声纹特征信息与所述预设声纹特征信息进行逐一匹配,并 得到多个待用匹配度;Matching all the target voiceprint feature information with the preset voiceprint feature information one by one, and obtaining multiple standby matching degrees;计算所述多个待用匹配度的平均值;Calculating an average value of the plurality of standby matching degrees;将所述预设匹配阈值的数值由当前的目标值调整为所述平均值。Adjusting the value of the preset matching threshold from the current target value to the average value.
- 根据权利要求10所述的匹配阈值的调整装置,其中,所述装置还包括恢复模块,用于:The apparatus for adjusting a matching threshold according to claim 10, wherein the apparatus further comprises a recovery module, configured to:设置一有效期;Set a validity period;当所述有效期结束时,将所述预设匹配阈值的数值由所述平均值恢复为所述目标值。When the validity period ends, the value of the preset matching threshold is restored from the average value to the target value.
- 根据权利要求10所述的匹配阈值的调整装置,其中,所述装置还包括确定模块,用于:计算所述预设匹配阈值与所述目标匹配度的百分比差值;将所述百分比差值转换为以一百为分母的分数;将所述分数的分子的数字确定为目标数值;The device for adjusting a matching threshold according to claim 10, wherein the device further comprises a determining module, configured to: calculate a percentage difference between the preset matching threshold and the target matching degree; and set the percentage difference Convert to a fraction with one hundred as the denominator; determine the number of the numerator of the fraction as the target value;所述调整模块,用于获取数量为所述目标数值的多段语音。The adjustment module is configured to obtain a plurality of pieces of speech whose quantity is the target value.
- 一种存储介质,其上存储有计算机程序,其中,当所述计算机程序在计算机上执行时,使得所述计算机执行如权利要求1至7中任一项所述的方法。A storage medium having stored thereon a computer program, wherein when the computer program is executed on a computer, the computer is caused to perform the method according to any one of claims 1 to 7.
- 一种电子设备,包括存储器,处理器,其中,所述处理器通过调用所述存储器中存储的计算机程序,用于执行:An electronic device includes a memory and a processor, wherein the processor is configured to execute a computer program by calling a computer program stored in the memory:获取当前用户的声纹特征信息;Obtain the voiceprint feature information of the current user;将所述当前用户的声纹特征信息与预设声纹特征信息进行匹配,得到相应的目标匹配度;Matching the voiceprint feature information of the current user with preset voiceprint feature information to obtain a corresponding target matching degree;若所述目标匹配度和预设匹配阈值之间满足预设条件,则对所述当前用户进行身份验证;If a preset condition is satisfied between the target matching degree and a preset matching threshold, performing identity verification on the current user;在所述身份验证通过后,对所述预设匹配阈值的数值进行调整。After the identity verification is passed, the value of the preset matching threshold is adjusted.
- 根据权利要求14所述的电子设备,其中,所述处理器用于执行:The electronic device according to claim 14, wherein the processor is configured to perform:若所述目标匹配度小于所述预设匹配阈值,则获取所述预设匹配阈值与所述目标匹配度的差值;If the target matching degree is less than the preset matching threshold, obtaining a difference between the preset matching threshold and the target matching degree;若检测到所述差值小于或等于预设差值,则确定所述目标匹配度和所述预设匹配阈值之间满足预设条件,并对所述当前用户进行身份验证。If it is detected that the difference is less than or equal to a preset difference, it is determined that a preset condition is satisfied between the target matching degree and the preset matching threshold, and the current user is authenticated.
- 根据权利要求14所述的电子设备,其中,所述处理器用于执行:The electronic device according to claim 14, wherein the processor is configured to perform:获取所述当前用户的多段语音,并从每段语音中提取目标声纹特征信息;Acquiring a plurality of voices of the current user, and extracting target voiceprint feature information from each voice;将所有所述目标声纹特征信息与所述预设声纹特征信息进行逐一匹配,并得到多个待用匹配度;Matching all the target voiceprint feature information with the preset voiceprint feature information one by one, and obtaining multiple standby matching degrees;计算所述多个待用匹配度的平均值;Calculating an average value of the plurality of standby matching degrees;将所述预设匹配阈值的数值由当前的目标值调整为所述平均值。Adjusting the value of the preset matching threshold from the current target value to the average value.
- 根据权利要求16所述的电子设备,其中,所述处理器用于执行:The electronic device of claim 16, wherein the processor is configured to perform:设置一有效期;Set a validity period;当所述有效期结束时,将所述预设匹配阈值的数值由所述平均值恢复为所述目标值。When the validity period ends, the value of the preset matching threshold is restored from the average value to the target value.
- 根据权利要求16所述的电子设备,其中,所述处理器用于执行:The electronic device of claim 16, wherein the processor is configured to perform:计算所述预设匹配阈值与所述目标匹配度的百分比差值;Calculating a percentage difference between the preset matching threshold and the target matching degree;将所述百分比差值转换为以一百为分母的分数;Converting the percentage difference into a fraction with a hundred denominator;将所述分数的分子的数字确定为目标数值;Determining the number of the numerator of the score as the target value;获取数量为所述目标数值的多段语音。Acquire multiple pieces of speech with the target value.
- 根据权利要求16所述的电子设备,其中,所述处理器用于执行:The electronic device of claim 16, wherein the processor is configured to perform:设定预设数值;Set a preset value;获取数量为所述预设数值的多段语音。Acquire multiple pieces of speech with the preset value.
- 根据权利要求14所述的电子设备,其中,所述处理器用于执行:The electronic device according to claim 14, wherein the processor is configured to perform:通过指纹识别或者人脸识别或者虹膜识别对所述当前用户进行身份验证。Performing identity verification on the current user through fingerprint recognition, face recognition or iris recognition.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810542931.2 | 2018-05-30 | ||
CN201810542931.2A CN108766443A (en) | 2018-05-30 | 2018-05-30 | Matching threshold adjusting method and device, storage medium and electronic equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019228135A1 true WO2019228135A1 (en) | 2019-12-05 |
Family
ID=64004565
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/085534 WO2019228135A1 (en) | 2018-05-30 | 2019-05-05 | Method and device for adjusting matching threshold, storage medium and electronic device |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN108766443A (en) |
WO (1) | WO2019228135A1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108766443A (en) * | 2018-05-30 | 2018-11-06 | Oppo广东移动通信有限公司 | Matching threshold adjusting method and device, storage medium and electronic equipment |
CN112102821B (en) * | 2019-06-18 | 2024-01-12 | 北京京东尚科信息技术有限公司 | Data processing method, device, system and medium applied to electronic equipment |
CN111816178B (en) * | 2020-07-07 | 2024-09-06 | 云知声智能科技股份有限公司 | Control method, device and equipment of voice equipment |
CN112417412A (en) * | 2020-11-23 | 2021-02-26 | 中国农业银行股份有限公司陕西省分行 | Bank account balance inquiry method, device and system |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080312924A1 (en) * | 2007-06-13 | 2008-12-18 | At&T Corp. | System and method for tracking persons of interest via voiceprint |
CN101887722A (en) * | 2009-06-18 | 2010-11-17 | 博石金(北京)信息技术有限公司 | Rapid voiceprint authentication method |
CN102142254A (en) * | 2011-03-25 | 2011-08-03 | 北京得意音通技术有限责任公司 | Voiceprint identification and voice identification-based recording and faking resistant identity confirmation method |
CN102404287A (en) * | 2010-09-14 | 2012-04-04 | 盛乐信息技术(上海)有限公司 | Voiceprint authentication system and method for determining voiceprint authentication threshold value by data multiplexing method |
US20140348308A1 (en) * | 2013-05-22 | 2014-11-27 | Nuance Communications, Inc. | Method And System For Speaker Verification |
CN107491671A (en) * | 2016-06-13 | 2017-12-19 | 中兴通讯股份有限公司 | A kind of safe login method and device |
CN108040032A (en) * | 2017-11-02 | 2018-05-15 | 阿里巴巴集团控股有限公司 | A kind of voiceprint authentication method, account register method and device |
CN108766443A (en) * | 2018-05-30 | 2018-11-06 | Oppo广东移动通信有限公司 | Matching threshold adjusting method and device, storage medium and electronic equipment |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5320894B2 (en) * | 2008-08-06 | 2013-10-23 | 富士通株式会社 | Personal authentication device, personal authentication method, and personal authentication program |
CN103000176B (en) * | 2012-12-28 | 2014-12-10 | 安徽科大讯飞信息科技股份有限公司 | Speech recognition method and system |
CN104331651A (en) * | 2014-10-08 | 2015-02-04 | 无锡指网生物识别科技有限公司 | Fingerprint- and voice recognition-based control system and equipment |
CN105550561B (en) * | 2015-12-14 | 2019-03-15 | Oppo广东移动通信有限公司 | A kind of identification method and device of mobile terminal |
CN107146613A (en) * | 2017-04-10 | 2017-09-08 | 北京猎户星空科技有限公司 | A kind of voice interactive method and device |
CN107864121A (en) * | 2017-09-30 | 2018-03-30 | 上海壹账通金融科技有限公司 | User ID authentication method and application server |
CN107943536B (en) * | 2017-11-14 | 2020-05-12 | Oppo广东移动通信有限公司 | Method and device for adjusting threshold, storage medium and electronic equipment |
-
2018
- 2018-05-30 CN CN201810542931.2A patent/CN108766443A/en active Pending
-
2019
- 2019-05-05 WO PCT/CN2019/085534 patent/WO2019228135A1/en active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080312924A1 (en) * | 2007-06-13 | 2008-12-18 | At&T Corp. | System and method for tracking persons of interest via voiceprint |
CN101887722A (en) * | 2009-06-18 | 2010-11-17 | 博石金(北京)信息技术有限公司 | Rapid voiceprint authentication method |
CN102404287A (en) * | 2010-09-14 | 2012-04-04 | 盛乐信息技术(上海)有限公司 | Voiceprint authentication system and method for determining voiceprint authentication threshold value by data multiplexing method |
CN102142254A (en) * | 2011-03-25 | 2011-08-03 | 北京得意音通技术有限责任公司 | Voiceprint identification and voice identification-based recording and faking resistant identity confirmation method |
US20140348308A1 (en) * | 2013-05-22 | 2014-11-27 | Nuance Communications, Inc. | Method And System For Speaker Verification |
CN107491671A (en) * | 2016-06-13 | 2017-12-19 | 中兴通讯股份有限公司 | A kind of safe login method and device |
CN108040032A (en) * | 2017-11-02 | 2018-05-15 | 阿里巴巴集团控股有限公司 | A kind of voiceprint authentication method, account register method and device |
CN108766443A (en) * | 2018-05-30 | 2018-11-06 | Oppo广东移动通信有限公司 | Matching threshold adjusting method and device, storage medium and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN108766443A (en) | 2018-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US12230279B1 (en) | User authentication for voice-input devices | |
US11289072B2 (en) | Object recognition method, computer device, and computer-readable storage medium | |
CN109428719B (en) | Identity verification method, device and equipment | |
RU2697736C1 (en) | Speaker verification | |
JP6621536B2 (en) | Electronic device, identity authentication method, system, and computer-readable storage medium | |
WO2019228135A1 (en) | Method and device for adjusting matching threshold, storage medium and electronic device | |
WO2020181824A1 (en) | Voiceprint recognition method, apparatus and device, and computer-readable storage medium | |
WO2021159688A1 (en) | Voiceprint recognition method and apparatus, and storage medium and electronic apparatus | |
US11252152B2 (en) | Voiceprint security with messaging services | |
US10096321B2 (en) | Reverberation compensation for far-field speaker recognition | |
US20170236520A1 (en) | Generating Models for Text-Dependent Speaker Verification | |
WO2017012496A1 (en) | User voiceprint model construction method, apparatus, and system | |
EP3989217B1 (en) | Method for detecting an audio adversarial attack with respect to a voice input processed by an automatic speech recognition system, corresponding device, computer program product and computer-readable carrier medium | |
US9646613B2 (en) | Methods and systems for splitting a digital signal | |
CN109462482B (en) | Voiceprint recognition method, voiceprint recognition device, electronic equipment and computer readable storage medium | |
CN113330511B (en) | Speech recognition method, device, storage medium and electronic device | |
US10916249B2 (en) | Method of processing a speech signal for speaker recognition and electronic apparatus implementing same | |
US20180285068A1 (en) | Processing method of audio control and electronic device thereof | |
US12217751B2 (en) | Digital signal processor-based continued conversation | |
WO2021169711A1 (en) | Instruction execution method and apparatus, storage medium, and electronic device | |
CN110544468A (en) | Application wake-up method, device, storage medium and electronic device | |
CN108847243B (en) | Voiceprint feature updating method and device, storage medium and electronic equipment | |
US20230289420A1 (en) | Method for multifactor authentication using bone conduction and audio signals | |
WO2022017152A1 (en) | Resource transfer method and apparatus, computer device, and storage medium | |
CN117378006A (en) | Hybrid multilingual text-dependent and text-independent speaker verification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19811948 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19811948 Country of ref document: EP Kind code of ref document: A1 |