CN109935225A - Word information processing device and method, computer storage medium and mobile terminal - Google Patents
Word information processing device and method, computer storage medium and mobile terminal Download PDFInfo
- Publication number
- CN109935225A CN109935225A CN201711352721.9A CN201711352721A CN109935225A CN 109935225 A CN109935225 A CN 109935225A CN 201711352721 A CN201711352721 A CN 201711352721A CN 109935225 A CN109935225 A CN 109935225A
- Authority
- CN
- China
- Prior art keywords
- originator
- voice
- speech
- sound
- information processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Machine Translation (AREA)
Abstract
Description
技术领域technical field
本发明涉及一种数据处理技术,具体涉及一种文字信息处理装置、文字信息处理装置方法及计算机存储介质。The present invention relates to a data processing technology, in particular to a text information processing device, a text information processing device method and a computer storage medium.
背景技术Background technique
现有的社交软件,例如微信、QQ,都仅能接收文字信息或语音信息,而且需要随时查看文字信息或点击语音信息来收听才能知道信息内容。当信息发送者发送文字信息时而接收者不方便查看,例如开车的时候,可能就会错过一些重要的信息。有些软件能通过文字转语音(TTS)的方式将文字播出,但语音由本地合成,接受者需要根据语音内容先确认发送者身份再确认发送内容信息,影响信息接收的效率。Existing social software, such as WeChat and QQ, can only receive text messages or voice messages, and it is necessary to view the text messages or click on the voice messages at any time to listen to the contents of the messages. When the sender of the message sends a text message and the recipient is inconvenient to view it, such as when driving, some important information may be missed. Some software can broadcast text through text-to-speech (TTS), but the voice is synthesized locally. The receiver needs to confirm the sender's identity and then confirm the content information according to the voice content, which affects the efficiency of information reception.
发明内容SUMMARY OF THE INVENTION
鉴于上述状况,有必要提供一种可以在不方便查看信息时快速获取信息的文字信息处理装置、方法及计算机存储介质。In view of the above situation, it is necessary to provide a word information processing device, method and computer storage medium that can quickly obtain information when it is inconvenient to view the information.
一种文字信息处理方法,应用于一文字信息处理装置中,所述文字信息处理装置存储有语音合成数据库,该方法包括:接收一文字信息并记录发信者;在所述语音合成数据库中查找所述发信者的个人语音数据;根据所述发信者的个人语音数据将所述文字信息转换为语音信息;及播放所述语音信息。A text information processing method, applied in a text information processing device, wherein the text information processing device stores a speech synthesis database, the method comprises: receiving a text message and recording the sender; searching the speech synthesis database for the sender the personal voice data of the sender; converting the text information into voice information according to the personal voice data of the sender; and playing the voice information.
进一步地,所述方法还包括以下步骤:Further, the method also includes the following steps:
判断是否有所述发信者的个人语音数据;及determine whether there is personal voice data of the sender; and
记录所述发信者的个人语音数据。The personal voice data of the sender is recorded.
进一步地,所述记录所述发信者的个人语音数据包括:Further, the recording of the sender's personal voice data includes:
识别所述发信者;identify said sender;
记录一指定文字的语音信息;及record a voice message in a specified text; and
提取声音特性并存入所述发信者的个人语音数据。Voice characteristics are extracted and stored in the sender's personal voice data.
进一步地,所述提取声音特性包括:Further, the extracted sound characteristics include:
将声音特性与装置预设的默认语音的声音特性做比对;及Compare the sound characteristics with the sound characteristics of the default voice preset by the device; and
利用比对的声音特性差异,对默认语音的声音特性进行修改并生成发信者的个人语音。Using the difference of the compared voice characteristics, the voice characteristics of the default voice are modified and the sender's personal voice is generated.
进一步地,所述记录所述发信者的个人语音数据包括:Further, the recording of the sender's personal voice data includes:
记录发信者对声母、韵母及声调的读音数据;及Record the sender's reading data of initials, finals and tones; and
储存所述读音数据为所述发信者的个人语音。The pronunciation data is stored as the sender's personal voice.
一种文字信息处理装置,存储有语音合成数据库,该文字信息处理装置包括:A text information processing device storing a speech synthesis database, the text information processing device comprising:
接收模块,用以接收一文字信息并记录发信者;The receiving module is used to receive a text message and record the sender;
查找模块,用以在所述语音合成数据库中查找所述发信者的个人语音数据;a search module for searching the personal voice data of the sender in the speech synthesis database;
转换模块,用以根据所述发信者的个人语音数据将所述文字信息转换为语音信息;及a conversion module for converting the text information into voice information according to the personal voice data of the sender; and
播放模块,用以播放所述语音信息。A playing module is used to play the voice information.
进一步地,所述文字信息处理装置还包括:Further, the word information processing device also includes:
识别模块,用以识别所述发信者;an identification module for identifying the sender;
记录模块,用以记录一指定文字的语音信息;a recording module for recording the voice information of a specified text;
提取模块,用以提取声音特性并存入所述发信者的个人语音数据;及an extraction module for extracting voice characteristics and storing in the sender's personal voice data; and
处理模块,用以将声音特性与装置预设的默认语音的声音特性做比对,及利用比对的声音特性差异,对默认语音的声音特性进行修改并生成发信者的个人语音数据。The processing module is used for comparing the sound characteristics with the sound characteristics of the default voice preset by the device, and using the difference of the compared voice characteristics to modify the sound characteristics of the default voice and generate personal voice data of the sender.
进一步地,所述记录模块还用以记录发信者针对对应语言基本读音单元的读音数据;所述文字信息处理装置还包括储存模块,所述储存模块用以储存所述读音数据为对应的个人语音至所述语音合成数据库中。Further, the recording module is also used to record the pronunciation data of the sender for the basic pronunciation unit of the corresponding language; the text information processing device also includes a storage module, and the storage module is used to store the pronunciation data as a corresponding personal voice. into the speech synthesis database.
一种计算机存储介质,该计算机存储介质存储多条指令,所述多条指令适于由处理器加载并执行上述文字信息处理方法。A computer storage medium storing a plurality of instructions suitable for being loaded by a processor and executing the above word information processing method.
一种移动终端,包括:A mobile terminal, comprising:
语音合成数据库,用以储存个人语音数据;A speech synthesis database for storing personal speech data;
处理器,用以实现一条或一条以上指令;及a processor to implement one or more instructions; and
计算机存储介质,用以存储多条指令,所述多条指令适于由处理器加载并执行上述文字信息处理方法。A computer storage medium for storing a plurality of instructions suitable for being loaded by a processor and executing the above word information processing method.
上述文字信息处理装置及方法,能够在收到信息时直接播放出来供用户知道,不需要打开手机查看,就能知道信息内容,防止在不方便查看手机的时候错过信息。The above-mentioned text information processing device and method can directly play out the information for the user to know when receiving the information, and the information content can be known without opening the mobile phone to view, so as to prevent the information from being missed when it is inconvenient to view the mobile phone.
附图说明Description of drawings
图1为本发明一实施方式中文字信息处理装置与一发送终端的模块连接框图。FIG. 1 is a block diagram of a module connection between a text information processing apparatus and a sending terminal in an embodiment of the present invention.
图2为本发明一实施方式中文字信息处理方法的步骤流程图。FIG. 2 is a flow chart of steps of a text information processing method according to an embodiment of the present invention.
图3为图2文字信息处理方法的一实施方式的一记录发信者的个人语音数据的步骤流程图。FIG. 3 is a flow chart of steps for recording personal voice data of a sender according to an embodiment of the text information processing method of FIG. 2 .
图4为图3文字信息处理方法的另一实施方式的记录发信者的个人语音数据的步骤流程图。FIG. 4 is a flowchart of steps of recording personal voice data of a sender according to another embodiment of the text information processing method of FIG. 3 .
图5为图1中文字信息处理装置的模块连接框图。FIG. 5 is a block diagram of module connections of the word information processing device in FIG. 1 .
图6为图1中文字信息处理装置的内部结构连接框图。FIG. 6 is a block diagram showing the connection of the internal structure of the word information processing apparatus in FIG. 1 .
主要元件符号说明Description of main component symbols
如下具体实施方式将结合上述附图进一步说明本发明。The following specific embodiments will further illustrate the present invention in conjunction with the above drawings.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
需要说明的是,当一个元件被认为是“连接”另一个元件,它可以是直接连接到另一个元件或者可能同时存在居中设置的元件。当一个元件被认为是“设置于”另一个元件,它可以是直接设置在另一个元件上或者可能同时存在居中设置的元件。It should be noted that when an element is referred to as being "connected" to another element, it can be directly connected to the other element or intervening elements may also be present. When an element is referred to as being "disposed on" another element, it can be directly disposed on the other element or intervening elements may also be present.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本发明。本文所使用的术语“及/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
请参阅图1,本发明提供一实施方式中的一种文字信息处理方法可将文字信息转换为个人语音信息并播出。该方法应用于一文字信息处理装置100中。本发明一实施方式中的文字信息处理系统包括一发送终端10及所述文字信息处理装置100。所述发送终端10用以向所述文字信息处理装置100发送文字信息。所述文字信息处理装置100存储有语音合成数据库31。Referring to FIG. 1 , the present invention provides a text information processing method in an embodiment that can convert text information into personal voice information and broadcast it. The method is applied in a word information processing device 100 . The text information processing system in an embodiment of the present invention includes a transmitting terminal 10 and the text information processing device 100 . The sending terminal 10 is used for sending text information to the text information processing device 100 . The text information processing device 100 stores a speech synthesis database 31 .
请同时参阅图2,该文字信息处理方法包括以下步骤:Please refer to FIG. 2 at the same time, the text information processing method includes the following steps:
步骤S201:接收一文字信息并记录发信者,所述发信者包括发信者名称及头像等;Step S201: Receive a text message and record the sender, where the sender includes the sender's name and avatar;
步骤S202:在语音合成数据库中查找所述发信者的个人语音数据;Step S202: searching the personal voice data of the sender in the speech synthesis database;
步骤S203:判断是否有所述发信者的个人语音数据,如果否,则执行步骤S204;如果是,执行步骤S205;Step S203: Determine whether there is personal voice data of the sender, if not, go to Step S204; if yes, go to Step S205;
步骤S204:记录发信者的个人语音数据,在一实施方式中,所述个人语音数据包括对应语言的基本单元读音,例如中文为21个声母、37韵母、5个声调组合成的多个读音;Step S204: record the personal voice data of the sender, in one embodiment, the personal voice data includes the basic unit pronunciation of the corresponding language, for example, Chinese is a plurality of pronunciations composed of 21 initials, 37 finals, and 5 tones;
步骤S205:根据所述发信者的个人语音数据将所述文字信息转换为语音信息;及Step S205: converting the text information into voice information according to the personal voice data of the sender; and
步骤S206:播放语音信息。Step S206: Play the voice information.
请参阅图3,具体实现中,所述步骤S204包括:Please refer to FIG. 3. In a specific implementation, the step S204 includes:
步骤S301:识别所述发信者;Step S301: Identify the sender;
步骤S302:记录一指定文字的语音信息;Step S302: record voice information of a specified text;
步骤S303:提取声音特性,所述声音特征包括音高、音色及音调等;Step S303: extracting sound characteristics, the sound characteristics include pitch, timbre and tone, etc.;
步骤S304:将声音特性与装置预设的默认语音的声音特性做比对;及Step S304: Compare the sound characteristics with the sound characteristics of the default voice preset by the device; and
步骤S305:利用比对的声音特性差异,对默认语音的声音特性进行修改并生成发信者的个人语音。Step S305: Using the difference of the compared voice characteristics, modify the voice characteristics of the default voice and generate the sender's personal voice.
请参阅图4,为了达到更好的个人语音效果,所述步骤S204还可以包括:Please refer to Fig. 4, in order to achieve better personal voice effect, the step S204 may further include:
步骤S401:记录发信者对声母、韵母及声调组合成的多个读音数据;及Step S401: record the multiple pronunciation data composed of the initial consonant, the final consonant and the tone by the sender; and
步骤S402:储存这些读音数据为对应的个人语音。Step S402: Store these pronunciation data as corresponding personal voices.
所述文字信息处理方法还包括:设置播放方式。设置播放方式包括打开或关闭自动播放语音开关及选择合成语音的对象。The text information processing method further includes: setting a playback mode. Setting the playback method includes turning on or off the auto-play voice switch and selecting the object to synthesize voice.
当打开自动播放语音开关时,才会自动播放语音信息,否则需点击语音信息才可播放。When the auto-play voice switch is turned on, the voice message will be played automatically, otherwise, you need to click the voice message to play.
所述选择合成语音的对象包括选择发信者个人语音及系统默认语音。所述系统默认语音存储在所述语音合成数据库31中,当设置以默认语音播放时,则语音合成时,仅需要调取所述默认语音。所述默认语音包括以一特定声音特性朗读的21个声母、37韵母、5个声调组合成的多个读音。当语音合成时,将与各个文字对应的读音连贯起来形成语音信息,再配以特定的语速。系统默认语音对象可以是机器语音、动画人物或名人等。The object for selecting the synthesized voice includes selecting the sender's personal voice and the system default voice. The system default voice is stored in the voice synthesis database 31, and when the default voice is set to be played, only the default voice needs to be retrieved during voice synthesis. The default speech includes a plurality of pronunciations composed of 21 initials, 37 finals, and 5 tones read with a specific sound characteristic. When the speech is synthesized, the pronunciation corresponding to each character is coherently formed into speech information, and then matched with a specific speech rate. The system default speech object can be machine speech, animated characters or celebrities, etc.
所述文字信息处理方法还包括存储文字信息及语音信息,以及显示文字信息和语音信息在聊天界面上。The text information processing method further includes storing the text information and the voice information, and displaying the text information and the voice information on the chat interface.
请同时参阅图5,所述发送终端10可为手机或平板电脑。Please also refer to FIG. 5 , the sending terminal 10 may be a mobile phone or a tablet computer.
所述文字信息处理装置100包括:The word information processing device 100 includes:
接收模块51,用以接收来自所述发送终端10的文字信息并记录发信者;The receiving module 51 is used to receive the text information from the sending terminal 10 and record the sender;
查找模块52:用以在语音合成数据库中查找所述发信者的个人语音数据;Searching module 52: used to search the personal voice data of the sender in the speech synthesis database;
判断模块53:用以判断语音合成数据库中是否有所述发信者的个人语音数据;Judging module 53: for judging whether there is personal voice data of the sender in the speech synthesis database;
记录模块54:用以在没有所述发信者的个人语音数据时记录发信者的个人语音数据;Recording module 54: for recording the personal voice data of the sender when there is no personal voice data of the sender;
转换模块55:用以根据所述发信者的个人语音数据将所述文字信息转换为语音信息;及Conversion module 55: for converting the text information into voice information according to the personal voice data of the sender; and
播放模块56:用以播放语音信息。Playing module 56: used for playing voice information.
所述文字信息处理装置100还包括:The word information processing device 100 further includes:
识别模块57:用以识别所述发信者;Identifying module 57: used to identify the sender;
记录模块54还用以记录一指定文字的语音信息;The recording module 54 is also used to record the voice information of a specified text;
提取模块58:用以提取声音特性,所述声音特征包括音色及音调等;Extraction module 58: used to extract sound characteristics, the sound characteristics include timbre and pitch, etc.;
比对模块59:用以将声音特性与装置预设的默认语音的声音特性做比对;及Comparison module 59: used to compare the sound characteristics with the sound characteristics of the default voice preset by the device; and
生成模块61:用以利用比对的声音特性差异,对默认语音的声音特性进行修改并生成发信者的个人语音。Generating module 61: used to modify the sound characteristics of the default voice and generate the sender's personal voice by using the difference of the compared voice characteristics.
为了达到更好的个人语音效果,所述记录模块54还用以记录发信者对应语言的基本读音单元的读音数据,所述文字信息处理装置100还包括存储模块63,用以储存这些读音数据为对应的个人语音至语音合成数据库31中。In order to achieve a better personal voice effect, the recording module 54 is also used to record the pronunciation data of the basic pronunciation unit of the language corresponding to the sender, and the word information processing device 100 further includes a storage module 63 to store these pronunciation data as The corresponding personal speech is put into the speech synthesis database 31 .
所述文字信息处理装置100还包括设置模块65,用以设置播放方式包括打开或关闭自动播放语音开关及选择合成语音的对象。The word information processing apparatus 100 further includes a setting module 65 for setting the playback mode including turning on or off the automatic playback voice switch and selecting an object to synthesize voice.
当打开自动播放语音开关时,才会自动播放语音信息,否则需点击语音信息才可播放。When the auto-play voice switch is turned on, the voice message will be played automatically, otherwise, you need to click the voice message to play.
所述选择合成语音的对象包括选择发信者个人语音及系统默认语音。所述系统默认语音存储在所述语音合成数据库31中,当设置以默认语音播放时,则语音合成时,仅需要调取所述默认语音。所述默认语音包括以一特定声音特性朗读的基本读音单元的多个读音。当语音合成时,将与各个文字对应的读音连贯起来形成语音信息,再配以特定的语速。系统默认语音对象可以是机器语音、动画人物或名人等。The object for selecting the synthesized voice includes selecting the sender's personal voice and the system default voice. The system default voice is stored in the voice synthesis database 31, and when the default voice is set to be played, only the default voice needs to be retrieved during voice synthesis. The default speech includes a plurality of pronunciations of basic phonetic units read with a specific sound characteristic. When the speech is synthesized, the pronunciation corresponding to each character is coherently formed into speech information, and then matched with a specific speech rate. The system default speech object can be machine speech, animated characters or celebrities, etc.
所述存储模块65还用以存储文字信息及语音信息,以及显示文字信息和语音信息在聊天界面上。The storage module 65 is also used for storing text information and voice information, and displaying the text information and voice information on the chat interface.
请同时参阅图6,所述文字信息处理装置100的内部结构可包括至少一个处理器(processor)71(图中以一个处理器71为例);显示屏72;以及计算机存储介质(memory)73,还可以包括通信接口(Communications Interface)74和总线75。其中,处理器71、显示屏72、计算机存储介质73和通信接口74可以通过总线75完成相互间的通信。显示屏72设置为显示初始设置模式中预设的用户引导界面。通信接口74可以传输信息。处理器71可以调用计算机存储介质73中的逻辑指令,以执行上述实施例中的方法。Please also refer to FIG. 6 , the internal structure of the word information processing device 100 may include at least one processor 71 (in the figure, one processor 71 is taken as an example); a display screen 72 ; and a computer storage medium (memory) 73 , and may also include a Communications Interface 74 and a bus 75 . The processor 71 , the display screen 72 , the computer storage medium 73 and the communication interface 74 can communicate with each other through the bus 75 . The display screen 72 is arranged to display the user guide interface preset in the initial setup mode. Communication interface 74 may transmit information. The processor 71 may invoke logic instructions in the computer storage medium 73 to perform the methods in the above-described embodiments.
此外,上述的计算机存储介质73中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机存储介质中。In addition, the above-mentioned logic instructions in the computer storage medium 73 can be implemented in the form of software functional units and can be stored in a computer storage medium when sold or used as an independent product.
计算机存储介质73可设置为存储软件程序、计算机可执行程序,如本公开实施例中的方法对应的程序指令或模块。处理器71通过运行存储在计算机存储介质73中的软件程序、指令或模块,从而执行功能应用以及数据处理,即实现上述实施例中的方法。The computer storage medium 73 may be configured to store software programs, computer-executable programs, such as program instructions or modules corresponding to the methods in the embodiments of the present disclosure. The processor 71 executes functional applications and data processing by running software programs, instructions or modules stored in the computer storage medium 73 , that is, implements the methods in the above embodiments.
计算机存储介质73可包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序;存储数据区可存储根据终端设备的使用所创建的数据等。此外,计算机存储介质73可以包括高速随机存取计算机存储介质,还可以包括非易失性计算机存储介质。例如,U盘、移动硬盘、只读计算机存储介质(Read-Only Memory,ROM)、随机存取计算机存储介质(Random Access Memory,RAM)、磁碟或者光盘等多种可以存储程序代码的介质,也可以是暂态存储介质。The computer storage medium 73 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to the use of the terminal device, and the like. Additionally, computer storage media 73 may include high-speed random access computer storage media, and may also include non-volatile computer storage media. For example, U disk, mobile hard disk, read-only computer storage medium (Read-Only Memory, ROM), random access computer storage medium (Random Access Memory, RAM), magnetic disk or optical disk and other media that can store program codes, It can also be a transient storage medium.
此外,上述存储介质以及移动终端中的多条指令处理器加载并执行的具体过程在In addition, the specific process of loading and executing the above-mentioned storage medium and the multiple instruction processor in the mobile terminal is as follows:
上述方法中已经详细说明,在这里就不再一一陈述。The above method has been described in detail, and will not be described one by one here.
在一实施方式中,所述文字信息处理装置100包括一移动终端及一服务器。所述服务器包括所述处理器及计算机存储介质。所述移动终端可以是手机或者平板电脑。In one embodiment, the word information processing apparatus 100 includes a mobile terminal and a server. The server includes the processor and a computer storage medium. The mobile terminal may be a mobile phone or a tablet computer.
所述处理器加载并执行计算机存储介质中存放的一条或一条以上指令,以实现上述图2-图4所示方法流程的相应步骤;具体实现中,计算机存储介质中的一条或一条以上指令由处理器加载并执行如下步骤:The processor loads and executes one or more instructions stored in the computer storage medium to realize the corresponding steps of the method flow shown in the above-mentioned Figures 2-4; in specific implementation, one or more instructions in the computer storage medium are represented by The processor loads and performs the following steps:
步骤S201:接收一文字信息并记录发信者;Step S201: Receive a text message and record the sender;
步骤S202:在语音合成数据库中查找所述发信者的个人语音数据;Step S202: searching the personal voice data of the sender in the speech synthesis database;
步骤S203:判断是否有所述发信者的个人语音数据,如果否,则执行步骤S204;如果是,执行步骤S205;Step S203: Determine whether there is personal voice data of the sender, if not, go to Step S204; if yes, go to Step S205;
步骤S204:记录发信者的个人语音数据;Step S204: record the personal voice data of the sender;
步骤S205:根据所述发信者的个人语音数据将所述文字信息转换为语音信息;及Step S205: converting the text information into voice information according to the personal voice data of the sender; and
发送语音信息至接收终端。Send voice information to the receiving terminal.
所述接收终端200接收语音信息并播放语音信息。The receiving terminal 200 receives the voice information and plays the voice information.
所述计算机存储介质中的一条或一条以上指令由处理器加载并进一步执行所述步骤S204所包括的:One or more instructions in the computer storage medium are loaded by the processor and further perform the steps included in step S204:
步骤S301:识别所述发信者;Step S301: Identify the sender;
步骤S302:记录一指定文字的语音信息;Step S302: record voice information of a specified text;
步骤S303:提取声音特性,所述声音特征包括音色及音调等;Step S303: extracting sound characteristics, the sound characteristics include timbre and pitch, etc.;
步骤S304:将声音特性与装置预设的默认语音的声音特性做比对;及Step S304: Compare the sound characteristics with the sound characteristics of the default voice preset by the device; and
步骤S305:利用比对的声音特性差异,对默认语音的声音特性进行修改并生成发信者的个人语音。Step S305: Using the difference of the compared voice characteristics, modify the voice characteristics of the default voice and generate the sender's personal voice.
计算机存储介质中的一条或一条以上指令由处理器加载并进一步执行所述步骤S204包括的:One or more instructions in the computer storage medium are loaded by the processor and further perform the step S204 including:
步骤S401:记录发信者对应语言的基本读音单元的多个读音数据;及Step S401: record multiple pronunciation data of the basic pronunciation unit of the language corresponding to the sender; and
步骤S402:储存这些读音数据为对应的个人语音。Step S402: Store these pronunciation data as corresponding personal voices.
所述接收终端200包括终端处理器及终端计算机存储介质,所述终端计算机存储介质的一条或一条以上指令由终端处理器加载并执行步骤:设置播放方式。设置播放方式包括打开或关闭自动播放语音开关及选择合成语音的对象。The receiving terminal 200 includes a terminal processor and a terminal computer storage medium, and one or more instructions of the terminal computer storage medium are loaded by the terminal processor and execute the step: setting a playback mode. Setting the playback method includes turning on or off the auto-play voice switch and selecting the object to synthesize voice.
当打开自动播放语音开关时,才会自动播放语音信息,否则需点击语音信息才可播放。When the auto-play voice switch is turned on, the voice message will be played automatically, otherwise, you need to click the voice message to play.
所述选择合成语音的对象包括选择发信者个人语音及系统默认语音。所述系统默认语音存储在所述语音合成数据库31中,当设置以默认语音播放时,则语音合成时,仅需要调取所述默认语音。所述默认语音包括以一特定声音特性朗读的对应语言的基本读音单元的多个读音。当语音合成时,将与各个文字对应的读音连贯起来形成语音信息,再配以特定的语速。系统默认语音对象可以是机器语音、动画人物或名人等。The object for selecting the synthesized voice includes selecting the sender's personal voice and the system default voice. The system default voice is stored in the voice synthesis database 31, and when the default voice is set to be played, only the default voice needs to be retrieved during voice synthesis. The default speech includes a plurality of pronunciations of basic pronunciation units of the corresponding language read with a specific sound characteristic. When the speech is synthesized, the pronunciation corresponding to each character is coherently formed into speech information, and then matched with a specific speech rate. The default speech object of the system can be machine speech, animated characters or celebrities, etc.
在另一实施方式中,所述文字信息处理装置100仅为一移动终端。该移动终端可以是手机或者平板电脑。所述移动终端包括处理器71及计算机存储介质73。In another embodiment, the word information processing apparatus 100 is only a mobile terminal. The mobile terminal may be a mobile phone or a tablet computer. The mobile terminal includes a processor 71 and a computer storage medium 73 .
所述处理器加载并执行计算机存储介质中存放的一条或一条以上指令,以实现上述图2-图4所示方法流程的相应步骤;具体实现中,计算机存储介质中的一条或一条以上指令由处理器加载并执行如下步骤:The processor loads and executes one or more instructions stored in the computer storage medium to realize the corresponding steps of the method flow shown in the above-mentioned Fig. 2-Fig. 4; The processor loads and performs the following steps:
步骤S201:接收一文字信息并记录发信者;Step S201: Receive a text message and record the sender;
步骤S202:在语音合成数据库中查找所述发信者的个人语音数据;Step S202: searching the personal voice data of the sender in the speech synthesis database;
步骤S203:判断是否有所述发信者的个人语音数据,如果否,则执行步骤S204;如果是,执行步骤S205;Step S203: Determine whether there is personal voice data of the sender, if not, go to Step S204; if yes, go to Step S205;
步骤S204:记录发信者的个人语音数据;Step S204: record the personal voice data of the sender;
步骤S205:根据所述发信者的个人语音数据将所述文字信息转换为语音信息;及Step S205: converting the text information into voice information according to the personal voice data of the sender; and
步骤S206:播放语音信息。Step S206: Play the voice information.
所述计算机存储介质中的一条或一条以上指令由处理器加载并进一步执行所述步骤S204所包括的:One or more instructions in the computer storage medium are loaded by the processor and further perform the steps included in step S204:
步骤S301:识别所述发信者;Step S301: Identify the sender;
步骤S302:记录一指定文字的语音信息;Step S302: record voice information of a specified text;
步骤S303:提取声音特性,所述声音特征包括音色及音调等;Step S303: extracting sound characteristics, the sound characteristics include timbre and pitch, etc.;
步骤S304:将声音特性与装置预设的默认语音的声音特性做比对;及Step S304: Compare the sound characteristics with the sound characteristics of the default voice preset by the device; and
步骤S305:利用比对的声音特性差异,对默认语音的声音特性进行修改并生成发信者的个人语音。Step S305: Using the difference of the compared voice characteristics, modify the voice characteristics of the default voice and generate the sender's personal voice.
计算机存储介质中的一条或一条以上指令由处理器加载并进一步执行所述步骤S204包括的:One or more instructions in the computer storage medium are loaded by the processor and further perform the step S204 including:
步骤S401:记录发信者对应语言的基本读音单元的多个读音数据;及Step S401: record multiple pronunciation data of the basic pronunciation unit of the language corresponding to the sender; and
步骤S402:储存这些读音数据为对应的个人语音。Step S402: Store these pronunciation data as corresponding personal voices.
计算机存储介质中的一条或一条以上指令由处理器还加载并执行步骤:设置播放方式。设置播放方式包括打开或关闭自动播放语音开关及选择合成语音的对象。One or more instructions in the computer storage medium are also loaded by the processor and execute the step: setting a playback mode. Setting the playback method includes turning on or off the auto-play voice switch and selecting the object to synthesize voice.
当打开自动播放语音开关时,才会自动播放语音信息,否则需点击语音信息才可播放。When the auto-play voice switch is turned on, the voice message will be played automatically, otherwise, you need to click the voice message to play.
所述选择合成语音的对象包括选择发信者个人语音及系统默认语音。所述系统默认语音存储在所述语音合成数据库31中,当设置以默认语音播放时,则语音合成时,仅需要调取所述默认语音。所述默认语音包括以一特定声音特性朗读的21个声母、37韵母、5个声调组合成的多个读音。当语音合成时,将与各个文字对应的读音连贯起来形成语音信息,再配以特定的语速。系统默认语音对象可以是机器语音、动画人物或名人等。The objects for selecting the synthesized voice include selecting the sender's personal voice and the system default voice. The system default voice is stored in the voice synthesis database 31, and when the default voice is set to be played, only the default voice needs to be retrieved during voice synthesis. The default speech includes a plurality of pronunciations composed of 21 initials, 37 finals, and 5 tones read with a specific sound characteristic. When the speech is synthesized, the pronunciation corresponding to each character is coherently formed into speech information, and then matched with a specific speech rate. The default speech object of the system can be machine speech, animated characters or celebrities, etc.
另外,本领域技术人员还可在本发明精神内做其它变化,当然,这些依据本发明精神所做的变化,都应包含在本发明所要求保护的范围内。In addition, those skilled in the art can also make other changes within the spirit of the present invention. Of course, these changes made according to the spirit of the present invention should all be included in the scope of the claimed protection of the present invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201711352721.9A CN109935225A (en) | 2017-12-15 | 2017-12-15 | Word information processing device and method, computer storage medium and mobile terminal |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201711352721.9A CN109935225A (en) | 2017-12-15 | 2017-12-15 | Word information processing device and method, computer storage medium and mobile terminal |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN109935225A true CN109935225A (en) | 2019-06-25 |
Family
ID=66980306
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201711352721.9A Pending CN109935225A (en) | 2017-12-15 | 2017-12-15 | Word information processing device and method, computer storage medium and mobile terminal |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN109935225A (en) |
Citations (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1379391A (en) * | 2001-04-06 | 2002-11-13 | 国际商业机器公司 | Method of producing individual characteristic speech sound from text |
| CN1534595A (en) * | 2003-03-28 | 2004-10-06 | 中颖电子(上海)有限公司 | Speech sound change over synthesis device and its method |
| CN101064104A (en) * | 2006-04-24 | 2007-10-31 | 中国科学院自动化研究所 | Emotion voice creating method based on voice conversion |
| CN201107040Y (en) * | 2007-08-15 | 2008-08-27 | 深圳华强信息产业有限公司 | Vehicular GPS terminal with automatic voice prompting function |
| CN101308652A (en) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | Synthesizing method of personalized singing voice |
| CN101359473A (en) * | 2007-07-30 | 2009-02-04 | 国际商业机器公司 | Auto speech conversion method and apparatus |
| CN101751922A (en) * | 2009-07-22 | 2010-06-23 | 中国科学院自动化研究所 | Text-independent speech conversion system based on HMM model state mapping |
| CN102568472A (en) * | 2010-12-15 | 2012-07-11 | 盛乐信息技术(上海)有限公司 | Voice synthesis system with speaker selection and realization method thereof |
| CN202563577U (en) * | 2012-05-04 | 2012-11-28 | 山东卡尔电气股份有限公司 | Intelligent monitoring management system for vehicles |
| CN202602766U (en) * | 2012-05-24 | 2012-12-12 | 四川华立德科技有限公司 | Mobile phone capable of searching voice information |
| CN102982809A (en) * | 2012-12-11 | 2013-03-20 | 中国科学技术大学 | Conversion method for sound of speaker |
| CN104123932A (en) * | 2014-07-29 | 2014-10-29 | 科大讯飞股份有限公司 | Voice conversion system and method |
| CN104348250A (en) * | 2013-07-24 | 2015-02-11 | 江苏永迅电气有限公司 | Voice broadcasting type dual-power automatic diverter switch |
| CN104464716A (en) * | 2014-11-20 | 2015-03-25 | 北京云知声信息技术有限公司 | Voice broadcasting system and method |
| CN104900226A (en) * | 2014-03-03 | 2015-09-09 | 联想(北京)有限公司 | Information processing method and device |
| CN105261355A (en) * | 2015-09-02 | 2016-01-20 | 百度在线网络技术(北京)有限公司 | Voice synthesis method and apparatus |
| CN105721292A (en) * | 2016-03-31 | 2016-06-29 | 宇龙计算机通信科技(深圳)有限公司 | Information reading method, device and terminal |
| CN106205602A (en) * | 2015-05-06 | 2016-12-07 | 上海汽车集团股份有限公司 | Speech playing method and system |
| CN106452990A (en) * | 2016-09-05 | 2017-02-22 | 中山海倍瑞智能软件科技有限公司 | Method and device for controlling voice broadcast of household electrical appliances, and household electrical appliances |
| CN106558304A (en) * | 2015-09-22 | 2017-04-05 | 重庆双东机械制造有限公司 | Automobile production information learns automatically and studies translation system |
| CN106652996A (en) * | 2016-12-23 | 2017-05-10 | 北京奇虎科技有限公司 | Prompt tone generating method and device and mobile terminal |
| CN106685808A (en) * | 2017-01-17 | 2017-05-17 | 百度在线网络技术(北京)有限公司 | Live event flow message passing method and device |
-
2017
- 2017-12-15 CN CN201711352721.9A patent/CN109935225A/en active Pending
Patent Citations (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1379391A (en) * | 2001-04-06 | 2002-11-13 | 国际商业机器公司 | Method of producing individual characteristic speech sound from text |
| CN1534595A (en) * | 2003-03-28 | 2004-10-06 | 中颖电子(上海)有限公司 | Speech sound change over synthesis device and its method |
| CN101064104A (en) * | 2006-04-24 | 2007-10-31 | 中国科学院自动化研究所 | Emotion voice creating method based on voice conversion |
| CN101359473A (en) * | 2007-07-30 | 2009-02-04 | 国际商业机器公司 | Auto speech conversion method and apparatus |
| CN201107040Y (en) * | 2007-08-15 | 2008-08-27 | 深圳华强信息产业有限公司 | Vehicular GPS terminal with automatic voice prompting function |
| CN101308652A (en) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | Synthesizing method of personalized singing voice |
| CN101751922A (en) * | 2009-07-22 | 2010-06-23 | 中国科学院自动化研究所 | Text-independent speech conversion system based on HMM model state mapping |
| CN102568472A (en) * | 2010-12-15 | 2012-07-11 | 盛乐信息技术(上海)有限公司 | Voice synthesis system with speaker selection and realization method thereof |
| CN202563577U (en) * | 2012-05-04 | 2012-11-28 | 山东卡尔电气股份有限公司 | Intelligent monitoring management system for vehicles |
| CN202602766U (en) * | 2012-05-24 | 2012-12-12 | 四川华立德科技有限公司 | Mobile phone capable of searching voice information |
| CN102982809A (en) * | 2012-12-11 | 2013-03-20 | 中国科学技术大学 | Conversion method for sound of speaker |
| CN104348250A (en) * | 2013-07-24 | 2015-02-11 | 江苏永迅电气有限公司 | Voice broadcasting type dual-power automatic diverter switch |
| CN104900226A (en) * | 2014-03-03 | 2015-09-09 | 联想(北京)有限公司 | Information processing method and device |
| CN104123932A (en) * | 2014-07-29 | 2014-10-29 | 科大讯飞股份有限公司 | Voice conversion system and method |
| CN104464716A (en) * | 2014-11-20 | 2015-03-25 | 北京云知声信息技术有限公司 | Voice broadcasting system and method |
| CN106205602A (en) * | 2015-05-06 | 2016-12-07 | 上海汽车集团股份有限公司 | Speech playing method and system |
| CN105261355A (en) * | 2015-09-02 | 2016-01-20 | 百度在线网络技术(北京)有限公司 | Voice synthesis method and apparatus |
| CN106558304A (en) * | 2015-09-22 | 2017-04-05 | 重庆双东机械制造有限公司 | Automobile production information learns automatically and studies translation system |
| CN105721292A (en) * | 2016-03-31 | 2016-06-29 | 宇龙计算机通信科技(深圳)有限公司 | Information reading method, device and terminal |
| CN106452990A (en) * | 2016-09-05 | 2017-02-22 | 中山海倍瑞智能软件科技有限公司 | Method and device for controlling voice broadcast of household electrical appliances, and household electrical appliances |
| CN106652996A (en) * | 2016-12-23 | 2017-05-10 | 北京奇虎科技有限公司 | Prompt tone generating method and device and mobile terminal |
| CN106685808A (en) * | 2017-01-17 | 2017-05-17 | 百度在线网络技术(北京)有限公司 | Live event flow message passing method and device |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9715873B2 (en) | Method for adding realism to synthetic speech | |
| JP4619623B2 (en) | Voice message processing system and method | |
| CN106663430B (en) | Keyword detection for speaker-independent keyword models using user-specified keywords | |
| KR101628050B1 (en) | Animation system for reproducing text base data by animation | |
| US9135740B2 (en) | Animated messaging | |
| US10140989B2 (en) | Method and system for speech recognition processing | |
| US20090198497A1 (en) | Method and apparatus for speech synthesis of text message | |
| US11423113B2 (en) | Contextual deep bookmarking | |
| KR101513888B1 (en) | Apparatus and method for generating multimedia email | |
| US20060210028A1 (en) | System and method for personalized text-to-voice synthesis | |
| US20060069567A1 (en) | Methods, systems, and products for translating text to speech | |
| US20110007077A1 (en) | Animated messaging | |
| CN1946065A (en) | Method and system for remarking instant messaging by audible signal | |
| CN104050966A (en) | Voice interaction method of terminal equipment and terminal equipment employing voice interaction method | |
| KR102312993B1 (en) | Method and apparatus for implementing interactive message using artificial neural network | |
| CN107622768B (en) | Audio cutting device | |
| CN103873642A (en) | Method and device for recording call log | |
| KR20150017662A (en) | Method, apparatus and storing medium for text to speech conversion | |
| CN110740212B (en) | Call answering method and device based on intelligent voice technology and electronic equipment | |
| WO2020188885A1 (en) | Information processing method, program, and terminal | |
| CN106601251A (en) | Generating method and system of communication text | |
| JP2005065252A (en) | Cell phone | |
| JP2016109784A (en) | Information processing device, information processing method, interactive system and control program | |
| CN108364638A (en) | A kind of voice data processing method, device, electronic equipment and storage medium | |
| EP1703492A1 (en) | System and method for personalised text-to-voice synthesis |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190625 |
|
| WD01 | Invention patent application deemed withdrawn after publication |