JPH06332945A

JPH06332945A - Device and method for processing natural language

Info

Publication number: JPH06332945A
Application number: JP5117178A
Authority: JP
Inventors: Yuji Ikeda; 裕治池田; Keiichi Sakai; 桂一酒井; Minoru Fujita; 稔藤田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1993-05-19
Filing date: 1993-05-19
Publication date: 1994-12-02

Abstract

(57)【要約】【目的】応答文の聞きやすさを判定し、ユーザがより
聞きやすいものに修正して出力する自然言語インターフ
ェースを提供する。【構成】自然言語インターフェース装置に、自然言語
によって質問文を入力する入力部１０１と、入力された
質問文を解析する文解析部１０２と、その解析結果に基
づいて情報を検索する検索部１０４と、その検索結果か
ら自然言語による応答文を生成する応答文生成部１０６
と、生成される応答文の聞きやすさを判定する判定部１
０７と、その判定結果に基づいて生成部１０６により生
成される応答文を修正する応答文変更部１１２と、修正
された応答文を出力する合成音声出力部１１４とを具え
る。 (57) [Summary] [Purpose] To provide a natural language interface that judges the easiness of listening to a response sentence, and modifies the response sentence to make it easier for the user to output. [Structure] An input unit 101 for inputting a question sentence in a natural language, a sentence analysis unit 102 for analyzing the input question sentence, and a search unit 104 for searching information based on the analysis result in a natural language interface device. , A response sentence generation unit 106 that generates a response sentence in natural language from the search result
And a determination unit 1 that determines the easiness of listening to the generated response sentence.
07, a response sentence changing unit 112 that corrects the response sentence generated by the generating unit 106 based on the determination result, and a synthetic voice output unit 114 that outputs the corrected response sentence.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、自然言語よりなる文章
を入力して、データベースを検索してその結果を合成音
声で出力する自然言語インタフェース装置のごとき自然
言語処理装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a natural language processing device such as a natural language interface device for inputting a sentence in natural language, searching a database and outputting the result as synthetic speech.

【０００２】[0002]

【従来の技術】近年、自然言語よりなる入力文章に基づ
き、データベースから情報を検索して、その結果を合成
音声で出力する自然言語インタフェース装置が研究開発
されている。かかる従来の自然言語インタフェース装置
の構成及び動作を図３と図４を用いて説明する。2. Description of the Related Art In recent years, a natural language interface device has been researched and developed which retrieves information from a database based on an input sentence in natural language and outputs the result as synthetic speech. The configuration and operation of such a conventional natural language interface device will be described with reference to FIGS.

【０００３】図３は、従来の自然言語インタフェース装
置の構成を示す図である。同図において、３０１は、ユ
ーザから自然言語文による質問を受け取る入力部であ
る。３０２は、入力部３０１から得られた自然言語文を
解析して、文の意味する内容を解析する文解析部であ
る。３０３は、文解析部３０２から得られたユーザの質
問内容をデータベース検索部３０４に問い合わせ、その
問い合わせに対する答えを話題の履歴として保存すると
ともに応答文生成部３０６に送る対話管理部である。３
０４は、上記対話管理部３０３からユーザの質問内容を
受け取り、その質問内容に対応するデータをデータベー
ス３０５から検索するデータベース検索部である。３０
５は、データベースである。FIG. 3 is a diagram showing the configuration of a conventional natural language interface device. In the figure, 301 is an input unit for receiving a question in a natural language sentence from a user. A sentence analysis unit 302 analyzes the natural language sentence obtained from the input unit 301 and analyzes the meaning of the sentence. A dialogue management unit 303 inquires of the user's question content obtained from the sentence analysis unit 302 to the database search unit 304, stores an answer to the inquiry as a history of topics, and sends it to the response sentence generation unit 306. Three
A database search unit 04 receives the user's question content from the dialogue management unit 303 and searches the database 305 for data corresponding to the question content. Thirty
5 is a database.

【０００４】また３０６は、対話管理部３０３から受け
取ったユーザの質問に対する答えとなる応答文を、その
読み及びアクセント情報とともに生成する応答文生成部
である。３０７は、応答文生成部３０６で生成された文
の情報から合成音声波形を生成する音響処理部である。
３０８は、音響処理部３０７で生成された音声波形をス
ピーカに出力して合成音声を出力する音声出力部であ
る。Reference numeral 306 is a response sentence generation unit for generating a response sentence, which is an answer to the user's question received from the dialogue management unit 303, together with its reading and accent information. Reference numeral 307 is an acoustic processing unit that generates a synthetic speech waveform from the sentence information generated by the response sentence generation unit 306.
A voice output unit 308 outputs the voice waveform generated by the sound processing unit 307 to a speaker and outputs a synthetic voice.

【０００５】図４は、図３の構成で、ユーザと自然言語
インタフェース装置が対話する過程を表したフローチャ
ートである。まず、入力文受け取り処理ステップＳ１２
でユーザの入力した自然言語による質問文を受け取り、
文解析処理ステップＳ１３に渡す。文解析処理ステップ
Ｓ１３では、この自然言語文の意味を解析し、解析され
たユーザの質問の内容をデータベース検索処理ステップ
Ｓ１４に渡す。データベース検索処理ステップＳ１４で
は、データベース３０５に照らし合わせて、適当なデー
タを選択して、話題管理処理ステップＳ１５に受け渡
す。FIG. 4 is a flow chart showing a process in which the user interacts with the natural language interface device in the configuration of FIG. First, the input sentence receiving process step S12
Received the question text in natural language entered by the user at
It is passed to the sentence analysis processing step S13. In the sentence analysis processing step S13, the meaning of the natural language sentence is analyzed, and the content of the analyzed question of the user is passed to the database search processing step S14. In the database search processing step S14, appropriate data is selected with reference to the database 305, and is transferred to the topic management processing step S15.

【０００６】話題管理処理ステップＳ１５は、その時点
のユーザの質問と、データベース検索処理ステップＳ１
４によって得られたデータを、その質問応答で対象にな
ったデータとともに、話題の履歴として、対話管理部３
０３に保存する。この情報は、次のユーザの質問の内容
を解析するときの文脈として、文解析処理ステップＳ１
３において使用する。話題管理処理ステップＳ１５は、
話題の履歴を保持した後に、答えとしてのデータの情報
を応答文生成処理ステップＳ１６に受け渡す。応答文生
成処理ステップＳ１６は、話題管理処理ステップＳ１５
から得られたデータをもとに応答文を生成して、その読
み及びアクセントの情報とともに音響処理ステップＳ１
７に渡す。The topic management processing step S15 is a question of the user at that time and the database search processing step S1.
The data obtained by the step 4 together with the data targeted by the question and answer are used as the history of the topic, and the dialogue management unit 3
Save in 03. This information is used as a context when the content of the question of the next user is analyzed, and the sentence analysis processing step S1.
Used in 3. The topic management processing step S15 is
After holding the history of topics, information of data as an answer is passed to the response sentence generation processing step S16. The response sentence generation processing step S16 is the topic management processing step S15.
A response sentence is generated on the basis of the data obtained from, and the acoustic processing step S1 together with the reading and accent information.
Pass to 7.

【０００７】音響処理ステップＳ１７は、応答文生成処
理ステップＳ１６から渡された文とその読みとアクセン
ト情報から、音声波形を生成して、合成音声出力処理ス
テップＳ１８に渡す。合成音声出力処理ステップＳ１８
は、渡された音声波形をスピーカに音声として出力す
る。そしてステップＳ１９で質問応答を終了するか否か
の判断を行ない、終了させる場合には終了する。もし終
了させない場合には、ステップＳ１２の入力受け取り処
理に戻る。この繰り返しの処理が、自然言語インタフェ
ース装置により質問応答する過程の処理の流れである。In the sound processing step S17, a voice waveform is generated from the sentence passed from the response sentence generation processing step S16, its reading and accent information, and passed to the synthetic voice output processing step S18. Synthetic voice output processing step S18
Outputs the passed voice waveform to the speaker as voice. Then, in step S19, it is determined whether or not the question answering is ended, and if it is ended, the processing is ended. If not ended, the process returns to the input receiving process of step S12. This repetitive process is a process flow of a process of answering a question by the natural language interface device.

【０００８】[0008]

【発明が解決しようとしている課題】しかしながら、上
記従来例では、自然言語インタフェース装置がユーザに
返す応答は、データベースから検索されたデータを元に
生成された自然言語文の応答文をそのまま音声合成した
ものを出力するので、ユーザがシステムからの応答を声
として聞いたときに以下のような欠点があった。（１）システムが提示する情報の中で、どの部分が重要
なのかが、音声で聞いたときには判断しにくい。（２）システムが提示しうる複数の情報の読みが類似し
ている場合、単調に音声合成された音声では区別が困難
である。（３）常に同じ発声速度で応答文を返すと、ユーザを苛
立たせてしまうことがある。（４）データベースの検索に時間が係っている場合に
は、その間システムが黙り込んでしまうために、ユーザ
に不安を抱かせるような間が空いてしまう。However, in the above-mentioned conventional example, the natural language interface device returns the response to the user by synthesizing the natural language response sentence generated based on the data retrieved from the database as it is. Since it outputs the output, there are the following drawbacks when the user hears the response from the system as a voice. (1) It is difficult to judge which part of the information presented by the system is important when heard by voice. (2) When the readings of a plurality of information that can be presented by the system are similar, it is difficult to distinguish the voices that are monotonically synthesized. (3) If the response sentence is always returned at the same speech rate, the user may be annoyed. (4) If it takes time to search the database, the system shuts down during that time, so that there is a time to make the user anxious.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決するため
に、本発明の自然言語処理装置は、自然言語によって質
問文を入力する入力手段と、該入力手段より入力された
質問文を解析する解析手段と、該解析手段による解析結
果に基づいて情報を検索する検索手段と、該検索手段の
検索結果から自然言語による応答文を生成する生成手段
と、該生成手段により生成される応答文の把握しやすさ
を判定する判定手段と、該判定手段の判定結果に基づい
て前記生成手段により生成される応答文を修正する修正
手段と、該修正手段により修正された応答文を出力する
出力手段とを具える。In order to solve the above-mentioned problems, a natural language processing apparatus of the present invention analyzes an input means for inputting a question text in natural language and a question text input by the input means. Analysis means, search means for searching information based on the analysis result by the analysis means, generation means for generating a response sentence in natural language from the search result by the search means, and response sentence generated by the generation means Judgment means for judging easiness of grasping, correction means for correcting the response sentence generated by the generation means based on the judgment result of the determination means, and output means for outputting the response sentence corrected by the correction means And with.

【００１０】また、本発明の他の態様によれば、自然言
語処理方法に、自然言語によって質問文を入力する入力
工程と、該入力工程において入力された質問文を解析す
る解析工程と、該解析工程による解析結果に基づいて情
報を検索する検索工程と、該検索工程の検索結果から自
然言語による応答文を生成する生成工程と、該生成工程
により生成される応答文の把握しやすさを判定する判定
工程と、該判定工程の判定結果に基づいて前記生成工程
により生成される応答文を修正する修正工程と、該修正
工程により修正された応答文を出力する出力工程とを具
える。According to another aspect of the present invention, in the natural language processing method, an input step of inputting a question sentence in natural language, an analysis step of analyzing the question sentence input in the input step, A search step of searching information based on the analysis result of the analysis step, a generation step of generating a response sentence in natural language from the search result of the search step, and an easy grasp of the response sentence generated by the generation step. The determination step includes a determination step, a correction step for correcting the response sentence generated by the generation step based on the determination result of the determination step, and an output step for outputting the response sentence corrected by the correction step.

【００１１】[0011]

【作用】上記自然言語処理装置は、入力手段より入力さ
れた自然言語による質問文を解析手段により解析した結
果に基づいて、検索手段が情報を検索し、その検索結果
から生成手段により生成される自然言語による応答文の
把握しやすさを判定手段により判定し、その判定結果に
基づいて、生成される応答文を修正手段により修正して
出力手段より出力する。In the natural language processing apparatus, the search means searches for information based on the result of the analysis means analyzing the question sentence in the natural language input from the input means, and the search result is generated by the generating means. The easiness of grasping the response sentence in natural language is determined by the determination means, and the generated response sentence is corrected by the correction means based on the determination result and output from the output means.

【００１２】上記自然言語処理方法は、入力工程におい
て入力された自然言語による質問文を解析工程により解
析した結果に基づいて、検索工程で情報を検索し、その
検索結果から生成工程により生成される自然言語による
応答文の把握しやすさを判定工程により判定し、その判
定結果に基づいて、生成される応答文を修正工程により
修正して出力工程にて出力する。In the natural language processing method, information is searched for in the search step based on the result of analysis of the question sentence in the natural language input in the input step in the analysis step, and the information is generated in the generation step from the search result. The easiness of grasping the response sentence in natural language is determined in the determination step, and the generated response sentence is corrected in the correction step based on the determination result and output in the output step.

【００１３】[0013]

【実施例】以下、図面を参照して本発明を詳細に説明す
る。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will be described in detail below with reference to the drawings.

【００１４】図１は、本発明の一実施例に係る装置の構
成を示すブロック図である。１０１は、ユーザから自然
言語による質問文を受け取る入力部であり、文字を入力
するためのキーボード、あるいは、音声を入力するため
のマイクロフォン及び音声認識装置などにより構成され
る。１０２は、入力部１０１から得られた自然言語文を
解析して、文の意味する内容を解析する文解析部であ
る。１０３は、文解析部１０２から得られたユーザの質
問内容をデータベース検索部１０４に問い合わせ、その
問い合わせに対する答えを話題の履歴として保存すると
ともに応答文生成部１０６に送る対話管理部である。１
０４は、上記対話管理部１０３からユーザの質問内容を
受け取り、その質問内容に対応するデータをデータベー
ス１０５から検索するデータベース検索部である。１０
５は、データベースである。FIG. 1 is a block diagram showing the configuration of an apparatus according to an embodiment of the present invention. Reference numeral 101 denotes an input unit that receives a question sentence in natural language from a user, and includes a keyboard for inputting characters, a microphone for inputting voice, a voice recognition device, and the like. A sentence analysis unit 102 analyzes the natural language sentence obtained from the input unit 101 and analyzes the meaning of the sentence. Reference numeral 103 is a dialogue management unit that inquires the database search unit 104 about the question content of the user obtained from the sentence analysis unit 102, saves the answer to the inquiry as a history of topics, and sends the response sentence generation unit 106. 1
A database search unit 04 receives the user's question content from the dialogue management unit 103 and searches the database 105 for data corresponding to the question content. 10
5 is a database.

【００１５】１０６は、対話管理部１０３から、ユーザ
の質問に対する答えとなる応答文を、その読み及びアク
セント情報とともに生成する応答文生成部である。１０
７は、応答文生成部１０６で生成された文の情報と、対
話管理部１０３で管理されている対話の内容及びユーザ
とシステムとの対話の履歴の情報に基づいて、生成され
た応答文をユーザが合成音声で聞いたときの聞きやすさ
を判定する聞きやすさ判定部である。この聞きやすさ判
定部１０７は、焦点判定部１０８あるいは、類似読み判
定部１０９あるいは、入力速度判定部１１０あるいは、
タイマ１１１のいずれかからなる。本実施例では、聞き
やすさ判定部１０７が焦点判定部１０８で構成される場
合を以下に説明する。Reference numeral 106 denotes a response sentence generation unit for generating, from the dialogue management unit 103, a response sentence serving as an answer to the user's question together with its reading and accent information. 10
Reference numeral 7 indicates the generated response sentence based on the information of the sentence generated by the response sentence generation unit 106, the content of the dialogue managed by the dialogue management unit 103, and the history information of the dialogue between the user and the system. This is an easiness of hearing determination unit that determines easiness of hearing when the user listens to the synthesized voice. The easiness of hearing determination unit 107 includes a focus determination unit 108, a similar reading determination unit 109, an input speed determination unit 110,
It is composed of one of the timers 111. In the present embodiment, a case where the easiness of listening determination unit 107 is composed of the focus determination unit 108 will be described below.

【００１６】焦点判定部１０８は、データベース１０５
から検索されたデータを応答文として生成する際に、ど
のデータがユーザにとって重要であるかを、対話管理部
１０３に保持された対話の履歴の情報を用いて判定す
る。この焦点判定部１０８で構成される聞きやすさ判定
部１０７は、判定結果である重要データ部分と応答文生
成部１０６で生成された応答文の情報を応答文変更部１
１２に渡す。応答文変更部１１２は、聞きやすさ判定部
１０７から渡された判定結果と応答文の情報を元に、応
答文を音声合成したときに聞きやすい文、すなわち、こ
の場合は重要データが分かりやすく聞きやすい文になる
ように応答文を変更する応答文変更部である。The focus determination unit 108 is a database 105.
When the data retrieved from is generated as a response sentence, which data is important to the user is determined by using the history information of the dialogue held in the dialogue management unit 103. The easiness of listening determination unit 107 configured by the focus determination unit 108 receives the important data portion as the determination result and the information of the response sentence generated by the response sentence generation unit 106 from the response sentence changing unit 1
Pass to 12. Based on the determination result and the information of the response sentence passed from the easiness of listening determination unit 107, the response sentence changing unit 112 makes the sentence easy to hear when the response sentence is speech-synthesized, that is, the important data in this case is easy to understand. It is a response sentence changing unit that changes the response sentence so that the sentence becomes easy to hear.

【００１７】１１３は、応答文変更部１１２で変更され
た文の情報から合成音声波形を生成する音響処理部であ
る。１１４は、音響処理部１１３で生成された音声波形
をスピーカに出力して合成音声を出力する音声出力部で
ある。Reference numeral 113 is an acoustic processing unit for generating a synthetic speech waveform from the sentence information changed by the response sentence changing unit 112. Reference numeral 114 is a voice output unit that outputs the voice waveform generated by the sound processing unit 113 to a speaker and outputs a synthetic voice.

【００１８】図２は、図１のように構成された自然言語
インタフェース装置が、ユーザと対話する過程を表した
フローチャートである。まず、入力文受け取り処理ステ
ップＳ１でユーザの入力した自然言語による質問文を受
け取り、文解析処理ステップＳ２に渡す。文解析処理ス
テップＳ２は、この自然言語文の意味を解析し、ユーザ
の質問の内容をデータベース検索処理ステップＳ３に渡
す。データベース検索処理ステップＳ３は、データベー
ス１０５に照らし合わせて、適当なデータを選択して、
話題管理処理ステップＳ４に受け渡す。FIG. 2 is a flowchart showing a process in which the natural language interface device configured as shown in FIG. 1 interacts with a user. First, in the input sentence receiving process step S1, the question sentence in the natural language input by the user is received and passed to the sentence analyzing process step S2. The sentence analysis processing step S2 analyzes the meaning of the natural language sentence and passes the content of the user's question to the database search processing step S3. The database search processing step S3 selects appropriate data with reference to the database 105,
It is passed to the topic management processing step S4.

【００１９】話題管理処理ステップＳ４は、その時点の
ユーザの質問と、データベース検索処理ステップＳ３に
よって得られたデータを、その質問応答で対象になった
データとともに、話題の履歴として、対話管理部１０３
に保存する。この情報は、次のユーザの質問の内容を解
析するときの文脈として、文解析処理ステップＳ２にお
いて使用する。話題管理処理ステップＳ４は、話題の履
歴を保存した後に、答えとしてのデータの情報を応答文
生成処理ステップＳ５に受け渡す。応答文生成処理ステ
ップＳ５は、話題管理処理ステップＳ４から得られたデ
ータをもとに応答文を生成して、生成された応答文をそ
の読みとアクセントの情報とともに聞きやすさ判定処理
ステップＳ６に渡す。In the topic management processing step S4, the dialogue management unit 103 stores the question of the user at that time and the data obtained in the database search processing step S3, together with the data targeted by the question answer, as a history of the topic.
Save to. This information is used in the sentence analysis processing step S2 as the context when the content of the question of the next user is analyzed. In the topic management processing step S4, after the history of the topic is stored, information of data as an answer is transferred to the response sentence generation processing step S5. The response sentence generation processing step S5 generates a response sentence based on the data obtained from the topic management processing step S4, and sends the generated response sentence to the easiness of hearing determination processing step S6 together with the reading and accent information. hand over.

【００２０】聞きやすさ判定処理ステップＳ６は、話題
管理処理ステップＳ４で得られた話題の履歴と、応答文
生成処理ステップＳ６で生成された応答文とその読み及
びアクセントの情報から、応答文のどの部分がユーザに
とって重要かを判定して、その結果を焦点をあてる情報
としてステップＳ７に渡す。ステップＳ７では、聞きや
すさ判定処理ステップＳ６から渡された情報に改良点で
ある焦点情報があるかを判断する。焦点情報がある、す
なわち改良点がある場合には、その焦点情報と共に、応
答文と読みとアクセントの情報を応答文変更処理ステッ
プＳ８に渡す。改良点がないばあいには、応答文と読み
とアクセントの情報を音響処理ステップＳ９に渡す。In the easiness of hearing judgment processing step S6, a response sentence is extracted from the history of the topic obtained in the topic management processing step S4, the response sentence generated in the response sentence generation processing step S6, and its reading and accent information. It is determined which part is important to the user, and the result is passed to step S7 as focused information. In step S7, it is determined whether or not the information passed from the listening comprehension determination processing step S6 includes focus information that is an improvement point. If there is focus information, that is, if there is an improvement, the response sentence and the reading and accent information are passed to the response sentence change processing step S8 together with the focus information. If there is no improvement, the response sentence and the reading and accent information are passed to the acoustic processing step S9.

【００２１】応答文変更処理ステップＳ８は、ステップ
Ｓ７の判定によってあると判定され、渡された改良点の
情報である焦点情報と、応答文と読みとアクセントの情
報から、応答文の中で焦点をあてる部分の語のアクセン
トを強くしたり、その部分だけをゆっくり発声するよう
にする情報を付加した応答文を生成する。そしてその応
答文を、音響処理ステップＳ９に渡す。音響処理ステッ
プＳ９は、応答文生成部ステップＳ８から渡された文と
その読みとアクセント情報から、音声波形を生成して、
合成音声出力処理ステップＳ１０に渡す。合成音声出力
処理ステップＳ１０は、渡された音声波形をスピーカに
音声として出力する。そしてステップＳ１１で質問応答
を終了するか否かの判断を行ない、終了させる場合には
終了する。もし終了させない場合には、ステップＳ１２
の入力受け取り処理に戻る。この繰り返しの処理が、自
然言語インタフェース装置により質問応答する過程の処
理の流れである。The response sentence change processing step S8 is determined by the determination of step S7, and the focus information in the response sentence, which is the information on the improvement point passed, and the response sentence, the reading and the accent information, are used as the focus in the response sentence. Generates a response sentence to which the accent of the word to which is applied is strengthened or information is added so that only that part is uttered slowly. Then, the response sentence is passed to the acoustic processing step S9. The sound processing step S9 generates a voice waveform from the sentence passed from the response sentence generation unit step S8, its reading and accent information,
The synthesized voice output processing is passed to step S10. In the synthesized voice output processing step S10, the delivered voice waveform is output to the speaker as voice. Then, in step S11, it is determined whether or not the question answering is ended, and if it is ended, the processing is ended. If not ended, step S12
Return to the input receiving process of. This repetitive process is a process flow of a process of answering a question by the natural language interface device.

【００２２】図１の自然言語インタフェース装置におい
て、文解析部１０２などの機能は、図２のフローチャー
トに対応するプログラムをメモリに記憶させておき、Ｃ
ＰＵがメモリ上のワークエリアを利用しながらこのプロ
グラムを実行することにより実現することもできる。In the natural language interface device of FIG. 1, the function of the sentence analysis unit 102 and the like stores a program corresponding to the flowchart of FIG.
It can also be realized by executing this program while the PU uses the work area on the memory.

【００２３】図５と図６は、図１の構成の自然言語イン
タフェース装置で、聞きやすさを判定して聞きやすい応
答文を生成した例である。図５では、（ユーザ１）の質
問文「伊豆あたりにあって腰痛に効く温泉を知りたい」
に対して、場所が伊豆という条件と、効能が腰痛である
温泉という条件とのＡＮＤ条件でデータベースを検索し
た結果を元に、（システム１）の応答文が生成される。
この場合、場所については「伊豆あたり」という表現を
用いていることから、聞きやすさ判定処理ステップＳ６
は、「温泉」の条件のほうが焦点をあてるべき部分であ
ると判定して、図中ＡとＢで示した温泉の名前に焦点を
あてるという判定を行なう。それによって、応答文変更
処理ステップＳ８は、このＡとＢの温泉とホテルの名前
の部分の読みに、強いアクセントを示す情報、あるい
は、スピードを遅くするという情報、あるいは両方の情
報を追加した応答文を生成する。これにより（システム
１）の応答では、ＡとＢの部分が強調されて音声合成さ
れる。FIGS. 5 and 6 show an example in which the natural language interface device having the configuration shown in FIG. In Figure 5, the question sentence of (User 1) "I want to know the hot springs around Izu that are effective for back pain."
On the other hand, the response sentence of (system 1) is generated based on the result of searching the database under the AND condition of the condition that the place is Izu and the condition that the effect is a hot spring with low back pain.
In this case, since the expression "per Izu" is used for the place, the easiness of hearing determination processing step S6
Determines that the condition of "hot spring" is the portion to be focused, and determines that the name of the hot spring shown by A and B in the figure is focused. As a result, the response sentence change processing step S8 adds the information indicating the strong accent, the information indicating that the speed is slowed down, or both of the information added to the reading of the names of the hot springs and the hotel of A and B. Generate a sentence. As a result, in the response of (system 1), the portions A and B are emphasized and speech synthesis is performed.

【００２４】また、図６では、（ユーザ２）の質問文
「東伊豆で東京からすぐ行ける温泉で、腰痛に効くのを
知りたい」に対して、図５と同じ内容の応答文（システ
ム２）を生成するが、図５の場合とは異なり、「東伊豆
で東京からすぐ行ける」という表現を用いていることか
ら、場所の条件に焦点をあてるべきであると判定して、
Ｃの「熱海に」という場所の部分を強調した合成音声を
生成する。以上のような聞きやすさ判定部と応答文変更
部を用いることによって、本実施例の装置では、応答文
の中で重要な情報の部分が強調された聞きやすい合成音
声を得ることが可能になった。In addition, in FIG. 6, in response to the question sentence (user 2) "I want to know that it is effective for low back pain at a hot spring in Higashiizu, which can be reached from Tokyo immediately," the same response sentence (system 2) ) Is generated, but unlike the case of FIG. 5, since the expression “you can go from Tokyo in Higashiizu” is used, it is determined that the condition of the place should be focused,
A synthetic voice is generated by emphasizing the portion of the location "Atami" in C. By using the easiness of listening determination unit and the response sentence changing unit as described above, in the device of the present embodiment, it is possible to obtain an easy-to-hear synthetic voice in which a portion of important information in the response sentence is emphasized. became.

【００２５】〔他の実施例〕上記実施例では、聞きやす
さ判定部１０７として、焦点判定部１０８を用いて聞き
やすさの判定を行なっていたが、これに限らず、聞きや
すさの判定基準として、類似読み判定部１０９を用いる
ことができる。図７は、類似読み判定部１０９を用いた
ときの対話例である。（ユーザ３）の質問「多摩方向に
あるゴルフ場を教えて」に対して（システム３）の応答
文を生成しているのだが、この場合、検索結果データの
ゴルフ場「東多摩カントリークラブ」「多摩ゴルフクラ
ブ」「ゴルフコース南多摩」の３つを応答している。[Other Embodiments] In the above-described embodiment, the focus determination unit 108 is used as the audibility determination unit 107 to determine the easiness of hearing, but the present invention is not limited to this. As a reference, the similar reading determination unit 109 can be used. FIG. 7 is an example of a dialogue when the similar reading determination unit 109 is used. The response sentence of (system 3) is generated in response to the question (tell me the golf course in the Tama direction) from (user 3). In this case, the golf course "Higashitama Country Club" in the search result data is generated. We are responding to three things: "Tama Golf Club" and "Golf Course Minamitama".

【００２６】ここで、この３つの名前にはどれも「多
摩」という読みの入っているので、類似読み判定部１０
９により、「多摩」という読みの部分が類似していると
判定される。このような判定結果を用いれば、応答文変
更部では、この３つの名前で異なる読みの部分につい
て、強さあるいはスピードを変えて読むようにして、Ｄ
とＥとＦの部分を強調して音声合成を行なうことが可能
になる。これによって、ユーザは類似した読みに注意が
向けることができる。Here, since the reading "Tama" is included in all of these three names, the similar reading determination unit 10
According to 9, it is determined that the reading part “Tama” is similar. If such a determination result is used, the response sentence changing unit reads the different reading portions with these three names with different strengths or speeds, and D
It becomes possible to perform speech synthesis by emphasizing the portions E, F and F. This allows the user to focus on similar readings.

【００２７】また、上記実施例では、聞きやすさ判定部
１０７として、類似読み判定部１０９を用いて聞きやす
さの判定を行なっていたが、これに限らず、聞きやすさ
の判定基準として、入力速度判定部１１０を用いること
ができる。ユーザによっては、同じスピードや強さでの
合成音声を用いた質問応答を行なうと、苛立つことがあ
る。そもそも気の短いユーザは、ゆっくり話されると苛
立ってしまう場合がある。Further, in the above embodiment, the easiness of listening is determined by using the similar reading determining unit 109 as the easiness of listening determining unit 107, but the present invention is not limited to this. The input speed determination unit 110 can be used. Depending on the user, it may be frustrating to perform question answering using synthetic voice at the same speed and strength. In the first place, short-minded users may get frustrated if they speak slowly.

【００２８】このような場合に、質問文の入力速度を、
キーボード入力の時にはそのキー入力の速度を計測した
り、音声入力の場合には、ユーザの話している時間と、
その文の長さから、早口であるかを判定できる。この条
件と用いると、判定の結果、入力速度が速い場合にはも
ともと気の短いユーザあるいは、ユーザが苛立ってお
り、応答文の速度を上げたほうがユーザによって聞きや
すいと判定して、応答文全体の発声速度を速くすること
が可能である。In such a case, the input speed of the question sentence is
In case of keyboard input, the speed of the key input is measured, and in the case of voice input, the talking time of the user,
From the length of the sentence, it is possible to determine whether it is quick. If this condition is used, the result of the determination is that if the input speed is fast, the user who is originally short-minded or the user is irritated, and it is easier for the user to hear the response text It is possible to increase the vocalization speed of.

【００２９】また、上記実施例では、聞きやすさ判定部
１０７として、入力速度判定部１１０を用いて聞きやす
さの判定を行なっていたが、これに限ることなく、聞き
やすさの判定基準として、タイマ１１１を用いることが
できる。大量データの検索が必要な質問が入力された場
合には、システムが応答の合成音声をしばらく返さない
ので、ユーザに不安を抱かせることもある。このような
場合、タイマ１１１によって、一定の時間システムが応
答を返せないときには、間をつなぐための応答文を出す
ことができる。Further, in the above embodiment, the easiness of hearing is determined by using the input speed determination unit 110 as the easiness of listening determination unit 107. , The timer 111 can be used. When a question that requires a large amount of data retrieval is input, the system does not return the synthetic voice of the response for a while, which may cause anxiety to the user. In such a case, when the system cannot return a response for a certain period of time, the timer 111 can issue a response sentence for connecting the two.

【００３０】図８は、聞きやすさ判定部１０７としてタ
イマ１１１を用いたときの対話例である。（ユーザ４）
の質問に対して（システム５）の応答文を生成する間
に、質問の入力から所定の時間を要したことをタイマが
検知する。そしてその応答までの間を埋めるために、あ
らかじめ用意された（システム４−１）や（システム４
−２）や（システム４−３）の応答を生成する。これに
より、話の途切れのない、聞きやすい対話を実現するこ
とができる。FIG. 8 is an example of a dialogue when the timer 111 is used as the easiness of listening determination unit 107. (User 4)
While the response sentence of (system 5) is generated in response to the question, the timer detects that it takes a predetermined time from the input of the question. Then, in order to fill the time until the response, (system 4-1) or (system 4) prepared in advance
-2) and the response of (system 4-3) are generated. As a result, it is possible to realize an easy-to-listen dialogue without interruption of the conversation.

【００３１】上記実施例では、聞きやすさ判定部１０７
として、焦点判定部１０８、類似読み判定部１０９、入
力速度判定部１１０、タイマ１１１を単独で用いていて
聞きやすさの判定を行なっていたが、これに限らず、聞
きやすさ判定部１０７として、これらの判定部を１つ以
上組み合わせて聞きやすさの判定を行うことができる。In the above-described embodiment, the easiness of listening determination unit 107 is used.
As the focus determination unit 108, the similar reading determination unit 109, the input speed determination unit 110, and the timer 111, the ease of listening is determined, but the present invention is not limited to this. The easiness of hearing can be determined by combining one or more of these determination units.

【００３２】また、以上の実施例では、音声出力の場合
について説明したが、これに限らず、応答文を表示出力
する場合でも、聞きやすさと同様に読みやすさを判定し
て読みやすい応答文を作成することもできる。すなわ
ち、焦点判定部１０８あるいは類似読み判定部１０９の
判定結果に基づいて、強調すべき部分を、まさに図５〜
図７に示されるように下線を付したり、あるいは文字
色、文字サイズ、輝度を変えるなどして、容易に識別可
能とすることができる。また、タイマ１１１を用いて、
所定の時間の経過に応じて、図８に示すごとき応答文を
順に表示することもできる。In the above embodiments, the case of voice output has been described, but the present invention is not limited to this, and even when a response sentence is displayed and output, the readability is determined in the same manner as the readability by determining the readability. Can also be created. That is, based on the determination result of the focus determination unit 108 or the similar reading determination unit 109, the portion to be emphasized is exactly as shown in FIG.
As shown in FIG. 7, it is possible to easily identify by underlining or changing the character color, character size, and brightness. Also, using the timer 111,
It is also possible to sequentially display response sentences as shown in FIG. 8 according to the passage of a predetermined time.

【００３３】[0033]

【発明の効果】以上説明したように、応答文の把握しや
すさを判定し、判定結果に基づいて応答文を修正するこ
とによって、ユーザが把握しやすい応答文が得られると
いう効果がある。As described above, by determining the easiness of grasping the response sentence and correcting the response sentence based on the determination result, there is an effect that the response sentence which the user can easily grasp is obtained.

【００３４】特に、以下の効果が得られる。（１）システムが伝えたい情報の中で重要な部分が強調
された応答文が得られる。（２）類似した読みを持つ複数の回答において、それら
の差を区別できるように、互いに異なる部分が強調され
た応答が得られる。（３）ユーザからの質問文の入力速度に応じて、音声応
答における出力音声の速度が変更できる。（４）システムがデータベースを検索してその結果を返
すまでに長時間を要す場合に、その間を埋める応答をす
ることで、途切れのない対話処理が実現できる。In particular, the following effects can be obtained. (1) A response sentence in which an important part of the information that the system wants to convey is emphasized can be obtained. (2) In a plurality of answers having similar readings, a response in which different parts are emphasized is obtained so that the difference between them can be distinguished. (3) The speed of the output voice in the voice response can be changed according to the input speed of the question sentence from the user. (4) When it takes a long time for the system to search the database and return the result, by providing a response to fill the gap, seamless dialogue processing can be realized.

[Brief description of drawings]

【図１】本発明の実施例の自然言語インタフェース装置
のブロック図である。FIG. 1 is a block diagram of a natural language interface device according to an embodiment of the present invention.

【図２】本発明の実施例の自然言語インタフェース装置
の処理過程を示すフローチャートである。FIG. 2 is a flowchart showing a processing procedure of the natural language interface device according to the embodiment of the present invention.

【図３】従来の自然言語インタフェース装置のブロック
図である。FIG. 3 is a block diagram of a conventional natural language interface device.

【図４】従来の自然言語インタフェース装置の処理過程
を示すフローチャートである。FIG. 4 is a flowchart showing a processing process of a conventional natural language interface device.

【図５】実施例のユーザとシステムの対話例を示す図で
ある。FIG. 5 is a diagram showing an example of interaction between a user and a system according to an embodiment.

【図６】実施例のユーザとシステムの対話例を示す図で
ある。FIG. 6 is a diagram showing an example of interaction between a user and a system according to an embodiment.

【図７】他の実施例のユーザとシステムの対話例を示す
図である。FIG. 7 is a diagram showing an example of interaction between a user and a system according to another embodiment.

【図８】他の実施例のユーザとシステムの対話例を示す
図である。FIG. 8 is a diagram showing an example of interaction between a user and a system according to another embodiment.

[Explanation of symbols]

１０１文入力部１０２文解析部１０３対話管理部１０４データベース検索部１０５データベース１０６応答文生成部１０７聞きやすさ判定部１０８焦点判定部１０９類似読み判定部１１０入力速度判定部１１１タイマ１１２応答文変更部１１３音響処理部１１４合成音声出力部 101 sentence input unit 102 sentence analysis unit 103 dialogue management unit 104 database search unit 105 database 106 response sentence generation unit 107 audibility determination unit 108 focus determination unit 109 similar reading determination unit 110 input speed determination unit 111 timer 112 response sentence change unit 113 sound processing unit 114 synthetic speech output unit

Claims

[Claims]

1. An input unit for inputting a question sentence in natural language, an analyzing unit for analyzing the question sentence input by the input unit, and a searching unit for retrieving information based on an analysis result by the analyzing unit. Generating means for generating a response sentence in natural language from the search result of the searching means; determining means for determining easiness of grasping the response sentence generated by the generating means; and A natural language processing apparatus comprising: a correction unit that corrects a response sentence generated by the generation unit; and an output unit that outputs the response sentence corrected by the correction unit.

2. The output means outputs the response sentence by voice information, and the correction means corrects the accent, volume, or output speed of a part or all of the response sentence. The natural language processing device according to claim 1.

3. The storage device manages and saves the topic related to the question sentence and the response sentence based on the analysis result of the analysis device and the search result of the search device, and the determination device stores the contents of the storage device. Determining the important part in the response sentence generated by the generating means with reference, and modifying the important part determined by the determining means so as to be distinguishable from other parts in the response sentence. The natural language processing device according to claim 1.

4. The determining means determines the existence of similar portions in the response sentence generated by the generating means, and the correcting means can distinguish the similar portions determined by the determining means. The natural language processing apparatus according to claim 1, wherein the response sentence is modified so that

5. The determination means measures the search execution time by the search means to determine whether a predetermined time has passed, and the correction means determines that the predetermined time has passed by the determination means. The natural language processing device according to claim 1, wherein the response sentence is modified so that a predetermined response sentence is output prior to the response sentence generated by the generating means.

6. The determination means determines the input speed of the question sentence by the input means, and the correction means, based on the determination result of the determination means,
3. The natural language processing apparatus according to claim 2, wherein the response sentence is modified so that the voice output speed of the response sentence from the output unit is changed.

7. An input step of inputting a question text in natural language, an analysis step of analyzing the question text input in the input step, and a search step of searching information based on the analysis result of the analysis step. A generation step of generating a response sentence in natural language from the search result of the search step, a determination step of determining the easiness of grasping the response sentence generated by the generation step, and A natural language processing method comprising: a correction step of correcting a response sentence generated by the generation step; and an output step of outputting a response sentence corrected by the correction step.

8. In the output step, the response sentence is output by voice information, and in the correction step, a part or all of the response sentence is accented, volume, or
The natural language processing method according to claim 7, wherein the output speed is corrected.

9. A storage step of managing and storing topics related to the question text and the response text based on the analysis result of the analysis step and the search result of the search step, wherein the determination step is stored by the storage step. The important part in the response sentence generated by the generating step is referred to with reference to the content, and the correcting step is performed so that the important part determined by the determining means can be distinguished from other parts in the response sentence. The natural language processing method according to claim 7, which is modified.

10. The determining step determines the presence of similar parts in the response sentence generated by the generating step, and the correcting step can distinguish the similar parts determined by the determining step. 8. The natural language processing method according to claim 7, wherein the response sentence is modified so that

11. The determination step measures the search execution time by the search step to determine whether a predetermined time has passed, and the correction step determines whether the predetermined time has passed by the determination step. 8. The natural language processing method according to claim 7, wherein the response sentence is modified so that a predetermined response sentence is output prior to the response sentence generated by the generating step.

12. The determination step determines the input speed of the question sentence in the input step, and the correction step, based on the determination result of the determination step,
9. The natural language processing method according to claim 8, wherein the response sentence is modified so that the voice output speed of the response sentence from the output unit is changed.