WO2002061609A1 - Systeme d'extraction dans une base de donnees - Google Patents
Systeme d'extraction dans une base de donnees Download PDFInfo
- Publication number
- WO2002061609A1 WO2002061609A1 PCT/AU2001/000428 AU0100428W WO02061609A1 WO 2002061609 A1 WO2002061609 A1 WO 2002061609A1 AU 0100428 W AU0100428 W AU 0100428W WO 02061609 A1 WO02061609 A1 WO 02061609A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- user
- request
- query
- query terms
- operative
- Prior art date
Links
- 230000004044 response Effects 0.000 claims abstract description 6
- 238000000034 method Methods 0.000 claims description 13
- 230000009471 action Effects 0.000 claims description 10
- 238000012545 processing Methods 0.000 claims description 7
- 230000001419 dependent effect Effects 0.000 claims description 4
- 238000012512 characterization method Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 238000013459 approach Methods 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000004913 activation Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 239000011435 rock Substances 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3322—Query formulation using system suggestions
Definitions
- the present invention relates generally to a database retrieval system and more specifically to such a system which uses a natural language search request interface.
- the invention has particular application for use as an interface for an internet search facility, and the invention is herein described in that context. However it is to be appreciated that the invention has broader application and is not limited to that particular use.
- natural language is used to refer to word statements which are represented in a sentence structure or in a structure which is close to a sentence structure and which can be readily understood by a non-instructed person to which the statements are presented.
- Database retrieval systems using a natural language format have been gaining popularity in recent times. Such systems have been used as internet search facility interfaces and also as interfaces for other types of databases, such as online help systems accompanying computer software applications or as part of automated customer service systems.
- users are directed to enter their search requests in the form of a complete question expressed in terms of every day conversational language.
- the database retrieval system then analyses the question submitted by the user and returns a list of database records which have been deemed by the system to be a possible match to the user's search request.
- Natural language interfaces have been developed primarily to make the computer more "humanised” as compared to traditional search structures using a Boolean search structure or the like. In this way, the interface is designed to be easier to use and less intimidating, particularly to users who are not experienced in computer-based searching.
- the present invention relates to a system operative to enable a user to produce a search request for retrieving information from a database
- the system including storage means including a plurality of query terms, each being operative to form part of a said search request to retrieve the information from the database, processing means operative to generate a plurality of request lists each including selected ones of the query terms, means for making the request lists available to the user, means for receiving a user selection of at least one query term in each request list, and means operative to cause each selected query term to be represented to the user in a controlled order as part of the search request, wherein the processing means is operative to select query terms to generate at least one said request list in response to the user selection of at least one query term from a previous request list, and wherein when represented to the user in the controlled order, the search request including the selected query terms is in a natural language format.
- the advantage of the present invention is that the system provides a search request interface which guides the user in development of the complete search request, and which also represents that search request in a natural language format. In this way, the user not only readily understands the search request that is being constructed but also the search request has a higher probability of achieving a successful outcome. This occurs through the guidance provided by the system in construction of that request and also by the design of the system which provides a more intuitive approach and which is able to be designed to be specifically compatible with the database in which to access the information. This guiding capability of the system is enabled by the system generating lists of query terms from which the user may select based on previously chosen query terms.
- Request lists generated by this process are thereby filtered so that the query terms included are appropriate for selection by the user based on the part of the search request that has already been constructed. Further, the representation to the user of the request lists may be ordered so that as individual query terms are selected, the concatenation of the selected query terms in their selected order builds up a natural language search request.
- the storage means includes the full range of possible search query terms. At least some of these query terms are linked to other specific query terms so that when that specific query term is selected by the user, the linked query terms are subsequently represented to the user as a request list. In this way, different categories are formed by virtue of this interlinking of the query terms with the terms in each category having a common linked query term.
- the system may be operated in various mediums including audio, visual or a combination of these mediums.
- the system is used on a computer or interactive television, typically through an Internet Web page or similar graphical user interface.
- the user commences a search request which when initiated, displays an initial list of query terms.
- the user then chooses a query term in that list and this then prompts the system to display a new list of query terms, one or more of which is selected.
- This prompts yet a further list of query terms, one or more of which is selected and then which prompts yet a further list of query terms and so on until the search request is completed.
- some of the request lists generated may be dependent on previously selected query terms while some may not.
- the system On completion of the search request, the system generates the results.
- the system may be designed so that each of the query terms forms part of a predefined search path.
- the number of search paths is dependent on the number of possible query term combinations and would typically be in the order of hundreds of thousands.
- each predetermined path has specific information relating to the information database linked to it. This information is displayed as the results of the search. This information may be in the form of a specific URL in an internet search facility, or in the form of recorded information or the like.
- the system effectively provides a means by which a user may select a predetermined search path from a store of many such paths.
- the selection process is not one off but rather is built up as the search request is constructed as the user chooses the individual query terms.
- This approach enables the user to effectively manage and choose appropriately from a vast number of predetermined search paths.
- the query terms are used to interrogate a separate database containing discrete information. Typically this discrete information is individually tagged and when the selected query term matches with the tags on a particular discrete information, that information or selected parts of it, is then displayed as a result.
- the process of selecting individual query terms from the displayed category of terms may take any suitable form. For example, these query terms may be entered through activation of a remote device such as a keyboard or mouse connected to the computer, menu control or by touching a screen, or by voice activation, assuming the computer interface has appropriate capabilities.
- the system is also operative to represent to the user in a controlled manner the selected query terms in a natural language format.
- each of the query terms is in the form of a part sentence.
- the selected query terms are then displayed in the controlled order so that the concatenation of the selected query terms together with any embedded terms in the interface, constructs a sentence.
- the individual query terms take different forms. Some query terms are in the form of verbs, whilst others are in the form of phrases or nouns.
- search request As soon as a query term is selected it is then represented to the user as part of the search request. With this arrangement at all times the search request that is being constructed is displayed to the user. This has particular advantage as it enables the user to see exactly the logic which is behind the search request which is being constructed with the aid of the system. As this search request is in the form of a natural language format it is easily understood by even an inexperienced user.
- the system may be used purely through an audio medium such as a telephone or the like. In that application, the user is prompted by the system voicing each of the query terms in the lists selected by the system at any one time.
- the user selects one or more of the prompted queries either by voice activation or through the telephone key pad or the like which then causes the system to tell the user the query terms of the next selected list.
- the controlled order by which the search request terms are represented to the user may vary.
- the search terms are displayed sequentially so that the search request as it is being constructed, can be viewed.
- the arrangement of the display may vary from a strict linear sentence structure to a more liberal format where the search request may be represented in a paragraph or as bullet points or the like. In each case it may still constitute a natural language format.
- the order may be changed from sequential to improve the language of the search request to aid understanding for the user.
- the present invention provides a method of guiding a user to construct a search request for retrieving information from a database, the method including the steps of: (a) providing a plurality of query terms;
- Figure 1 illustrates a schematic view of the components of a database retrieval system according to a preferred embodiment of the invention
- Figures 2 to 4 are user interfaces for the system of Figure 1
- Figure 5 is a flow chart illustrating the steps performed by the system of Figure 1 ;
- Figure 1 illustrates in schematic view, a system 10 which is operative to retrieve information from a database 50.
- the system 10 is operative over a distributed network of computers such as available over the Internet through the World Wide Web.
- the system is accessible by a user from a remote client device 11 , and includes a server 12 and a search database 13.
- the system 10 is operative to establish a search request, which is then used to retrieve information from the database 50 which is typically web site characterisation data which allows a user to locate associated web pages on the World Wide Web.
- the databases 13 and 50 are shown as separate items, although it is appreciated that they may be incorporated into a single database as explained in more detail below.
- the remote user device 11 communicates with the server 12 through a distributed computer network 14 such as the Internet.
- the remote device 11 is typically in the form of a PC, although may include any other suitable device such as a personal digital assistant or mobile phone with appropriate communication protocols to enable access over the Internet.
- the server 12 is operative to send to the remote device, on request, a user interface 30 such as web pages illustrated in Figures 2 to 5, to enable a user to construct a search request which is then processed by the server 12 which in turn accesses databases 13 and 50.
- a user interface 30 such as web pages illustrated in Figures 2 to 5
- the system 10 is structured such that the search database 13 includes a store of query terms. These terms are associated with different category fields. At least some of the query terms are interlinked so that when a specific query term is selected by the user under one category field, the linked query terms are subsequently represented to the user as a list of terms in a subsequent category field. In this way, the terms in many of the request lists each have a common linked query term so as to provide a filtering process for displaying only relevant query terms.
- search database 13 is structured to define a plurality of a predetermined search path.
- search paths which typically number in the tens to hundreds of thousands, have specific web site characterisation data associated with it which is stored in database 50.
- This characterisation data is typically specific URLs or the like which enable a user to locate specific sites on the World Wide Web.
- the system may of course be applicable to search a database containing other information and is not limited to an Internet search engine as described in the present embodiment.
- Table 1 illustrates the structure of the search database 13, identifying both the request lists, category fields and predetermined search paths.
- the database 13 is structured so that there are six category fields, namely Action, Content Type, General Topic, Specific Topic, Region, and Match Phrase Description. These are represented by the columns of Table 1.
- the rows of the Table are the individual search paths, while the Table entries are the query terms which are grouped together in the different request lists.
- the order of the category fields is structured to guide the user in developing a natural language search request. In this way, the display to the user of the request lists as the user progresses through the category fields is ordered so that as individual query terms are selected, the concatenation of the selected query terms in their selected order builds up a natural language search request.
- each category field there are multiple query terms which may be displayed together in one list or which may be displayed in different filtered lists depending on previously selected query terms.
- the terms are displayed in one list, whereas in most of the other category fields the request lists are filtered.
- the request list consists of query terms including "listen to”, “view”, “read”, “buy”, “sell” etc.
- a filtered request list can be generated based on each query term in the request category in the Action field. For example, for the query term "listen to” there is a linked request list which includes the query terms "music", “comedy”, “news”, “interview”, “sounds", "radio".
- next category field there is a linked request list for each of the query terms in each of the request categories of Content Type.
- a request list including "rock genre”, “pop genre”, “jazz genre”, and “grunge genre” is generated under the General Topic category field which in turn is linked to the query term "music" in one of the request list under the Content Type field.
- This structure of linking specific query terms to generated filtered request lists carries through the other category fields (Specific Topic, and Match Phrase Description). Because it only includes a limited number of query terms, the Region category field does not need to be filtered and therefore the request list generated is not dependent on the selected term in the previous category fields. It is noted that the system 10 is able to include a free form query term which allows keywords to be included in the search request. This free form query would appear as a query term in the selected category and indicated in the arrangement of Table 1 under the "Match Phrase Description" category field. Table 1 also illustrates to embedded search request terms which appear on the user interface 30. These embedded terms are "I wish to" and "specifically". The system 10 is structured so that the embedded terms, in conjunction with the query terms in the category fields, form a complete search request which is in a natural language format so as to be easy for a user to understand the nature of the specific request.
- each completed search request forms a predetermined search path and associated with each search path, is appropriate web site characterisation data from the database 50.
- This association of the web site characterisation data can be obtained by modifying and manipulating an existing dump of web site characterisation data from the Internet. Alternatively, it could be built up using appropriate automated search tools or manually.
- the ongoing maintenance of the categories and the links directory may be performed manually (either on line or by the web site owners or by internal staff), or by using suitable automated tools that may be available.
- FIGS 2 to 4 illustrate the construction on the user interface 30 of the following search request:
- the interface 30 is structured so that the embedded terms 31 appear on the screen.
- the interface 30 also includes six separate entry field boxes.
- the first box is the Action field box 32
- the second is the Content Type box 33
- the third is the General Topic field box 34
- the fourth is the Specific Topic field box 35
- the fifth is the Region field box 36
- the sixth is the Match Phrase Description field box 37.
- To commence a search a user goes first to the Action field box where clicking a cursor on the box will prompt a pop up menu 38 to appear which displays the request list showing each of the query terms in that category field.
- the user is then able to select one of the query terms within the menu box 38 by highlighting on the selected query term.
- the selected query term is then represented in the field box 32. In the present example the selected query term is "listen to”.
- FIG. 3 is a flow chart illustrating the steps performed by the system as outlined above. As can be seen at step 20, the user commences the search request by initiating the system by some appropriate form such as clicking on the arrow in the first field 32.
- the system 10 On initiating the search request, the system 10 at step 21 then displays a first list of query terms. The user then selects one of the query terms in the display list as indicated at step 22. Once the query term is selected, the display changes to remove the first list of query terms and displays the selected query term adjacent to the embedded words "I wish to" to form part of the constructed search request. This occurs at step 23.
- the selected query term is then sent to the server 12 which determines whether the selection of that term represents the completed search request. If the server determines that the search request is completed, then at step 25, the query term(s) is used to interrogate the search database which then provides the matching linked results to that complete search request and sends those results back to the remote device at step 26.
- the query term is then used to interrogate the search database to select a subsequent list of query terms from which the user is able to select. This may occur automatically or by initiating action by the user such as by clicking on the next empty category field. This occurs at step 27.
- step 28 the subsequent selected list of query terms is then displayed in a similar manner as at the first step 21 in the form of a drop down box or the like.
- the user is then again in a position to select the query term (step 22) and the cycle then continues.
- the next selected query term is then displayed as a continuation of the search request and that query term is then forwarded to the server which it first determines whether it completes the requested search and then adopts either steps 25 and 26 or 27 and 28 accordingly.
- the system 10 is able to include a free form query term which allows keywords to be included into the search request. If the user makes a free form entry, the system then performs a keyword search directly off the database 50, taking into account all the category selections and the keywords that the user has entered.
- the present invention allows the construction of a natural language format search request.
- the user is guided to select each query term sequentially in order to gradually construct a very precise natural language request with the interface continually displaying the concatenation of the already selected query terms and relevant embedded text at any point in time.
- the guided structure not only greatly assists the user to enter appropriate query terms for retrieving information to produce meaningful content, but it also allows the system to use a standard request structure. This is distinct from more recent natural language structures which require specific systems to analyse the content and syntax of the question and then generate appropriate query terms to interrogate the query database.
- Other advantages include the fact that the user merely needs to choose from a selected list of query terms at any one particular time thereby restricting user entry error potential. Further, by both guiding the user and also displaying the search request as it is being constructed, gives even inexperienced users a much greater probability of finding relevant material as compared to a free form natural language search request.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AUPQ6862A AUPQ686200A0 (en) | 2000-04-12 | 2000-04-12 | Database retrieval system |
AUPQ6862 | 2000-04-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2002061609A1 true WO2002061609A1 (fr) | 2002-08-08 |
Family
ID=3820954
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/AU2001/000428 WO2002061609A1 (fr) | 2000-04-12 | 2001-04-12 | Systeme d'extraction dans une base de donnees |
Country Status (2)
Country | Link |
---|---|
AU (1) | AUPQ686200A0 (fr) |
WO (1) | WO2002061609A1 (fr) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110307461A1 (en) * | 2010-06-11 | 2011-12-15 | Microsoft Corporation | Query context selection using graphical properties |
CN102385483A (zh) * | 2010-09-03 | 2012-03-21 | Sap股份公司 | 基于上下文的用户接口、搜索和导航 |
CN110781370A (zh) * | 2019-10-16 | 2020-02-11 | 杭州云深科技有限公司 | 一种移动终端信息查询方法和计算机设备 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5414838A (en) * | 1991-06-11 | 1995-05-09 | Logical Information Machine | System for extracting historical market information with condition and attributed windows |
US6026388A (en) * | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
GB2343530A (en) * | 1998-11-07 | 2000-05-10 | Int Computers Ltd | Nested graphical representations of Boolean expressions assist database querying |
EP1033662A2 (fr) * | 1999-03-01 | 2000-09-06 | Canon Kabushiki Kaisha | Méthode et appareil de recherche en langage naturel |
-
2000
- 2000-04-12 AU AUPQ6862A patent/AUPQ686200A0/en not_active Abandoned
-
2001
- 2001-04-12 WO PCT/AU2001/000428 patent/WO2002061609A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5414838A (en) * | 1991-06-11 | 1995-05-09 | Logical Information Machine | System for extracting historical market information with condition and attributed windows |
US6026388A (en) * | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
GB2343530A (en) * | 1998-11-07 | 2000-05-10 | Int Computers Ltd | Nested graphical representations of Boolean expressions assist database querying |
EP1033662A2 (fr) * | 1999-03-01 | 2000-09-06 | Canon Kabushiki Kaisha | Méthode et appareil de recherche en langage naturel |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110307461A1 (en) * | 2010-06-11 | 2011-12-15 | Microsoft Corporation | Query context selection using graphical properties |
US8239363B2 (en) * | 2010-06-11 | 2012-08-07 | Microsoft Corporation | Query context selection using graphical properties |
CN102385483A (zh) * | 2010-09-03 | 2012-03-21 | Sap股份公司 | 基于上下文的用户接口、搜索和导航 |
CN110781370A (zh) * | 2019-10-16 | 2020-02-11 | 杭州云深科技有限公司 | 一种移动终端信息查询方法和计算机设备 |
CN110781370B (zh) * | 2019-10-16 | 2022-07-29 | 杭州云深科技有限公司 | 一种移动终端信息查询方法和计算机设备 |
Also Published As
Publication number | Publication date |
---|---|
AUPQ686200A0 (en) | 2000-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220188309A1 (en) | Methods, systems, and media for interpreting queries | |
CA2767838C (fr) | Filtrage progressif de resultats de recherche | |
US6460029B1 (en) | System for improving search text | |
US6601059B1 (en) | Computerized searching tool with spell checking | |
JP4276702B2 (ja) | キーボードを用いない入力環境における、検索タームの知的選択のためのシステム、方法、およびメディア | |
US6418440B1 (en) | System and method for performing automated dynamic dialogue generation | |
US6829603B1 (en) | System, method and program product for interactive natural dialog | |
US9218414B2 (en) | System, method, and user interface for a search engine based on multi-document summarization | |
US20160055245A1 (en) | Systems and methods for providing information discovery and retrieval | |
US20050060304A1 (en) | Navigational learning in a structured transaction processing system | |
US20070214126A1 (en) | Enhanced System and Method for Search | |
US20140101139A1 (en) | Methods and devices for querying databases using aliasing tables on mobile devices | |
WO2003052625A1 (fr) | Systeme et procede de recherche de sources | |
US20100293162A1 (en) | Automated Keyword Generation Method for Searching a Database | |
WO2002048921A1 (fr) | Method and apparatus for searching a database and providing relevance feedback | |
US20070088683A1 (en) | Method and system for search engine enhancement | |
US12038958B1 (en) | System, method, and user interface for a search engine based on multi-document summarization | |
CN1568469A (zh) | 用于信息检索的分层数据驱动导航系统及方法 | |
CN118277588B (zh) | 查询请求处理方法、电子设备及存储介质 | |
US11868343B2 (en) | Utilizing autocompletion as a data discovery scaffold for supporting visual analysis | |
US20050209992A1 (en) | Method and system for search engine enhancement | |
Hoeber et al. | HotMap: Supporting visual exploration of Web search results | |
WO2002061609A1 (fr) | Systeme d'extraction dans une base de donnees | |
US20240394730A1 (en) | User-contributor ranking and matching in a content marketplace | |
US8666915B2 (en) | Method and device for information retrieval |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: JP |