WO2006006287A1 - Système de recherche de marchandises/services sur le web - Google Patents
Système de recherche de marchandises/services sur le web Download PDFInfo
- Publication number
- WO2006006287A1 WO2006006287A1 PCT/JP2005/007163 JP2005007163W WO2006006287A1 WO 2006006287 A1 WO2006006287 A1 WO 2006006287A1 JP 2005007163 W JP2005007163 W JP 2005007163W WO 2006006287 A1 WO2006006287 A1 WO 2006006287A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- page
- search
- price
- web
- product
- Prior art date
Links
- 239000000284 extract Substances 0.000 claims abstract description 7
- 238000000034 method Methods 0.000 claims description 13
- 238000012217 deletion Methods 0.000 claims description 3
- 230000037430 deletion Effects 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/954—Navigation, e.g. using categorised browsing
Definitions
- the present invention relates to a product search service on the web, and more particularly to a system for searching for products and services provided for a fee to users on the web.
- the present invention has been made in view of such a situation, and an object of the present invention is to enable a user to easily search for information on a product 'service provided for a fee on the web and store the product'. Service This is to provide a product 'service search system that makes it easy to visit websites that offer services for a fee.
- the present invention of the first aspect is provided in a product / service search system provided for a fee on the web.
- a robot that searches the web and obtains the source code of the page on the web, and a price-containing page search means that extracts only the page that contains the price by searching the source code of each acquired page
- Index search means for extracting the product name or service name and price as an index from the extracted page, and storing the extracted product name or service name and price in the database in association with the URL to the page;
- the product name or service name, its price, and URL are read out from the database, and on the terminal where the information is entered on the search page, the read product name or A search result providing means that provides the service name and its price with a link to the corresponding page.
- a search system characterized by being powerful is provided.
- the present invention of the second aspect is provided in a product / service search system provided for a fee on the web.
- a robot that searches the web and obtains the source code of a page on the web. By searching the source code of each page, a price-containing page search means that acquires only the page that includes the price is provided. Equipped robot,
- Index search means for extracting the product name or service name and price from the acquired page as an index, and storing the extracted product name or service name and price in the database in association with the URL of the page; Providing a search page for
- the product name or service name, its price, and URL are read from the database, and the terminal has entered information on the search page
- the price-containing page search means includes a price character string in which an integer is arranged before or after the currency symbol. It is preferred to be configured to extract only pages that are!
- the price-containing page search means is configured to extract only a page including a definition tag representing a price.
- the search system further includes a purchase intention page deletion unit that deletes a page including a term indicating purchase intention from the page acquired by the price-containing page search unit, and the index search unit includes the purchase intention page. It is preferable that the product name or service name and price are extracted as an index from a page that has not been deleted by the deleting means.
- the search system further extracts a product sales page by searching for and deleting a purchase intention page representing a product purchase intention by a keyword search from the pages extracted by the price-containing page search means.
- a keyword search means that separates the extracted product sales page into an e-commerce page and an auction page by keyword search, and the index search means indexes only from the product sales page. It is preferred to be configured to extract.
- the present invention is configured as described above, searches a page with a keyword specific to a price universally used around the world, creates a database, and stores the product name 'service name on the page.
- the load on the server is drastically reduced compared to the conventional example.
- the time required to find a new page or an updated page is significantly reduced compared to the conventional example in which the load on the robot itself is small.
- FIG. 1 is a block diagram showing a configuration of a search system according to the present invention.
- FIG. 2 is an explanatory diagram for explaining the operation of the search system according to the present invention.
- FIG. 3 is an explanatory diagram illustrating a user search input page and a search result display page provided by the search system according to the present invention.
- the product search service on the web according to the present invention is equivalent to the case where there is a character string with an integer before or after the currency symbol of each country in the content provided by the website. It was developed with a focus on the provision of services. In other words, if the content on the web page contains a string with integers before and after the currency symbol, the string represents the price, so provide that page. This is based on the fact that it is highly probable that the existing website is an e-commerce mall or an auction mall.
- the present invention searches for a web page including such a price character string (continuous currency symbol and integer), searches for a product 'service name corresponding to the page power price, and stores a product' service.
- the name and its price are provided in a state where the user can easily search, together with the URL of the page with a link to each page.
- product and “service” provided for a fee will be referred to as “product”.
- product A character string accompanied by an integer before or after a currency symbol is referred to as a “price character string”.
- FIG. 1 shows a basic configuration of a product “service search system on the web (hereinafter simply referred to as“ search system ”) according to the present invention.
- 10 is a search service site that provides a search system
- 20 is the Internet.
- Search service site 10 consists of robot 1, web information database (DB) 2, price string search engine 3, price string containing page DB4, keyword search engine 5, product sales page DB6, index search engine 7, index It has DB8 and user search service engine 9.
- the robot 1 searches the Internet 20. Swim around and search for new and updated pages on the Internet 20.
- the source code of the entire page obtained by the robot search is stored in the web information DB2.
- all the pages on the web become new pages when the search system is started up, and the source code of the page obtained by the search of the robot 1 is stored in the page information DB2 in accordance with it.
- the price string search engine 3 searches the source code of the page newly stored or updated in the web information DB2 and extracts the source code of only the page including the price string. .
- the source code of the extracted page is stored in the price string containing page DB4. For example, in the page of the source code as shown in FIG. 2, “ ⁇ 6, 930” is searched as a price character string, so the source code power of this page is stored in the character string containing page DB4.
- the robot 1 may have a price character string search function.
- the web information DB2 becomes unnecessary.
- the keyword search engine 5 searches the source code of the page stored in the price character string-containing page DB4 using the keyword, and then purchases a page indicating purchase intention (for example, Reverse page).
- a page indicating purchase intention for example, Reverse page.
- terms indicating the intention of purchasing a product such as “Want”, “Want” and “Ikaga” are used, and a page including any deviation of these terms is searched.
- the source code of the purchase intention page extracted by such keyword search is deleted, and the source code of the remaining pages is stored in the product sales page DB6.
- most of the pages stored in the product sales page DB6 are directly related to e-commerce and auction.
- the keyword search engine 5 further identifies whether the page stored in the product sales page DB6 is for e-commerce or auction. For this identification, it is determined whether or not a keyword associated with the auction such as “current price” is included in the page, or whether “auction” or “auctions” is included in the URL. If this determination is affirmative, it is determined that the page is for auction. Then, on the corresponding page stored in the product sales page DB6, confirm that the sales form is an auction. Put a mark to show. Pages that are not marked are for e-commerce.
- the source code of the purchase intention page is stored in a database separate from the e-commerce and auction pages, and the contents of the e-commerce and auction pages are matched with the contents of the purchase intention page. If there is a page that can be checked and matched! /, Then it should be configured to notify both page providers.
- a product sales page DB6 is not provided, but a flag is added corresponding to the page in the price character string containing page DB4 to indicate that it is a product sales page.
- the index search engine 7 analyzes the source code of the e-commerce and auction pages extracted in this way and stored in the product sales page DB6, and from each page, “product name”, “ Information corresponding to “Amount” is extracted, and this information and the URL of the page are stored in the index DB 8 as index information.
- the index DB8 also stores the sales form as to whether it is for power e-commerce for auction.
- the index information is as follows.
- the user search service engine 9 has a function of providing the index information stored in the index DB 8 on the web in a form that is easy for a user to search. Therefore, the user search input page as shown in Fig. 3 (A) is provided to the user on the web. On the user search input page, for example, input fields for “product name”, “amount”, “sales method”, and “payment method” are provided.
- the "Sales method” input field is E-commerce This is for inputting a sales form as to whether it is a certain power auction.
- the user search service engine 9 searches the index DB8 according to the search algorithm, and is specified for these search key Z queries.
- the search results are displayed in the selected display method. For example, if the user enters “Orange Keyboard” in the product name field and “ ⁇ 6800” to “ ⁇ 7300” in the price field, the user search service engine 9 will match the entered conditions.
- the product information display page as shown in Fig. 3 (B) is provided. In the example of (B) in Fig. 3, the orange keyboard dealers are listed in order from the lowest price. If the amount is not entered on the user search input page, the product information page will display all the power of the store selling the entered product name “Orange Keyboard” along with the product price in an appropriate order. Is displayed. The corresponding URL may also be displayed.
- sales method and “payment method” on the user search input page in FIG. 3A are in a form in which one pull-down menu is selected as appropriate.
- sales methods include normal sales methods and auctions
- payment methods include credit card payments, merchandise exchanges, and transfers.
- Input fields for "sales method” and “payment method” are not necessarily provided. Even if these input fields are provided, the search result can be displayed on the product information display page even if there is no user input in these input fields.
- Each store / price pair on the product information display page is linked to the URL of the store, and the user selects, for example, the lowest priced orange pair. Then, the session switches to the Orange House orange keyboard sales page. As a result, the user can obtain detailed information on the orange keyboard on the sales page, and if the purchase is decided, the purchase procedure can be performed on the web page.
- the price string search engine 3 and the keyword search engine 5 are used to extract and use only the product sales page. Even with these search engines, the extracted page may include pages other than the product sales page. There is. Such a possibility is extremely When it becomes clear that the page is a non-commodity sales page, the index information corresponding to the page may be deleted from the index DB8 as appropriate.
- the robot swims around the web to acquire the source code of the page on the web, and the price string search engine extracts the page containing the price string from the example, and As described above, the robot has a price string search function, and the robot acquires only the page including the price string.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004204961A JP2006031108A (ja) | 2004-07-12 | 2004-07-12 | ウエブ上の商品・サービスの検索システム |
JP2004-204961 | 2004-07-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2006006287A1 true WO2006006287A1 (fr) | 2006-01-19 |
Family
ID=35783648
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/007163 WO2006006287A1 (fr) | 2004-07-12 | 2005-04-13 | Système de recherche de marchandises/services sur le web |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP2006031108A (fr) |
WO (1) | WO2006006287A1 (fr) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008014702A1 (fr) * | 2006-07-25 | 2008-02-07 | Beijing Sogou Technology Development Co., Ltd. | Procédé et système d'extraction de mots nouveaux |
CN103186618A (zh) * | 2011-12-30 | 2013-07-03 | 北京新媒传信科技有限公司 | 正确数据的获取方法和装置 |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5000999B2 (ja) * | 2006-02-08 | 2012-08-15 | 美恵子 露崎 | 情報更新システム及び情報取得システム |
JP4987434B2 (ja) * | 2006-11-15 | 2012-07-25 | 株式会社日立製作所 | 電文データの監査用保管・検索システム、電文データの監査用保管・検索方法、および電文データの監査用保管・検索プログラム |
JP5077300B2 (ja) * | 2009-06-24 | 2012-11-21 | 富士通株式会社 | ショッピングサイトの価格調査方法及び情報処理装置 |
CN102456057B (zh) * | 2010-11-01 | 2016-08-17 | 阿里巴巴集团控股有限公司 | 基于网上交易平台的检索方法、装置和服务器 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11259498A (ja) * | 1998-03-10 | 1999-09-24 | Fujitsu Ltd | 文書処理装置および記録媒体 |
JP2000172722A (ja) * | 1998-12-01 | 2000-06-23 | Korea Electronics Telecommun | オンライン商店上の製品情報自動索引方法及びシステム |
JP2002133290A (ja) * | 2000-10-20 | 2002-05-10 | Matsushita Electric Works Ltd | 電子商取引を支援する方法、および電子商取引支援システム |
JP2002318814A (ja) * | 2001-04-19 | 2002-10-31 | Tadashi Goino | 商品検索方法、商品検索装置及びプログラム |
-
2004
- 2004-07-12 JP JP2004204961A patent/JP2006031108A/ja active Pending
-
2005
- 2005-04-13 WO PCT/JP2005/007163 patent/WO2006006287A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11259498A (ja) * | 1998-03-10 | 1999-09-24 | Fujitsu Ltd | 文書処理装置および記録媒体 |
JP2000172722A (ja) * | 1998-12-01 | 2000-06-23 | Korea Electronics Telecommun | オンライン商店上の製品情報自動索引方法及びシステム |
JP2002133290A (ja) * | 2000-10-20 | 2002-05-10 | Matsushita Electric Works Ltd | 電子商取引を支援する方法、および電子商取引支援システム |
JP2002318814A (ja) * | 2001-04-19 | 2002-10-31 | Tadashi Goino | 商品検索方法、商品検索装置及びプログラム |
Non-Patent Citations (1)
Title |
---|
HONMA Y.: "[Shin Service] Hikaku Kensaku Serivr Kakaku Sokyu no Shin. Shukyaku Tool EC Shien ya Kokyaku Doko Data mo Teikyo", NIKKEI NETBUSINESS, NIPON, NIKKEI BUSINESS PUBLICATIONS INC., no. 64, 15 October 2000 (2000-10-15), pages 110 - 113, XP002996668 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008014702A1 (fr) * | 2006-07-25 | 2008-02-07 | Beijing Sogou Technology Development Co., Ltd. | Procédé et système d'extraction de mots nouveaux |
CN103186618A (zh) * | 2011-12-30 | 2013-07-03 | 北京新媒传信科技有限公司 | 正确数据的获取方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
JP2006031108A (ja) | 2006-02-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7865407B2 (en) | System and method for automating association of retail items to support shopping proposals | |
US6611814B1 (en) | System and method for using virtual wish lists for assisting shopping over computer networks | |
US7698172B2 (en) | Methods for running an on-line shopping mall with updated price notification | |
US8793239B2 (en) | Method and system for form-filling crawl and associating rich keywords | |
US9928525B2 (en) | Method, medium, and system for promoting items based on event information | |
US20030225632A1 (en) | Method and system for providing personalized online shopping service | |
US20030195877A1 (en) | Search query processing to provide category-ranked presentation of search results | |
CN105164710A (zh) | 实体投标 | |
JP5241903B2 (ja) | レビュー文章出力システム、レビュー文章出力方法、プログラム及びコンピュータ可読情報記憶媒体 | |
WO2010056862A1 (fr) | Système et procédé pour réaliser des courses localisées en ligne, et publicité juste-à-temps | |
WO2012103530A2 (fr) | Systèmes et procédés d'appariement en ligne de clients et de détaillants | |
US12174894B2 (en) | Computer implemented system and methods for implementing advertisement placement via internet | |
KR20140073256A (ko) | 상품 추천 서비스 제공 방법 및 장치 | |
WO2017126707A1 (fr) | Système d'aide à l'achat de marchandises | |
WO2006006287A1 (fr) | Système de recherche de marchandises/services sur le web | |
KR101979237B1 (ko) | 쇼핑 정보 제공 방법 및 장치 | |
JP5596101B2 (ja) | 商品検索支援サーバ、商品検索支援方法、商品検索支援プログラム、及びそのプログラムを記憶するコンピュータ読み取り可能な記録媒体 | |
US20110246301A1 (en) | Methods to access product placement data | |
KR20120135790A (ko) | 메타 정보 서버의 운영 방법 | |
Najjar | Designing e-commerce user interfaces | |
JP6110999B1 (ja) | 情報処理装置、情報処理方法及び情報処理プログラム | |
JP2005222154A (ja) | 情報配信システム | |
JP2005070928A (ja) | 商品の代金還元システム | |
KR20020004650A (ko) | 물품 구매정보 제공을 위한 포탈 웹사이트 운영 방법 | |
JP2002269451A (ja) | ネット商品検索システム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: DE |
|
122 | Ep: pct application non-entry in european phase |