US20080065609A1 - System and method for providing a searchable library of electronic documents to a user - Google Patents
System and method for providing a searchable library of electronic documents to a user Download PDFInfo
- Publication number
- US20080065609A1 US20080065609A1 US11/873,681 US87368107A US2008065609A1 US 20080065609 A1 US20080065609 A1 US 20080065609A1 US 87368107 A US87368107 A US 87368107A US 2008065609 A1 US2008065609 A1 US 2008065609A1
- Authority
- US
- United States
- Prior art keywords
- user
- search
- books
- book
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 238000006243 chemical reaction Methods 0.000 claims description 8
- 238000012544 monitoring process Methods 0.000 claims 1
- 238000012549 training Methods 0.000 abstract description 2
- 238000004891 communication Methods 0.000 description 18
- 238000012795 verification Methods 0.000 description 14
- 230000004044 response Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 239000000470 constituent Substances 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
Definitions
- the invention relates generally to a system and method for providing a searchable library of electronic documents to a user and more specifically to a system and method for providing a searchable library of electronic books which can be accessed by a user over a global computer network such as the Internet.
- Book publishers publish many books only in hard copy form. In many instances these hard copy books are not available to individuals until the books reach retail stores or are available for purchase through mail order or on-line services. In order for an individual to have access to the contents of the hard copy book, the individual must either go to a retail store and purchase the book or order the book and wait for the book to be delivered. In both instances the book is not immediately available to the individual upon the individual desiring the book. In certain instances this causes a problem for an individual who requires instant information at time of day when retail bookstores are not open for business.
- What is desired then is a system and method for providing a searchable library of electronic documents which is accessible to an individual at any time.
- What is also desired is a system and method for publishing a hard copy document in an electronic form which may be searched by an individual. The present invention permits such functionality.
- the invention relates to a system and method for providing a searchable library of electronic documents to a user.
- the electronic library system includes an electronic document database, a search module, a user database, a user verification module, a web site server, a communication network and at least one user computer.
- the electronic library system is used for providing a searchable library of electronic documents to a user.
- the library of electronic documents is made accessible to a user upon the user obtaining a subscription to the electronic document library service.
- the user pays a monthly fee in order to access documents stored in the electronic document library.
- the electronic documents may be books, magazines, or any other document which is in electronic form.
- the electronic books are technical and computer books.
- the electronic documents may be accessed by a user computer by accessing the electronic document library's web site through a communication network.
- the communication network is a global computer network such as the Internet.
- the invention also relates to a method for searching the electronic document database.
- the user enters a key word query to search for documents in the electronic document database.
- the search module searches the document database in response to the query entered by the user.
- the search module retrieves all the documents which satisfy the requirements of the key word query.
- the search module retrieves the documents which are most relevant to the user's request.
- the search module expands the search parameters in order to locate relevant documents.
- the invention further relates to a system and method for publishing a hard copy document in an electronic form which may be searched by a user.
- the electronic library system includes electronic document conversion computers which convert documents from hard copy form into electronic form.
- the electronic document conversion computers convert documents which are in an electronic form used by publishers to print hard copy documents into a form for electronic publication.
- the electronic form of documents used by publishers may be files from layout programs such as PageMaker, QuarkExpress, FrameMaker, or any other publishing program.
- the present invention has the advantage of providing a searchable library of electronic documents which is accessible to a user at any time of the day.
- the invention has the further advantage of providing a library containing a quantity of documents which the user may not otherwise have access to and which is searchable by a user through key word searches. The invention therefore allows a user to have access to information at anytime that information is required and provides the user with a convenient and time efficient method for searching for the documents which are the most relevant to the user's needs.
- FIG. 1 is a schematic block diagram of an embodiment of an electronic document library system according to the invention.
- FIG. 2 is a flowchart illustrating the steps performed in an embodiment of the invention for a user to gain access to the documents stored in the electronic document library;
- FIG. 3 is a diagram of the options available to a user through an embodiment of a main menu of the electronic document library system of the invention
- FIG. 4 is a pictorial view of an embodiment of a screen display which is presented to a user upon accessing the web site of the electronic document library;
- FIG. 5 is a flowchart illustrating the steps performed by a user to gain access to the documents in the electronic document library
- FIG. 6 is a pictorial view of an embodiment of a login screen display
- FIG. 7 is a pictorial view of an embodiment of a registration screen display
- FIG. 8 is a pictorial view of an embodiment of a search screen display
- FIG. 9 is a pictorial view of an embodiment of a sub-topic list screen display
- FIG. 10 is a pictorial view of an embodiment of a search results screen display
- FIG. 11A is a pictorial view of another embodiment of a search results screen display
- FIGS. 11B and 11C are a pictorial view of an embodiment of a screen displaying the contents of an electronic document
- FIG. 12 is a pictorial view of another embodiment of a screen displaying the contents of an electronic document
- FIG. 13 is a pictorial view of an embodiment of menu presented to a user to navigate the contents of an electronic document
- FIG. 14 is a flowchart illustrating the steps performed by the search module to return the most relevant documents in response to a user's query
- FIG. 15 is a block diagram illustrating the hierarchy of elements contained in typical hard copy books
- FIG. 16 is a flowchart illustrating the steps performed by the search module to improve the relevancy of the search results returned to a user
- FIG. 17 is a pictorial view of an embodiment a virtual bookshelf
- FIG. 18 is a pictorial view of an embodiment of a screen display presented to a user to create a virtual bookmark
- FIG. 19 is a pictorial view of a screen display showing the manifest created by an embodiment of the tool for converting a publisher's file to an electronic document file;
- FIG. 20 is pictorial view of a screen display showing the list of tables created by an embodiment of the tool for converting a publisher's file to an electronic document file.
- an embodiment of an electronic document library system 10 includes an electronic document database 12 , a search module 14 , a user database 16 , a user verification module 18 , a web site server 20 , a communication network 22 and at least one user computer 24 .
- the electronic document library system 10 is used for providing a searchable library of electronic documents to a user.
- the library of electronic documents 26 is made accessible to a user upon the user obtaining a subscription to the electronic document library service.
- the user pays a monthly fee in order to access documents stored in the electronic document library 26 .
- the user pays a yearly fee in order to gain access.
- the user pays a fee based on another predetermined period of time.
- the documents available in the document database 12 are documents which are commercially available from publishers.
- the electronic document library provider obtains the right to electronically publish the documents from the publisher.
- the electronic document library provider provides the publisher with compensation in return for the right to electronically publish the document.
- the compensation is a flat fee.
- the compensation is a royalty-based fee.
- the electronic document library provider may pay each publisher a percentage of the subscription fees obtained from users. The fee may be based on the number of documents the publisher provides.
- the fee paid to the publisher is based on the number of users who access the publisher's documents.
- the fee paid to the publisher is based on the number of times the publisher's documents are accessed.
- the electronic document library provider negotiates with each publisher the percentage of the electronic document library provider's revenue reserved for royalties.
- the electronic document library provider's royalty pool size is then a blended average of the negotiated rates with each publisher.
- the publisher competes based on content usage (i.e., retrievals) for a piece of their view of the royalty pool.
- the usage calculation is weighted: page retrievals, placing a book on a bookshelf, and the list price of the book all factor into usage points for a publisher.
- the electronic document library provider then divides up the royalty pool as seen by each publisher by getting a percentage of their usage points versus total usage points for all publishers.
- the royalty payments are made quarterly.
- the document database 12 provides a central location for storing the electronic documents that may be accessed by a user.
- the electronic documents may be books, magazines, or any other document which is in electronic form.
- the electronic books are technical and computer books.
- the electronic documents may be accessed by a user computer 24 as described in more detail below.
- the document database 12 is maintained by a third party at the electronic document library 26 .
- the document database 12 may be a single database or may be a plurality of databases which are in communication with each other.
- the document database 12 is in communication with the search module 14 .
- the search module 14 searches the document database 12 in response to queries entered by a user computer 24 .
- the search module 14 retrieves the documents which are most relevant to the user's request. An embodiment of a process for retrieving the most relevant documents will be described in more detail below in the description of FIGS. 14-16 .
- the search module 14 is in communication with the web site server 20 .
- the web site server 20 generates an interactive screen display for interacting with a user computer 24 .
- the web site server 20 is connected to the communication network 22 .
- the communication network 22 is a global computer network, such as the Internet.
- the communication network 22 is a telephone network.
- the communication network 22 is an electronic network dedicated to the communication of information between the web site server 20 and the user computer 24 .
- the communication network 22 may be any system for communicating data between two computers.
- the user computer 24 may be any computer which is capable of being connected to the communication network 22 .
- the user computer 24 may be the user's own personal computer or any other computer or terminal which the user may access to connect to the communication network 22 .
- the user computer 24 may be a computer which the user has access to at the user's workplace.
- the user computer 24 is not a single computer, but changes as the user accesses different computers.
- the user database 18 provides a central location for storing user information.
- the user database 18 stores a user name and a password for each user authorized to access documents stored in the document database 12 .
- the user database 18 stores personal information regarding authorized users, such as name, address, telephone number, e-mail address, occupation, etc.
- the user database 18 stores billing information for each user. For example, the user database 18 may store information indicating that the user's personal credit card account should be charged to keep the user's subscription current. Alternatively, the user database 18 may store information indicating that the user's employer should be charged to keep the user's subscription current.
- the user database 18 is in communication with a user verification module 16 which receives user information from the user computer 24 through the communication network 22 and the web site server 20 .
- the user verification module 16 compares the user identification information received from the user computer 24 to the entries in the user database 18 and determines whether a user should be granted access to the document database 12 .
- a user is allowed to search the document database 12 without being authorized by the user verification module 16 .
- the user is verified by the user verification module 16 upon requesting to view the contents of an electronic document.
- the user is verified by the user verification module 16 before being able to search the document database 12 .
- the user is able to view only certain documents in the document database 12 without being verified by the user verification module 16 .
- the electronic library system 10 includes a registration module 28 .
- the registration module 28 enables users who are not currently included in the user database 18 to register for the service and to be added to the user database 18 .
- the user contacts the third party maintaining the electronic library system 10 in order to register for the service and be added to the user database 18 . The registration process will be described in more detail below in the discussion of FIG. 7 .
- the electronic library system 10 includes electronic document conversion computers 30 which convert documents from hard copy form into electronic form.
- the electronic document conversion computers 30 convert documents which are in an electronic form used by publishers to print hard copy documents into a form for electronic publication.
- the electronic form of documents used by publishers may be files from layout programs such as PageMaker, QuarkExpress, FrameMaker, or any other publishing program. The process for converting books into a standard electronic form will be described in more detail below.
- a flowchart illustrates the series of steps performed for a user to gain access to the document database 12 .
- a user has access to a computer 24 , such as a personal computer.
- the user connects the computer 24 to the communication network 22 .
- the user connects the computer 24 to a global computer network, such as the Internet, through a service provider.
- the user selects the web site address of the electronic document library 26 .
- the web site server 20 transmits the screen display for the home page to the user computer 24 for display on the monitor of the user computer 24 .
- the home page displays the main menu 37 from which the user may select.
- the main menu 37 provides the user with a central page from which all the major functions of the system 10 can be reached.
- the main menu screen presents the user with the different functions which may be selected by pointing and clicking on an appropriate button or icon.
- FIG. 3 illustrates the options from which a user may select upon entering the main menu page.
- the options include: register 38 , new documents 39 , most popular documents 40 , login 41 , search documents 42 , personal bookshelf 43 , help 44 and feedback 45 . Each of these options will be described in detail below.
- FIG. 4 presents one embodiment of a screen display 50 which may be presented to a user as a home page or main menu page. Referring back to FIG. 2 , in step 46 , the user reviews the options and selects the option of interest. In response to the users selection, the web site server 20 determines which option the user selected in step 48 and selects the appropriate page to present to the user.
- FIG. 5 is one embodiment of a login page screen display 58 .
- the login page screen display 58 queries the user for a user name and a password.
- the login page screen display 58 includes a user name field 60 and a password field 62 .
- the user selects the “Login” button 64 or hits a “Return/Enter” key.
- the user computer 24 transmits the user identification information to the web site server 20 in step 66 .
- the web site server 20 receives the user identification information entered into the login screen 58 , in step 68 , the web site server 20 transfers the user identification information to the user verification module 16 .
- the user verification module 16 searches the user database 18 to determine if the user name is contained in the user database 18 and to determine if the entered password is valid.
- the user verification module 16 grants the user access to the full contents of the documents in the electronic document database 12 in step 70 . In one embodiment, if the user name is valid, but the associated password is invalid, the user verification module 16 sends an error message to the user computer 24 through the web site server 20 and requests the user to login again. If the user name is not valid, the user verification module 16 denies the user access to the electronic document database 12 in step 72 . In another embodiment, the user is granted access to search the document database 12 , but may not view the full contents of the electronic documents. In another embodiment, if the user is denied access in step 72 , the user is given an opportunity to obtain a subscription to the electronic document library service by registering with the electronic document library 26 .
- the user may set the user's preferences to automatic login in order to avoid entering a user name and password each time the user accesses the web site server 20 .
- the user verification module 16 still verifies the user name and password, each time the user logs into the web site server 20 in order to verify that the user's subscription is still valid.
- FIG. 7 shows one embodiment of a registration screen 74 .
- the registration screen 74 allows a user to request a subscription to the electronic document library 26 .
- the registration screen 74 queries the user for user identification information, company information, telephone number information, e-mail address information and number of subscriptions requested.
- the registration screen 74 also queries the user for billing information, such as a credit card number.
- the registration screen 74 includes user name fields 76 , 78 , a job title field 80 , an e-mail address field 82 , a telephone number field 84 , a country field 86 , a company name field 88 , a department name field 90 , a department size field 92 and a number of seats requested field 94 .
- the number of seats requested field enables a user to purchase multiple subscriptions at the same time. For example, a company may wish to purchase subscriptions for several of its employees.
- the country field 86 is a drop down menu from which a user may select a country.
- the registration screen 74 also includes fields for the user's address and billing information, such as a credit card number and expiration date.
- the registration screen 74 includes a field for the user to enter a preferred password. Once the user has entered all of the required information into the registration screen 74 , the user selects the OK button 96 or hits the “Return/Enter” key.
- the user computer 24 transmits the information to the web site server 20 .
- the web site server 20 transfers the information to the registration module 28 which determines if the user should be granted a subscription. If the user is granted a subscription, the registration module 28 adds a user name and a password to the user database 18 .
- the registration module 28 uses the user name entered by the user in the user name fields 76 , 78 .
- the registration module 28 selects a user name for the user.
- the registration module 28 uses the password entered by the user as a preferred password.
- the registration module 28 selects a password to be associated with the user name.
- the registration module 28 contacts the user and informs the user of the user name and password to use when logging into the electronic document library 26 .
- the registration module 28 sends an e-mail message to the user.
- the e-mail message contains the user name and password to be entered by the user when logging into the electronic document library 26 .
- the user is given the opportunity to change the user name and password.
- the web site server 20 proceeds to step 98 and displays information regarding documents recently added to the document database 12 . If the user selects the most popular documents option 40 , the web site server 20 displays information regarding documents which have been accessed by the greatest number of users or which have been accessed the greatest number of times during a predetermined period. In one embodiment, the predetermined period is the previous week.
- FIG. 8 shows one embodiment of a find books page screen 102 .
- the find books screen 102 presents the user with three options for searching the document database 12 for electronic documents.
- the user is able to browse a topic list 104 .
- the topic list 104 of FIG. 8 identifies topics which may be of interest to information technology and computer specialists. Other embodiments of the topic list 104 may contain topics of interest to individuals in other fields.
- the user is presented with a list of sub-topics.
- the user For example, if a user selects the “Databases” topic 106 , the user is presented with the sub-topic list 108 shown in FIG. 9 . The user then has the option to return to the main topic list 110 , select one of the sub-topics 108 or go to related topics 112 .
- the search module 14 identifies the documents categorized under the selected sub-topic and the user is presented with a list of documents which are categorized under the selected sub-topic. For example, if the user selects the “Access” sub-topic 114 , the user is presented with a list of documents in the form of the screen display 116 shown in FIG. 10 . The user may then select one of the documents or return to the sub-topic list 108 or main topic list 104 .
- the user is able to search for a particular document by supplying information regarding the particular document.
- the user may search for a document by entering words in the document's title, the author's name, the publisher or the ISBN.
- the user enters the document information into the lookup field 118 .
- the user selects the “Lookup” button 120 or hits the “Return/Enter” key.
- the search module 14 searches the document database 12 for documents which correspond to the user's query and returns a list of the relevant documents.
- the user is able to search for an electronic document in the document database 12 by entering key words which might occur in the text of the document.
- the user enters the key words into the key word field 122 .
- the user selects the “Search” button 124 or hits the “Return/Enter” key.
- the search module 14 searches the document database 12 for documents which contain the key words.
- the user is able to instruct the search module 14 to search for documents in the entire document database 12 , in a particular category from the main topic list 104 or sub-topic list 108 , in a results list from a previous search or in books present in the user's personal bookshelf.
- the user has entered the key words “java jar files” in the key word field 122 .
- the user has also selected to search all documents.
- the search module 14 searched the document database 12 and found 21 documents.
- the search module 14 presents a list of the found documents to the user.
- the search module 14 searches the document database 12 for the documents which contain information most relevant to the user's query. An embodiment of a method for determining which documents are most relevant will be described in detail below in the discussion of FIGS. 14-16 .
- the search module presents the results of the search in order of relevance.
- the search module 14 indicates the relevance of documents as “High”, “Medium”, or “Low”.
- the search module 14 indicates the sections of the document which are most relevant. In yet another embodiment, the search module 14 presents the most relevant documents and links to the most relevant sections within these documents. In the embodiment shown in FIG. 11A , the search module 14 indicates the relevancy of different sections by using a thermometer type gauge 126 . A thermometer gauge which the most filled has the highest relevancy. For example, in FIG. 11A , the section “Minimizing Applet Loading Times” 128 has a higher relevancy than the section “Using the Java Development Kit (JDK)” 130 . In other embodiments, other types of level indicators are used to indicate relevancy.
- JDK Java Development Kit
- the user may select to view an electronic document. For example, the user may select to view the book “Mastering Java 2” by clicking on the underlined title.
- the web site server 20 displays the book to the user.
- the web site server 20 displays the entire content of the book to the user, including text, graphics, tables, code listings, etc.
- the web site server 20 displays the contents of the book just as the user would see the contents if viewing the hard copy version.
- the web site server 20 displays a synopsis of the book 132 , the table of contents of the book 134 and the information on the back cover of the book 136 .
- the web site server 20 also provides links to colleague comments about the book and public comments about the book.
- the web site server 20 also displays the thermometer gauges next to the most relevant sections of the book.
- the user may also select a link that leads directly to the relevant section of a document.
- a link For example, the user may select the link “Minimizing Applet Loading Times” 128 to go directly to that section of the document.
- FIG. 12 shows an example of a screen 137 which displays the content of the document section selected by the user.
- the web site server highlights the key words 138 entered by the user.
- FIG. 13 shows an embodiment of a menu 140 presented to the user along with the contents of the selected document.
- the menu 140 identifies the portion 142 of the document the user is viewing and presents the user with links to other portions of the document.
- FIG. 15 illustrates a possible hierarchy for breaking down a book 146 into its constituent parts.
- each part 147 being composed of chapters 148
- each chapter 148 being composed of sections 150
- each section 150 being composed of sub-sections 152 .
- the sub-sections may contain text, tables, graphics, code listings, etc.
- Other documents may be composed of other elements.
- FIG. 14 shows a flowchart illustrating the steps performed by the search module 14 to locate the most relevant documents in the document database 12 in response to the key words entered by the user in the key word field 122 .
- the search module 14 receives the search terms from the user.
- the search module 14 searches the lowest level elements of the documents. For example, if the document is a book 146 , the search module 14 searches the subsections 152 for the key words entered by the user.
- the search module ranks the relevancy of the lowest level elements located.
- the search module may use any of the relevancy ranking algorithms known in the art to rank the relevancy of the lowest level elements located. All of the objects within the element are included in the calculation.
- the search module 14 calculates the number of times a search term appears in the element. After the relevancy of the lowest level elements has been ranked, in step 158 the search module 14 sums the relevancy rankings of the lowest level elements to determine the ranking for the next higher element. For the book example shown in FIG. 15 , the search module would rank the relevancy rankings for Part 1 , and then sum the relevancy rankings for the parts 147 to determine a relevancy ranking subsections 152 to determine a relevancy ranking for Section 1 . Next, in step 160 , the search module 14 determines if the relevancy ranking for the highest level element has been calculated. For the book example of FIG.
- the search module 14 determines if the relevancy ranking has been calculated for the book 146 . If the relevancy ranking for the highest level element has not been calculated, the search module 14 returns to step 158 and repeats steps 158 and 160 until the relevancy ranking for the highest level element is calculated. Referring again to the book example of FIG. 15 , the search module 14 would sum the relevancy rankings for sections 150 to determine a relevancy ranking for Chapter 1 , and then sum the relevancy rankings for the chapters 148 to determine a relevancy ranking for the book 146 . Once the relevancy ranking has been determined for the highest most elements, the search module 14 returns a list of the documents with the highest ratings to the user in step 162 .
- the search module 14 assigns a relevancy ranking based on the number of hits and the quality of the hits (i.e. exact words in key word search). In another embodiment, the search module 14 assigns a relevancy ranking to a document according to how many times the document has been accessed. In another embodiment, the search module 14 assigns a relevancy ranking according to whether the user performing the search has accessed the document previously or has stored the document on the user's personal bookshelf.
- the search module 14 evaluates the search results before returning the results to the user. In one such embodiment, before returning the results to the user in step 162 , the search module 14 performs the steps illustrated in the flowchart of FIG. 16 . In step 164 , the search module 14 evaluates the search results. In certain embodiments, to evaluate the search results, the search module 14 determines whether the exact key words entered by the user appeared in the located documents, the number of times the key words appeared in the documents, whether the key words appeared in the titles of the different elements of the document or simply within the text, and the number of elements within a document the key words appeared. Based on this evaluation, in step 166 the search module 14 suppresses low ranking results. In one embodiment, the search module 14 suppresses low ranking documents.
- the search module 14 suppresses low ranking elements within a document.
- the search module 14 determines whether to expand the search parameters. The search module 14 may determine to expand the search parameters if none of the documents located had a high relevancy ranking or if the number of documents located was low.
- search module 14 determines the search results are sufficient and the search parameters do not need to be expanded, the search module 14 returns to step 154 of FIG. 14 and returns the results to the user. If the search module 14 determines that the search results are not sufficient and the search parameters should be expanded, the search module 14 proceeds to step 170 and employs fallback strategies to expand the search results.
- the search module 14 In determining whether to implement fallback strategies, the search module 14 must balance precision and recall. Precision refers to the accuracy of the documents located. If specific key words are important, to be precise the search results must contain those key words. Recall refers to the scope of the documents located. For example, by searching for only specific key words, a user may inadvertently omit documents which are relevant. Often users are not able to formulate a search which will return the most relevant documents. A user may not think to enter certain key words. The search module 14 balances precision and recall.
- the search module 14 begins by searching for highly precise results, e.g., results containing the exact key words in the exact manner entered by the user. If adequate results are not returned, the search module 14 works toward high recall.
- the search module 14 simulates the type of searching performed by information librarians.
- the strategies for expanding the search results may include stemming the key words entered by the user, implementing “near” operators, and implementing “and” and “or” operators. If the user has entered more than one key word, the search module 14 implements “near” operators by searching for the key words within a certain distance within from each other rather than searching for a phrase containing the key words directly next to each other.
- the search module 14 looks for both key words being in the document rather than being directly next to each other.
- the search module 14 searches for documents containing at least one of the key words entered by the user.
- the search module 14 employs different strategies for expanding the search and employs the strategies in different orders. The purpose of expanding the search is to provide the user with the optimum results.
- the search module 14 determines a strategy for expanding the search parameters, the search module 14 returns to step 154 of FIG. 14 and repeats steps 154 - 168 until the search module 14 determines not to expand the search parameters.
- the ranking assigned to a document element is dependent upon the search performed. For example, a document containing an exact phrase entered by a user receives a higher relevancy ranking than a document containing each word of the phrase in separate parts of the document.
- the search module 14 displays the actual search conducted to locate the search results. For example, if the user entered the key words “java jar files” and the search module 14 returned documents containing the words “java” or “jar” or “file”, the search module would inform the user that the search conducted was: java or jar or file.
- a user may create a personal virtual “bookshelf” of documents.
- the bookshelf contains documents that the user accesses frequently or desires to be able to access without searching the document database 12 .
- a user upon accessing an electronic document, a user is presented with the option of adding the document to the user's personal bookshelf through an “Add to Bookshelf” option 172 .
- the document is added to the user's personal bookshelf.
- FIG. 12 when viewing a portion of a document, the user is presented with an option of creating a virtual bookmark to that portion.
- the web site server 20 displays a list of the documents in the user's personal bookshelf and a list of the bookmarks created by the user (step 176 in FIG. 2 ).
- FIG. 17 shows an example of a screen displaying a document 180 and a bookmark 182 stored on the user's bookshelf.
- the create bookmark option 174 is selected by the user, in addition to simply adding a bookmark to the user's bookshelf, the web site server 20 presents the user with a several options for personalizing the bookmark.
- FIG. 18 shows one embodiment of a screen display 184 for allowing a user to personalize a bookmark.
- the screen 184 includes a bookmark note field 186 into which the user may enter a comment about the document.
- the screen 184 also allows the user to keep the bookmark private or share the bookmark with others.
- the screen 186 allows a user to select an e-mail option 188 . If the user selects the e-mail option 188 , an e-mail notification is sent to the individuals identified by the user informing the individuals of the bookmark.
- the electronic document library system 10 includes electronic document conversion computers 30 for creating digital versions of existing paper-based books which are aggregated, categorized and added to the document database 12 .
- Many hard copy book publishers used different desktop publishing software programs to produce books. Examples of different programs include PageMaker, QuarkExpress, FrameMaker and others. These publishing programs enable books to be published as hard copy books by printing presses. These files are not in a form capable of electronically publishing the documents.
- the electronic document conversion computers 30 execute an algorithm which converts files from different desktop publishing programs into a standard electronic format.
- Attachment 1 is a user guide which describes the process for creating an electronic book from a file obtained from a publisher.
- the user guide describes the steps a user, such as a person who is converting an electronic book file from a publisher's press-ready or pre-publication digital data format to a searchable electronic library format of the present invention, performs, using an embodiment of a document conversion tool to create an electronic book.
- FIG. 19 shows a screen display illustrating the manifest created by an embodiment of the tool for converting a publisher's file to an electronic document file.
- the manifest lists all of the files composing the document.
- FIG. 20 shows a screen display illustrating the list of tables included in a book created by an embodiment of the tool for converting a publisher's file to an electronic document file.
- the present invention provides a method and system for publishing a plurality of books for user access to information.
- the system includes selecting a plurality of books, converting each book from a layout or publication digital data form, e.g., by training a tool to detect characteristic features (such as layout, typeface, and hierarchical or organizational features such as chapter headings, captions, drawings and tables), and extracting text or data information of the book tagged with the features.
- characteristic features such as layout, typeface, and hierarchical or organizational features such as chapter headings, captions, drawings and tables
- This produces a searchable publishing database, e.g., arranged as an xml database indexed by book structure such that a user may remotely, e.g., over the internet or other network, access the database, search desired text data, and view an image of a portion of the book with the desired data.
- the system includes a user registration module to identify an authorized user, and may maintain a personal bookshelf for the user.
- a search engine may rank search results based on their position in the hierarchy or other factors, determining degree of relevance of text or data information located by the search engine. The other factors may include position of located search data in the hierarchy, identification of search data in the user's personal library, identification of search data in a prior search by the user or degree of match of data identified in the search.
- the search engine may be a commercially available search engine together with an interface that operates, when provided a search query by a user, to search for an exact match and to score hits for relevance, and in the event an exact match is not found, operate to expand the query and return scored search results located by the search engine together with an indication of the expanded search such that the user may ascertain a degree of likely relevance of returned text or data information.
- the relational database may include hyperlinks to related section heading data, such that a user accessing a page of a book may immediately view related data and context of a page.
- the relational database is indexed by logical subunits of the book, so searches for boolean combinations or proximity of elements span page breaks of book text to identify all instances of the desired search data.
- the search engine may employ a graded or adaptive search strategy, and may expand a search if all hits have low ranking, and suppress hits of low ranking when the search produces hits of high ranking.
- the search engine may be adapted to inspect the content of tables, formulae and drawings of the converted book files to further enhance utility of the system as a technical book library. This may be done, for example, by parsing data structures in appearing in a programming language, such as C that define the tables, formulae or drawings.
- Attachment 2 is an example of a chapter of a book that has been converted to electronic form using the program described in the user guide of Attachment 1. Attachment 2 shows examples of the different tags applied by the program to identify different elements of the book.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
- This application is related to and claims the priority under 35 U.S.C. 119(e) of U.S. Provisional Patent Application Ser. No. 60/170,038 filed Dec. 10, 1999. That application is hereby incorporated herein by reference in its entirety.
- The invention relates generally to a system and method for providing a searchable library of electronic documents to a user and more specifically to a system and method for providing a searchable library of electronic books which can be accessed by a user over a global computer network such as the Internet.
- Book publishers publish many books only in hard copy form. In many instances these hard copy books are not available to individuals until the books reach retail stores or are available for purchase through mail order or on-line services. In order for an individual to have access to the contents of the hard copy book, the individual must either go to a retail store and purchase the book or order the book and wait for the book to be delivered. In both instances the book is not immediately available to the individual upon the individual desiring the book. In certain instances this causes a problem for an individual who requires instant information at time of day when retail bookstores are not open for business.
- In addition, in order for an individual to determine if a book in hard copy form contains desired information, the individual must read the back cover, a summary of the book or scan the table of contents. This is a time consuming process and the information desired by the individual may still be located in the book, even if it is not identified in the summary or in the table of contents. There is no automatic mechanism for searching for key words in the text of a hard copy book. Further, many books published on the same topic may contain different information which is valuable to the individual. In many instances it is prohibitively expensive for an individual to purchase all the relevant books, and new books are constantly being published.
- What is desired then is a system and method for providing a searchable library of electronic documents which is accessible to an individual at any time. What is also desired is a system and method for publishing a hard copy document in an electronic form which may be searched by an individual. The present invention permits such functionality.
- The invention relates to a system and method for providing a searchable library of electronic documents to a user. The electronic library system includes an electronic document database, a search module, a user database, a user verification module, a web site server, a communication network and at least one user computer. The electronic library system is used for providing a searchable library of electronic documents to a user. The library of electronic documents is made accessible to a user upon the user obtaining a subscription to the electronic document library service. In one embodiment, the user pays a monthly fee in order to access documents stored in the electronic document library. The electronic documents may be books, magazines, or any other document which is in electronic form. In one embodiment, the electronic books are technical and computer books. The electronic documents may be accessed by a user computer by accessing the electronic document library's web site through a communication network. In one embodiment the communication network is a global computer network such as the Internet.
- The invention also relates to a method for searching the electronic document database. In one embodiment, the user enters a key word query to search for documents in the electronic document database. The search module searches the document database in response to the query entered by the user. In one embodiment, the search module retrieves all the documents which satisfy the requirements of the key word query. In another embodiment, the search module retrieves the documents which are most relevant to the user's request. In yet another embodiment, the search module expands the search parameters in order to locate relevant documents.
- The invention further relates to a system and method for publishing a hard copy document in an electronic form which may be searched by a user. In one embodiment, the electronic library system includes electronic document conversion computers which convert documents from hard copy form into electronic form. In one such embodiment, the electronic document conversion computers convert documents which are in an electronic form used by publishers to print hard copy documents into a form for electronic publication. The electronic form of documents used by publishers may be files from layout programs such as PageMaker, QuarkExpress, FrameMaker, or any other publishing program.
- The present invention has the advantage of providing a searchable library of electronic documents which is accessible to a user at any time of the day. The invention has the further advantage of providing a library containing a quantity of documents which the user may not otherwise have access to and which is searchable by a user through key word searches. The invention therefore allows a user to have access to information at anytime that information is required and provides the user with a convenient and time efficient method for searching for the documents which are the most relevant to the user's needs.
- The invention is pointed out with particularity in the appended claims. The above and further advantages of this invention may be better understood by referring to the following description taken in conjunction with the accompanying drawings, in which:
-
FIG. 1 is a schematic block diagram of an embodiment of an electronic document library system according to the invention; -
FIG. 2 is a flowchart illustrating the steps performed in an embodiment of the invention for a user to gain access to the documents stored in the electronic document library; -
FIG. 3 is a diagram of the options available to a user through an embodiment of a main menu of the electronic document library system of the invention; -
FIG. 4 is a pictorial view of an embodiment of a screen display which is presented to a user upon accessing the web site of the electronic document library; -
FIG. 5 is a flowchart illustrating the steps performed by a user to gain access to the documents in the electronic document library; -
FIG. 6 is a pictorial view of an embodiment of a login screen display; -
FIG. 7 is a pictorial view of an embodiment of a registration screen display; -
FIG. 8 is a pictorial view of an embodiment of a search screen display; -
FIG. 9 is a pictorial view of an embodiment of a sub-topic list screen display; -
FIG. 10 is a pictorial view of an embodiment of a search results screen display; -
FIG. 11A is a pictorial view of another embodiment of a search results screen display; -
FIGS. 11B and 11C are a pictorial view of an embodiment of a screen displaying the contents of an electronic document; -
FIG. 12 is a pictorial view of another embodiment of a screen displaying the contents of an electronic document; -
FIG. 13 is a pictorial view of an embodiment of menu presented to a user to navigate the contents of an electronic document; -
FIG. 14 is a flowchart illustrating the steps performed by the search module to return the most relevant documents in response to a user's query; -
FIG. 15 is a block diagram illustrating the hierarchy of elements contained in typical hard copy books; -
FIG. 16 is a flowchart illustrating the steps performed by the search module to improve the relevancy of the search results returned to a user; -
FIG. 17 is a pictorial view of an embodiment a virtual bookshelf; -
FIG. 18 is a pictorial view of an embodiment of a screen display presented to a user to create a virtual bookmark; -
FIG. 19 is a pictorial view of a screen display showing the manifest created by an embodiment of the tool for converting a publisher's file to an electronic document file; and -
FIG. 20 is pictorial view of a screen display showing the list of tables created by an embodiment of the tool for converting a publisher's file to an electronic document file. - Like reference characters in the respective drawn figures indicate corresponding parts.
- In broad overview, and referring to
FIG. 1 , an embodiment of an electronicdocument library system 10 according to the present invention includes anelectronic document database 12, asearch module 14, auser database 16, auser verification module 18, aweb site server 20, acommunication network 22 and at least oneuser computer 24. The electronicdocument library system 10 is used for providing a searchable library of electronic documents to a user. The library ofelectronic documents 26 is made accessible to a user upon the user obtaining a subscription to the electronic document library service. In one embodiment, the user pays a monthly fee in order to access documents stored in theelectronic document library 26. In another embodiment, the user pays a yearly fee in order to gain access. In yet other embodiments, the user pays a fee based on another predetermined period of time. - In one embodiment, the documents available in the
document database 12 are documents which are commercially available from publishers. In one such embodiment, the electronic document library provider obtains the right to electronically publish the documents from the publisher. The electronic document library provider provides the publisher with compensation in return for the right to electronically publish the document. In one embodiment the compensation is a flat fee. In another embodiment, the compensation is a royalty-based fee. For example, the electronic document library provider may pay each publisher a percentage of the subscription fees obtained from users. The fee may be based on the number of documents the publisher provides. In another embodiment the fee paid to the publisher is based on the number of users who access the publisher's documents. In yet another embodiment, the fee paid to the publisher is based on the number of times the publisher's documents are accessed. - In still another embodiment, the electronic document library provider negotiates with each publisher the percentage of the electronic document library provider's revenue reserved for royalties. The electronic document library provider's royalty pool size is then a blended average of the negotiated rates with each publisher. The publisher then competes based on content usage (i.e., retrievals) for a piece of their view of the royalty pool. The usage calculation is weighted: page retrievals, placing a book on a bookshelf, and the list price of the book all factor into usage points for a publisher. The electronic document library provider then divides up the royalty pool as seen by each publisher by getting a percentage of their usage points versus total usage points for all publishers. The royalty payments are made quarterly.
- The
document database 12 provides a central location for storing the electronic documents that may be accessed by a user. The electronic documents may be books, magazines, or any other document which is in electronic form. In one embodiment, the electronic books are technical and computer books. The electronic documents may be accessed by auser computer 24 as described in more detail below. In one embodiment, thedocument database 12 is maintained by a third party at theelectronic document library 26. Thedocument database 12 may be a single database or may be a plurality of databases which are in communication with each other. - The
document database 12 is in communication with thesearch module 14. Thesearch module 14 searches thedocument database 12 in response to queries entered by auser computer 24. In one embodiment thesearch module 14 retrieves the documents which are most relevant to the user's request. An embodiment of a process for retrieving the most relevant documents will be described in more detail below in the description ofFIGS. 14-16 . Thesearch module 14 is in communication with theweb site server 20. Theweb site server 20 generates an interactive screen display for interacting with auser computer 24. Theweb site server 20 is connected to thecommunication network 22. In one embodiment, thecommunication network 22 is a global computer network, such as the Internet. In another embodiment, thecommunication network 22 is a telephone network. In yet another embodiment, thecommunication network 22 is an electronic network dedicated to the communication of information between theweb site server 20 and theuser computer 24. Thecommunication network 22 may be any system for communicating data between two computers. - The
user computer 24 may be any computer which is capable of being connected to thecommunication network 22. Theuser computer 24 may be the user's own personal computer or any other computer or terminal which the user may access to connect to thecommunication network 22. For example, theuser computer 24 may be a computer which the user has access to at the user's workplace. In one embodiment, theuser computer 24 is not a single computer, but changes as the user accesses different computers. - The
user database 18 provides a central location for storing user information. In one embodiment, theuser database 18 stores a user name and a password for each user authorized to access documents stored in thedocument database 12. In another embodiment, theuser database 18 stores personal information regarding authorized users, such as name, address, telephone number, e-mail address, occupation, etc. In another embodiment, theuser database 18 stores billing information for each user. For example, theuser database 18 may store information indicating that the user's personal credit card account should be charged to keep the user's subscription current. Alternatively, theuser database 18 may store information indicating that the user's employer should be charged to keep the user's subscription current. Theuser database 18 is in communication with auser verification module 16 which receives user information from theuser computer 24 through thecommunication network 22 and theweb site server 20. Theuser verification module 16 compares the user identification information received from theuser computer 24 to the entries in theuser database 18 and determines whether a user should be granted access to thedocument database 12. - In one embodiment, a user is allowed to search the
document database 12 without being authorized by theuser verification module 16. In this embodiment, the user is verified by theuser verification module 16 upon requesting to view the contents of an electronic document. In another embodiment, the user is verified by theuser verification module 16 before being able to search thedocument database 12. In yet another embodiment, the user is able to view only certain documents in thedocument database 12 without being verified by theuser verification module 16. - In one embodiment, the
electronic library system 10 includes aregistration module 28. Theregistration module 28 enables users who are not currently included in theuser database 18 to register for the service and to be added to theuser database 18. In another embodiment, the user contacts the third party maintaining theelectronic library system 10 in order to register for the service and be added to theuser database 18. The registration process will be described in more detail below in the discussion ofFIG. 7 . - In another embodiment, the
electronic library system 10 includes electronicdocument conversion computers 30 which convert documents from hard copy form into electronic form. In one such embodiment, the electronicdocument conversion computers 30 convert documents which are in an electronic form used by publishers to print hard copy documents into a form for electronic publication. The electronic form of documents used by publishers may be files from layout programs such as PageMaker, QuarkExpress, FrameMaker, or any other publishing program. The process for converting books into a standard electronic form will be described in more detail below. - Referring to
FIG. 2 , a flowchart illustrates the series of steps performed for a user to gain access to thedocument database 12. Atstep 32, a user has access to acomputer 24, such as a personal computer. The user connects thecomputer 24 to thecommunication network 22. In one embodiment, the user connects thecomputer 24 to a global computer network, such as the Internet, through a service provider. Next, instep 34 the user selects the web site address of theelectronic document library 26. Instep 36, in response to the user's selection, theweb site server 20 transmits the screen display for the home page to theuser computer 24 for display on the monitor of theuser computer 24. The home page displays themain menu 37 from which the user may select. - The
main menu 37 provides the user with a central page from which all the major functions of thesystem 10 can be reached. The main menu screen presents the user with the different functions which may be selected by pointing and clicking on an appropriate button or icon.FIG. 3 illustrates the options from which a user may select upon entering the main menu page. The options include: register 38,new documents 39, mostpopular documents 40, login 41,search documents 42,personal bookshelf 43, help 44 andfeedback 45. Each of these options will be described in detail below.FIG. 4 presents one embodiment of ascreen display 50 which may be presented to a user as a home page or main menu page. Referring back toFIG. 2 , instep 46, the user reviews the options and selects the option of interest. In response to the users selection, theweb site server 20 determines which option the user selected instep 48 and selects the appropriate page to present to the user. - If the user selects the
login option 41, theweb site server 20 proceeds to step 52 and displays a user login page. The flowchart ofFIG. 5 illustrates the steps performed by one embodiment the present invention to login a user. As described above, instep 54 the user selects thelogin option 41. Next, instep 56, theweb site server 20 transmits the screen display for the login page to theuser computer 24.FIG. 6 is one embodiment of a loginpage screen display 58. The loginpage screen display 58 queries the user for a user name and a password. The loginpage screen display 58 includes auser name field 60 and apassword field 62. Once the user has entered the user name and password into the loginpage screen display 58, the user selects the “Login”button 64 or hits a “Return/Enter” key. Upon selecting the “Login”button 64, theuser computer 24 transmits the user identification information to theweb site server 20 instep 66. Once theweb site server 20 receives the user identification information entered into thelogin screen 58, instep 68, theweb site server 20 transfers the user identification information to theuser verification module 16. Theuser verification module 16 searches theuser database 18 to determine if the user name is contained in theuser database 18 and to determine if the entered password is valid. - If the user name and associated password are valid, the
user verification module 16 grants the user access to the full contents of the documents in theelectronic document database 12 instep 70. In one embodiment, if the user name is valid, but the associated password is invalid, theuser verification module 16 sends an error message to theuser computer 24 through theweb site server 20 and requests the user to login again. If the user name is not valid, theuser verification module 16 denies the user access to theelectronic document database 12 instep 72. In another embodiment, the user is granted access to search thedocument database 12, but may not view the full contents of the electronic documents. In another embodiment, if the user is denied access instep 72, the user is given an opportunity to obtain a subscription to the electronic document library service by registering with theelectronic document library 26. In one embodiment, the user may set the user's preferences to automatic login in order to avoid entering a user name and password each time the user accesses theweb site server 20. In this embodiment, theuser verification module 16 still verifies the user name and password, each time the user logs into theweb site server 20 in order to verify that the user's subscription is still valid. - Referring again to
FIG. 2 , if the user selects theregister option 38, theweb site server 20 proceeds to step 72 and displays the registration page.FIG. 7 shows one embodiment of aregistration screen 74. Theregistration screen 74 allows a user to request a subscription to theelectronic document library 26. Theregistration screen 74 queries the user for user identification information, company information, telephone number information, e-mail address information and number of subscriptions requested. In another embodiment, theregistration screen 74 also queries the user for billing information, such as a credit card number. Theregistration screen 74 includes user name fields 76, 78, ajob title field 80, ane-mail address field 82, atelephone number field 84, acountry field 86, acompany name field 88, adepartment name field 90, adepartment size field 92 and a number of seats requestedfield 94. The number of seats requested field enables a user to purchase multiple subscriptions at the same time. For example, a company may wish to purchase subscriptions for several of its employees. In one embodiment, thecountry field 86 is a drop down menu from which a user may select a country. In another embodiment, theregistration screen 74 also includes fields for the user's address and billing information, such as a credit card number and expiration date. In another embodiment, theregistration screen 74, includes a field for the user to enter a preferred password. Once the user has entered all of the required information into theregistration screen 74, the user selects theOK button 96 or hits the “Return/Enter” key. - After the user has entered the necessary information into the
registration screen 74 and selected theOK button 96, theuser computer 24 transmits the information to theweb site server 20. Theweb site server 20 transfers the information to theregistration module 28 which determines if the user should be granted a subscription. If the user is granted a subscription, theregistration module 28 adds a user name and a password to theuser database 18. In one embodiment, theregistration module 28 uses the user name entered by the user in the user name fields 76, 78. In another embodiment theregistration module 28 selects a user name for the user. In yet another embodiment, theregistration module 28 uses the password entered by the user as a preferred password. In another embodiment, theregistration module 28 selects a password to be associated with the user name. If the user is granted a registration, theregistration module 28 contacts the user and informs the user of the user name and password to use when logging into theelectronic document library 26. In one such embodiment, theregistration module 28 sends an e-mail message to the user. The e-mail message contains the user name and password to be entered by the user when logging into theelectronic document library 26. In another embodiment, the user is given the opportunity to change the user name and password. - Referring again to
FIG. 2 , if the user selects thenew documents option 39, theweb site server 20 proceeds to step 98 and displays information regarding documents recently added to thedocument database 12. If the user selects the mostpopular documents option 40, theweb site server 20 displays information regarding documents which have been accessed by the greatest number of users or which have been accessed the greatest number of times during a predetermined period. In one embodiment, the predetermined period is the previous week. - If the user selects the
search documents option 42, theweb site server 20 proceeds to step 100 and displays a find books page screen.FIG. 8 shows one embodiment of a findbooks page screen 102. In the embodiment shown inFIG. 8 , the find books screen 102 presents the user with three options for searching thedocument database 12 for electronic documents. In one option, the user is able to browse a topic list 104. The topic list 104 ofFIG. 8 identifies topics which may be of interest to information technology and computer specialists. Other embodiments of the topic list 104 may contain topics of interest to individuals in other fields. In one embodiment, once a user selects topic from the topic list 104, the user is presented with a list of sub-topics. For example, if a user selects the “Databases”topic 106, the user is presented with thesub-topic list 108 shown inFIG. 9 . The user then has the option to return to themain topic list 110, select one of the sub-topics 108 or go torelated topics 112. - Once the user selects one of the sub-topics 108, the
search module 14 identifies the documents categorized under the selected sub-topic and the user is presented with a list of documents which are categorized under the selected sub-topic. For example, if the user selects the “Access” sub-topic 114, the user is presented with a list of documents in the form of thescreen display 116 shown inFIG. 10 . The user may then select one of the documents or return to thesub-topic list 108 or main topic list 104. - Referring again to
FIG. 8 , in another embodiment, the user is able to search for a particular document by supplying information regarding the particular document. For example, the user may search for a document by entering words in the document's title, the author's name, the publisher or the ISBN. The user enters the document information into the lookup field 118. Once the user enters the document information, the user selects the “Lookup”button 120 or hits the “Return/Enter” key. Thesearch module 14 then searches thedocument database 12 for documents which correspond to the user's query and returns a list of the relevant documents. - In another embodiment, the user is able to search for an electronic document in the
document database 12 by entering key words which might occur in the text of the document. The user enters the key words into thekey word field 122. Once the user has entered the key words, the user selects the “Search”button 124 or hits the “Return/Enter” key. Thesearch module 14 then searches thedocument database 12 for documents which contain the key words. In one embodiment, the user is able to instruct thesearch module 14 to search for documents in theentire document database 12, in a particular category from the main topic list 104 orsub-topic list 108, in a results list from a previous search or in books present in the user's personal bookshelf. - In the example shown in
FIG. 11A , the user has entered the key words “java jar files” in thekey word field 122. The user has also selected to search all documents. Thesearch module 14 searched thedocument database 12 and found 21 documents. Thesearch module 14 presents a list of the found documents to the user. In one embodiment, thesearch module 14 searches thedocument database 12 for the documents which contain information most relevant to the user's query. An embodiment of a method for determining which documents are most relevant will be described in detail below in the discussion ofFIGS. 14-16 . In one such embodiment, the search module presents the results of the search in order of relevance. In one embodiment, thesearch module 14 indicates the relevance of documents as “High”, “Medium”, or “Low”. In still another embodiment, thesearch module 14 indicates the sections of the document which are most relevant. In yet another embodiment, thesearch module 14 presents the most relevant documents and links to the most relevant sections within these documents. In the embodiment shown inFIG. 11A , thesearch module 14 indicates the relevancy of different sections by using athermometer type gauge 126. A thermometer gauge which the most filled has the highest relevancy. For example, inFIG. 11A , the section “Minimizing Applet Loading Times” 128 has a higher relevancy than the section “Using the Java Development Kit (JDK)” 130. In other embodiments, other types of level indicators are used to indicate relevancy. - From the search results list, the user may select to view an electronic document. For example, the user may select to view the book “
Mastering Java 2” by clicking on the underlined title. Theweb site server 20 then displays the book to the user. In one embodiment, theweb site server 20 displays the entire content of the book to the user, including text, graphics, tables, code listings, etc. In another embodiment, theweb site server 20 displays the contents of the book just as the user would see the contents if viewing the hard copy version. Continuing the example from above and referring toFIGS. 11B and 11C , in one embodiment theweb site server 20 displays a synopsis of thebook 132, the table of contents of thebook 134 and the information on the back cover of thebook 136. In one embodiment, theweb site server 20 also provides links to colleague comments about the book and public comments about the book. Theweb site server 20 also displays the thermometer gauges next to the most relevant sections of the book. - Referring back to
FIG. 11A , the user may also select a link that leads directly to the relevant section of a document. For example, the user may select the link “Minimizing Applet Loading Times” 128 to go directly to that section of the document.FIG. 12 shows an example of ascreen 137 which displays the content of the document section selected by the user. In one embodiment, the web site server highlights thekey words 138 entered by the user. -
FIG. 13 shows an embodiment of amenu 140 presented to the user along with the contents of the selected document. Themenu 140 identifies the portion 142 of the document the user is viewing and presents the user with links to other portions of the document. - In order to understand how this embodiment of a search is performed, it is important to understand how documents are stored in the
document database 12. Knowledge of the typical structure of a type of document is used when documents are added to thedocument database 12. When the electronic documents are stored in thedocument database 12, the documents are broken down into a hierarchy of constituent elements. The constituent elements are linked together in a relational scheme so that thedocument database 12 can identify which documents a particular element is part of. An index is created for each document which identifies the constituent elements.FIG. 15 illustrates a possible hierarchy for breaking down abook 146 into its constituent parts. Many books are composed ofparts 147, eachpart 147 being composed ofchapters 148, eachchapter 148 being composed ofsections 150 and eachsection 150 being composed ofsub-sections 152. The sub-sections may contain text, tables, graphics, code listings, etc. Other documents may be composed of other elements. -
FIG. 14 shows a flowchart illustrating the steps performed by thesearch module 14 to locate the most relevant documents in thedocument database 12 in response to the key words entered by the user in thekey word field 122. Instep 144, thesearch module 14 receives the search terms from the user. Instep 154 thesearch module 14 searches the lowest level elements of the documents. For example, if the document is abook 146, thesearch module 14 searches thesubsections 152 for the key words entered by the user. Next, instep 156, the search module ranks the relevancy of the lowest level elements located. The search module may use any of the relevancy ranking algorithms known in the art to rank the relevancy of the lowest level elements located. All of the objects within the element are included in the calculation. For example, text, graphics, tables, code listings, etc. all searched and used in the calculation. In one embodiment, thesearch module 14 calculates the number of times a search term appears in the element. After the relevancy of the lowest level elements has been ranked, instep 158 thesearch module 14 sums the relevancy rankings of the lowest level elements to determine the ranking for the next higher element. For the book example shown inFIG. 15 , the search module would rank the relevancy rankings for Part1, and then sum the relevancy rankings for theparts 147 to determine arelevancy ranking subsections 152 to determine a relevancy ranking for Section1. Next, instep 160, thesearch module 14 determines if the relevancy ranking for the highest level element has been calculated. For the book example ofFIG. 15 , thesearch module 14 determines if the relevancy ranking has been calculated for thebook 146. If the relevancy ranking for the highest level element has not been calculated, thesearch module 14 returns to step 158 and repeatssteps FIG. 15 , thesearch module 14 would sum the relevancy rankings forsections 150 to determine a relevancy ranking for Chapter1, and then sum the relevancy rankings for thechapters 148 to determine a relevancy ranking for thebook 146. Once the relevancy ranking has been determined for the highest most elements, thesearch module 14 returns a list of the documents with the highest ratings to the user instep 162. - In one embodiment, the
search module 14 assigns a relevancy ranking based on the number of hits and the quality of the hits (i.e. exact words in key word search). In another embodiment, thesearch module 14 assigns a relevancy ranking to a document according to how many times the document has been accessed. In another embodiment, thesearch module 14 assigns a relevancy ranking according to whether the user performing the search has accessed the document previously or has stored the document on the user's personal bookshelf. - In one embodiment, the
search module 14 evaluates the search results before returning the results to the user. In one such embodiment, before returning the results to the user instep 162, thesearch module 14 performs the steps illustrated in the flowchart ofFIG. 16 . Instep 164, thesearch module 14 evaluates the search results. In certain embodiments, to evaluate the search results, thesearch module 14 determines whether the exact key words entered by the user appeared in the located documents, the number of times the key words appeared in the documents, whether the key words appeared in the titles of the different elements of the document or simply within the text, and the number of elements within a document the key words appeared. Based on this evaluation, instep 166 thesearch module 14 suppresses low ranking results. In one embodiment, thesearch module 14 suppresses low ranking documents. In another embodiment, thesearch module 14 suppresses low ranking elements within a document. Next, instep 168, thesearch module 14 determines whether to expand the search parameters. Thesearch module 14 may determine to expand the search parameters if none of the documents located had a high relevancy ranking or if the number of documents located was low. - If the
search module 14 determines the search results are sufficient and the search parameters do not need to be expanded, thesearch module 14 returns to step 154 ofFIG. 14 and returns the results to the user. If thesearch module 14 determines that the search results are not sufficient and the search parameters should be expanded, thesearch module 14 proceeds to step 170 and employs fallback strategies to expand the search results. - In determining whether to implement fallback strategies, the
search module 14 must balance precision and recall. Precision refers to the accuracy of the documents located. If specific key words are important, to be precise the search results must contain those key words. Recall refers to the scope of the documents located. For example, by searching for only specific key words, a user may inadvertently omit documents which are relevant. Often users are not able to formulate a search which will return the most relevant documents. A user may not think to enter certain key words. Thesearch module 14 balances precision and recall. - In one embodiment, the
search module 14 begins by searching for highly precise results, e.g., results containing the exact key words in the exact manner entered by the user. If adequate results are not returned, thesearch module 14 works toward high recall. Thesearch module 14 simulates the type of searching performed by information librarians. The strategies for expanding the search results may include stemming the key words entered by the user, implementing “near” operators, and implementing “and” and “or” operators. If the user has entered more than one key word, thesearch module 14 implements “near” operators by searching for the key words within a certain distance within from each other rather than searching for a phrase containing the key words directly next to each other. To implement the “and” operator, thesearch module 14 looks for both key words being in the document rather than being directly next to each other. To implement the “or” operator, thesearch module 14 searches for documents containing at least one of the key words entered by the user. In different embodiments thesearch module 14 employs different strategies for expanding the search and employs the strategies in different orders. The purpose of expanding the search is to provide the user with the optimum results. - Once the
search module 14 determines a strategy for expanding the search parameters, thesearch module 14 returns to step 154 ofFIG. 14 and repeats steps 154-168 until thesearch module 14 determines not to expand the search parameters. In one embodiment, when repeatingstep 156, the ranking assigned to a document element is dependent upon the search performed. For example, a document containing an exact phrase entered by a user receives a higher relevancy ranking than a document containing each word of the phrase in separate parts of the document. - In one embodiment, along with returning the search results, the
search module 14 displays the actual search conducted to locate the search results. For example, if the user entered the key words “java jar files” and thesearch module 14 returned documents containing the words “java” or “jar” or “file”, the search module would inform the user that the search conducted was: java or jar or file. - In addition to searching for documents, a user may create a personal virtual “bookshelf” of documents. The bookshelf contains documents that the user accesses frequently or desires to be able to access without searching the
document database 12. Referring again toFIG. 11B , upon accessing an electronic document, a user is presented with the option of adding the document to the user's personal bookshelf through an “Add to Bookshelf”option 172. Upon selecting the “Add to Bookshelf”option 172, the document is added to the user's personal bookshelf. Referring toFIG. 12 , when viewing a portion of a document, the user is presented with an option of creating a virtual bookmark to that portion. - When the user selects the
personal bookshelf option 43 from the main menu, theweb site server 20 displays a list of the documents in the user's personal bookshelf and a list of the bookmarks created by the user (step 176 inFIG. 2 ).FIG. 17 shows an example of a screen displaying adocument 180 and abookmark 182 stored on the user's bookshelf. In one embodiment, when the createbookmark option 174 is selected by the user, in addition to simply adding a bookmark to the user's bookshelf, theweb site server 20 presents the user with a several options for personalizing the bookmark.FIG. 18 shows one embodiment of ascreen display 184 for allowing a user to personalize a bookmark. Thescreen 184 includes abookmark note field 186 into which the user may enter a comment about the document. In one embodiment, thescreen 184 also allows the user to keep the bookmark private or share the bookmark with others. In another embodiment, thescreen 186 allows a user to select ane-mail option 188. If the user selects thee-mail option 188, an e-mail notification is sent to the individuals identified by the user informing the individuals of the bookmark. - As mentioned above, in one embodiment, the electronic
document library system 10 includes electronicdocument conversion computers 30 for creating digital versions of existing paper-based books which are aggregated, categorized and added to thedocument database 12. Many hard copy book publishers used different desktop publishing software programs to produce books. Examples of different programs include PageMaker, QuarkExpress, FrameMaker and others. These publishing programs enable books to be published as hard copy books by printing presses. These files are not in a form capable of electronically publishing the documents. The electronicdocument conversion computers 30 execute an algorithm which converts files from different desktop publishing programs into a standard electronic format. -
Attachment 1 is a user guide which describes the process for creating an electronic book from a file obtained from a publisher. The user guide describes the steps a user, such as a person who is converting an electronic book file from a publisher's press-ready or pre-publication digital data format to a searchable electronic library format of the present invention, performs, using an embodiment of a document conversion tool to create an electronic book. -
FIG. 19 shows a screen display illustrating the manifest created by an embodiment of the tool for converting a publisher's file to an electronic document file. The manifest lists all of the files composing the document. -
FIG. 20 shows a screen display illustrating the list of tables included in a book created by an embodiment of the tool for converting a publisher's file to an electronic document file. - Thus it will be seen that the present invention provides a method and system for publishing a plurality of books for user access to information. The system includes selecting a plurality of books, converting each book from a layout or publication digital data form, e.g., by training a tool to detect characteristic features (such as layout, typeface, and hierarchical or organizational features such as chapter headings, captions, drawings and tables), and extracting text or data information of the book tagged with the features. This produces a searchable publishing database, e.g., arranged as an xml database indexed by book structure such that a user may remotely, e.g., over the internet or other network, access the database, search desired text data, and view an image of a portion of the book with the desired data. The system includes a user registration module to identify an authorized user, and may maintain a personal bookshelf for the user. Advantageously, a search engine may rank search results based on their position in the hierarchy or other factors, determining degree of relevance of text or data information located by the search engine. The other factors may include position of located search data in the hierarchy, identification of search data in the user's personal library, identification of search data in a prior search by the user or degree of match of data identified in the search.
- The search engine may be a commercially available search engine together with an interface that operates, when provided a search query by a user, to search for an exact match and to score hits for relevance, and in the event an exact match is not found, operate to expand the query and return scored search results located by the search engine together with an indication of the expanded search such that the user may ascertain a degree of likely relevance of returned text or data information. The relational database may include hyperlinks to related section heading data, such that a user accessing a page of a book may immediately view related data and context of a page. The relational database is indexed by logical subunits of the book, so searches for boolean combinations or proximity of elements span page breaks of book text to identify all instances of the desired search data. The search engine may employ a graded or adaptive search strategy, and may expand a search if all hits have low ranking, and suppress hits of low ranking when the search produces hits of high ranking. In further embodiments, the search engine may be adapted to inspect the content of tables, formulae and drawings of the converted book files to further enhance utility of the system as a technical book library. This may be done, for example, by parsing data structures in appearing in a programming language, such as C that define the tables, formulae or drawings.
-
Attachment 2 is an example of a chapter of a book that has been converted to electronic form using the program described in the user guide ofAttachment 1.Attachment 2 shows examples of the different tags applied by the program to identify different elements of the book. - One skilled in the art will appreciate further features and advantages of the invention based on the above-described embodiments. Accordingly, the invention is not to be limited by what has been particularly shown and described. All publications and references cited herein are expressly incorporated herein by reference in their entirety.
Claims (25)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/873,681 US20080065609A1 (en) | 1999-12-10 | 2007-10-17 | System and method for providing a searchable library of electronic documents to a user |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17003899P | 1999-12-10 | 1999-12-10 | |
US09/734,494 US7287214B1 (en) | 1999-12-10 | 2000-12-11 | System and method for providing a searchable library of electronic documents to a user |
US11/873,681 US20080065609A1 (en) | 1999-12-10 | 2007-10-17 | System and method for providing a searchable library of electronic documents to a user |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/734,494 Continuation US7287214B1 (en) | 1999-12-10 | 2000-12-11 | System and method for providing a searchable library of electronic documents to a user |
Publications (1)
Publication Number | Publication Date |
---|---|
US20080065609A1 true US20080065609A1 (en) | 2008-03-13 |
Family
ID=38607164
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/734,494 Expired - Fee Related US7287214B1 (en) | 1999-12-10 | 2000-12-11 | System and method for providing a searchable library of electronic documents to a user |
US11/873,681 Abandoned US20080065609A1 (en) | 1999-12-10 | 2007-10-17 | System and method for providing a searchable library of electronic documents to a user |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/734,494 Expired - Fee Related US7287214B1 (en) | 1999-12-10 | 2000-12-11 | System and method for providing a searchable library of electronic documents to a user |
Country Status (1)
Country | Link |
---|---|
US (2) | US7287214B1 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060149710A1 (en) * | 2004-12-30 | 2006-07-06 | Ross Koningstein | Associating features with entities, such as categories of web page documents, and/or weighting such features |
US20090012956A1 (en) * | 2003-01-06 | 2009-01-08 | Microsoft Corporation | Retrieval of Structured Documents |
US20100058440A1 (en) * | 2008-08-27 | 2010-03-04 | Yahoo! Inc. | Interaction with desktop and online corpus |
EP2469460A1 (en) * | 2010-12-21 | 2012-06-27 | France Telecom | Computer-displayable content enriched with selected advertisement content |
US20130232134A1 (en) * | 2012-02-17 | 2013-09-05 | Frances B. Haugen | Presenting Structured Book Search Results |
KR101414999B1 (en) * | 2012-10-23 | 2014-07-08 | 한국과학기술정보연구원 | Search result providing system and method using tag based boolean query matching |
US20140253977A1 (en) * | 2004-04-19 | 2014-09-11 | Google Inc. | Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device |
WO2014151399A3 (en) * | 2013-03-15 | 2014-11-13 | Harpercollins Christian Publishing | System, method and interface for compiled literary work |
US9043878B2 (en) * | 2012-03-06 | 2015-05-26 | International Business Machines Corporation | Method and system for multi-tiered distributed security authentication and filtering |
US10452740B2 (en) * | 2012-09-14 | 2019-10-22 | Sdl Netherlands B.V. | External content libraries |
US11287422B2 (en) | 2019-09-23 | 2022-03-29 | Element Biosciences, Inc. | Multivalent binding composition for nucleic acid analysis |
Families Citing this family (63)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8180904B1 (en) | 2001-04-26 | 2012-05-15 | Nokia Corporation | Data routing and management with routing path selectivity |
US9032097B2 (en) | 2001-04-26 | 2015-05-12 | Nokia Corporation | Data communication with remote network node |
US9143545B1 (en) | 2001-04-26 | 2015-09-22 | Nokia Corporation | Device classification for media delivery |
US20030014339A1 (en) * | 2001-07-16 | 2003-01-16 | Bush Joel L. | Cash flow management system and method |
US8041739B2 (en) * | 2001-08-31 | 2011-10-18 | Jinan Glasgow | Automated system and method for patent drafting and technology assessment |
US7721192B2 (en) * | 2002-11-27 | 2010-05-18 | Microsoft Corporation | User interface for a resource search tool |
CN1898257B (en) * | 2003-12-17 | 2010-06-23 | 普雷西斯药品公司 | Methods for synthesis of encoded libraries |
US7783638B2 (en) * | 2004-01-09 | 2010-08-24 | International Business Machines Corporation | Search and query operations in a dynamic composition of help information for an aggregation of applications |
WO2005070040A2 (en) | 2004-01-14 | 2005-08-04 | Barr Cary B | Partner protection insurance |
US20050172282A1 (en) * | 2004-01-30 | 2005-08-04 | Michael Shenfield | System and method for publishing and accessing application APIs on a generic terminal |
US7844668B2 (en) | 2004-07-30 | 2010-11-30 | Microsoft Corporation | Suggesting a discussion group based on indexing of the posts within that discussion group |
US7580837B2 (en) | 2004-08-12 | 2009-08-25 | At&T Intellectual Property I, L.P. | System and method for targeted tuning module of a speech recognition system |
US8386453B2 (en) * | 2004-09-30 | 2013-02-26 | Google Inc. | Providing search information relating to a document |
US7668889B2 (en) | 2004-10-27 | 2010-02-23 | At&T Intellectual Property I, Lp | Method and system to combine keyword and natural language search results |
US7242751B2 (en) | 2004-12-06 | 2007-07-10 | Sbc Knowledge Ventures, L.P. | System and method for speech recognition-enabled automatic call routing |
US7751551B2 (en) | 2005-01-10 | 2010-07-06 | At&T Intellectual Property I, L.P. | System and method for speech-enabled call routing |
US9400875B1 (en) | 2005-02-11 | 2016-07-26 | Nokia Corporation | Content routing with rights management |
US7631013B2 (en) * | 2005-04-06 | 2009-12-08 | Sierra Interactive Systems, Inc. | System and method for publishing, distributing, and reading electronic interactive books |
US7996391B2 (en) | 2005-06-20 | 2011-08-09 | Google Inc. | Systems and methods for providing search results |
US8595633B2 (en) * | 2005-10-31 | 2013-11-26 | Yahoo! Inc. | Method and system for displaying contextual rotating advertisements |
US9177124B2 (en) * | 2006-03-01 | 2015-11-03 | Oracle International Corporation | Flexible authentication framework |
US8868540B2 (en) | 2006-03-01 | 2014-10-21 | Oracle International Corporation | Method for suggesting web links and alternate terms for matching search queries |
US8332430B2 (en) | 2006-03-01 | 2012-12-11 | Oracle International Corporation | Secure search performance improvement |
US8214394B2 (en) | 2006-03-01 | 2012-07-03 | Oracle International Corporation | Propagating user identities in a secure federated search system |
US7941419B2 (en) * | 2006-03-01 | 2011-05-10 | Oracle International Corporation | Suggested content with attribute parameterization |
US8707451B2 (en) | 2006-03-01 | 2014-04-22 | Oracle International Corporation | Search hit URL modification for secure application integration |
US8027982B2 (en) | 2006-03-01 | 2011-09-27 | Oracle International Corporation | Self-service sources for secure search |
US8005816B2 (en) | 2006-03-01 | 2011-08-23 | Oracle International Corporation | Auto generation of suggested links in a search system |
US8875249B2 (en) | 2006-03-01 | 2014-10-28 | Oracle International Corporation | Minimum lifespan credentials for crawling data repositories |
US8433712B2 (en) | 2006-03-01 | 2013-04-30 | Oracle International Corporation | Link analysis for enterprise environment |
US20080097975A1 (en) * | 2006-05-19 | 2008-04-24 | Louise Guay | Simulation-assisted search |
US8260776B2 (en) * | 2006-08-28 | 2012-09-04 | Katherine Lynn France-Prouvoste | Method, system and apparatus for dynamic registry of books and for modeling real-time market demand for books within academic sectors |
US8126887B2 (en) * | 2006-09-29 | 2012-02-28 | Business Objects Software Ltd. | Apparatus and method for searching reports |
US7899837B2 (en) | 2006-09-29 | 2011-03-01 | Business Objects Software Ltd. | Apparatus and method for generating queries and reports |
US8204895B2 (en) * | 2006-09-29 | 2012-06-19 | Business Objects Software Ltd. | Apparatus and method for receiving a report |
US20080195574A1 (en) * | 2007-02-08 | 2008-08-14 | Morgana Stephen C | Printed document concordance searching systems and methods |
US8392454B2 (en) * | 2007-03-08 | 2013-03-05 | Xerox Corporation | Concordance searching systems and methods |
US7756860B2 (en) * | 2007-05-23 | 2010-07-13 | International Business Machines Corporation | Advanced handling of multiple form fields based on recent behavior |
US7996392B2 (en) | 2007-06-27 | 2011-08-09 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US8179915B2 (en) * | 2007-06-28 | 2012-05-15 | Lantiq Deutschland Gmbh | System and method for transmitting and retransmitting data |
US8316007B2 (en) | 2007-06-28 | 2012-11-20 | Oracle International Corporation | Automatically finding acronyms and synonyms in a corpus |
JP2009187376A (en) * | 2008-02-07 | 2009-08-20 | Canon Inc | File management device, file management method, and program |
US8447759B2 (en) * | 2008-03-13 | 2013-05-21 | Microsoft Corporation | Assets suggestion across applications |
JP5383089B2 (en) * | 2008-05-16 | 2014-01-08 | キヤノン株式会社 | Information processing apparatus, control method therefor, and control program |
US20090327210A1 (en) * | 2008-06-27 | 2009-12-31 | Microsoft Corporation | Advanced book page classification engine and index page extraction |
JP2010152481A (en) * | 2008-12-24 | 2010-07-08 | Fujitsu Ltd | Device and program for presenting information |
US20100299603A1 (en) * | 2009-05-22 | 2010-11-25 | Bernard Farkas | User-Customized Subject-Categorized Website Entertainment Database |
US8266542B2 (en) * | 2010-06-17 | 2012-09-11 | Hewlett-Packard Development Company, L.P. | Rendering and sorting book search results as three dimensional icons |
SG180039A1 (en) * | 2010-10-22 | 2012-05-30 | Creative Tech Ltd | A method, system and apparatus for efficient display and browsing of e-books |
WO2012058332A1 (en) | 2010-10-26 | 2012-05-03 | Google Inc. | Rich results relevant to user search queries for books |
US8700643B1 (en) * | 2010-11-03 | 2014-04-15 | Google Inc. | Managing electronic media collections |
US20130232236A1 (en) * | 2010-11-19 | 2013-09-05 | M. Vennimalai | Network based dynamically updating electronic flip book server device |
KR101743295B1 (en) * | 2010-11-26 | 2017-06-16 | 삼성전자주식회사 | Device and method for searching e-book in wireless terminal |
US20140324573A1 (en) * | 2011-10-31 | 2014-10-30 | Simon Raab | System and method for click fraud protection |
US9141674B2 (en) * | 2012-05-16 | 2015-09-22 | Google Inc. | Prominent display of selective results of book search queries |
US20140115436A1 (en) * | 2012-10-22 | 2014-04-24 | Apple Inc. | Annotation migration |
US10146915B2 (en) * | 2015-09-14 | 2018-12-04 | Salesforce.Com, Inc. | Publication of collaborative file to library |
US20180330372A1 (en) * | 2017-05-15 | 2018-11-15 | Shlomo Dadon | Code, methods, and systems for monitoring, authorizing and rejecting electronic financial transactions and credit inquiries |
US11762863B2 (en) * | 2017-10-13 | 2023-09-19 | Microsoft Technology Licensing, Llc | Hierarchical contextual search suggestions |
US10909191B2 (en) | 2017-11-20 | 2021-02-02 | Rovi Guides, Inc. | Systems and methods for displaying supplemental content for an electronic book |
US10909193B2 (en) * | 2017-11-20 | 2021-02-02 | Rovi Guides, Inc. | Systems and methods for filtering supplemental content for an electronic book |
US11977584B2 (en) | 2018-04-11 | 2024-05-07 | Jumptuit, Inc. | System and method of correlating multiple data points to create a new single data point |
US11971933B2 (en) * | 2018-04-11 | 2024-04-30 | Jumptuit, Inc. | System and method of AI assisted search |
Citations (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4173408A (en) * | 1977-08-29 | 1979-11-06 | Xerox Corporation | Demand publishing royalty accounting system for an electrostatic copier |
US4179212A (en) * | 1977-09-06 | 1979-12-18 | Xerox Corporation | Demand publishing royalty accounting system |
US5532920A (en) * | 1992-04-29 | 1996-07-02 | International Business Machines Corporation | Data processing system and method to enforce payment of royalties when copying softcopy books |
US5629980A (en) * | 1994-11-23 | 1997-05-13 | Xerox Corporation | System for controlling the distribution and use of digital works |
US5634012A (en) * | 1994-11-23 | 1997-05-27 | Xerox Corporation | System for controlling the distribution and use of digital works having a fee reporting mechanism |
US5778366A (en) * | 1996-06-28 | 1998-07-07 | International Business Machines Corp. | Method and system for organizing on-line books using collections |
US5845303A (en) * | 1994-12-06 | 1998-12-01 | Netpodium, Inc. | Document processing using frame-based templates with hierarchical tagging |
US5850520A (en) * | 1996-07-01 | 1998-12-15 | Electronic Data Systems Corporation | Method and system for electronic publication distribution including return receipt |
US5892843A (en) * | 1997-01-21 | 1999-04-06 | Matsushita Electric Industrial Co., Ltd. | Title, caption and photo extraction from scanned document images |
US5893132A (en) * | 1995-12-14 | 1999-04-06 | Motorola, Inc. | Method and system for encoding a book for reading using an electronic book |
US5907845A (en) * | 1996-07-26 | 1999-05-25 | International Business Machines Corporation | Method and system for organizing on-line books using bookcases |
US5924108A (en) * | 1996-03-29 | 1999-07-13 | Microsoft Corporation | Document summarizer for word processors |
US6012053A (en) * | 1997-06-23 | 2000-01-04 | Lycos, Inc. | Computer system with user-controlled relevance ranking of search results |
US6101503A (en) * | 1998-03-02 | 2000-08-08 | International Business Machines Corp. | Active markup--a system and method for navigating through text collections |
US6138129A (en) * | 1997-12-16 | 2000-10-24 | World One Telecom, Ltd. | Method and apparatus for providing automated searching and linking of electronic documents |
US6185592B1 (en) * | 1997-11-18 | 2001-02-06 | Apple Computer, Inc. | Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds |
US6216123B1 (en) * | 1998-06-24 | 2001-04-10 | Novell, Inc. | Method and system for rapid retrieval in a full text indexing system |
US6292796B1 (en) * | 1999-02-23 | 2001-09-18 | Clinical Focus, Inc. | Method and apparatus for improving access to literature |
US6385614B1 (en) * | 1998-01-14 | 2002-05-07 | Netlibrary Inc. | Electronic bookshelf with multi-user features |
US6418429B1 (en) * | 1998-10-21 | 2002-07-09 | Apple Computer, Inc. | Portable browsing interface for information retrieval |
US6493702B1 (en) * | 1999-05-05 | 2002-12-10 | Xerox Corporation | System and method for searching and recommending documents in a collection using share bookmarks |
US6535791B1 (en) * | 2000-05-01 | 2003-03-18 | Ricoh Company, Ltd. | Electronic information vending machine and method |
US6546406B1 (en) * | 1995-11-03 | 2003-04-08 | Enigma Information Systems Ltd. | Client-server computer system for large document retrieval on networked computer system |
US6704733B2 (en) * | 2000-10-25 | 2004-03-09 | Lightning Source, Inc. | Distributing electronic books over a computer network |
US6728752B1 (en) * | 1999-01-26 | 2004-04-27 | Xerox Corporation | System and method for information browsing using multi-modal features |
US20060004746A1 (en) * | 1998-09-04 | 2006-01-05 | Kalido Limited | Data processing system |
-
2000
- 2000-12-11 US US09/734,494 patent/US7287214B1/en not_active Expired - Fee Related
-
2007
- 2007-10-17 US US11/873,681 patent/US20080065609A1/en not_active Abandoned
Patent Citations (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4173408A (en) * | 1977-08-29 | 1979-11-06 | Xerox Corporation | Demand publishing royalty accounting system for an electrostatic copier |
US4179212A (en) * | 1977-09-06 | 1979-12-18 | Xerox Corporation | Demand publishing royalty accounting system |
US5532920A (en) * | 1992-04-29 | 1996-07-02 | International Business Machines Corporation | Data processing system and method to enforce payment of royalties when copying softcopy books |
US5629980A (en) * | 1994-11-23 | 1997-05-13 | Xerox Corporation | System for controlling the distribution and use of digital works |
US5634012A (en) * | 1994-11-23 | 1997-05-27 | Xerox Corporation | System for controlling the distribution and use of digital works having a fee reporting mechanism |
US5845303A (en) * | 1994-12-06 | 1998-12-01 | Netpodium, Inc. | Document processing using frame-based templates with hierarchical tagging |
US6546406B1 (en) * | 1995-11-03 | 2003-04-08 | Enigma Information Systems Ltd. | Client-server computer system for large document retrieval on networked computer system |
US5893132A (en) * | 1995-12-14 | 1999-04-06 | Motorola, Inc. | Method and system for encoding a book for reading using an electronic book |
US5924108A (en) * | 1996-03-29 | 1999-07-13 | Microsoft Corporation | Document summarizer for word processors |
US5778366A (en) * | 1996-06-28 | 1998-07-07 | International Business Machines Corp. | Method and system for organizing on-line books using collections |
US5850520A (en) * | 1996-07-01 | 1998-12-15 | Electronic Data Systems Corporation | Method and system for electronic publication distribution including return receipt |
US5907845A (en) * | 1996-07-26 | 1999-05-25 | International Business Machines Corporation | Method and system for organizing on-line books using bookcases |
US5892843A (en) * | 1997-01-21 | 1999-04-06 | Matsushita Electric Industrial Co., Ltd. | Title, caption and photo extraction from scanned document images |
US6012053A (en) * | 1997-06-23 | 2000-01-04 | Lycos, Inc. | Computer system with user-controlled relevance ranking of search results |
US6185592B1 (en) * | 1997-11-18 | 2001-02-06 | Apple Computer, Inc. | Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds |
US6138129A (en) * | 1997-12-16 | 2000-10-24 | World One Telecom, Ltd. | Method and apparatus for providing automated searching and linking of electronic documents |
US6385614B1 (en) * | 1998-01-14 | 2002-05-07 | Netlibrary Inc. | Electronic bookshelf with multi-user features |
US6101503A (en) * | 1998-03-02 | 2000-08-08 | International Business Machines Corp. | Active markup--a system and method for navigating through text collections |
US6216123B1 (en) * | 1998-06-24 | 2001-04-10 | Novell, Inc. | Method and system for rapid retrieval in a full text indexing system |
US20060004746A1 (en) * | 1998-09-04 | 2006-01-05 | Kalido Limited | Data processing system |
US6418429B1 (en) * | 1998-10-21 | 2002-07-09 | Apple Computer, Inc. | Portable browsing interface for information retrieval |
US6728752B1 (en) * | 1999-01-26 | 2004-04-27 | Xerox Corporation | System and method for information browsing using multi-modal features |
US6292796B1 (en) * | 1999-02-23 | 2001-09-18 | Clinical Focus, Inc. | Method and apparatus for improving access to literature |
US6493702B1 (en) * | 1999-05-05 | 2002-12-10 | Xerox Corporation | System and method for searching and recommending documents in a collection using share bookmarks |
US6535791B1 (en) * | 2000-05-01 | 2003-03-18 | Ricoh Company, Ltd. | Electronic information vending machine and method |
US6704733B2 (en) * | 2000-10-25 | 2004-03-09 | Lightning Source, Inc. | Distributing electronic books over a computer network |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090012956A1 (en) * | 2003-01-06 | 2009-01-08 | Microsoft Corporation | Retrieval of Structured Documents |
US8046370B2 (en) * | 2003-01-06 | 2011-10-25 | Microsoft Corporation | Retrieval of structured documents |
US20140253977A1 (en) * | 2004-04-19 | 2014-09-11 | Google Inc. | Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device |
US9319555B2 (en) * | 2004-04-19 | 2016-04-19 | Google Inc. | Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device |
US10769431B2 (en) | 2004-09-27 | 2020-09-08 | Google Llc | Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device |
US9852225B2 (en) | 2004-12-30 | 2017-12-26 | Google Inc. | Associating features with entities, such as categories of web page documents, and/or weighting such features |
US20060149710A1 (en) * | 2004-12-30 | 2006-07-06 | Ross Koningstein | Associating features with entities, such as categories of web page documents, and/or weighting such features |
US20100058440A1 (en) * | 2008-08-27 | 2010-03-04 | Yahoo! Inc. | Interaction with desktop and online corpus |
EP2469460A1 (en) * | 2010-12-21 | 2012-06-27 | France Telecom | Computer-displayable content enriched with selected advertisement content |
US20130232134A1 (en) * | 2012-02-17 | 2013-09-05 | Frances B. Haugen | Presenting Structured Book Search Results |
US9043878B2 (en) * | 2012-03-06 | 2015-05-26 | International Business Machines Corporation | Method and system for multi-tiered distributed security authentication and filtering |
US10452740B2 (en) * | 2012-09-14 | 2019-10-22 | Sdl Netherlands B.V. | External content libraries |
KR101414999B1 (en) * | 2012-10-23 | 2014-07-08 | 한국과학기술정보연구원 | Search result providing system and method using tag based boolean query matching |
WO2014151399A3 (en) * | 2013-03-15 | 2014-11-13 | Harpercollins Christian Publishing | System, method and interface for compiled literary work |
US11580171B2 (en) | 2013-03-15 | 2023-02-14 | HarperCollins Christian Publishing, Inc. | System, method and interface for compiled literary work |
US11287422B2 (en) | 2019-09-23 | 2022-03-29 | Element Biosciences, Inc. | Multivalent binding composition for nucleic acid analysis |
US12241891B2 (en) | 2019-09-23 | 2025-03-04 | Element Biosciences, Inc. | Multivalent binding composition for nucleic acid analysis |
Also Published As
Publication number | Publication date |
---|---|
US7287214B1 (en) | 2007-10-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7287214B1 (en) | System and method for providing a searchable library of electronic documents to a user | |
CN101739467B (en) | Personalized network searching method and system | |
US6665659B1 (en) | Methods and apparatus for distributing and using metadata via the internet | |
US10460327B2 (en) | Systems and methods for persistent context-aware guides | |
CA2546514C (en) | Legal research system | |
US8126818B2 (en) | Knowledge-management systems for law firms | |
CA2539431C (en) | Personalized searchable library with highlighting capabilities | |
US6256623B1 (en) | Network search access construct for accessing web-based search services | |
US7496560B2 (en) | Personalized searchable library with highlighting capabilities | |
KR101016683B1 (en) | System and method for providing search results | |
JP5008979B2 (en) | A system with an interactive graphical interface for delivering planning and consulting materials, surveys, and compliance information on taxes or other forms | |
US20050210042A1 (en) | Methods and apparatus to search and analyze prior art | |
AU2006275659B2 (en) | System and method for reordering a result set | |
EP1978455A2 (en) | Personalised searchable library with highlighting capabilities and access to electronic images of text based on user ownership of corresponding physical text | |
US20080313144A1 (en) | Method for enhancing search results | |
US20110238662A1 (en) | Method and system for searching a wide area network | |
US20040078388A1 (en) | Computer-implemented method and system for browsing directories and generating forms | |
US20080195495A1 (en) | Notebook system | |
JP2008542926A (en) | Pay-for-access legal research system with access to open web content | |
US8140525B2 (en) | Information processing apparatus, information processing method and computer readable information recording medium | |
WO2010124334A1 (en) | System and method for providing computer-enabled employment search services | |
US20060053105A1 (en) | Method for information retrieval | |
JP3933617B2 (en) | Shared information search method, shared information search program, and information sharing system | |
US20020107700A1 (en) | System and process for capturing, storing, maintaining and reporting information regarding databases via the internet | |
JP2006079365A (en) | Information retrieval system, information providing system, information retrieval method, program and recording medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: CREDIT SUISSE, NEW YORK Free format text: SECURITY AGREEMENT;ASSIGNOR:SKILLSOFT IRELAND LIMITED;REEL/FRAME:022240/0891 Effective date: 20090209 |
|
AS | Assignment |
Owner name: BOOKS24X7.COM, INC., MASSACHUSETTS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:JENKINS, DOUGLAS M.;CHAPPELL, GEOFF;SIDLO, JOHN;AND OTHERS;REEL/FRAME:022355/0623;SIGNING DATES FROM 20010706 TO 20011016 |
|
AS | Assignment |
Owner name: SKILLSOFT IRELAND LIMITED, IRELAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BOOKS24X7.COM, INC.;REEL/FRAME:022372/0102 Effective date: 20090205 |
|
AS | Assignment |
Owner name: SKILLSOFT IRELAND LIMITED,IRELAND Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CREDIT SUISSE;REEL/FRAME:024424/0748 Effective date: 20100518 |
|
AS | Assignment |
Owner name: MORGAN STANLEY SENIOR FUNDING, INC., AS COLLATERAL Free format text: PATENT SECURITY AGREEMENT;ASSIGNOR:SKILLSOFT IRELAND LIMITED;REEL/FRAME:025137/0920 Effective date: 20100625 |
|
AS | Assignment |
Owner name: SKILLSOFT IRELAND LIMITED, NEW HAMPSHIRE Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:MORGAN STANLEY SENIOR FUNDING, INC.;REEL/FRAME:032775/0093 Effective date: 20140428 Owner name: CBT (TECHNOLOGY) LIMITED, NEW HAMPSHIRE Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:MORGAN STANLEY SENIOR FUNDING, INC.;REEL/FRAME:032775/0093 Effective date: 20140428 Owner name: SKILLSOFT LIMITED, NEW HAMPSHIRE Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:MORGAN STANLEY SENIOR FUNDING, INC.;REEL/FRAME:032775/0093 Effective date: 20140428 |
|
AS | Assignment |
Owner name: BARCLAYS BANK PLC, AS ADMINISTRATIVE AGENT AND COL Free format text: FIRST LIEN SECURITY AGREEMENT;ASSIGNORS:SKILLSOFT IRELAND LTD;SKILLSOFT LIMITED;REEL/FRAME:032792/0820 Effective date: 20140429 |
|
AS | Assignment |
Owner name: BARCLAYS BANK PLC, AS ADMINISTRATIVE AGENT AND COL Free format text: SECOND LIEN SECURITY AGREEMENT;ASSIGNORS:SKILLSOFT IRELAND LTD;SKILLSOFT LIMITED;REEL/FRAME:032800/0213 Effective date: 20140429 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- AFTER EXAMINER'S ANSWER OR BOARD OF APPEALS DECISION |
|
AS | Assignment |
Owner name: WILMINGTON SAVINGS FUND SOCIETY, FSB, AS SUCCESSOR COLLATERAL AGENT, DELAWARE Free format text: NOTICE OF AGENCY RESIGNATION AND ASSIGNMENT OF PATENT SECURITY AGREEMENT;ASSIGNOR:BARCLAYS BANK PLC, AS RESIGNING COLLATERAL AGENT;REEL/FRAME:053236/0505 Effective date: 20200714 Owner name: WILMINGTON SAVINGS FUND SOCIETY, FSB, AS SUCCESSOR COLLATERAL AGENT, DELAWARE Free format text: NOTICE OF AGENCY RESIGNATION AND ASSIGNMENT OF PATENT SECURITY AGREEMENT;ASSIGNOR:BARCLAYS BANK PLC, AS RESIGNING COLLATERAL AGENT;REEL/FRAME:053236/0554 Effective date: 20200714 |
|
AS | Assignment |
Owner name: SKILLSOFT IRELAND LTD., IRELAND Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:WILMINGTON BANK SAVINGS FUND SOCIETY, FSB, AS COLLATERAL AGENT;REEL/FRAME:053638/0600 Effective date: 20200827 Owner name: SKILLSOFT LIMITED, IRELAND Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:WILMINGTON BANK SAVINGS FUND SOCIETY, FSB, AS COLLATERAL AGENT;REEL/FRAME:053638/0600 Effective date: 20200827 Owner name: SUMTOTAL SYSTEMS LLC, NEW HAMPSHIRE Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:WILMINGTON BANK SAVINGS FUND SOCIETY, FSB, AS COLLATERAL AGENT;REEL/FRAME:053638/0600 Effective date: 20200827 Owner name: CYBERSHIFT HOLDINGS, INC., NEW HAMPSHIRE Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:WILMINGTON BANK SAVINGS FUND SOCIETY, FSB, AS COLLATERAL AGENT;REEL/FRAME:053638/0600 Effective date: 20200827 |
|
AS | Assignment |
Owner name: SKILLSOFT IRELAND LIMITED, IRELAND Free format text: RELEASE OF FIRST LIEN PATENT SECURITY INTEREST;ASSIGNOR:WILMINGTON SAVINGS FUND SOCIETY, FSB;REEL/FRAME:053743/0249 Effective date: 20200827 Owner name: SKILLSOFT LIMITED, IRELAND Free format text: RELEASE OF FIRST LIEN PATENT SECURITY INTEREST;ASSIGNOR:WILMINGTON SAVINGS FUND SOCIETY, FSB;REEL/FRAME:053743/0249 Effective date: 20200827 Owner name: SUMTOTAL SYSTEMS LLC, NEW HAMPSHIRE Free format text: RELEASE OF FIRST LIEN PATENT SECURITY INTEREST;ASSIGNOR:WILMINGTON SAVINGS FUND SOCIETY, FSB;REEL/FRAME:053743/0249 Effective date: 20200827 Owner name: CYBERSHIFT HOLDINGS, INC., NEW HAMPSHIRE Free format text: RELEASE OF FIRST LIEN PATENT SECURITY INTEREST;ASSIGNOR:WILMINGTON SAVINGS FUND SOCIETY, FSB;REEL/FRAME:053743/0249 Effective date: 20200827 |