+

CN105095404A - Method and apparatus for processing and recommending webpage information - Google Patents

Method and apparatus for processing and recommending webpage information Download PDF

Info

Publication number
CN105095404A
CN105095404A CN201510400445.3A CN201510400445A CN105095404A CN 105095404 A CN105095404 A CN 105095404A CN 201510400445 A CN201510400445 A CN 201510400445A CN 105095404 A CN105095404 A CN 105095404A
Authority
CN
China
Prior art keywords
information
webpage
collection
generic
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510400445.3A
Other languages
Chinese (zh)
Inventor
陈庆伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anyi Hengtong Beijing Technology Co Ltd
Original Assignee
Anyi Hengtong Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anyi Hengtong Beijing Technology Co Ltd filed Critical Anyi Hengtong Beijing Technology Co Ltd
Priority to CN201510400445.3A priority Critical patent/CN105095404A/en
Publication of CN105095404A publication Critical patent/CN105095404A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9562Bookmark management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

An embodiment of the invention provides a method and an apparatus for processing and recommending webpage information, wherein the method for processing the webpage information comprises: acquiring information of favorite WebPages of a number of users; respectively analyzing information of the favorite WebPages of the users to acquire information of types of the favorite WebPages; storing the information of the favorite WebPages and the information of the types in an associated way. Through the method and the apparatus for processing and recommending the webpage information, the information of the favorite WebPages of a number of users can be automatically analyzed and classified; and the classified favorite datum are shared between users, thus improving user experience.

Description

The disposal route of info web, the recommend method of info web and device
Technical field
The present invention relates to Internet technical field, particularly relate to a kind of disposal route of info web, the recommend method of info web and device.
Background technology
Collection is the web site collection function that browser provides.User, when carrying out web page browsing by browser, by conventional, that like or need the website of mark or webpage to put into the collection of browser, can search so that follow-up, accesses.
The technology of collection data management is divided into two kinds usually.Wherein in a kind of method, manage local collection, user initiatively finds Data Source to add collection data.This kind of method has the limited defect of collection Data Source.In another approach, client carries out regular or irregular synchronous process with high in the clouds to favorites data, can realize the functions such as the remote backup of favorites data, at any time recovery.
But these two kinds of methods all only consider unique user, the collection data separate of numerous user is not got up, cannot provide more abundant Internet resources for user yet.
Summary of the invention
The object of the embodiment of the present invention is, provides a kind of disposal route of info web, the recommend method of info web and device, automatically analyzes the information of the collection webpage of multiple user, classifies, to share the collection data of classification between users.
For achieving the above object, The embodiment provides a kind of disposal route of info web, comprising: the information obtaining the collection webpage about multiple user; Respectively the information of the collection webpage of described user is analyzed, obtain the information of described collection webpage generic; Associatedly store the described information of collection webpage and the information of generic thereof.
Preferably, describedly respectively the information of the collection webpage of described user to be analyzed, the process obtaining the information of described collection webpage generic comprises: from the URL(uniform resource locator) (URL) of the acquisition of information collection webpage of described collection webpage, the content-data of described collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described collection webpage generic.
Preferably, described described content-data to be analyzed, the process obtaining the information of described collection webpage generic also comprises: analyze described content-data, obtain the information of the subclass under described collection webpage generic, the described process associatedly storing the described information of collection webpage and the information of generic thereof comprises: associatedly store the information of described collection webpage and the information of generic and subclass thereof.
Preferably, the information of described classification is the one in following classification: net purchase class, information door class, automotive-type, finance and economic, educational, game class, video display class, novel class, life kind and community's class.
Preferably, the packets of information of described collection webpage draws together web page interlinkage and web page title.
Embodiments of the invention additionally provide a kind of recommend method of info web, comprising: the information receiving the first collection webpage of the user sent from terminal device; The information of described first collection webpage is analyzed, obtains the information of described first collection webpage generic; Obtain and the described first second information of collecting webpage of collecting that webpage generic mates from collection Web page classifying database; The information of described second collection webpage is sent to described terminal device.
Preferably, the described information to described first collection webpage is analyzed, the process obtaining the information of described first collection webpage generic comprises: the URL(uniform resource locator) (URL) obtaining described first collection webpage from the information of described first collection webpage, the content-data of described first collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described first collection webpage generic.
Preferably, described described content-data to be analyzed, the process obtaining the information of described first collection webpage generic also comprises: analyze described content-data, obtaining the information of the subclass under described first collection webpage generic, describedly obtain from collection Web page classifying database and described first collect the second process of collecting the information of webpage that webpage generic mates and comprise: obtaining second information of collecting webpage of to collect with described first that webpage generic and subclass mate from collecting Web page classifying database.
Preferably, described method also comprises: screen according to the information of predetermined recommendation rules to described second collection webpage, and the described information by described second collection webpage sends to the process of described terminal device to comprise: the information of the second webpage after screening is sent to described terminal device.
Preferably, described predetermined recommendation rules comprises: will collect in the information of the second webpage that webpage generic mates with described first, user collect number of times exceed setting collection frequency threshold value the second webpage send to user, or, to collect in the information of the second webpage that webpage generic mates with described first, the second webpage that user's access times exceed setting access times threshold value sends to user.
Embodiments of the invention additionally provide a kind of recommend method of info web, comprising: the operation of collecting the first webpage in response to user, send the request of described first webpage of collection to server; The response comprising the information of collecting the second webpage that webpage generic mates with described first is received from described server; The information of described second webpage of reception is shown to user.
Preferably, described method also comprises: indicate the order of collecting described second webpage in response to user, described second webpage is added to web page storage folder.
Embodiments of the invention additionally provide a kind for the treatment of apparatus of info web, comprising: info web acquisition module, for obtaining the information of the collection webpage about multiple user; Classification information acquisition module, for analyzing the information of the collection webpage of described user respectively, obtains the information of described collection webpage generic; Information storage module, for associatedly storing the described information of collection webpage and the information of generic thereof.
Preferably, described classification information acquisition module is used for the URL(uniform resource locator) (URL) of the acquisition of information collection webpage from described collection webpage, the content-data of described collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described collection webpage generic.
Preferably, described classification information acquisition module is also for analyzing described content-data, obtain the information of the subclass under described collection webpage generic, described information storage module is used for associatedly storing the information of described collection webpage and the information of generic and subclass thereof.
Embodiments of the invention additionally provide a kind of recommendation apparatus of info web, comprising: info web receiver module, for receiving the information of the first collection webpage of the user sent from terminal device; Info web analysis module, for analyzing the information of described first collection webpage, obtains the information of described first collection webpage generic; Info web acquisition module, for obtaining from collection Web page classifying database and the described first second information of collecting webpage of collecting that webpage generic mates; Info web sending module, for sending to described terminal device by the information of described second collection webpage.
Preferably, described info web analysis module is used for the URL(uniform resource locator) (URL) obtaining described first collection webpage from the information of described first collection webpage, the content-data of described first collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described first collection webpage generic.
Preferably, described info web analysis module is also for analyzing described content-data, obtain the information of the subclass under described first collection webpage generic, described info web acquisition module is used for obtaining and the described first second information of collecting webpage of collecting that webpage generic and subclass mate from collection Web page classifying database.
Preferably, described device also comprises: info web screening module, for screening according to the information of predetermined recommendation rules to described second collection webpage, the information that described info web sending module is used for the second webpage after by screening sends to described terminal device.
Embodiments of the invention additionally provide a kind of recommendation apparatus of info web, comprising: collection request sending module, for collecting the operation of the first webpage in response to user, send the request of described first webpage of collection to server; Info web receiver module, for receiving the response comprising the information of collecting the second webpage that webpage generic mates with described first from described server; Info web display module, for showing the information of described second webpage of reception to user.
The disposal route of the info web that the embodiment of the present invention provides, the recommend method of info web and information analyze of device to the collection webpage of the multiple users got obtain the information of collecting webpage generic, again the collection information of webpage and the information of generic thereof are carried out association store, achieve and automatically the information of the collection webpage of multiple user is analyzed, classification, the convenient collection data sharing classification between users, thus recommend the information of the collection webpage of its demand for user, the time that user searches information can be reduced, enrich user network experience.
Accompanying drawing explanation
Fig. 1 is the process flow diagram of the disposal route of the info web that the embodiment of the present invention one is shown;
Fig. 2 is the process flow diagram of the recommend method of the info web that the embodiment of the present invention two is shown;
Fig. 3 is the process flow diagram of the recommend method of the info web that the embodiment of the present invention three is shown;
Fig. 4 is the logic diagram of the treating apparatus of the info web that the embodiment of the present invention four is shown;
Fig. 5 is the logic diagram of the recommendation apparatus of the info web that the embodiment of the present invention five is shown;
Fig. 6 is the logic diagram of the recommendation apparatus of the info web that the embodiment of the present invention six is shown.
Embodiment
Basic conception of the present invention is, there is provided a kind of processing mode of collecting webpage: after the information of collection webpage obtaining multiple user, the information of the collection webpage of the described user of further analysis, by analyzing the information obtaining collecting webpage generic, between the collection information of webpage and the information of generic thereof, set up corresponding relation and associatedly store, can provide more abundant based on the collection data of numerous user for user and more press close to the Internet resources of its demand thus, promote user network and experience.
Describe the disposal route of info web of the embodiment of the present invention, the recommend method of info web in detail below in conjunction with accompanying drawing and use the device of described method.
Embodiment one
Fig. 1 is the process flow diagram of the disposal route of the info web that the embodiment of the present invention one is shown.Such as can perform described method at server end device as described in Figure 4.
With reference to Fig. 1, in step S110, obtain the information of the collection webpage about multiple user.
For example, when supposing that user browses webpage " www.tmall.com ", click by right key " adding collection to ", now server can get the information of the collection webpage of this user, and wherein, the information of collection webpage can comprise, but be not limited to web page interlinkage and web page title, the web page interlinkage of above-mentioned example is " www.tmall.com ", and web page title is " sky cat tmall.com-still sky cat, has just purchased ".In like manner, when there being multiple user to add collection, server just can obtain the information of the collection webpage of multiple user.
In step S120, respectively the information of the collection webpage of described user is analyzed, obtain the information of described collection webpage generic.
It should be noted that, the information of described classification may be, but not limited to, the one in following classification: net purchase class, information door class, automotive-type, finance and economic, educational, game class, video display class, novel class, life kind and community's class.Such as, the information of webpage " www.autohome.com.cn " generic is automotive-type, and the information of webpage " www.7k7k.com " generic is game class.
After the information of collection webpage getting multiple user, according to exemplary embodiment of the present invention, step S120 comprises: from the URL(uniform resource locator) (URL) of the acquisition of information collection webpage of described collection webpage, the content-data of described collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described collection webpage generic.
Still for aforementioned webpage " www.tmall.com ", more can obtain its content-data according to the URL of this webpage, known by analysis, the information of this webpage generic is net purchase class.
In step S130, associatedly store the described information of collection webpage and the information of generic thereof.
That is, the collection information of webpage and the information of generic thereof are set up corresponding relation, thus associatedly stores, such as, the information of collection webpage and the information of generic thereof are stored in a database.
In order to improve the accuracy of classification, the information of multi-level generic can be set, be convenient to for user recommends more to press close to the information of the collection webpage of its demand, step S120 can comprise: analyze described content-data, obtains the information of the subclass under described collection webpage generic.Correspondingly, according to exemplary embodiment of the present invention, step S130 comprises: associatedly store the information of described collection webpage and the information of generic and subclass thereof.
In order to clearly understand embodiments of the invention, below in conjunction with concrete example, the present invention is described in detail.Suppose that user has collected webpage www.nuomi.com, server can get the information of the collection webpage of this user, wherein, web page interlinkage is " www.nuomi.com ", web page title is " Baidu's glutinous rice ", its content-data is obtained again according to the URL of this webpage, information through existing content analysis techniques known collection webpage generic is net purchase class, if the information of generic is set to two-stage, the subclass of so net purchase class can include but not limited to, purchase by group, brand-name integration, dress ornament etc., above-mentioned analysis also can be learnt, the information of the subclass under this collection webpage generic purchases by group, thus by " www.nuomi.com ", " Baidu's glutinous rice ", " net purchase class " and " purchasing by group " associatedly store.
The disposal route of the info web that the embodiment of the present invention provides, the information of collecting webpage generic is obtained to the information analysis of the collection webpage of the multiple users got, again the collection information of webpage and the information of generic thereof are carried out association store, automatically the information of the collection webpage of multiple user is analyzed, classified, the Information Pull of the collection webpage of numerous user is got up, the convenient collection data sharing classification between users, promote user network and experience.
Embodiment two
Fig. 2 is the process flow diagram of the recommend method of the info web that the embodiment of the present invention two is shown.Such as can perform described method at server end device as described in Figure 5.
With reference to Fig. 2, in step S210, receive the information of the first collection webpage of the user sent from terminal device.
In step S220, the information of described first collection webpage is analyzed, obtain the information of described first collection webpage generic.
Here, the information of described classification may be, but not limited to, the one in following classification: net purchase class, information door class, automotive-type, finance and economic, educational, game class, video display class, novel class, life kind and community's class.
In order to obtain the information of collecting webpage generic accurately, according to exemplary embodiment of the present invention, similar to the embodiment of step S120, step S220 comprises: the URL obtaining described first collection webpage from the information of described first collection webpage, the content-data of described first collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described first collection webpage generic.
In step S230, obtain and the described first second information of collecting webpage of collecting that webpage generic mates from collection Web page classifying database.
That is, using the information of described first collection webpage generic as coupling foundation, from the information by obtaining the second collection webpage the collection Web page classifying database of the method establishment described in embodiment one.For example, suppose that the first collection webpage is " www.tmall.com ", the information of its generic is net purchase class, matches " www.jd.com ", the information of " www.taobao.com " generic is also all net purchase classes from collection Web page classifying database.
In the application of reality, suppose that the demand of user is group buying websites, purchase by group and belong to net purchase class, in broad terms, the website that net purchase class comprises is numerous, can be purchase by group, also can be other websites such as brand-name integration, this just needs the further refinement of classification, considers, classification is subdivided into multiple subclass from the vertical field of classification.
Therefore, described in step S220, described content-data is analyzed, the process obtaining the information of described first collection webpage generic also can comprise: analyze described content-data, obtains the information of the subclass under described first collection webpage generic.Correspondingly, according to exemplary embodiment of the present invention, step S230 comprises: obtain and the described first second information of collecting webpage of collecting that webpage generic and subclass mate from collection Web page classifying database.
Suppose that the first collection webpage is " www.nuomi.com ", through the process of step S210 ~ S230, the information of its generic is net purchase class, the information of subclass purchases by group, and the information of the generic and subclass that get " www.meituan.com " from collection Web page classifying database is also net purchase class and purchases by group.
In step S240, the information of described second collection webpage is sent to described terminal device.
In previous examples, the information of the webpage " www.meituan.com " got from collection Web page classifying database is sent to user terminal as the information that second collects webpage.
Further, described method can also comprise: the information of webpage of having collected from user described in the information deletion of the second collection webpage obtained.Consider the information finally sending to the information of the user's second collection webpage likely to contain the webpage that user has collected, bad network can be brought to experience to user in order to avoid repeating to provide, just needing the content repeated to delete.It should be noted that, " information of the webpage that described user has collected " can be that client together sends to server when sending the information of the first collection webpage, and also can be user when adding collection, server sync stores.
Except above-mentioned using classification information as recommendation according to except, further, described method can also comprise: according to predetermined recommendation rules to described second collection webpage information screen.Correspondingly, according to exemplary embodiment of the present invention, step S240 comprises: the information of the second webpage after screening is sent to described terminal device.
It should be noted that, aforementioned predetermined recommendation rules can comprise, but be not limited to, such as will collect in the information of the second webpage that webpage generic mates with described first, user collect number of times exceed setting collection frequency threshold value the second webpage send to user, or will collect in the information of the second webpage that webpage generic mates with described first, the second webpage that user's access times exceed setting access times threshold value sends to user.
That is, not that the information of collection webpages all under same classification is all recommended active user, but collection webpages all under same classification is screened further, such as, the user adding up each collection webpage under a certain classification respectively collects number of times, and setting collection frequency threshold value is 10,000, and the collection webpage exceeding this collection frequency threshold value is sent to user, or sort according to collection number of times, the collection webpage being positioned at front three after sequence is sent to user.Again such as, add up user's access times of each collection webpage under a certain classification respectively, the collection webpage exceeding setting access times threshold value is sent to user, or sorts according to access times, the collection webpage being positioned at front three after sequence is sent to user.
The recommend method of the info web that the embodiment of the present invention provides, carry out the collection info web of the user received analyzing the information obtaining its generic, further with the information of the classification analyzed for coupling foundation, the information of collection webpage is obtained from collection Web page classifying database, thus collection webpage recommending classification information matched is to user, the collection data achieved based on numerous user provide the information of the collection webpage of more abundant demand of being close to the users for user, improve user network and experience.
In addition, by removing the information of the webpage that user had collected from the information of the collection webpage got, avoid as user provides the content of repetition.
Embodiment three
Fig. 3 is the process flow diagram of the recommend method of the info web that the embodiment of the present invention three is shown.Such as can perform described method on client device as shown in Figure 6.
With reference to Fig. 3, in step S310, collect the operation of the first webpage in response to user, send the request of described first webpage of collection to server.
Such as, user clicks " adding collection to " on webpage " www.nuomi.com ", triggers and performs step S310, sends the request of this webpage of collection to server.
In step S320, receive the response comprising the information of collecting the second webpage that webpage generic mates with described first from described server.
Particularly, other users matched with the generic of the webpage of requested for collection that reception server returns collect the information of webpage.Corresponding in the method flow of embodiment two according to the information step of screening of predetermined recommendation rules to described second collection webpage, according to exemplary embodiment of the present invention, step S320 comprises: the information receiving the second webpage after screening from described server.Such as, the user that reception server returns collects the collection webpage that number of times rank is positioned at the first five.
In step S330, show the information of described second webpage of reception to user.
Correspondingly, according to exemplary embodiment of the present invention, step S330 comprises: the information of the second webpage after screening is sent to described terminal device.
After showing user, user can select the information of whether collecting the webpage returned according to wish, for the information of the webpage of meeting consumers' demand, further, described method also can comprise: indicate the order of collecting described second webpage in response to user, described second webpage is added to web page storage folder.Still for aforementioned webpage " www.nuomi.com ", be the webpage of net purchase class to the known user's request of this web page analysis, the webpage so just such as " www.meituan.com ", " t.dianping.com " etc. being purchased by group class sends to this user.If user selects to collect " www.meituan.com ", correspondingly operate.
The recommend method of the info web that the embodiment of the present invention provides, send the request of collection webpage to server when collection operation occurs user, thus can receive from server the information that other users matched with the classification information of collecting webpage collect webpage, and then show the information that other users collect webpage, achieve the collection data sharing classification between users.In addition, user can select whether to collect the information that other users collect webpage as required, if instruction collection is just added in collection, facilitates user's subsequent access, improves user network and experiences.
Embodiment four
Fig. 4 is the logic diagram of the treating apparatus of the info web that the embodiment of the present invention four is shown.Can be used for performing the method step of embodiment as shown in Figure 1.
With reference to Fig. 4, the treating apparatus of described info web comprises info web acquisition module 410, classification information acquisition module 420 and information storage module 430.
Info web acquisition module 410 is for obtaining the information of the collection webpage about multiple user.
Classification information acquisition module 420, for analyzing the information of the collection webpage of described user respectively, obtains the information of described collection webpage generic.
Particularly, described classification information acquisition module 420 is for the URL(uniform resource locator) (URL) of the acquisition of information collection webpage from described collection webpage, the content-data of described collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described collection webpage generic.
Here, the information of described classification is the one in following classification: net purchase class, information door class, automotive-type, finance and economic, educational, game class, video display class, novel class, life kind and community's class.
Information storage module 430 is for associatedly storing the described information of collection webpage and the information of generic thereof.
In order to obtain the classification of collecting webpage more accurately, further, described classification information acquisition module 420 also can be used for analyzing described content-data, obtain the information of the subclass under described collection webpage generic, correspondingly, described information storage module 430 can be used for associatedly storing the information of described collection webpage and the information of generic and subclass thereof.
The treating apparatus of the info web that the embodiment of the present invention provides, the information of collecting webpage generic is obtained to the information analysis of the collection webpage of the multiple users got, again the collection information of webpage and the information of generic thereof are carried out association store, automatically the information of the collection webpage of multiple user is analyzed, classified, the Information Pull of the collection webpage of numerous user is got up, the convenient collection data sharing classification between users, promote user network and experience.
Embodiment five
Fig. 5 is the logic diagram of the recommendation apparatus of the info web that the embodiment of the present invention five is shown.Can be used for performing the method step of embodiment as shown in Figure 2.
With reference to Fig. 5, the recommendation apparatus of described info web comprises info web receiver module 510, info web analysis module 520, info web acquisition module 530 and info web sending module 540.
Info web receiver module 510 is for receiving the information of the first collection webpage of the user sent from terminal device.
Info web analysis module 520, for analyzing the information of described first collection webpage, obtains the information of described first collection webpage generic.
Particularly, described info web analysis module 520 is for obtaining the URL(uniform resource locator) (URL) of described first collection webpage in the information from described first collection webpage, the content-data of described first collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described first collection webpage generic.
Here, the information of described classification is the one in following classification: net purchase class, information door class, automotive-type, finance and economic, educational, game class, video display class, novel class, life kind and community's class.
Info web acquisition module 530 is for obtaining from collection Web page classifying database and the described first second information of collecting webpage of collecting that webpage generic mates.
Further, described info web analysis module 520 also can be used for analyzing described content-data, obtain the information of the subclass under described first collection webpage generic, correspondingly, described info web acquisition module 530 can be used for obtaining and the described first second information of collecting webpage of collecting that webpage generic and subclass mate from collection Web page classifying database.
Info web sending module 540 is for sending to described terminal device by the information of described second collection webpage.
Except above-mentioned using classification information as recommendation according to except, further, described device also can comprise: info web screening module (not shown) is used for screening according to the information of predetermined recommendation rules to described second collection webpage, correspondingly, info web sending module 540 is for sending to described terminal device by the information of the second webpage after screening.
Alternatively, described device also comprises: the information of the webpage that info web removing module (not shown) has been collected for user described in the information deletion from the second collection webpage obtained.
The recommendation apparatus of the info web that the embodiment of the present invention provides, carry out the collection info web of the user received analyzing the information obtaining its generic, further with the information of the classification analyzed for coupling foundation, the information of collection webpage is obtained from collection Web page classifying database, thus by collection webpage recommending identical for classification information to user, achieve the information that the collection data based on numerous user provide more abundant collection webpage for user, improve user network and experience.In addition, by removing the information of the webpage that user had collected from the information of the collection webpage got, avoid as user provides the content of repetition.
Embodiment six
Fig. 6 is the logic diagram of the recommendation apparatus of the info web that the embodiment of the present invention six is shown.Can be used for performing the method step of embodiment as shown in Figure 3.
With reference to Fig. 6, the recommendation apparatus of described info web comprises collection request sending module 610, info web receiver module 620 and info web display module 630.
Collection request sending module 610, for collecting the operation of the first webpage in response to user, sends the request of described first webpage of collection to server.
Info web display module 620 is for receiving the response comprising the information of collecting the second webpage that webpage generic mates with described first from described server.
Particularly, info web display module 620 is for receiving the information of the second webpage after screening from described server.
Info web display module 630 is for showing the information of described second webpage of reception to user.
Correspondingly, info web display module 630 for show screening from reception to user after the information of described second webpage.
Preferably, described device can also comprise: web page storage module (not shown) is used for indicating the order of collecting described second webpage in response to user, described second webpage is added to web page storage folder.
The recommendation apparatus of the info web that the embodiment of the present invention provides, send the request of collection webpage to server when collection operation occurs user, thus can receive from server the information that other users matched with the classification information of collecting webpage collect webpage, and then show the information that other users collect webpage, achieve the collection data sharing classification between users.In addition, user can select whether to collect the information that other users collect webpage as required, if instruction collection is just added in collection, facilitates user's subsequent access, improves user network and experiences.
In several embodiment provided by the present invention, should be understood that, disclosed apparatus and method, can realize by another way.Such as, device embodiment described above is only schematic, and such as, the division of described module, is only a kind of logic function and divides, and actual can have other dividing mode when realizing.
In addition, each functional module in each embodiment of the present invention can be integrated in a processing module, also can be that the independent physics of modules exists, also can two or more module integrations in a module.Above-mentioned integrated module both can adopt the form of hardware to realize, and the form that hardware also can be adopted to add software function module realizes.
The above-mentioned integrated module realized with the form of software function module, can be stored in a computer read/write memory medium.Above-mentioned software function module is stored in a storage medium, comprising some instructions in order to make a computer equipment (can be personal computer, server, or the network equipment etc.) or processor (processor) perform the part steps of method described in each embodiment of the present invention.And aforesaid storage medium comprises: USB flash disk, portable hard drive, ROM (read-only memory) (Read-OnlyMemory, ROM), random access memory (RandomAccessMemory, RAM), magnetic disc or CD etc. various can be program code stored medium.
The above; be only the specific embodiment of the present invention, but protection scope of the present invention is not limited thereto, is anyly familiar with those skilled in the art in the technical scope that the present invention discloses; change can be expected easily or replace, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of described claim.

Claims (20)

1. a disposal route for info web, is characterized in that, described method comprises:
Obtain the information about the collection webpage of multiple user;
Respectively the information of the collection webpage of described user is analyzed, obtain the information of described collection webpage generic;
Associatedly store the described information of collection webpage and the information of generic thereof.
2. method according to claim 1, is characterized in that, describedly analyzes the information of the collection webpage of described user respectively, and the process obtaining the information of described collection webpage generic comprises:
From the uniform resource position mark URL of the acquisition of information collection webpage of described collection webpage,
The content-data of described collection webpage is obtained according to described URL,
Described content-data is analyzed, obtains the information of described collection webpage generic.
3. method according to claim 2, it is characterized in that, describedly analyze described content-data, the process obtaining the information of described collection webpage generic also comprises: analyze described content-data, obtain the information of the subclass under described collection webpage generic
The described process associatedly storing the described information of collection webpage and the information of generic thereof comprises: associatedly store the information of described collection webpage and the information of generic and subclass thereof.
4. the method according to any one of claims 1 to 3, it is characterized in that, the information of described classification is the one in following classification: net purchase class, information door class, automotive-type, finance and economic, educational, game class, video display class, novel class, life kind and community's class.
5. the method according to any one of claims 1 to 3, is characterized in that, the packets of information of described collection webpage draws together web page interlinkage and web page title.
6. a recommend method for info web, is characterized in that, described method comprises:
Receive the information of the first collection webpage of the user sent from terminal device;
The information of described first collection webpage is analyzed, obtains the information of described first collection webpage generic;
Obtain and the described first second information of collecting webpage of collecting that webpage generic mates from collection Web page classifying database;
The information of described second collection webpage is sent to described terminal device.
7. method according to claim 6, is characterized in that, the described information to described first collection webpage is analyzed, and the process obtaining the information of described first collection webpage generic comprises:
The uniform resource position mark URL of described first collection webpage is obtained from the information of described first collection webpage,
The content-data of described first collection webpage is obtained according to described URL,
Described content-data is analyzed, obtains the information of described first collection webpage generic.
8. method according to claim 7, it is characterized in that, described described content-data to be analyzed, the process obtaining the information of described first collection webpage generic also comprises: analyze described content-data, obtain the information of the subclass under described first collection webpage generic
Describedly obtaining from collection Web page classifying database and described first collect the second process of collecting the information of webpage that webpage generic mates and comprise: obtaining second information of collecting webpage of to collect with described first that webpage generic and subclass mate from collecting Web page classifying database.
9. method according to claim 8, is characterized in that, described method also comprises: screen according to the information of predetermined recommendation rules to described second collection webpage,
The described information by described second collection webpage sends to the process of described terminal device to comprise: the information of the second webpage after screening is sent to described terminal device.
10. method according to claim 9, is characterized in that, described predetermined recommendation rules comprises:
To collect in the information of the second webpage that webpage generic mates with described first, user collect number of times exceed setting collection frequency threshold value the second webpage send to user, or,
To collect in the information of the second webpage that webpage generic mates with described first, the second webpage that user's access times exceed setting access times threshold value sends to user.
The recommend method of 11. 1 kinds of info webs, is characterized in that, described method comprises:
Collect the operation of the first webpage in response to user, send the request of described first webpage of collection to server;
The response comprising the information of collecting the second webpage that webpage generic mates with described first is received from described server;
The information of described second webpage of reception is shown to user.
12. methods according to claim 11, is characterized in that, described method also comprises:
Indicate the order of collecting described second webpage in response to user, described second webpage is added to web page storage folder.
The treating apparatus of 13. 1 kinds of info webs, is characterized in that, described device comprises:
Info web acquisition module, for obtaining the information of the collection webpage about multiple user;
Classification information acquisition module, for analyzing the information of the collection webpage of described user respectively, obtains the information of described collection webpage generic;
Information storage module, for associatedly storing the described information of collection webpage and the information of generic thereof.
14. devices according to claim 13, it is characterized in that, described classification information acquisition module is used for the uniform resource position mark URL of the acquisition of information collection webpage from described collection webpage, the content-data of described collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described collection webpage generic.
15. devices according to claim 14, is characterized in that, described classification information acquisition module, also for analyzing described content-data, obtains the information of the subclass under described collection webpage generic,
Described information storage module is used for associatedly storing the information of described collection webpage and the information of generic and subclass thereof.
The recommendation apparatus of 16. 1 kinds of info webs, is characterized in that, described device comprises:
Info web receiver module, for receiving the information of the first collection webpage of the user sent from terminal device;
Info web analysis module, for analyzing the information of described first collection webpage, obtains the information of described first collection webpage generic;
Info web acquisition module, for obtaining from collection Web page classifying database and the described first second information of collecting webpage of collecting that webpage generic mates;
Info web sending module, for sending to described terminal device by the information of described second collection webpage.
17. devices according to claim 16, it is characterized in that, described info web analysis module is used for the uniform resource position mark URL obtaining described first collection webpage from the information of described first collection webpage, the content-data of described first collection webpage is obtained according to described URL, described content-data is analyzed, obtains the information of described first collection webpage generic.
18. devices according to claim 17, is characterized in that, described info web analysis module, also for analyzing described content-data, obtains the information of the subclass under described first collection webpage generic,
Described info web acquisition module is used for obtaining and the described first second information of collecting webpage of collecting that webpage generic and subclass mate from collection Web page classifying database.
19. devices according to claim 18, is characterized in that, described device also comprises: info web screening module, for screening according to the information of predetermined recommendation rules to described second collection webpage,
The information that described info web sending module is used for the second webpage after by screening sends to described terminal device.
The recommendation apparatus of 20. 1 kinds of info webs, is characterized in that, described device comprises:
Collection request sending module, for collecting the operation of the first webpage in response to user, sends the request of described first webpage of collection to server;
Info web receiver module, for receiving the response comprising the information of collecting the second webpage that webpage generic mates with described first from described server;
Info web display module, for showing the information of described second webpage of reception to user.
CN201510400445.3A 2015-07-09 2015-07-09 Method and apparatus for processing and recommending webpage information Pending CN105095404A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510400445.3A CN105095404A (en) 2015-07-09 2015-07-09 Method and apparatus for processing and recommending webpage information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510400445.3A CN105095404A (en) 2015-07-09 2015-07-09 Method and apparatus for processing and recommending webpage information

Publications (1)

Publication Number Publication Date
CN105095404A true CN105095404A (en) 2015-11-25

Family

ID=54575841

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510400445.3A Pending CN105095404A (en) 2015-07-09 2015-07-09 Method and apparatus for processing and recommending webpage information

Country Status (1)

Country Link
CN (1) CN105095404A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107203530A (en) * 2016-03-16 2017-09-26 北大方正集团有限公司 Information recommendation method
CN109948035A (en) * 2017-09-28 2019-06-28 广州市动景计算机科技有限公司 Information sharing method, apparatus and system
CN110020335A (en) * 2017-07-28 2019-07-16 北京搜狗科技发展有限公司 The treating method and apparatus of collection
CN115687662A (en) * 2022-09-26 2023-02-03 北京字跳网络技术有限公司 Multimedia work processing method, device, equipment and storage medium

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0643359A2 (en) * 1993-09-09 1995-03-15 Mni Interactive Method and apparatus for recommending selections based on preferences in a multi-user system
WO2004095317A1 (en) * 2003-04-24 2004-11-04 Sony Corporation Content search program, method, and device based on user preference
CN102819555A (en) * 2012-06-27 2012-12-12 北京奇虎科技有限公司 A method and device for loading recommended information in web page reading mode
CN102843586A (en) * 2011-06-21 2012-12-26 华为软件技术有限公司 Video recommendation method and terminal
CN102930057A (en) * 2012-11-21 2013-02-13 北京奇虎科技有限公司 Search implementation method and device
CN102968459A (en) * 2012-11-01 2013-03-13 北京奇虎科技有限公司 Website processing method and device
CN102982068A (en) * 2012-10-25 2013-03-20 北京奇虎科技有限公司 Method for displaying recommended data and corresponding browser
CN102982069A (en) * 2012-10-25 2013-03-20 北京奇虎科技有限公司 Method and device for recommended data displaying
US20140052587A1 (en) * 2012-08-15 2014-02-20 Zindigo, Inc. Social commerce agent store replication
CN103678555A (en) * 2013-12-06 2014-03-26 北京奇虎科技有限公司 Webpage collecting method and browser
CN104239338A (en) * 2013-06-19 2014-12-24 阿里巴巴集团控股有限公司 Information recommendation method and information recommendation device
CN104615770A (en) * 2015-02-13 2015-05-13 深圳市欧珀通信软件有限公司 Recommendation method and recommendation device for data of bookmark of mobile terminal
CN104699704A (en) * 2013-12-06 2015-06-10 腾讯科技(深圳)有限公司 Content pushing and receiving method, device and system

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0643359A2 (en) * 1993-09-09 1995-03-15 Mni Interactive Method and apparatus for recommending selections based on preferences in a multi-user system
WO2004095317A1 (en) * 2003-04-24 2004-11-04 Sony Corporation Content search program, method, and device based on user preference
CN102843586A (en) * 2011-06-21 2012-12-26 华为软件技术有限公司 Video recommendation method and terminal
CN102819555A (en) * 2012-06-27 2012-12-12 北京奇虎科技有限公司 A method and device for loading recommended information in web page reading mode
US20140052587A1 (en) * 2012-08-15 2014-02-20 Zindigo, Inc. Social commerce agent store replication
CN102982068A (en) * 2012-10-25 2013-03-20 北京奇虎科技有限公司 Method for displaying recommended data and corresponding browser
CN102982069A (en) * 2012-10-25 2013-03-20 北京奇虎科技有限公司 Method and device for recommended data displaying
CN102968459A (en) * 2012-11-01 2013-03-13 北京奇虎科技有限公司 Website processing method and device
CN102930057A (en) * 2012-11-21 2013-02-13 北京奇虎科技有限公司 Search implementation method and device
CN104239338A (en) * 2013-06-19 2014-12-24 阿里巴巴集团控股有限公司 Information recommendation method and information recommendation device
CN103678555A (en) * 2013-12-06 2014-03-26 北京奇虎科技有限公司 Webpage collecting method and browser
CN104699704A (en) * 2013-12-06 2015-06-10 腾讯科技(深圳)有限公司 Content pushing and receiving method, device and system
CN104615770A (en) * 2015-02-13 2015-05-13 深圳市欧珀通信软件有限公司 Recommendation method and recommendation device for data of bookmark of mobile terminal

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107203530A (en) * 2016-03-16 2017-09-26 北大方正集团有限公司 Information recommendation method
CN110020335A (en) * 2017-07-28 2019-07-16 北京搜狗科技发展有限公司 The treating method and apparatus of collection
CN110020335B (en) * 2017-07-28 2022-04-26 北京搜狗科技发展有限公司 Favorite processing method and device
CN109948035A (en) * 2017-09-28 2019-06-28 广州市动景计算机科技有限公司 Information sharing method, apparatus and system
CN115687662A (en) * 2022-09-26 2023-02-03 北京字跳网络技术有限公司 Multimedia work processing method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
Khder Web scraping or web crawling: State of art, techniques, approaches and application.
CN102799610B (en) Method and system for collecting network information
CN102855309B (en) A kind of information recommendation method based on user behavior association analysis and device
US20150278359A1 (en) Method and apparatus for generating a recommendation page
US9177341B2 (en) Determining search relevance from user feedback
CN104899220A (en) Application program recommendation method and system
CN105404699A (en) Method, device and server for searching articles of finance and economics
CN105589914A (en) A web page pre-reading method, device and intelligent terminal equipment
CN104462336A (en) Information pushing method and device
CN108197244A (en) It is a kind of to search for the method for pushing and device for recommending word
CN103761296A (en) Method and system for analyzing network behaviors of mobile terminal users
KR20130059738A (en) System and method for recommending application using contents analysis
CN111723273A (en) Smart cloud retrieval system and method
CN110929058B (en) Trademark picture retrieval method and device, storage medium and electronic device
CN110362737B (en) Recommended content pushing method and device and server
CN103838754A (en) Information searching device and method
CN105335423A (en) Collecting and processing method and apparatus for user feedbacks of webpage
CN103164423A (en) Method and device for confirming browser inner core type rendering web pages
JP7200069B2 (en) Information processing device, vector generation method and program
CN105718578A (en) Short link generation method and device
CN105095404A (en) Method and apparatus for processing and recommending webpage information
CN112035744A (en) Page recommendation method, device, equipment and storage medium
CN116561402A (en) Method, device and server for acquiring target content information in webpage
CN103605742A (en) Method and device for recognizing network resource entity content page
CN103425767B (en) A kind of determination method and system pointing out data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20151125

RJ01 Rejection of invention patent application after publication
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载