+

CN110297967B - Method, device and equipment for determining interest points and computer readable storage medium - Google Patents

Method, device and equipment for determining interest points and computer readable storage medium Download PDF

Info

Publication number
CN110297967B
CN110297967B CN201910398136.5A CN201910398136A CN110297967B CN 110297967 B CN110297967 B CN 110297967B CN 201910398136 A CN201910398136 A CN 201910398136A CN 110297967 B CN110297967 B CN 110297967B
Authority
CN
China
Prior art keywords
interest
point
points
determining
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910398136.5A
Other languages
Chinese (zh)
Other versions
CN110297967A (en
Inventor
何伯磊
肖欣延
吴甜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201910398136.5A priority Critical patent/CN110297967B/en
Publication of CN110297967A publication Critical patent/CN110297967A/en
Application granted granted Critical
Publication of CN110297967B publication Critical patent/CN110297967B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present disclosure provides a method, an apparatus, a device and a computer-readable storage medium for determining a point of interest, including: mining network resources through a pre-trained mining classifier to obtain interest points; determining an incidence relation between the interest points according to the obtained interest points; and determining the target interest points of the user according to the incidence relation among the interest points. The method, the device, the equipment and the computer readable storage medium provided by the disclosure can extract the interest points in time according to network resources and construct the association between the interest points, so that the target interest points of the user can be determined according to the known user information and the association relation between the interest points, and the content which is possibly interested by the user can be determined quickly under the condition that the iteration speed of the internet information is high.

Description

兴趣点确定方法、装置、设备及计算机可读存储介质Point-of-interest determination method, apparatus, device, and computer-readable storage medium

技术领域technical field

本公开涉及内容推送技术,尤其涉及一种兴趣点确定方法、装置、设备及计算机可读存储介质。The present disclosure relates to content push technology, and in particular, to a method, apparatus, device, and computer-readable storage medium for determining a point of interest.

背景技术Background technique

随着互联网的发展,对用户进行个性化推荐内容越来越普及,通过有针对性的向用户推荐内容,能够使得用户更快捷的找到感兴趣内容,并于用户浏览。With the development of the Internet, personalized content recommendation for users is becoming more and more popular. By recommending content to users in a targeted manner, users can more quickly find interesting content and browse for users.

在个性化推荐过程中,现有技术中采用的方式是基于用户历史数据确定用户的兴趣或潜在兴趣,再向用户进行推送。In the personalized recommendation process, the method adopted in the prior art is to determine the user's interest or potential interest based on the user's historical data, and then push it to the user.

但是,网络信息的迭代速度较快,若仅根据用户的历史数据向用户推送内容,容易将与历史信息相关的内容推送给用户,而无法将当前较新的内容推送给用户。因此,这种尝试性推荐的方式,无法准确的确定用户当前有可能感兴趣的内容,进而无法准确的向用户推送其感兴趣的内容。However, the iteration speed of network information is relatively fast. If content is only pushed to users based on the user's historical data, it is easy to push the content related to the historical information to the user, but it is impossible to push the current newer content to the user. Therefore, this tentative recommendation method cannot accurately determine the content that the user may be interested in at present, and thus cannot accurately push the interested content to the user.

发明内容SUMMARY OF THE INVENTION

本公开提供一种兴趣点确定方法、装置、设备及计算机可读存储介质,以解决现有技术中无法准确的确定用户当前感兴趣的内容,进而无法准确的向用户推送其感兴趣的内容的问题。The present disclosure provides a method, apparatus, device, and computer-readable storage medium for determining a point of interest, so as to solve the problem that in the prior art, the content of the user's current interest cannot be accurately determined, and thus the content of interest cannot be accurately pushed to the user. question.

本公开的第一个方面是提供一种兴趣点确定方法,包括:A first aspect of the present disclosure is to provide a method for determining a point of interest, including:

通过预先训练的挖掘分类器,对网络资源进行挖掘,获取兴趣点;Mining network resources through pre-trained mining classifiers to obtain points of interest;

根据获取的所述兴趣点确定兴趣点间的关联关系;Determine the association relationship between the points of interest according to the acquired points of interest;

根据所述兴趣点间的关联关系确定用户的目标兴趣点。The target interest point of the user is determined according to the relationship between the interest points.

本公开的另一个方面是提供一种兴趣点确定装置,包括:Another aspect of the present disclosure is to provide an apparatus for determining a point of interest, including:

挖掘模块,用于通过预先训练的挖掘分类器,对网络资源进行挖掘,获取兴趣点;The mining module is used to mine network resources and obtain points of interest through pre-trained mining classifiers;

关联模块,用于根据获取的所述兴趣点确定兴趣点间的关联关系;an association module, configured to determine the association relationship between the interest points according to the acquired interest points;

确定模块,用于根据所述兴趣点间的关联关系确定用户的目标兴趣点。The determining module is configured to determine the target interest point of the user according to the association relationship between the interest points.

本公开的又一个方面是提供一种兴趣点确定设备,包括:Yet another aspect of the present disclosure is to provide a point-of-interest determination device, including:

存储器;memory;

处理器;以及processor; and

计算机程序;Computer program;

其中,所述计算机程序存储在所述存储器中,并配置为由所述处理器执行以实现如上述第一方面所述的兴趣点确定方法。Wherein, the computer program is stored in the memory and configured to be executed by the processor to implement the method for determining a point of interest as described in the first aspect above.

本公开的又一个方面是提供一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行以实现如上述第一方面所述的兴趣点确定方法。Yet another aspect of the present disclosure is to provide a computer-readable storage medium on which a computer program is stored, the computer program being executed by a processor to implement the method for determining a point of interest as described in the first aspect above.

本公开提供的兴趣点确定方法、装置、设备及计算机可读存储介质的技术效果是:The technical effects of the method, apparatus, device and computer-readable storage medium for determining a point of interest provided by the present disclosure are:

本公开提供的兴趣点确定方法、装置、设备及计算机可读存储介质,包括:通过预先训练的挖掘分类器,对网络资源进行挖掘,获取兴趣点;根据获取的兴趣点确定兴趣点间的关联关系;根据兴趣点间的关联关系确定用户的目标兴趣点。本公开提供的方法、装置、设备及计算机可读存储介质能够根据网络资源及时提取兴趣点,并构建兴趣点之间的关联,从而能够根据已知的用户信息,以及兴趣点之间的关联关系确定用户的目标兴趣点,从而再互联网资讯迭代速度较快的情况下,较快的确定用户有可能感兴趣的内容。The method, device, device and computer-readable storage medium for determining a point of interest provided by the present disclosure include: mining network resources through a pre-trained mining classifier to obtain a point of interest; and determining an association between the points of interest according to the acquired point of interest Relationship; determine the user's target POI according to the relationship between POIs. The method, apparatus, device, and computer-readable storage medium provided by the present disclosure can timely extract points of interest according to network resources, and build an association between the points of interest, so that the known user information and the association relationship between the points of interest can be used. Determine the user's target point of interest, so as to quickly determine the content that the user may be interested in when the iteration speed of Internet information is fast.

附图说明Description of drawings

图1为本发明一示例性实施例示出的兴趣点确定方法的流程图;FIG. 1 is a flowchart of a method for determining a point of interest according to an exemplary embodiment of the present invention;

图2为本发明另一示例性实施例示出的兴趣点确定方法的流程图;2 is a flowchart of a method for determining a point of interest according to another exemplary embodiment of the present invention;

图3为本发明一示例性实施例示出的兴趣点确定装置的结构图;3 is a structural diagram of an apparatus for determining a point of interest according to an exemplary embodiment of the present invention;

图4为本发明另一示例性实施例示出的兴趣点确定装置的结构图;4 is a structural diagram of an apparatus for determining a point of interest according to another exemplary embodiment of the present invention;

图5为本发明一示例性实施例示出的兴趣点确定设备的结构图。FIG. 5 is a structural diagram of a device for determining a point of interest according to an exemplary embodiment of the present invention.

具体实施方式Detailed ways

随着网络技术的发展,越来越多的用户会在网络中获取需要的信息,例如,观看一些热点新闻。为了给用户提供更加优质的服务,很多网络平台都会主动向用户推送内容,例如今天发生的新闻,用户有可能关心的热点内容。With the development of network technology, more and more users will obtain needed information on the network, for example, watching some hot news. In order to provide users with better services, many network platforms will actively push content to users, such as news that happened today, and hot content that users may care about.

现有技术中,网络平台获取用户的历史数据,并根据历史数据推测用户兴趣,并在新闻内容或热点内容中,挑选用户有可能感兴趣的内容,并推送给用户。In the prior art, the network platform obtains the user's historical data, infers the user's interest according to the historical data, selects the content that the user may be interested in from the news content or hot content, and pushes it to the user.

但是,随着网络信息迭代速度较快,仅依据用户历史数据推测其当前可能感兴趣的内容准确率较低。本发明实施例提供的方案,能够在迭代速度非常快的网络数据中挖掘兴趣点,并确定兴趣点间的关联关系,从而能够基于这一关联关系,准确的确定用户的目标兴趣点,使得基于该目标兴趣点向用户推送内容时,能够匹配到与用户更加相符的内容,提高推送效率。However, with the rapid iteration of network information, the accuracy of inferring content that may be of interest to a user based solely on historical data is low. The solution provided by the embodiment of the present invention can mine points of interest in network data with a very fast iteration speed, and determine the correlation between the points of interest, so that the target point of interest of the user can be accurately determined based on the correlation, so that based on the correlation When the target interest point pushes content to the user, it can match the content that is more in line with the user, thereby improving the push efficiency.

图1为本发明一示例性实施例示出的兴趣点确定方法的流程图。FIG. 1 is a flowchart of a method for determining a point of interest according to an exemplary embodiment of the present invention.

如图1所示,本实施例提供的兴趣点确定方法包括:As shown in FIG. 1 , the method for determining a point of interest provided by this embodiment includes:

步骤101,通过预先训练的挖掘分类器,对网络资源进行挖掘,获取兴趣点。Step 101 , mining network resources through a pre-trained mining classifier to obtain points of interest.

本实施例提供的方法可以由具备计算能力的电子设备执行,例如计算机。该电子设备可以是一网络平台的后台服务器,可以将本实施例提供的方法封装在该服务器中,使得服务器能够执行这一方法。The method provided in this embodiment may be executed by an electronic device with computing capability, such as a computer. The electronic device may be a background server of a network platform, and the method provided in this embodiment may be encapsulated in the server, so that the server can execute the method.

其中,该电子设备可以与用户终端通过网络连接,从而向用户终端进行内容推送。例如,可以在用户终端中安装一客户端,用户可以操作该客户端浏览网络内容,电子设备可以通过该客户端的功能实现向用户终端推送内容。Wherein, the electronic device may be connected with the user terminal through a network, so as to push content to the user terminal. For example, a client terminal may be installed in the user terminal, the user may operate the client terminal to browse network contents, and the electronic device may push contents to the user terminal through the function of the client terminal.

具体的,可以在电子设备中设置预先训练完毕的挖掘分类器。该挖掘分类器可以对网络资源进行处理,提取其中可能包括的兴趣点。兴趣点是指用户可能感兴趣的内容,例如一个当下比较红的明星,例如一个突发事件等。Specifically, a pre-trained mining classifier may be set in the electronic device. The mining classifier can process network resources to extract points of interest that may be included. The point of interest refers to the content that the user may be interested in, such as a popular star at the moment, such as an unexpected event, etc.

进一步的,有一些兴趣点是已经存在的,例如在几个月甚至几年前就产生的兴趣点,这些可以作为已知兴趣点。还有一些兴趣点是正在产生或还未产生的,由于网络信息的流通速度非常快,很多突发事件会迅速在网络中传播,因此,时时刻刻都会有新的兴趣点产生。通过预先训练的挖掘分类器,能够对网络资源进行处理,进而及时的获取当前产生的兴趣点。Further, some points of interest already exist, such as points of interest generated months or even years ago, which can be regarded as known points of interest. There are also some points of interest that are being generated or have not yet been generated. Due to the very fast circulation of network information, many emergencies will quickly spread in the network. Therefore, new points of interest will be generated all the time. Through the pre-trained mining classifier, the network resources can be processed, and the current interest points can be obtained in time.

实际应用时,可以先根据已知的兴趣点及其对应的网络资源对分类器进行训练。例如,可以选定一些兴趣点及其对应的网络资源作为样本,这里可以包括一些正样本和负样本。正样本为正确的兴趣点与网络资源的组合,负样本为错误的兴趣点与网络资源的组合。In practical application, the classifier can be trained according to the known points of interest and their corresponding network resources. For example, some points of interest and their corresponding network resources can be selected as samples, which can include some positive samples and negative samples. Positive samples are the correct combination of interest points and network resources, and negative samples are the wrong combination of interest points and network resources.

还可以将一部分样本作为训练样本,另一部分样本作为测试样本。在训练样本上执行分类器算法,生成分类器。在测试样本上执行分类器,生成预测结果。根据预测结果确定评估指标,评估分类器的性能。若分类器不达标,则可以调整分类器中的参数,重新进行训练。It is also possible to use a part of the samples as training samples and another part of the samples as test samples. Execute the classifier algorithm on the training samples to generate a classifier. Execute the classifier on the test samples to generate predictions. Determine the evaluation index according to the prediction result, and evaluate the performance of the classifier. If the classifier does not meet the standard, you can adjust the parameters in the classifier and retrain.

其中,网络资源可以是网页数据,例如,可以训练一个能够在网页数据中挖掘兴趣点的分类器。网络资源还可以是搜索数据,例如,可以训练一个能够在用户的搜索数据中挖掘兴趣点的分类器。The network resource may be web page data, for example, a classifier capable of mining interest points in the web page data may be trained. The web resource can also be search data, for example, a classifier that can mine points of interest in a user's search data can be trained.

具体的,可以获取网络中的网络资源,并使用训练完毕的分类器对其进行计算,提取其中包括的兴趣点,进而能够及时获取网络中当前的兴趣点。Specifically, the network resources in the network can be obtained, and the trained classifier can be used to calculate them, and the interest points included therein can be extracted, so that the current interest points in the network can be obtained in time.

步骤102,根据获取的兴趣点确定兴趣点间的关联关系。Step 102: Determine the association relationship between the points of interest according to the acquired points of interest.

对于新获取的兴趣点,可以确定其与其他兴趣点之间的关系,从而可以根据这一关系,在一个用户的已知兴趣点基础上,推测该用户的其他兴趣点。For the newly acquired point of interest, the relationship between it and other points of interest can be determined, so that other points of interest of the user can be inferred on the basis of the known points of interest of a user according to the relationship.

其中,对于网络中的众多兴趣点来说,这些兴趣点之间具有一定的关系,例如,兴趣点A与兴趣点B成对出现,则可以认为二者具有一定的关系。再例如,有一些兴趣点包括多个子兴趣点,可以根据已有的知识体系确定这一关系,比如兴趣点清朝,属于兴趣点历史。还可以根据网络数据确定这一关系,比如一个电视节目是一个兴趣点,则参加该电视节目的嘉宾可以是该兴趣点的子兴趣点。Among them, for many interest points in the network, there is a certain relationship between these interest points. For example, if the interest point A and the interest point B appear in pairs, it can be considered that the two have a certain relationship. For another example, some points of interest include multiple sub-points of interest, and this relationship can be determined according to an existing knowledge system. For example, the point of interest in the Qing Dynasty belongs to the history of the point of interest. The relationship can also be determined according to network data. For example, a TV program is a point of interest, and the guests participating in the TV program can be a sub-point of interest of the point of interest.

具体的,构建兴趣点之间的关系,能够绘制兴趣点图谱,针对每个兴趣点,都具有其对应的关联兴趣点。Specifically, by constructing the relationship between the interest points, an interest point map can be drawn, and each interest point has its corresponding associated interest point.

步骤103,根据兴趣点间的关联关系确定用户的目标兴趣点。Step 103: Determine the target interest point of the user according to the association relationship between the interest points.

进一步的,可以先确定一个用户的兴趣点,并根据兴趣点间的关联关系,确定与该兴趣点具有关系的其他兴趣点,并可以将这些其他兴趣点确定为用户的目标兴趣点。例如,兴趣点A与兴趣点B具有关联关系,则推测用户对A感兴趣时,可以推测其也对B感兴趣。Further, a user's point of interest may be determined first, and other points of interest that have a relationship with the point of interest may be determined according to the association between the points of interest, and these other points of interest may be determined as the target point of interest of the user. For example, if the point of interest A and the point of interest B have an associated relationship, when it is presumed that the user is interested in A, it can be presumed that the user is also interested in B.

实际应用时,有一些兴趣点具有包含关系,此时,可以根据这一关系确定用户的目标兴趣点。例如用户具有一兴趣点A,那么可以认为其对兴趣点A的子兴趣点也有可能感兴趣,可以向用户发送这些子兴趣点,从而使用户能够在其中进行选择。In practical applications, some points of interest have an inclusion relationship, and at this time, the target point of interest of the user can be determined according to this relationship. For example, if the user has a point of interest A, it may be considered that he may also be interested in the sub-points of interest of the point of interest A, and these sub-points of interest may be sent to the user, so that the user can select among them.

其中,还可以直接询问用户其关心的兴趣点,进而准确的确定一已知兴趣点,再根据该已知兴趣点,以及兴趣点的关联关系,确定目标兴趣点。Among them, the user can also directly inquire about the points of interest that the user cares about, and then accurately determine a known point of interest, and then determine the target point of interest according to the known point of interest and the relationship between the points of interest.

本实施例提供的方法中,可以根据兴趣点间的关联关系确定用户的目标兴趣点,从而能够基于已有的用户信息或数据,更准确的推测用户关心的内容。In the method provided in this embodiment, the user's target interest point can be determined according to the relationship between the interest points, so that the content that the user cares about can be more accurately estimated based on the existing user information or data.

具体的,本实施例提供的方法,还可以包括:Specifically, the method provided in this embodiment may further include:

根据目标兴趣点确定与用户对应的推送内容,并向用户终端发送推送内容。The push content corresponding to the user is determined according to the target point of interest, and the push content is sent to the user terminal.

进一步的,在确定了用户可能关心的内容后,可以基于这一内容对网络内容进行筛选,例如对热点新闻进行筛选,并向用户进行推送。Further, after the content that the user may care about is determined, the network content can be screened based on the content, for example, hot news is screened and pushed to the user.

本实施例提供的方法,能够基于已有数据准确的确定用户的目标兴趣点,进而能够基于该目标兴趣点向用户推送与其更匹配的内容。The method provided by this embodiment can accurately determine the target interest point of the user based on the existing data, and then can push the content more matching with the target interest point to the user based on the target interest point.

本实施例提供的方法用于确定用户的兴趣点,该方法由设置有本实施例提供的方法的设备执行,该设备通常以硬件和/或软件的方式来实现。The method provided in this embodiment is used to determine a user's point of interest, and the method is executed by a device provided with the method provided in this embodiment, and the device is usually implemented in hardware and/or software.

本实施例提供的兴趣点确定方法,包括:通过预先训练的挖掘分类器,对网络资源进行挖掘,获取兴趣点;根据获取的兴趣点确定兴趣点间的关联关系;根据兴趣点间的关联关系确定用户的目标兴趣点。本实施例提供的方法能够根据网络资源及时提取兴趣点,并构建兴趣点之间的关联,从而能够根据已知的用户信息,以及兴趣点之间的关联关系确定用户的目标兴趣点,从而再互联网资讯迭代速度较快的情况下,较快的确定用户有可能感兴趣的内容。The method for determining points of interest provided by this embodiment includes: mining network resources through a pre-trained mining classifier to obtain points of interest; determining the association between the points of interest according to the acquired points of interest; and according to the association between the points of interest Identify the user's target point of interest. The method provided in this embodiment can extract points of interest in time according to network resources, and build an association between the points of interest, so that the target point of interest of the user can be determined according to the known user information and the relationship between the points of interest, and then When the iteration speed of Internet information is fast, the content that users may be interested in can be determined quickly.

图2为本发明另一示例性实施例示出的兴趣点确定方法的流程图。FIG. 2 is a flowchart of a method for determining a point of interest according to another exemplary embodiment of the present invention.

如图2所示,本实施例提供的兴趣点确定方法,包括:As shown in FIG. 2 , the method for determining a point of interest provided by this embodiment includes:

步骤201,通过第一挖掘分类器提取网页数据中的网页特征,并根据网页特征确定兴趣点。Step 201: Extract webpage features in the webpage data through a first mining classifier, and determine points of interest according to the webpage characteristics.

其中,第一挖掘分类器是通过已知兴趣点及其对应的网页数据训练得到的。The first mining classifier is obtained by training the known points of interest and their corresponding webpage data.

具体的,可以预先搜集已知兴趣点及其对应的网页数据,可以将这些数据作为训练数据,进而对分类器进行训练。可以将正确的兴趣点与网页数据的组合作为正样本,将错误的兴趣点与网页数据的组合作为负样本,基于这两种样本数据能够准确的训练得到第一分类器。Specifically, known points of interest and their corresponding web page data can be collected in advance, and these data can be used as training data to train the classifier. The combination of correct interest points and webpage data can be used as positive samples, and the wrong combination of interest points and webpage data can be used as negative samples, and the first classifier can be accurately trained based on these two sample data.

进一步的,训练分类器的电子设备可以是执行本实施例提供的方法的电子设备,也可以是其他电子设备。Further, the electronic device for training the classifier may be an electronic device that executes the method provided in this embodiment, or may be other electronic devices.

将训练完毕的第一分类器存储在执行本实施例提供的方法的电子设备中。该电子设备可以扫描网络中的网页数据,并通过第一分类器对这些网页数据进行计算,得到对应的兴趣点。The trained first classifier is stored in the electronic device that executes the method provided in this embodiment. The electronic device can scan web page data in the network, and calculate the web page data through the first classifier to obtain corresponding points of interest.

网页数据可以是网页页面中包括的数据内容,例如文字内容、图片内容、链接地址等。The web page data may be data content included in the web page, such as text content, picture content, link addresses, and the like.

第一分类器在处理网页数据时,可以提取网页数据包括的网页特征,再对网页特征进行分类,从而确定兴趣点。When processing the web page data, the first classifier can extract the web page features included in the web page data, and then classify the web page features, so as to determine the point of interest.

网页特征可以包括以下至少一种:页面特征、词条热点、词条类型。针对每种网页特征,都可以提取出一个相应的内容,也可以提取多个相应的内容。The webpage features may include at least one of the following: page features, entry hotspots, and entry types. For each web page feature, one corresponding content can be extracted, and multiple corresponding contents can also be extracted.

步骤202,通过第二挖掘分类器提取网络搜索数据中的搜索特征,并根据搜索特征确定兴趣点。Step 202 , extract the search features in the network search data through the second mining classifier, and determine the points of interest according to the search features.

其中,第二挖掘分类器是通过已知兴趣点及其对应的网络搜索数据训练得到的。Among them, the second mining classifier is obtained by training the known points of interest and their corresponding network search data.

步骤201与步骤202的时序不做限制。同时,可以仅采用步骤201和202中的一种方式挖掘兴趣点,也可以采用这两种方式挖掘兴趣点。The timing of step 201 and step 202 is not limited. At the same time, only one method in steps 201 and 202 can be used to mine points of interest, or both methods can be used to mine points of interest.

具体的,第二挖掘分类器是通过已知兴趣点及其对应的网络搜索数据训练得到的。Specifically, the second mining classifier is obtained by training the known points of interest and their corresponding network search data.

进一步的,可以预先搜集已知兴趣点及其对应的网络搜索数据,可以将这些数据作为训练数据,进而对分类器进行训练。可以将正确的兴趣点与网络搜索数据的组合作为正样本,将错误的兴趣点与网络搜索数据的组合作为负样本,基于这两种样本数据能够准确的训练得到第二分类器。Further, known points of interest and their corresponding network search data can be collected in advance, and these data can be used as training data to train the classifier. The combination of the correct interest point and the network search data can be used as a positive sample, and the combination of the wrong interest point and the network search data can be used as a negative sample, and the second classifier can be accurately trained based on these two sample data.

网络搜索数据是指用户在网络中进行信息搜索产生的数据。Network search data refers to data generated by users searching for information on the network.

实际应用时,训练分类器的电子设备可以是执行本实施例提供的方法的电子设备,也可以是其他电子设备。In practical applications, the electronic device for training the classifier may be an electronic device that executes the method provided in this embodiment, or may be other electronic devices.

将训练完毕的第二分类器存储在执行本实施例提供的方法的电子设备中。该电子设备可以采集用户在网络中进行的搜索数据,并通过第二分类器对这些搜索数据进行计算,得到对应的兴趣点。The trained second classifier is stored in the electronic device that executes the method provided by this embodiment. The electronic device can collect the search data performed by the user in the network, and calculate the search data through the second classifier to obtain the corresponding points of interest.

第二分类器在处理网络搜索数据时,可以提取网络搜索数据包括的搜索特征,再对网络搜索特征进行分类,从而确定兴趣点。When processing the network search data, the second classifier may extract the search features included in the network search data, and then classify the network search features to determine the point of interest.

搜索特征包括以下至少一种:搜索信息、用户点击信息、搜索时效性信息、网页内容特征。针对每种搜索特征,都可以提取出一个相应的内容,也可以提取多个相应的内容。The search features include at least one of the following: search information, user click information, search timeliness information, and webpage content features. For each search feature, one corresponding content may be extracted, or multiple corresponding content may be extracted.

基于步骤201和/或202挖掘到兴趣点以后,可以确定该兴趣点与其他兴趣点间的关联关系。具体可以通过步骤203、205、206中的任一种方式来确定。After the point of interest is mined based on steps 201 and/or 202, the association relationship between the point of interest and other points of interest may be determined. Specifically, it can be determined by any one of steps 203 , 205 , and 206 .

步骤203,在网络资源中确定兴趣点间的共现信息,并根据共现信息确定兴趣点间的关联关系。Step 203: Determine the co-occurrence information between the POIs in the network resource, and determine the association relationship between the POIs according to the co-occurrence information.

在一种实施方式中,认为共同出现的兴趣点之间,可能存在一定的关联关系。因此,可以在网络资源中确定兴趣点间的共现信息,例如,共现次数,共现次数与单独出现次数的比值等。In one embodiment, it is considered that there may be a certain relationship between the points of interest that appear together. Therefore, the co-occurrence information among the POIs can be determined in the network resource, for example, the number of co-occurrences, the ratio of the number of co-occurrences to the number of individual occurrences, and the like.

例如,当一个电视节目与一个演员名字经常共同出现时,可以认为二者具有关联关系,此时,可以认为这个电视节目与这个演员具有非指向性的联系。For example, when a TV program and an actor's name often appear together, it can be considered that the two have an associated relationship. At this time, it can be considered that the TV program and the actor have a non-directional relationship.

其中,还可以设置权重值,用于衡量两个兴趣点间的关联强度。例如,若两个兴趣点共现次数超出一阈值次数,则认为二者的关联强度较高,因此,可以设置一个较大的权重值。Among them, a weight value can also be set to measure the strength of the association between the two interest points. For example, if the co-occurrence times of two points of interest exceeds a threshold number of times, it is considered that the correlation strength between the two points of interest is relatively high, and therefore, a larger weight value can be set.

具体的,权重值也可以通过计算得到,例如,将兴趣点间的共现次数作为权重值。Specifically, the weight value can also be obtained by calculation, for example, the number of co-occurrences between interest points is used as the weight value.

进一步的,获取一个兴趣点后,可以确定其与其他兴趣点间的关系,此处的其他兴趣点可以是基于本实施例的方法挖掘得到的,也可以是通过其他方式得到的。Further, after acquiring a point of interest, the relationship between it and other points of interest may be determined, and the other points of interest here may be obtained by mining based on the method of this embodiment, or may be obtained by other methods.

步骤204,在社交网络兴趣点中,确定社交网络用户对应的关联用户,并将关联用户的关联兴趣点。Step 204 , in the points of interest of the social network, determine the associated users corresponding to the users of the social network, and associate the associated points of interest of the users.

在另一种实施方式中,兴趣点有可能是社交网络用户,例如用户N。此时,还可以通过社交网络确定这一兴趣点的关联兴趣点。In another embodiment, the point of interest may be a social network user, such as user N. At this time, the related point of interest of this point of interest may also be determined through the social network.

在网络中存在很多网络用户,一些网络用户本身也可能是兴趣点,例如一些演员、歌手等。而社这些网络用户在社交网络中具有社交关系,因此,可以根据属于兴趣点的社交网络用户在社交网络中的关联用户,确定其关联关系。例如,可以将与属于兴趣点的社交网络用户相互关注的用户,作为该兴趣点的关联用户,即可以将其也作为一个兴趣点,并认为这两个兴趣点之间具有关联关系。There are many network users in the network, and some network users themselves may also be points of interest, such as some actors, singers, and the like. The social network users have social relationships in the social network. Therefore, the association relationship of the social network users belonging to the point of interest can be determined according to their associated users in the social network. For example, a user who is concerned with a social network user belonging to a point of interest can be regarded as an associated user of the point of interest, that is, it can also be regarded as a point of interest, and it is considered that there is an associated relationship between the two points of interest.

其中,在确定关联用户时,还可以根据两个用户之间的互动信息,确定关联用户。例如,若两个用户互动频繁,其中一个用户被认为是兴趣点,那么另一个用户可以是该兴趣点的关联兴趣点。Wherein, when determining the associated user, the associated user may also be determined according to the interaction information between the two users. For example, if two users interact frequently and one of the users is considered a point of interest, the other user may be a related point of interest for the point of interest.

步骤205,根据网络资源确定兴趣点的待定上位概念。Step 205: Determine the pending superordinate concept of the point of interest according to the network resources.

具体的,有一些兴趣点之间存在着上下位的关系,例如,兴趣点体育中可以包括多个其他的兴趣点,例如足球、篮球等。因此,还可以基于兴趣点之间的上下位关系,构建有指向关系的兴趣点关联关系。Specifically, some points of interest have an upper-lower relationship. For example, a point-of-interest sports may include multiple other points of interest, such as football, basketball, and the like. Therefore, it is also possible to construct a point-of-interest association relationship with a pointing relationship based on the upper-lower relationship between the points of interest.

进一步的,可以先确定一兴趣点的待定上位概念。Further, an undetermined superordinate concept of a point of interest may be determined first.

在一种实施方式中,可以预先根据网络资源中已有的知识体系,构建一概念体系。该概念体系中可以包括各个名词对应的上位名字及下位名词。In an implementation manner, a conceptual system may be constructed in advance according to an existing knowledge system in network resources. The concept system can include the superordinate names and subordinate nouns corresponding to each noun.

当确定一兴趣点后,若该兴趣点不属于这一概念体系,则可以在概念体系中确定一个该兴趣点的待定上位概念。After a point of interest is determined, if the point of interest does not belong to the concept system, a pending superordinate concept of the point of interest can be determined in the concept system.

在另一种实施方式中,还可以根据该兴趣点所对应的网页内容,确定一待定上位概念。或根据该兴趣点对应的搜索数据,确定待定上位概念。In another implementation manner, a to-be-located superordinate concept may also be determined according to the content of the webpage corresponding to the point of interest. Or according to the search data corresponding to the interest point, determine the to-be-determined superordinate concept.

步骤206,根据第三分类器确定待定上位概念是否为兴趣点的真实上位概念。Step 206 , according to the third classifier, determine whether the to-be-determined superordinate concept is the real superordinate concept of the point of interest.

实际应用时,还可以在电子设备中设置第三分类器,用于确定待定上位概念是否准确。In practical application, a third classifier may also be set in the electronic device to determine whether the to-be-determined superordinate concept is accurate.

其中,第三分类器可以是预先训练得到的。例如,可以将确定的概念体系作为训练数据,还可以根据已知兴趣点及其对应的上位概念,以及其组合的搜索分布数据训练第三分类器。Wherein, the third classifier may be obtained by pre-training. For example, the determined concept system can be used as training data, and the third classifier can also be trained according to the search distribution data of known points of interest and their corresponding superordinate concepts, and their combinations.

具体的,将训练好的第三分类器设置在电子设备中,用于确定该待定上位概念是否准确。可以将兴趣点对应的网页内容、搜索数据等,以及确定的待定上位概念输入第三分类器中,从而使其进行确认。Specifically, the trained third classifier is set in the electronic device to determine whether the to-be-determined superordinate concept is accurate. The content of the webpage corresponding to the point of interest, the search data, etc., and the determined undetermined superordinate concept can be input into the third classifier, so that it can be confirmed.

进一步的,在步骤205中,针对一个兴趣点还可以确定多个待定上位概念,此时,本步骤还可以对每个待定概念都进行确认。一个兴趣点可以具有多个真实的上位概念,例如《红楼梦》可以具有上位概念“文学”,还可以具有上位概念“历史”、“清朝”等。Further, in step 205, a plurality of undetermined superordinate concepts may also be determined for a point of interest, and at this time, each pending concept may be confirmed in this step. A point of interest can have multiple real superordinate concepts, for example, "A Dream of Red Mansions" can have superordinate concepts "literature", and can also have superordinate concepts "history", "Qing Dynasty" and so on.

步骤207,若是,则确定真实上位概念与兴趣点具有指向关联关系Step 207, if yes, then determine that the real superordinate concept and the point of interest have a pointing relationship

实际应用时,若确定该待定上位概念是兴趣点的真实上位概念,则确定二者具有指向关联关系,例如,兴趣点指向其所属的真实上位概念。In practical application, if it is determined that the to-be-determined superordinate concept is the real superordinate concept of the point of interest, it is determined that the two have a pointing relationship, for example, the point of interest points to the real superordinate concept to which it belongs.

其中,若确认兴趣点的每个待定上位概念均不是其真实上位概念,则可以认为该兴趣点还不具备上位概念。例如,在兴趣点关系图谱建立初期,可能其中包括的信息较少,此时,有可能无法在已有的数据中确定真实上位概念。Wherein, if it is confirmed that each undetermined superordinate concept of a point of interest is not its real superordinate concept, it can be considered that the interest point does not yet have a superordinate concept. For example, in the early stage of establishing the relationship map of interest points, it may contain less information, and at this time, it may not be possible to determine the true superordinate concept in the existing data.

对于没有上位概念的兴趣点,可以在新兴趣点加入图谱中时,再根据新添加的内容进行识别,确定其是否是这些兴趣点的上位概念。For points of interest without a superordinate concept, when a new interest point is added to the map, it can be identified according to the newly added content to determine whether it is the superordinate concept of these interest points.

步骤208,根据用户历史数据确定用户的历史兴趣点。Step 208: Determine the user's historical points of interest according to the user's historical data.

具体的,本实施例提供的方法还可以根据确定的兴趣点间关系,确定用户的目标兴趣点。Specifically, the method provided in this embodiment may further determine the target interest point of the user according to the determined relationship between the interest points.

进一步的,可以根据用户的历史数据确定用户的历史兴趣点,即用户曾经感兴趣的内容。这种确定方式可以采用现有技术中的方法。Further, the historical interest points of the user, that is, the content that the user was interested in once, can be determined according to the historical data of the user. This determination method can adopt the method in the prior art.

实际应用时,网络资源迭代速度很快,有可能用户的历史兴趣点与当前的兴趣点不相符。因此,若仅根据历史兴趣点向用户推送内容,容易向用户推送其不感兴趣的内容。In practical applications, network resources are iterated very fast, and it is possible that the user's historical interest points do not match the current interest points. Therefore, if content is only pushed to users based on historical points of interest, it is easy to push content that is not of interest to users.

步骤209,根据兴趣点间的关联关系确定与历史兴趣点具有关联的目标兴趣点。Step 209: Determine the target interest point associated with the historical interest point according to the association relationship between the interest points.

其中,可以基于构建的兴趣点间关联关系,在其中确定与历史兴趣点具有关联的目标兴趣点。Wherein, the target interest point associated with the historical interest point may be determined based on the constructed association relationship between the interest points.

例如,若构建了无向关联关系,则直接获取与历史兴趣点具有关系的兴趣点,可以直接将这些兴趣点作为目标兴趣点;还可以根据这些兴趣点与历史兴趣点之间的权重值,筛选一些关联性更强的兴趣点作为目标兴趣点,比如筛选权重值较高的预设数量个兴趣点作为目标兴趣点。For example, if an undirected association relationship is constructed, the interest points that have a relationship with the historical interest points can be directly obtained, and these interest points can be directly used as the target interest points; according to the weight value between these interest points and the historical interest points, Some more relevant interest points are screened as target interest points, for example, a preset number of interest points with higher weight values are screened as target interest points.

再例如,若构建了有向关联关系,则可以获取与历史兴趣点的上位兴趣点,可以将其作为一个目标兴趣点。还可以更精准的确定用户的兴趣。比如可以结合无向关联关系,将即属于该上位兴趣点,又与历史兴趣点具有无向关联关系的兴趣点作为目标兴趣点。例如,历史兴趣点为A1,其上位兴趣点为A,则可以获取其他属于A的兴趣点如A2、A3,假设A2与A1具有无向关联关系,A3与A1不具备无向关联关系,则认为A2是一目标兴趣点。For another example, if a directed association relationship is established, the upper-level interest point with the historical interest point can be obtained, which can be used as a target interest point. It is also possible to more accurately determine the interests of users. For example, the undirected association relationship can be combined, and the interest point that belongs to the upper interest point and has an undirected association relationship with the historical interest point can be used as the target interest point. For example, if the historical interest point is A 1 , and its upper interest point is A, other interest points belonging to A can be obtained, such as A 2 and A 3 , assuming that A 2 and A 1 have an undirected relationship, and A 3 and A 1 do not If there is an undirected relationship, A2 is considered to be a target point of interest.

步骤209是一种由电子设备确定目标兴趣点的方式,此外,还可以通过与用户进行交互的方式,确定目标兴趣点。Step 209 is a way of determining the target point of interest by the electronic device. In addition, the target point of interest may also be determined by interacting with the user.

步骤210,向用户的用户终端发送包括第一兴趣点询问信息。Step 210: Send inquiry information including the first point of interest to the user terminal of the user.

其中,若构建了兴趣点间的有向关联关系,则电子设备可以基于该有向关系与用户终端进行交互,从而确定目标兴趣点。Wherein, if a directional association relationship between the points of interest is established, the electronic device can interact with the user terminal based on the directional relationship to determine the target point of interest.

具体的,可以先确定一个第一兴趣点,例如,可以是如上所述的历史兴趣点,还可以根据用户当前浏览内容确定第一兴趣点,还可以随机确定第一兴趣点。Specifically, a first point of interest may be determined first, for example, a historical point of interest as described above, the first point of interest may also be determined according to the user's current browsing content, or the first point of interest may be determined randomly.

进一步的,电子设备可以向用户终端发送一询问信息,用于询问用户是否对第一兴趣点感兴趣。Further, the electronic device may send an inquiry message to the user terminal for inquiring whether the user is interested in the first point of interest.

实际应用时,用户可以操作用户终端进行回复,例如回复是或否,喜欢或不喜欢等信息。In practical application, the user can operate the user terminal to reply, such as replying yes or no, like or dislike and other information.

步骤211,接收用户终端返回的兴趣结果。Step 211: Receive the interest result returned by the user terminal.

其中,用户操作用户终端进行回复后,电子设备可以接收到用户终端反馈的兴趣结果。结果具体可以是是或否。Wherein, after the user operates the user terminal to reply, the electronic device may receive the interest result fed back by the user terminal. The result can be either yes or no.

步骤212,根据兴趣结果确定一实际兴趣点,并在关联关系中,根据实际兴趣点的指向关系确定目标兴趣点。Step 212: Determine an actual point of interest according to the interest result, and in the association relationship, determine the target point of interest according to the pointing relationship of the actual point of interest.

具体的,可以根据用户回复的结果确定实际兴趣点,例如,用户若回复的是,则可以认为第一兴趣点使用户的实际兴趣点,若用户回复的是否,则可以继续执行步骤210,对用户进行询问。Specifically, the actual point of interest can be determined according to the result of the user's reply. For example, if the user replies yes, it can be considered that the first point of interest is the user's actual point of interest. User asks.

进一步的,确定出实际兴趣点后,可以在有向的关联关系中,根据实际兴趣点的指向关系确定目标兴趣点。例如,若用户对足球感兴趣,则电子设备可以确定用户对体育感兴趣,进而将体育作为一目标兴趣点。Further, after the actual point of interest is determined, the target point of interest may be determined according to the pointing relationship of the actual point of interest in the directed association relationship. For example, if the user is interested in football, the electronic device may determine that the user is interested in sports, and then take sports as a target interest point.

实际应用时,若能够对用户进行有针对性的内容推送,需要确定更加激化的目标兴趣点,因此,还可以在有向的兴趣点间关联关系中,确定属于实际兴趣点的子兴趣点,和/或在有向的兴趣点间关联关系中,确定实际兴趣点的父兴趣点。例如,若实际兴趣点是体育,则可以获取体育的子兴趣点,如篮球、足球、体操、游泳等。若实际兴趣点是篮球,还可以获取其对应的父兴趣点,如体育。In practical application, if targeted content can be pushed to users, more intensified target POIs need to be determined. Therefore, sub-POIs belonging to actual POIs can also be determined in the association relationship between directional POIs. And/or in a directed relationship between POIs, the parent POI of the actual POI is determined. For example, if the actual point of interest is sports, sub-points of interest in sports, such as basketball, football, gymnastics, swimming, etc., can be obtained. If the actual POI is basketball, the corresponding parent POI, such as sports, can also be obtained.

其中,还可以基于实际兴趣点的情况,选择获取子兴趣点还是父兴趣点,例如,若实际兴趣点仅具有父兴趣点,则可以获取该父兴趣点,若实际兴趣点仅具有子兴趣点,则可以获取该子兴趣点。此处的父兴趣点是其子兴趣点的上位概念。Among them, it is also possible to choose whether to obtain the child POI or the parent POI based on the actual POI. For example, if the actual POI only has the parent POI, the parent POI can be obtained, and if the actual POI only has the child POI , the sub-point of interest can be obtained. A parent POI here is the superordinate concept of its child POIs.

根据子兴趣点和/或父兴趣点向用户的用户终端发送包括子兴趣点和/或父兴趣点的询问信息。The query information including the child POI and/or the parent POI is sent to the user terminal of the user according to the child POI and/or the parent POI.

具体的,获取了与实际兴趣点关联的兴趣点后,可以进一步的询问用户是否对该兴趣点感兴趣,因此,可以向用户终端发送包括确定的兴趣点的询问信息。Specifically, after acquiring the POI associated with the actual POI, the user can be further inquired whether the POI is interested. Therefore, query information including the determined POI can be sent to the user terminal.

用户看到询问信息后,可以操作用户终端,向电子设备进行反馈。After seeing the inquiry information, the user can operate the user terminal to give feedback to the electronic device.

接收用户终端返回的第二兴趣结果。A second interest result returned by the user terminal is received.

用户操作用户终端确认是否对当前的兴趣点感兴趣后,用户终端可以将其发送给电子设备,进而接收第二兴趣结果。After the user operates the user terminal to confirm whether he is interested in the current point of interest, the user terminal may send it to the electronic device, and then receive the second interest result.

假设该第二兴趣结果为,用户对当前确定的兴趣点感兴趣,则可以将其作为一目标兴趣点。可以结束确定目标兴趣点的流程,还可以继续执行确定属于实际兴趣点的其他子兴趣点,和/或在有向的兴趣点间关联关系中,确定实际兴趣点的其他父兴趣点的步骤。Assuming that the second interest result is that the user is interested in the currently determined interest point, it can be used as a target interest point. The process of determining the target POI can be ended, and the steps of determining other child POIs belonging to the actual POI, and/or other parent POIs of the actual POI in the directional relationship between POIs can be continued.

假设根据第二兴趣结果确定用户对子兴趣点和/或父兴趣点不感兴趣,则继续在关联关系中确定属于实际兴趣点的其他子兴趣点,和/或在关联关系中,确定实际兴趣点的其他父兴趣点的步骤。Assuming that it is determined according to the second interest result that the user is not interested in the child POI and/or the parent POI, continue to determine other child POIs belonging to the actual POI in the association relationship, and/or determine the actual POI in the association relationship steps for other parent POIs.

若用户对当前确定的兴趣点不感兴趣,则可以继续确定其他与实际兴趣点关联的兴趣点,并基于确定的兴趣点与用户进行交互。通过与用户交互的方式,能够更加直接、准确的确定目标兴趣点。If the user is not interested in the currently determined point of interest, the user may continue to determine other points of interest associated with the actual point of interest, and interact with the user based on the determined point of interest. By interacting with the user, the target point of interest can be determined more directly and accurately.

例如,电子设备所在系统与用户的交互过程可以是:For example, the interaction process between the system where the electronic device is located and the user can be:

系统:您好,您喜欢体育新闻吗?System: Hello, do you like sports news?

用户:喜欢的;user: like;

系统:您喜欢NBA吗?System: Do you like NBA?

用户:不感兴趣;User: not interested;

系统:您喜欢足球吗?System: Do you like football?

用户:喜欢;user: like;

系统:您喜欢贝克汉姆吗?System: Do you like Beckham?

用户:喜欢;user: like;

系统:我们会根据您的兴趣跟您进行推荐。System: We will recommend to you based on your interests.

图3为本发明一示例性实施例示出的兴趣点确定装置的结构图。FIG. 3 is a structural diagram of an apparatus for determining a point of interest according to an exemplary embodiment of the present invention.

如图3所示,本实施例提供的兴趣点确定装置,包括:As shown in FIG. 3 , the device for determining a point of interest provided by this embodiment includes:

挖掘模块31,用于通过预先训练的挖掘分类器,对网络资源进行挖掘,获取兴趣点;The mining module 31 is used for mining network resources through a pre-trained mining classifier to obtain points of interest;

关联模块32,用于根据获取的所述兴趣点确定兴趣点间的关联关系;an association module 32, configured to determine an association relationship between the points of interest according to the acquired points of interest;

确定模块33,用于根据所述兴趣点间的关联关系确定用户的目标兴趣点。The determining module 33 is configured to determine the target interest point of the user according to the association relationship between the interest points.

本实施例提供的兴趣点确定装置,包括挖掘模块,用于通过预先训练的挖掘分类器,对网络资源进行挖掘,获取兴趣点;关联模块,用于根据获取的兴趣点确定兴趣点间的关联关系;确定模块,用于根据兴趣点间的关联关系确定用户的目标兴趣点。本实施例提供的装置能够根据网络资源及时提取兴趣点,并构建兴趣点之间的关联,从而能够根据已知的用户信息,以及兴趣点之间的关联关系确定用户的目标兴趣点,从而再互联网资讯迭代速度较快的情况下,较快的确定用户有可能感兴趣的内容。The device for determining a point of interest provided in this embodiment includes a mining module for mining network resources through a pre-trained mining classifier to acquire a point of interest; an association module for determining the association between the points of interest according to the acquired points of interest Relationship; a determination module, used for determining the target interest point of the user according to the association relationship between the interest points. The device provided in this embodiment can extract points of interest in time according to network resources, and build an association between the points of interest, so that the target point of interest of the user can be determined according to the known user information and the relationship between the points of interest, and then the user's target point of interest can be determined. When the iteration speed of Internet information is fast, the content that users may be interested in can be determined quickly.

本实施例提供的兴趣点确定装置的具体原理和实现方式均与图1所示的实施例类似,此处不再赘述。The specific principle and implementation manner of the apparatus for determining a point of interest provided in this embodiment are similar to the embodiment shown in FIG. 1 , and details are not described herein again.

图4为本发明另一示例性实施例示出的兴趣点确定装置的结构图。FIG. 4 is a structural diagram of an apparatus for determining a point of interest according to another exemplary embodiment of the present invention.

如图4所示,在上述实施例的基础上,本实施例提供的兴趣点确定装置,所述挖掘模块31,包括第一挖掘单元311,用于:As shown in FIG. 4 , on the basis of the above embodiment, in the device for determining a point of interest provided by this embodiment, the mining module 31 includes a first mining unit 311 , which is used for:

通过第一挖掘分类器提取网页数据中的网页特征,并根据所述网页特征确定所述兴趣点;Extracting webpage features in webpage data through a first mining classifier, and determining the interest point according to the webpage characteristics;

其中,所述第一挖掘分类器是通过已知兴趣点及其对应的网页数据训练得到的。Wherein, the first mining classifier is obtained by training the known points of interest and their corresponding webpage data.

可选的,所述挖掘模块31包括第二挖掘单元312,用于:Optionally, the digging module 31 includes a second digging unit 312 for:

通过第二挖掘分类器提取网络搜索数据中的搜索特征,并根据所述搜索特征确定所述兴趣点;Extracting search features in the network search data by a second mining classifier, and determining the point of interest according to the search features;

其中,所述第二挖掘分类器是通过已知兴趣点及其对应的网络搜索数据训练得到的。Wherein, the second mining classifier is obtained by training the known points of interest and their corresponding network search data.

可选的,所述关联模块32包括第一关联单元321,用于:Optionally, the association module 32 includes a first association unit 321 for:

在所述网络资源中确定所述兴趣点间的共现信息,并根据所述共现信息确定所述兴趣点间的关联关系。Co-occurrence information between the interest points is determined in the network resource, and an association relationship between the interest points is determined according to the co-occurrence information.

可选的,所述兴趣点是社交网络用户;Optionally, the point of interest is a social network user;

所述关联模块32包括第二关联单元322,用于:The association module 32 includes a second association unit 322 for:

在社交网络兴趣点中,确定所述社交网络用户对应的关联用户,并将所述关联用户的关联兴趣点。In the point of interest of the social network, the associated user corresponding to the social network user is determined, and the associated point of interest of the associated user is determined.

可选的,所述关联模块32包括第三关联单元323,用于:Optionally, the association module 32 includes a third association unit 323 for:

根据网络资源确定所述兴趣点的待定上位概念;determining the pending superordinate concept of the point of interest according to network resources;

根据第三分类器确定所述待定上位概念是否为所述兴趣点的真实上位概念;Determine whether the to-be-determined superordinate concept is the real superordinate concept of the point of interest according to the third classifier;

若是,则确定所述真实上位概念与所述兴趣点具有指向关联关系。If so, it is determined that the real superordinate concept has a pointing relationship with the point of interest.

可选的,所述确定模块33包括第一确定单元331用于:Optionally, the determining module 33 includes a first determining unit 331 for:

根据用户历史数据确定所述用户的历史兴趣点;Determine the historical points of interest of the user according to the user's historical data;

根据所述兴趣点间的关联关系确定与所述历史兴趣点具有关联的目标兴趣点。A target POI associated with the historical POI is determined according to the association relationship between the POIs.

可选的,所述确定模块33包括第二确定单元332,用于:Optionally, the determining module 33 includes a second determining unit 332 for:

向所述用户的用户终端发送包括第一兴趣点询问信息;sending inquiry information including the first point of interest to the user terminal of the user;

接收所述用户终端返回的兴趣结果;receiving the interest result returned by the user terminal;

根据所述兴趣结果确定一实际兴趣点,并在所述关联关系中,根据所述实际兴趣点的指向关系确定所述目标兴趣点。An actual point of interest is determined according to the interest result, and in the association relationship, the target point of interest is determined according to the pointing relationship of the actual point of interest.

可选的,所述第二确定单元332具体用于:Optionally, the second determining unit 332 is specifically configured to:

在所述关联关系中确定属于所述实际兴趣点的子兴趣点,和/或在所述关联关系中,确定所述实际兴趣点的父兴趣点;determining a child POI belonging to the actual POI in the association relationship, and/or determining a parent POI of the actual POI in the association relationship;

根据所述子兴趣点和/或所述父兴趣点向所述用户的用户终端发送包括所述子兴趣点和/或所述父兴趣点的询问信息;Send inquiry information including the child POI and/or the parent POI to the user terminal of the user according to the child POI and/or the parent POI;

接收所述用户终端返回的第二兴趣结果;receiving the second interest result returned by the user terminal;

若根据所述第二兴趣结果确定所述用户对所述子兴趣点和/或所述父兴趣点不感兴趣,则继续在所述关联关系中确定属于所述实际兴趣点的其他子兴趣点,和/或在所述关联关系中,确定所述实际兴趣点的其他父兴趣点的步骤。If it is determined according to the second interest result that the user is not interested in the child POI and/or the parent POI, continue to determine other child POIs belonging to the actual POI in the association relationship, And/or in the association relationship, the step of determining other parent POIs of the actual POI.

本实施例提供的兴趣点确定装置的具体原理和实现方式均与图2所示的实施例类似,此处不再赘述。The specific principle and implementation manner of the apparatus for determining a point of interest provided in this embodiment are similar to the embodiment shown in FIG. 2 , and details are not described herein again.

图5为本发明一示例性实施例示出的兴趣点确定设备的结构图。FIG. 5 is a structural diagram of a device for determining a point of interest according to an exemplary embodiment of the present invention.

如图5所示,本实施例提供的兴趣点确定设备包括:As shown in FIG. 5 , the device for determining a point of interest provided by this embodiment includes:

存储器51;memory 51;

处理器52;以及processor 52; and

计算机程序;Computer program;

其中,所述计算机程序存储在所述存储器51中,并配置为由所述处理器52执行以实现如上所述的任一种兴趣点确定方法。Wherein, the computer program is stored in the memory 51 and configured to be executed by the processor 52 to implement any of the above-mentioned methods for determining a point of interest.

本实施例还提供一种计算机可读存储介质,其上存储有计算机程序,This embodiment also provides a computer-readable storage medium on which a computer program is stored,

所述计算机程序被处理器执行以实现如上所述的任一种兴趣点确定方法。The computer program is executed by a processor to implement any of the point-of-interest determination methods described above.

本领域普通技术人员可以理解:实现上述各方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成。前述的程序可以存储于一计算机可读取存储介质中。该程序在执行时,执行包括上述各方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps of implementing the above method embodiments may be completed by program instructions related to hardware. The aforementioned program can be stored in a computer-readable storage medium. When the program is executed, the steps including the above method embodiments are executed; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk and other media that can store program codes.

最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: The technical solutions described in the foregoing embodiments can still be modified, or some or all of the technical features thereof can be equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the embodiments of the present invention. scope.

Claims (8)

1. A method for point of interest determination, comprising:
mining network resources through a pre-trained mining classifier to obtain interest points, wherein the interest points are contents in which users are interested;
determining an incidence relation between the interest points according to the obtained interest points;
sending query information including a first point of interest to a user terminal of the user;
receiving a first interest result returned by the user terminal;
determining an actual interest point according to the first interest result, and determining a first child interest point and/or a first parent interest point of the actual interest point in the incidence relation;
sending query information including the first child point of interest and/or the first parent point of interest to the user terminal;
determining whether the user is interested in the first child interest point and/or the first parent interest point according to a second interest result returned by the user terminal;
if the interest is found, the first child interest point and/or the first parent interest point are/is used as a target interest point;
if the interest is not in the interest, continuing to determine other child interest points belonging to the actual interest point in the incidence relation, and/or determining other parent interest points of the actual interest point in the incidence relation;
if the interest points do not belong to a pre-constructed concept system, determining the association relationship between the interest points according to the obtained interest points comprises the following steps:
determining a pending upper concept of the interest point in the concept system; the concept system comprises upper names and lower names corresponding to a plurality of nouns, and is constructed according to an existing knowledge system in network resources;
determining whether the undetermined upper concept is a real upper concept of the interest point according to a third classifier;
if so, determining that the real upper concept has a pointing association relation with the interest point;
if not, determining that the interest point is at the initial stage of establishing the interest point relation map, and the interest point does not have a superordinate concept; and when a new interest point is added into the relation graph, continuously determining whether the new interest point is a superior concept of the interest point.
2. The method of claim 1, wherein mining network resources to obtain points of interest through a pre-trained mining classifier comprises:
extracting webpage features in webpage data through a first mining classifier, and determining the interest points according to the webpage features;
the first mining classifier is obtained by training known interest points and corresponding webpage data.
3. The method of claim 1, wherein mining network resources to obtain points of interest through a pre-trained mining classifier comprises:
extracting search features in network search data through a second mining classifier, and determining the interest points according to the search features;
and the second mining classifier is obtained by training known interest points and network search data corresponding to the known interest points.
4. An apparatus for point of interest determination, comprising:
the mining module is used for mining network resources through a pre-trained mining classifier to obtain interest points, wherein the interest points are contents in which users are interested;
the association module is used for determining the association relationship among the interest points according to the obtained interest points;
the determining module is used for determining the target interest points of the user according to the incidence relation among the interest points;
the determining module comprises a second determining unit configured to: sending first interest point inquiry information to a user terminal of a user; determining an actual interest point according to an interest result returned by the user terminal; determining a first child interest point and/or a first parent interest point of the actual interest points in the incidence relation; sending query information including the first child point of interest and/or the first parent point of interest to the user terminal; determining whether the user is interested in the first child interest point and/or the first parent interest point according to a second interest result returned by the user terminal; if the interest is found, the first child interest point and/or the first parent interest point are/is used as a target interest point; if the interest is not in the interest, continuing to determine other child interest points belonging to the actual interest point in the incidence relation, and/or determining other parent interest points of the actual interest point in the incidence relation;
the association module comprises a third association unit, and if the interest point does not belong to a pre-constructed concept system, the third association unit is used for: determining a pending upper concept of the interest point in the concept system; the concept system comprises upper names and lower names corresponding to a plurality of nouns, and is constructed according to an existing knowledge system in network resources; determining whether the undetermined upper concept is a real upper concept of the interest point according to a third classifier; if so, determining that the real upper concept has a pointing association relation with the interest point; if not, determining that the interest point is at the initial stage of establishing the interest point relation map, and the interest point does not have a superordinate concept; and when a new interest point is added into the relation graph, continuously determining whether the new interest point is a superior concept of the interest point.
5. The apparatus of claim 4, wherein the excavation module comprises a first excavation unit to:
extracting webpage features in webpage data through a first mining classifier, and determining the interest points according to the webpage features;
the first mining classifier is obtained by training known interest points and corresponding webpage data.
6. The apparatus of claim 4, wherein the excavation module comprises a second excavation unit to:
extracting search features in network search data through a second mining classifier, and determining the interest points according to the search features;
and the second mining classifier is obtained by training known interest points and network search data corresponding to the known interest points.
7. A point of interest determination device, comprising:
a memory;
a processor; and
a computer program;
wherein the computer program is stored in the memory and configured to be executed by the processor to implement the method of any of claims 1-3.
8. A computer-readable storage medium, having stored thereon a computer program,
the computer program is executed by a processor to implement the method according to any one of claims 1-3.
CN201910398136.5A 2019-05-14 2019-05-14 Method, device and equipment for determining interest points and computer readable storage medium Active CN110297967B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910398136.5A CN110297967B (en) 2019-05-14 2019-05-14 Method, device and equipment for determining interest points and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910398136.5A CN110297967B (en) 2019-05-14 2019-05-14 Method, device and equipment for determining interest points and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN110297967A CN110297967A (en) 2019-10-01
CN110297967B true CN110297967B (en) 2022-04-12

Family

ID=68026849

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910398136.5A Active CN110297967B (en) 2019-05-14 2019-05-14 Method, device and equipment for determining interest points and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110297967B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110726418B (en) * 2019-10-10 2021-08-03 北京百度网讯科技有限公司 Method, Apparatus, Device and Storage Medium for Determining Point of Interest Area
CN111160471B (en) * 2019-12-30 2023-04-07 腾讯云计算(北京)有限责任公司 Interest point data processing method and device, electronic equipment and storage medium
CN114417170B (en) * 2022-01-26 2025-06-10 杭州卓健信息科技股份有限公司 Healthy consultation information pushing system and method based on interest points
CN115002675A (en) * 2022-05-23 2022-09-02 北京字节跳动科技有限公司 Data matching method and device, readable medium and electronic equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102542489A (en) * 2011-12-27 2012-07-04 纽海信息技术(上海)有限公司 Recommendation method based on user interest association
CN102857471A (en) * 2011-06-27 2013-01-02 深圳深讯和科技有限公司 Multimedia interacting method and system
CN103166918A (en) * 2011-12-12 2013-06-19 阿里巴巴集团控股有限公司 Data recommendation method and device
CN105069125A (en) * 2015-08-13 2015-11-18 上海斐讯数据通信技术有限公司 Social network recommending method and social network recommending system
CN105512334A (en) * 2015-12-29 2016-04-20 成都陌云科技有限公司 Data mining method based on search words
CN105528459A (en) * 2016-01-08 2016-04-27 腾讯科技(深圳)有限公司 Information processing method, server and terminal
CN108108465A (en) * 2017-12-29 2018-06-01 北京奇宝科技有限公司 A kind of method and apparatus for pushing recommendation
CN108287864A (en) * 2017-12-06 2018-07-17 深圳市腾讯计算机系统有限公司 A kind of interest group division methods, device, medium and computing device
CN108345702A (en) * 2018-04-10 2018-07-31 北京百度网讯科技有限公司 Entity recommends method and apparatus

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103914536B (en) * 2014-03-31 2017-11-07 北京百度网讯科技有限公司 A kind of point of interest for electronic map recommends method and system
CN108875007B (en) * 2018-06-15 2019-12-17 腾讯科技(深圳)有限公司 method and device for determining interest point, storage medium and electronic device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102857471A (en) * 2011-06-27 2013-01-02 深圳深讯和科技有限公司 Multimedia interacting method and system
CN103166918A (en) * 2011-12-12 2013-06-19 阿里巴巴集团控股有限公司 Data recommendation method and device
CN102542489A (en) * 2011-12-27 2012-07-04 纽海信息技术(上海)有限公司 Recommendation method based on user interest association
CN105069125A (en) * 2015-08-13 2015-11-18 上海斐讯数据通信技术有限公司 Social network recommending method and social network recommending system
CN105512334A (en) * 2015-12-29 2016-04-20 成都陌云科技有限公司 Data mining method based on search words
CN105528459A (en) * 2016-01-08 2016-04-27 腾讯科技(深圳)有限公司 Information processing method, server and terminal
CN108287864A (en) * 2017-12-06 2018-07-17 深圳市腾讯计算机系统有限公司 A kind of interest group division methods, device, medium and computing device
CN108108465A (en) * 2017-12-29 2018-06-01 北京奇宝科技有限公司 A kind of method and apparatus for pushing recommendation
CN108345702A (en) * 2018-04-10 2018-07-31 北京百度网讯科技有限公司 Entity recommends method and apparatus

Also Published As

Publication number Publication date
CN110297967A (en) 2019-10-01

Similar Documents

Publication Publication Date Title
CN110297967B (en) Method, device and equipment for determining interest points and computer readable storage medium
US20210329094A1 (en) Discovering signature of electronic social networks
Gao et al. Self-paced network embedding
Buntain et al. Identifying social roles in reddit using network structure
Lee et al. Uncovering social spammers: social honeypots+ machine learning
US9483580B2 (en) Estimation of closeness of topics based on graph analytics
CN104717124B (en) A friend recommendation method, device and server
CN104281882B (en) The method and system of prediction social network information stream row degree based on user characteristics
US11010687B2 (en) Detecting abusive language using character N-gram features
WO2018192491A1 (en) Information pushing method and device
CN107908789A (en) Method and apparatus for generating information
CN103974097A (en) Personalized user-generated video prefetching method and system based on popularity and social networks
CN103995823A (en) Information recommending method based on social network
CN106993030A (en) Information-pushing method and device based on artificial intelligence
JP5961320B2 (en) Method of classifying users in social media, computer program, and computer
US11336596B2 (en) Personalized low latency communication
CN113365090B (en) Object recommendation method, object recommendation device, electronic equipment and readable storage medium
KR101929649B1 (en) System and method for recommendation of open chat room through chat log keyword extraction
KR20150046431A (en) Auto-learning system and method for derive effective marketing
CN109299351B (en) Content recommendation method and device, electronic equipment and computer readable medium
Lu et al. Identification of key nodes in microblog networks
CN114547439A (en) Service optimization method based on big data and artificial intelligence and electronic commerce AI system
CN105589916A (en) Method for extracting explicit and implicit interest knowledge
US10242106B2 (en) Enhance search assist system's freshness by extracting phrases from news articles
CN112052995A (en) Social network user influence prediction method based on fusion emotional tendency theme

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载