CN118827774A - An AIGC-driven automatic personalized content push system and method - Google Patents
An AIGC-driven automatic personalized content push system and method Download PDFInfo
- Publication number
- CN118827774A CN118827774A CN202411277910.4A CN202411277910A CN118827774A CN 118827774 A CN118827774 A CN 118827774A CN 202411277910 A CN202411277910 A CN 202411277910A CN 118827774 A CN118827774 A CN 118827774A
- Authority
- CN
- China
- Prior art keywords
- browsing
- push
- content
- personalized
- list
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/50—Network services
- H04L67/55—Push-based network services
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/27—Regression, e.g. linear or logistic regression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/2866—Architectures; Arrangements
- H04L67/30—Profiles
- H04L67/306—User profiles
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域Technical Field
本发明属于数据推送技术领域,具体涉及一种AIGC驱动的内容自动个性化推送系统及方法。The present invention belongs to the technical field of data push, and in particular relates to an AIGC-driven automatic personalized content push system and method.
背景技术Background Art
随着互联网技术的飞速发展,人们获取信息的方式已经从传统的媒体转向了数字化、网络化的信息平台。这种转变带来了信息的爆炸式增长,用户在面对海量信息时往往感到无从下手,难以快速找到自己真正感兴趣的内容,因此,如何从庞大的数据中筛选出对用户有价值的信息,并将其有效地推送给用户,成为了当前信息技术领域的一个重要课题,为了解决这一问题,个性化推荐系统应运而生,个性化推荐系统通过分析用户的历史行为、偏好以及社交网络等信息,为用户推荐其可能感兴趣的内容。With the rapid development of Internet technology, the way people obtain information has shifted from traditional media to digital and networked information platforms. This shift has brought about an explosive growth of information. Users often feel overwhelmed when faced with massive amounts of information and find it difficult to quickly find content that they are really interested in. Therefore, how to filter out valuable information from huge amounts of data and effectively push it to users has become an important topic in the current field of information technology. In order to solve this problem, personalized recommendation systems have emerged. Personalized recommendation systems recommend content that users may be interested in by analyzing users' historical behaviors, preferences, and social network information.
然而,现有的个性化推荐系统仍存在一些不足之处,目前的推荐方式中大多是基于用户的偏好,来生成推荐列表,但是大多的推荐算法在处理大规模数据时,往往会忽略推荐内容之间的关联性,其结果过于程序化,导致推荐结果只考虑用户偏好,但是实际上,推荐列表中包含了较多的内容,部分内容之间是存在一定的关联性,因此不考虑这些关联性时,往往会使得推荐结果过于单一,缺乏相应的灵活性,基于此,本方案提供了一种AIGC驱动的内容自动个性化推送方法,以解决上述问题。However, the existing personalized recommendation systems still have some shortcomings. Most of the current recommendation methods are based on user preferences to generate recommendation lists. However, most recommendation algorithms tend to ignore the correlation between recommended content when processing large-scale data. The results are too procedural, resulting in recommendation results that only consider user preferences. But in fact, the recommendation list contains more content, and there is a certain correlation between some of the content. Therefore, if these correlations are not considered, the recommendation results will often be too single and lack corresponding flexibility. Based on this, this solution provides an AIGC-driven content automatic personalized push method to solve the above problems.
发明内容Summary of the invention
本发明的目的是提供一种AIGC驱动的内容自动个性化推送系统及方法,能够在用户浏览过程中提供更加丰富和灵活的个性化内容推送,从而提高用户满意度和系统推荐的准确性。The purpose of the present invention is to provide an AIGC-driven automatic personalized content push system and method, which can provide richer and more flexible personalized content push during user browsing, thereby improving user satisfaction and the accuracy of system recommendations.
本发明采取的技术方案具体如下:The technical solution adopted by the present invention is as follows:
一种AIGC驱动的内容自动个性化推送方法,包括:An AIGC-driven automatic personalized content push method, comprising:
基于获取的用户行为数据,进行分类处理,得到多个用户偏好标签;Based on the acquired user behavior data, classification processing is performed to obtain multiple user preference tags;
采集各个所述用户偏好标签下用户的浏览信息,并依据所述浏览信息对用户偏好标签进行排序处理,得到初始化推送列表;Collecting browsing information of users under each of the user preference tags, and sorting the user preference tags according to the browsing information to obtain an initialization push list;
依据所述初始化推送列表匹配推荐内容,并对各个所述推荐内容进行相关性分析,得到关联推荐内容和非关联推荐内容;Matching recommended content according to the initialization push list, and performing correlation analysis on each of the recommended content to obtain associated recommended content and non-associated recommended content;
依据所述关联推荐内容和非关联推荐内容对初始化推送列表进行重新排序,得到个性化推送列表,其中,依据该个性化推送列表推送时,可选择跳过与当前浏览内容相关的关联推荐内容;Reordering the initial push list according to the related recommended content and the non-related recommended content to obtain a personalized push list, wherein when pushing according to the personalized push list, it is possible to choose to skip the related recommended content related to the currently browsed content;
基于所述个性化推送列表向用户执行个性化推送,且依据个性化推送下的用户反馈信息对所述个性化推送列表进行优化,得到更新推送列表,并依据所述更新推送列表继续执行个性化推送。Based on the personalized push list, personalized push is performed to the user, and according to the user feedback information under the personalized push, the personalized push list is optimized to obtain an updated push list, and personalized push is continued to be performed according to the updated push list.
在一种优选方案中,所述基于获取的用户行为数据,进行分类处理,得到多个用户偏好标签的步骤,包括:In a preferred solution, the step of performing classification processing based on the acquired user behavior data to obtain multiple user preference tags includes:
构建历史监测时段,并采集所述历史监测时段内的用户行为数据,所述用户行为数据包括历史浏览内容和历史搜索记录;Establishing a historical monitoring period and collecting user behavior data within the historical monitoring period, wherein the user behavior data includes historical browsing content and historical search records;
对所述历史浏览内容和历史搜索记录进行预处理,所述预处理包括去除无效数据和异常值;Preprocessing the historical browsing content and historical search records, wherein the preprocessing includes removing invalid data and abnormal values;
提取预处理后的历史浏览内容和历史搜索记录的特征信息,并为同一所述特征信息下的历史浏览内容和历史搜索记录赋予用户偏好标签;Extracting feature information of the preprocessed historical browsing content and historical search records, and assigning user preference tags to the historical browsing content and historical search records under the same feature information;
其中,所有所述用户偏好标签均具有唯一性。Among them, all the user preference tags are unique.
在一种优选方案中,所述采集各个所述用户偏好标签下用户的浏览信息,并依据所述浏览信息对用户偏好标签进行排序处理,得到初始化推送列表的步骤,包括:In a preferred solution, the step of collecting browsing information of users under each of the user preference tags, and sorting the user preference tags according to the browsing information to obtain an initial push list includes:
获取所述历史监测时段内,所述用户偏好标签下的浏览信息,所述浏览信息包括用户对历史浏览内容的浏览完成度和浏览次数;Obtaining browsing information under the user preference tag during the historical monitoring period, wherein the browsing information includes the user's browsing completion degree and browsing times for historical browsing content;
依据所述浏览完成度和浏览次数对各个用户偏好标签进行加权评分,得到各个所述用户偏好标签的推送评分;Performing a weighted score on each user preference tag according to the browsing completion degree and browsing times to obtain a push score for each user preference tag;
对所述推送评分按照由大至小的顺序进行排序处理,且依据所述推送评分的排序结果逐一输出各个用户偏好标签的推送顺序,并同步汇总为初始化推送列表。The push scores are sorted in descending order, and the push order of each user preference tag is output one by one according to the sorting result of the push scores, and simultaneously summarized into an initialization push list.
在一种优选方案中,所述依据所述浏览完成度和浏览次数对各个用户偏好标签进行加权评分,得到各个所述用户偏好标签的推送评分的步骤,包括:In a preferred solution, the step of weighting and scoring each user preference tag according to the browsing completion degree and the browsing times to obtain the push score of each user preference tag includes:
在所述历史监测时段内设置多个等时间间隔的样本时段;Setting a plurality of sample periods with equal time intervals within the historical monitoring period;
采集各个所述样本时段内,所有所述历史浏览内容的浏览量,以及各个所述历史浏览内容的浏览时间;Collecting the page views of all the historical browsing contents and the page view time of each of the historical browsing contents within each of the sample time periods;
依据所述历史浏览内容的浏览量和浏览时间确定所述浏览完成度和浏览次数的分配权重;Determine the distribution weights of the browsing completion degree and the number of browsing times according to the browsing volume and browsing time of the historical browsing content;
获取加权函数,并将所述浏览完成度和浏览次数,以及对应的分配权重输入至加权函数中进行加权运算,得到各个所述用户偏好标签的推送评分。A weighted function is obtained, and the browsing completion degree and the number of browsing times, as well as the corresponding allocation weights are input into the weighted function for weighted calculation to obtain a push score for each of the user preference tags.
在一种优选方案中,所述依据所述历史浏览内容的浏览量和浏览时间确定所述浏览完成度和浏览次数的分配权重的步骤,包括:In a preferred solution, the step of determining the distribution weights of the browsing completion degree and the number of browsing times according to the browsing volume and browsing time of the historical browsing content includes:
根据各个所述样本时段内历史浏览内容的浏览量和浏览时间,计算各个所述样本时段下的单位浏览量;Calculate the unit page views in each sample period according to the page views and page views time of the historical page views in each sample period;
将各个所述样本时段下的单位浏览量按照发生时序进行排列;Arrange the unit page views in each sample period according to the chronological order of occurrence;
获取标准偏移量,并依据所述标准偏移量对各个所述单位浏览量进行偏移处理,得到多个聚类区间;Obtaining a standard offset, and performing an offset process on each of the unit views according to the standard offset to obtain a plurality of clustering intervals;
统计各个所述聚类区间下的单位浏览量的数量,并记录为筛选条件参数,且将取值最大的所述筛选条件参数下的聚类区间标定为标准区间;Counting the number of unit views in each clustering interval and recording it as a screening condition parameter, and marking the clustering interval under the screening condition parameter with the largest value as a standard interval;
对所述标准区间下的单位浏览量进行求和以及取平均值运算,得到所述历史监测时段下的标准单位浏览量;The unit page views in the standard interval are summed and averaged to obtain the standard unit page views in the historical monitoring period;
获取权重分配函数,并将所述标准单位浏览量输入至权重分配函数中,且将所述权重分配函数的输出结果记录为所述浏览完成度和浏览次数的分配权重。A weight allocation function is obtained, and the standard unit browsing volume is input into the weight allocation function, and the output result of the weight allocation function is recorded as the allocation weight of the browsing completion degree and the browsing number.
在一种优选方案中,所述对各个所述推荐内容进行相关性分析,得到关联推荐内容和非关联推荐内容的步骤,包括:In a preferred solution, the step of performing correlation analysis on each of the recommended contents to obtain related recommended contents and non-related recommended contents includes:
获取所有所述推荐内容,并进行特征提取,得到各个所述推荐内容的关键特征;Acquire all the recommended contents, and perform feature extraction to obtain key features of each of the recommended contents;
对各个所述关键特征进行向量转换,得到各个所述推荐内容的关键特征向量;Performing vector conversion on each of the key features to obtain a key feature vector of each of the recommended contents;
对所述推荐内容逐一进行两两匹配,并将两两匹配后的所述推荐内容的关键特征向量输入至预设的关联测算函数中,且将所述关联测算函数的输出结果记录为关联评分;Matching the recommended contents one by one, inputting the key feature vectors of the recommended contents after the pairwise matching into a preset association calculation function, and recording the output result of the association calculation function as an association score;
获取关联评价阈值,并将所述关联评分与关联评价阈值进行比较;Obtaining a correlation evaluation threshold, and comparing the correlation score with the correlation evaluation threshold;
当所述关联评分大于关联评价阈值时,则表明两两匹配后的所述推荐内容之间存在相关性,并同步记录为关联推荐内容;When the correlation score is greater than the correlation evaluation threshold, it indicates that there is a correlation between the recommended contents after pairwise matching, and they are simultaneously recorded as related recommended contents;
当所述关联评分小于或等于关联评价阈值时,则表明两两匹配后的所述推荐内容之间未存在相关性,并同步记录为非关联推荐内容。When the correlation score is less than or equal to the correlation evaluation threshold, it indicates that there is no correlation between the recommended contents after pairwise matching, and they are simultaneously recorded as non-correlated recommended contents.
在一种优选方案中,所述依据所述关联推荐内容和非关联推荐内容对初始化推送列表进行重新排序,得到个性化推送列表的步骤,包括:In a preferred solution, the step of reordering the initialized push list according to the associated recommended content and the non-associated recommended content to obtain a personalized push list includes:
获取所述初始化推送列表,并将所述初始化推送列表中的相邻推荐内容分别记录为前置待处理数据和后置待处理数据;Acquire the initialization push list, and record adjacent recommended contents in the initialization push list as pre-processed data and post-processed data respectively;
采集所述前置待处理数据的关联推荐内容与前置待处理数据之间的关联评分,并将所述关联评分转换至与推送评分为同一量纲,且将记录为条件参数;Collecting the correlation score between the associated recommended content of the pre-processed data and the pre-processed data, converting the correlation score to the same dimension as the push score, and recording it as a condition parameter;
将所述条件参数与前置待处理数据的关联推荐内容的推送评分进行加权求和处理,得到关联推送评分;Performing weighted sum processing on the condition parameter and the push score of the associated recommended content of the previous data to be processed to obtain the associated push score;
将所述关联推送评分与后置待处理数据的推送评分进行比较,且在所述关联推送评分大于后置待处理数据的推送评分时,将与关联推送评分对应的关联推荐内容调整至后置待处理数据的前方,反之,则继续保持原有的顺序;Compare the associated push score with the push score of the subsequent data to be processed, and when the associated push score is greater than the push score of the subsequent data to be processed, adjust the associated recommended content corresponding to the associated push score to the front of the subsequent data to be processed, otherwise, continue to maintain the original order;
重复对所述初始化推送列表中的所有相邻推荐内容进行处理,直至完成整个初始化推送列表的重新排序,并输出为个性化推送列表。All adjacent recommended contents in the initialization push list are processed repeatedly until the entire initialization push list is reordered and output as a personalized push list.
在一种优选方案中,所述依据个性化推送下的用户反馈信息对所述个性化推送列表进行优化,得到更新推送列表的步骤,包括:In a preferred solution, the step of optimizing the personalized push list according to user feedback information under personalized push to obtain an updated push list includes:
获取用户对个性化推送下的各个推荐内容的反馈满意度;Obtain user feedback and satisfaction with each recommended content under personalized push;
将所述反馈满意度转换为各个推荐内容对应推送评分的优化权重;Convert the feedback satisfaction into an optimized weight of a push score corresponding to each recommended content;
获取优化函数,并将各个推荐内容对应的推送评分以及对应的优化权重输入至优化函数中进行运算,得到更新推送评分;Obtain an optimization function, and input the push score and the corresponding optimization weight corresponding to each recommended content into the optimization function for calculation to obtain an updated push score;
依据所述更新推送评分对个性化推送列表进行重新排序,得到更新推送列表,且应用所述更新推送列表执行各个推荐内容的推送处理。The personalized push list is reordered according to the update push score to obtain an update push list, and the update push list is applied to execute push processing of each recommended content.
本发明还提供了,一种AIGC驱动的内容自动个性化推送系统,使用上述的AIGC驱动的内容自动个性化推送方法,包括:The present invention also provides an AIGC-driven automatic personalized content push system, using the above-mentioned AIGC-driven automatic personalized content push method, including:
数据分类模块,所述数据分类模块用于基于获取的用户行为数据,进行分类处理,得到多个用户偏好标签;A data classification module, wherein the data classification module is used to perform classification processing based on the acquired user behavior data to obtain multiple user preference tags;
初始化模块,所述初始化模块用于采集各个所述用户偏好标签下用户的浏览信息,并依据所述浏览信息对用户偏好标签进行排序处理,得到初始化推送列表;An initialization module, the initialization module is used to collect browsing information of users under each of the user preference tags, and sort the user preference tags according to the browsing information to obtain an initialization push list;
相关性分析模块,所述相关性分析模块用于依据所述初始化推送列表匹配推荐内容,并对各个所述推荐内容进行相关性分析,得到关联推荐内容和非关联推荐内容;A correlation analysis module, the correlation analysis module is used to match the recommended content according to the initialization push list, and perform correlation analysis on each of the recommended content to obtain related recommended content and non-related recommended content;
个性化推送模块,所述个性化推送模块用于依据所述关联推荐内容和非关联推荐内容对初始化推送列表进行重新排序,得到个性化推送列表;A personalized push module, the personalized push module is used to reorder the initialized push list according to the associated recommended content and the non-associated recommended content to obtain a personalized push list;
列表更新模块,所述列表更新模块用于基于所述个性化推送列表向用户执行个性化推送,且依据个性化推送下的用户反馈信息对所述个性化推送列表进行优化,得到更新推送列表,并依据所述更新推送列表继续执行个性化推送。A list update module, wherein the list update module is used to perform personalized push to users based on the personalized push list, and optimize the personalized push list according to user feedback information under the personalized push to obtain an updated push list, and continue to perform personalized push according to the updated push list.
以及,一种电子设备,所述电子设备包括:And, an electronic device, the electronic device comprising:
至少一个处理器;at least one processor;
以及与所述至少一个处理器通信连接的存储器;and a memory communicatively coupled to the at least one processor;
其中,所述存储器存储有可被所述至少一个处理器执行的计算机程序,所述计算机程序被所述至少一个处理器执行,以使所述至少一个处理器能够执行上述的AIGC驱动的内容自动个性化推送方法。The memory stores a computer program that can be executed by the at least one processor, and the computer program is executed by the at least one processor so that the at least one processor can execute the above-mentioned AIGC-driven automatic personalized content push method.
本发明取得的技术效果为:The technical effects achieved by the present invention are:
本发明通过分析用户行为数据,实现个性化内容推送,提高用户满意度和参与度,具体通过相关性分析,能够确保推荐内容的相关性和多样性,丰富用户的浏览内容,避免用户疲劳,提高用户对推送内容的满意度,同时还会根据用户反馈,对个性化推送列表进行相应的动态调整,持续提升推荐内容的推送效果,自动学习和适应用户偏好变化,实现高效的内容个性化推送。The present invention realizes personalized content push and improves user satisfaction and participation by analyzing user behavior data. Specifically, through correlation analysis, it can ensure the relevance and diversity of recommended content, enrich users' browsing content, avoid user fatigue, and improve user satisfaction with pushed content. At the same time, according to user feedback, the personalized push list can be dynamically adjusted accordingly, the push effect of recommended content can be continuously improved, and automatic learning and adaptation to changes in user preferences can be achieved to realize efficient personalized content push.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本发明的方法流程示意图;Fig. 1 is a schematic flow chart of the method of the present invention;
图2是本发明的系统模块示意图;FIG2 is a schematic diagram of a system module of the present invention;
图3是本发明的电子设备结构示意图。FIG. 3 is a schematic diagram of the structure of an electronic device of the present invention.
具体实施方式DETAILED DESCRIPTION
为使本发明的上述目的、特征和优点能够更加明显易懂,下面结合说明书附图对本发明的具体实施方式做详细的说明。In order to make the above-mentioned objects, features and advantages of the present invention more obvious and easy to understand, the specific implementation methods of the present invention are described in detail below in conjunction with the accompanying drawings.
在下面的描述中阐述了很多具体细节以便于充分理解本发明,但是本发明还可以采用其他不同于在此描述的其它方式来实施,本领域技术人员可以在不违背本发明内涵的情况下做类似推广,因此本发明不受下面公开的具体实施例的限制。In the following description, many specific details are set forth to facilitate a full understanding of the present invention, but the present invention may also be implemented in other ways different from those described herein, and those skilled in the art may make similar generalizations without violating the connotation of the present invention. Therefore, the present invention is not limited to the specific embodiments disclosed below.
其次,此处所称的“一个实施例”或“实施例”是指可包含于本发明至少一个实现方式中的特定特征、结构或特性。在本说明书中不同地方出现的“在一个较佳的实施方式中”并非均指同一个实施例,也不是单独的或选择性的与其他实施例互相排斥的实施例。Secondly, the term "one embodiment" or "embodiment" as used herein refers to a specific feature, structure or characteristic that may be included in at least one implementation of the present invention. The phrase "in a preferred embodiment" that appears in different places in this specification does not refer to the same embodiment, nor is it a separate or selective embodiment that is mutually exclusive with other embodiments.
请参阅图1所示,本发明提供了一种AIGC驱动的内容自动个性化推送方法,包括:Referring to FIG. 1 , the present invention provides an AIGC-driven automatic personalized content push method, including:
S1、基于获取的用户行为数据,进行分类处理,得到多个用户偏好标签;S1. Based on the acquired user behavior data, classification processing is performed to obtain multiple user preference tags;
S2、采集各个用户偏好标签下用户的浏览信息,并依据浏览信息对用户偏好标签进行排序处理,得到初始化推送列表;S2. Collect browsing information of users under each user preference tag, and sort the user preference tags according to the browsing information to obtain an initial push list;
S3、依据初始化推送列表匹配推荐内容,并对各个推荐内容进行相关性分析,得到关联推荐内容和非关联推荐内容;S3, matching recommended content according to the initialization push list, and performing correlation analysis on each recommended content to obtain related recommended content and non-related recommended content;
S4、依据关联推荐内容和非关联推荐内容对初始化推送列表进行重新排序,得到个性化推送列表,其中,依据该个性化推送列表推送时,可选择跳过与当前浏览内容相关的关联推荐内容;S4, reordering the initial push list according to the associated recommended content and the non-associated recommended content to obtain a personalized push list, wherein when pushing according to the personalized push list, it is possible to choose to skip the associated recommended content related to the currently browsed content;
S5、基于个性化推送列表向用户执行个性化推送,且依据个性化推送下的用户反馈信息对个性化推送列表进行优化,得到更新推送列表,并依据更新推送列表继续执行个性化推送。S5. Execute personalized push to the user based on the personalized push list, optimize the personalized push list according to the user feedback information under the personalized push, obtain an updated push list, and continue to execute personalized push according to the updated push list.
如上述步骤S1-S5所述,随着大数据和人工智能技术的发展,个性化推送能够更加精准地满足用户的需求,提高用户体验和满意度,本实施例中,首先会收集和分析用户的在线行为数据,通过对用户行为数据进行分类处理,能够识别出用户的多个偏好标签,以此能够反映出用户的兴趣和需求,接下来会针对每个用户偏好标签,采集对应用户的浏览信息,还会根据浏览信息对用户偏好标签进行排序处理,从而生成初始化推送列表,初始化推送列表是根据用户的历史行为和偏好进行初步排序的,目的是为了更好地满足用户的初始需求,然后会依据初始化推送列表匹配推荐内容,推荐内容可以是文章、视频、图片等多种形式,然后会对各个推荐内容进行相关性分析,以此能够区分出推荐内容中的关联推荐内容和非关联推荐内容,再之后,依据关联推荐内容和非关联推荐内容,会对初始化推送列表进行重新排序,通过此方式,能够将关联推荐内容优先展示给用户,从而提高用户对推送内容的满意度和点击率,重新排序后得到的个性化推送列表,能够更好地满足用户的个性化需求,最后,会基于个性化推送列表向用户执行个性化推送,用户在接收到推送内容后,会根据自己的兴趣和需求给出反馈信息(如评分制),依据反馈信息能够对个性化推送列表进行优化,得到更新推送列表,更新推送列表会不断进行优化,确保能够满足用户的个性化需求,实现内容与用户需求的精准匹配。As described in the above steps S1-S5, with the development of big data and artificial intelligence technology, personalized push can more accurately meet the needs of users and improve user experience and satisfaction. In this embodiment, the user's online behavior data will be collected and analyzed first. By classifying the user behavior data, multiple preference tags of the user can be identified to reflect the user's interests and needs. Next, the browsing information of the corresponding user will be collected for each user preference tag, and the user preference tags will be sorted according to the browsing information to generate an initialization push list. The initialization push list is preliminarily sorted according to the user's historical behavior and preferences in order to better meet the user's initial needs. Then, the recommended content will be matched according to the initialization push list. The recommended content can be in various forms such as articles, videos, pictures, etc., and then each push list will be sorted. The recommended content is analyzed for relevance, so that the related recommended content and the non-related recommended content can be distinguished. After that, the initialized push list will be reordered according to the related recommended content and the non-related recommended content. In this way, the related recommended content can be displayed to the user first, thereby improving the user's satisfaction with the pushed content and the click-through rate. The personalized push list obtained after the reordering can better meet the user's personalized needs. Finally, personalized push will be performed to the user based on the personalized push list. After receiving the pushed content, the user will give feedback information (such as a rating system) according to his or her interests and needs. The personalized push list can be optimized based on the feedback information to obtain an updated push list. The updated push list will be continuously optimized to ensure that the user's personalized needs can be met and the content can be accurately matched with the user's needs.
在一个较佳的实施方式中,基于获取的用户行为数据,进行分类处理,得到多个用户偏好标签的步骤,包括:In a preferred embodiment, the step of performing classification processing based on the acquired user behavior data to obtain multiple user preference tags includes:
S101、构建历史监测时段,并采集历史监测时段内的用户行为数据,用户行为数据包括历史浏览内容和历史搜索记录;S101, constructing a historical monitoring period, and collecting user behavior data within the historical monitoring period, where the user behavior data includes historical browsing content and historical search records;
S102、对历史浏览内容和历史搜索记录进行预处理,预处理包括去除无效数据和异常值;S102, preprocessing the historical browsing content and historical search records, the preprocessing including removing invalid data and abnormal values;
S103、提取预处理后的历史浏览内容和历史搜索记录的特征信息,并为同一特征信息下的历史浏览内容和历史搜索记录赋予用户偏好标签;S103, extracting characteristic information of the preprocessed historical browsing content and historical search records, and assigning user preference tags to the historical browsing content and historical search records under the same characteristic information;
其中,所有用户偏好标签均具有唯一性。Among them, all user preference tags are unique.
如上述步骤S101-S103,在输出用户偏好标签时,首先构建一个历史监测时段,历史监测时段为衔接当前时刻的时间段,能够反映出用户短期行为,在历史监测时段内,将会对用户行为数据进行采集,用户行为数据主要包括用户的历史浏览内容和历史搜索记录,通过用户的历史浏览内容和历史搜索记录,可以了解用户的兴趣和需求,然后将对历史浏览内容和历史搜索记录进行预处理,预处理的目的是确保数据的质量,以便进行更准确的分析,预处理步骤包括去除无效数据和异常值,以排除可能干扰分析结果的不相关或错误的信息,一次能够确保后续分析的准确性和可靠性,在预处理完成后,将提取历史浏览内容和历史搜索记录中的特征信息,这些特征信息是用户行为的关键指标,能够反映出用户的偏好和兴趣点,为更好地整合特征信息,为同一特征信息下的历史浏览内容和历史搜索记录赋予相应的用户偏好标签,同时还需要确保所有用户偏好标签的唯一性,使得每个用户偏好标签都代表一个独特的偏好,避免重复或混淆,确保分类的准确性和后续分析的有效性。As in the above steps S101-S103, when outputting the user preference label, a historical monitoring period is first constructed. The historical monitoring period is a time period connecting the current moment and can reflect the user's short-term behavior. During the historical monitoring period, user behavior data will be collected. The user behavior data mainly includes the user's historical browsing content and historical search records. Through the user's historical browsing content and historical search records, the user's interests and needs can be understood. Then the historical browsing content and historical search records will be preprocessed. The purpose of preprocessing is to ensure the quality of the data in order to conduct a more accurate analysis. The preprocessing step includes removing invalid data and outliers to exclude irrelevant or erroneous information that may interfere with the analysis results, which can ensure the accuracy and reliability of subsequent analysis. After the preprocessing is completed, the feature information in the historical browsing content and historical search records will be extracted. These feature information are key indicators of user behavior and can reflect the user's preferences and interests. In order to better integrate the feature information, the historical browsing content and historical search records under the same feature information are assigned corresponding user preference labels. At the same time, it is also necessary to ensure the uniqueness of all user preference labels, so that each user preference label represents a unique preference to avoid duplication or confusion, and ensure the accuracy of classification and the effectiveness of subsequent analysis.
在一个较佳的实施方式中,采集各个用户偏好标签下用户的浏览信息,并依据浏览信息对用户偏好标签进行排序处理,得到初始化推送列表的步骤,包括:In a preferred embodiment, the steps of collecting browsing information of users under each user preference tag, sorting the user preference tags according to the browsing information, and obtaining an initial push list include:
S201、获取历史监测时段内,用户偏好标签下的浏览信息,浏览信息包括用户对历史浏览内容的浏览完成度和浏览次数;S201, obtaining browsing information under a user preference tag during a historical monitoring period, where the browsing information includes the browsing completion degree and browsing times of the user's historical browsing content;
S202、依据浏览完成度和浏览次数对各个用户偏好标签进行加权评分,得到各个用户偏好标签的推送评分;S202, weighted scoring is performed on each user preference tag according to the browsing completion degree and the browsing times, so as to obtain a push score for each user preference tag;
S203、对推送评分按照由大至小的顺序进行排序处理,且依据推送评分的排序结果逐一输出各个用户偏好标签的推送顺序,并同步汇总为初始化推送列表。S203: sorting the push scores in descending order, and outputting the push order of each user preference tag one by one according to the sorting result of the push scores, and synchronously summarizing them into an initialization push list.
如上述S201-S203所述,为了更好地满足用户的需求,在用户偏好标签确定后,首先需要采集各个用户在不同用户偏好标签下的浏览信息,浏览信息包括用户在历史监测时段内对各种推送的内容的浏览完成度和浏览次数,以此来反映用户对不同内容的兴趣程度和偏好,然后依据用户对历史浏览内容的浏览完成度和浏览次数对各个用户偏好标签进行加权评分,以此评估各个用户偏好标签的推送评分,推送评分输出之后,会对各个用户偏好标签的推送评分进行排序处理,排序的依据是推送评分取值的大小,具体是按照由大至小的顺序进行排列,再依据推送评分的排序结果逐一输出各个用户偏好标签的推送顺序,并同步汇总为初始化推送列表即可。为后续个性化推送列表的生成提供相应的基础。As described in S201-S203 above, in order to better meet the needs of users, after the user preference tags are determined, it is first necessary to collect the browsing information of each user under different user preference tags. The browsing information includes the browsing completion and browsing times of various pushed contents by the user during the historical monitoring period, so as to reflect the user's interest and preference for different contents. Then, each user preference tag is weighted and scored according to the browsing completion and browsing times of the historical browsing contents by the user, so as to evaluate the push score of each user preference tag. After the push score is output, the push score of each user preference tag will be sorted. The sorting is based on the size of the push score value, specifically, it is arranged in order from large to small. Then, according to the sorting result of the push score, the push order of each user preference tag is output one by one, and synchronously summarized into an initialized push list. Provide a corresponding basis for the subsequent generation of personalized push lists.
在一个较佳的实施方式中,依据浏览完成度和浏览次数对各个用户偏好标签进行加权评分,得到各个用户偏好标签的推送评分的步骤,包括:In a preferred embodiment, the step of weighting and scoring each user preference tag according to the browsing completion degree and the browsing times to obtain the push score of each user preference tag includes:
在历史监测时段内设置多个等时间间隔的样本时段;Set up multiple sample periods with equal time intervals within the historical monitoring period;
采集各个样本时段内,所有历史浏览内容的浏览量,以及各个历史浏览内容的浏览时间;Collect the page views of all historical browsing contents and the browsing time of each historical browsing content in each sample period;
依据历史浏览内容的浏览量和浏览时间确定浏览完成度和浏览次数的分配权重;Determine the distribution weight of browsing completion and browsing times based on the browsing volume and browsing time of historical browsing content;
获取加权函数,并将浏览完成度和浏览次数,以及对应的分配权重输入至加权函数中进行加权运算,得到各个用户偏好标签的推送评分。A weighted function is obtained, and the browsing completion degree and the number of browsing times, as well as the corresponding distribution weights are input into the weighted function for weighted calculation to obtain the push score of each user preference tag.
上述,在计算推送评分时,首先在历史监测时段内设定多个等时间间隔的样本时段,以此能够更好地捕捉用户在不同时间段内的浏览行为,接下来会采集样本时段内,所有历史浏览内容的浏览量,以及各个历史浏览内容的浏览时间,浏览量可以反映出某个内容的受欢迎程度,而浏览时间则可以揭示用户对内容的深入程度,然后,将依据历史浏览内容的浏览量和浏览时间来确定浏览完成度和浏览次数的分配权重,例如,如果一个用户在某个历史浏览内容上花费了更多的时间,那么这个内容的浏览完成度将会被赋予更高的权重,同样,浏览量大的历史浏览内容,其浏览次数也会被赋予更高的权重,最后引入预设的加权函数,将浏览完成度和浏览次数,以及对应的分配权重输入至加权函数中进行加权运算,便能够得到各个用户偏好标签的推送评分,加权函数的表达式为:As mentioned above, when calculating the push score, firstly, multiple sample periods with equal time intervals are set within the historical monitoring period, so as to better capture the browsing behavior of users in different time periods. Next, the page views of all historical browsing contents and the browsing time of each historical browsing content within the sample period will be collected. The page views can reflect the popularity of a certain content, and the browsing time can reveal the user's depth of the content. Then, the distribution weights of browsing completion and number of views will be determined based on the page views and browsing time of the historical browsing content. For example, if a user spends more time on a certain historical browsing content, then the browsing completion of this content will be given a higher weight. Similarly, the number of views of historical browsing content with a large number of views will also be given a higher weight. Finally, a preset weighting function is introduced, and the browsing completion and number of views, as well as the corresponding distribution weights are input into the weighting function for weighted calculation, so that the push score of each user preference tag can be obtained. The expression of the weighting function is:
; ;
式中,表示推送评分,和分别表示浏览完成度和浏览次数的分配权重,表示浏览完成度,表示浏览次数,在推送评分输出之后,便可进行初始化推送列表的生成。In the formula, Indicates push rating. and They represent the distribution weights of browsing completion and browsing times, respectively. Indicates browsing completion. Indicates the number of views. After the push score is output, the push list can be initialized.
在一个较佳的实施方式中,依据历史浏览内容的浏览量和浏览时间确定浏览完成度和浏览次数的分配权重的步骤,包括:In a preferred embodiment, the step of determining the distribution weights of browsing completion and browsing times according to the browsing volume and browsing time of historical browsing content includes:
Step1、根据各个样本时段内历史浏览内容的浏览量和浏览时间,计算各个样本时段下的单位浏览量;Step 1. Calculate the unit page views in each sample period based on the page views and page views time of the historical page views in each sample period.
Step2、将各个样本时段下的单位浏览量按照发生时序进行排列;Step 2: Arrange the unit page views in each sample period according to the chronological order of occurrence;
Step3、获取标准偏移量,并依据标准偏移量对各个单位浏览量进行偏移处理,得到多个聚类区间;Step 3, obtain the standard offset, and perform offset processing on each unit of browsing volume according to the standard offset to obtain multiple clustering intervals;
Step4、统计各个聚类区间下的单位浏览量的数量,并记录为筛选条件参数,且将取值最大的筛选条件参数下的聚类区间标定为标准区间;Step 4. Count the number of unit views in each clustering interval and record it as the filtering condition parameter, and mark the clustering interval under the filtering condition parameter with the largest value as the standard interval;
Step5、对标准区间下的单位浏览量进行求和以及取平均值运算,得到历史监测时段下的标准单位浏览量;Step 5, sum and average the unit page views in the standard interval to obtain the standard unit page views in the historical monitoring period;
Step6、获取权重分配函数,并将标准单位浏览量输入至权重分配函数中,且将权重分配函数的输出结果记录为浏览完成度和浏览次数的分配权重。Step 6. Obtain a weight distribution function, input the standard unit page views into the weight distribution function, and record the output of the weight distribution function as the distribution weights of the page view completion and page view count.
如上述步骤Step1-Step6所述,为了确定浏览完成度和浏览次数的分配权重,需要依据历史浏览内容的浏览量和浏览时间来进行相应的分析,首先需要收集各个样本时段内历史浏览内容的浏览量和浏览时间,进而可以计算出各个样本时段下的单位浏览量,单位浏览量是指在浏览时间下与对应浏览量的比值,能够反映出在样本时段内用户对历史浏览内容的偏好程度,然后将各个样本时段下的单位浏览量按照发生的时间顺序进行排列,并引入预设的标准偏移量,标准偏移量是根据历史数据的统计特性确定的,用于对各个单位浏览量进行偏移处理,具体可以将单位浏览量向右或向左移动一定的数值,以形成多个聚类区间,在得到多个聚类区间后,需要统计每个聚类区间下的单位浏览量的数量,每个聚类区间下的单位浏览量的数量将被记录为筛选条件参数,之后找出取值最大的筛选条件参数,将对应的聚类区间标定为标准区间,标准区间代表了用户的习惯浏览行为,再之后对标准区间下的单位浏览量进行求和以及取平均值运算,通过对标准区间下的单位浏览量进行求和以及取平均值运算,可以得到历史监测时段下的标准单位浏览量,然后引入权重分配函数,且将标准单位浏览量输入至权重分配函数中,其中,权重分配函数的表达式为:As described in the above steps Step 1-Step 6, in order to determine the distribution weights of browsing completion and browsing times, it is necessary to perform corresponding analysis based on the browsing volume and browsing time of historical browsing content. First, it is necessary to collect the browsing volume and browsing time of historical browsing content in each sample period, and then calculate the unit browsing volume in each sample period. The unit browsing volume refers to the ratio of the browsing volume to the corresponding browsing volume in the browsing time, which can reflect the user's preference for the historical browsing content in the sample period. Then, the unit browsing volume in each sample period is arranged in the order of occurrence, and a preset standard offset is introduced. The standard offset is determined based on the statistical characteristics of historical data and is used to offset each unit browsing volume. Specifically, the unit browsing volume can be The page views are moved to the right or left by a certain value to form multiple clustering intervals. After obtaining multiple clustering intervals, it is necessary to count the number of unit page views in each clustering interval. The number of unit page views in each clustering interval will be recorded as the filtering condition parameter, and then the filtering condition parameter with the largest value is found, and the corresponding clustering interval is calibrated as the standard interval. The standard interval represents the user's habitual browsing behavior, and then the unit page views in the standard interval are summed and averaged. By summing and averaging the unit page views in the standard interval, the standard unit page views in the historical monitoring period can be obtained, and then the weight distribution function is introduced, and the standard unit page views are input into the weight distribution function, where the expression of the weight distribution function is:
; ;
式中,表示浏览完成度和浏览次数的分配权重,=1,2,,表示与浏览完成度相关的常数参数,,表示与浏览次数相关的常数参数,表示标准单位浏览量,通过权重分配函数的运算,便可以得到浏览完成度和浏览次数的分配权重。In the formula, Indicates the distribution weight of browsing completion and browsing times, =1,2, , Represents constant parameters related to browsing completion. , Represents constant parameters related to the number of views, It represents the standard unit of page views. By calculating the weight distribution function, the distribution weights of page view completion and page view times can be obtained.
在一个较佳的实施方式中,对各个推荐内容进行相关性分析,得到关联推荐内容和非关联推荐内容的步骤,包括:In a preferred embodiment, the step of performing correlation analysis on each recommended content to obtain related recommended content and non-related recommended content includes:
S301、获取所有推荐内容,并进行特征提取,得到各个推荐内容的关键特征;S301, obtaining all recommended contents, and performing feature extraction to obtain key features of each recommended content;
S302、对各个关键特征进行向量转换,得到各个推荐内容的关键特征向量;S302, performing vector conversion on each key feature to obtain a key feature vector of each recommended content;
S303、对推荐内容逐一进行两两匹配,并将两两匹配后的推荐内容的关键特征向量输入至预设的关联测算函数中,且将关联测算函数的输出结果记录为关联评分;S303, matching the recommended contents one by one, and inputting the key feature vectors of the recommended contents after the pairwise matching into a preset association calculation function, and recording the output result of the association calculation function as the association score;
S304、获取关联评价阈值,并将关联评分与关联评价阈值进行比较;S304, obtaining a correlation evaluation threshold, and comparing the correlation score with the correlation evaluation threshold;
当关联评分大于关联评价阈值时,则表明两两匹配后的推荐内容之间存在相关性,并同步记录为关联推荐内容;When the correlation score is greater than the correlation evaluation threshold, it indicates that there is a correlation between the recommended contents after pairwise matching, and they are simultaneously recorded as related recommended contents;
当关联评分小于或等于关联评价阈值时,则表明两两匹配后的推荐内容之间未存在相关性,并同步记录为非关联推荐内容。When the correlation score is less than or equal to the correlation evaluation threshold, it indicates that there is no correlation between the recommended contents after pairwise matching, and they are simultaneously recorded as non-correlated recommended contents.
如上述步骤S301-S304所述,在确定关联推荐内容和非关联推荐内容时,首先需要获取所有的推荐内容,并进行相应的特征提取,识别出各个推荐内容的核心特征,即关键特征,具体可通过文本挖掘、自然语言处理等技术手段实现,然后,将关键特征进行向量转换,以便于进行后续的数学计算,向量转换可以采用诸如TF-IDF、Word2Vec等方法,接下来,对推荐内容逐一进行两两匹配,比较不同推荐内容之间的相似性和差异性,具体是将每一对匹配后的推荐内容的关键特征向量输入到预设的关联测算函数中,关联测算函数将根据特征向量的相似度计算出一个关联评分,其中,关联测算函数的表达式为:As described in the above steps S301-S304, when determining the associated recommended content and the non-associated recommended content, it is first necessary to obtain all the recommended content and perform corresponding feature extraction to identify the core features of each recommended content, that is, the key features, which can be achieved through technical means such as text mining and natural language processing. Then, the key features are converted into vectors to facilitate subsequent mathematical calculations. The vector conversion can use methods such as TF-IDF and Word2Vec. Next, the recommended contents are matched one by one, and the similarities and differences between different recommended contents are compared. Specifically, the key feature vectors of each pair of matched recommended contents are input into the preset association measurement function. The association measurement function will calculate an association score based on the similarity of the feature vectors. The expression of the association measurement function is:
; ;
式中,表示关联评分,,均为常数参数,表示关键特征向量的特征点数量,,表示两两匹配后的推荐内容的关键特征向量,表示两两匹配后的推荐内容的同时出现次数,表示推荐内容的总出现次数,关联评分输出之后,引入预设的关联评价阈值,关联评价阈值是预先设定的标准,用于判断关联评分是否足够高以表明两个推荐内容之间存在相关性,具体来说,当关联评分大于或等于这个关联评价阈值时,表明两两匹配后的推荐内容之间存在一定的相关性,此种情况下,会将这对推荐内容记录为关联推荐内容,相反,如果关联评分小于或等于关联评价阈值,那么就认为这两两匹配后的推荐内容之间不存在显著的相关性,此情况下会将这对推荐内容记录为非关联推荐内容。In the formula, represents the association score, , are constant parameters, represents the number of feature points of the key feature vector, , Represents the key feature vector of the recommended content after pairwise matching, Indicates the number of times the recommended content appears at the same time after pairwise matching. Represents the total number of occurrences of the recommended content. After the correlation score is output, a preset correlation evaluation threshold is introduced. The correlation evaluation threshold is a pre-set standard used to determine whether the correlation score is high enough to indicate that there is a correlation between the two recommended contents. Specifically, when the correlation score is greater than or equal to the correlation evaluation threshold, it indicates that there is a certain correlation between the recommended contents after the two are matched. In this case, the pair of recommended contents will be recorded as related recommended contents. On the contrary, if the correlation score is less than or equal to the correlation evaluation threshold, it is considered that there is no significant correlation between the recommended contents after the two are matched. In this case, the pair of recommended contents will be recorded as unrelated recommended contents.
在一个较佳的实施方式中,依据关联推荐内容和非关联推荐内容对初始化推送列表进行重新排序,得到个性化推送列表的步骤,包括:In a preferred embodiment, the step of reordering the initial push list according to the associated recommended content and the non-associated recommended content to obtain a personalized push list includes:
S401、获取初始化推送列表,并将初始化推送列表中的相邻推荐内容分别记录为前置待处理数据和后置待处理数据;S401, obtaining an initialization push list, and recording adjacent recommended contents in the initialization push list as pre-processed data and post-processed data respectively;
S402、采集前置待处理数据的关联推荐内容与前置待处理数据之间的关联评分,并将关联评分转换至与推送评分为同一量纲,且将记录为条件参数;S402, collecting the correlation score between the related recommended content of the previous data to be processed and the previous data to be processed, converting the correlation score to the same dimension as the push score, and recording it as a condition parameter;
S403、将条件参数与前置待处理数据的关联推荐内容的推送评分进行加权求和处理,得到关联推送评分;S403, performing weighted sum processing on the condition parameter and the push score of the associated recommended content of the previous data to be processed to obtain the associated push score;
S404、将关联推送评分与后置待处理数据的推送评分进行比较,且在关联推送评分大于后置待处理数据的推送评分时,将与关联推送评分对应的关联推荐内容调整至后置待处理数据的前方,反之,则继续保持原有的顺序;S404, comparing the associated push score with the push score of the subsequent data to be processed, and when the associated push score is greater than the push score of the subsequent data to be processed, adjusting the associated recommended content corresponding to the associated push score to the front of the subsequent data to be processed, otherwise, continue to maintain the original order;
S405、重复对初始化推送列表中的所有相邻推荐内容进行处理,直至完成整个初始化推送列表的重新排序,并输出为个性化推送列表。S405: Repeat processing of all adjacent recommended contents in the initialization push list until the entire initialization push list is reordered and output as a personalized push list.
如上述步骤S401-S405所述,在关联推荐内容和非关联推荐内容输出之后,首先将初始化推送列表中的相邻推荐内容分别标记为前置待处理数据和后置待处理数据,然后采集前置待处理数据的关联推荐内容,以及前置待处理数据的关联推荐内容与前置待处理数据之间的关联评分,同时为了确保关联评分与推送评分的一致性,需要将关联评分转换到与推送评分相同的量纲,并记录为条件参数进行后续的处理,然后将条件参数与前置待处理数据的关联推荐内容的推送评分进行加权求和处理,从而能够得到一个综合的关联推送评分,再然后将关联推送评分与后置待处理数据的推送评分进行比较,如果关联推送评分高于后置待处理数据的推送评分,那么就将与关联推送评分对应的关联推荐内容调整至后置待处理数据的前方,反之,如果关联推送评分较低,将保持原有的顺序不变,以此类推,重复上述步骤,对初始化推送列表中的所有相邻推荐内容进行处理,直到整个列表完成重新排序,完成排序后,输出为最终的个性化推送列表,从而为用户提供更加精准和个性化的推荐。在于用户在查看推荐内容时,能够具有一定的连续性,例如推荐的内容为a类信息时,可以紧接着推荐与a类信息有关的信息,比如在刷视频时,根据用户偏好推荐了某视频,然后在当前浏览到该视频时,会优先推荐与该视频具有关联度的后续,无需额外去搜索,由于被推荐的内容本身即为用户偏好推送的,因此该方式使得推荐内容的连续性更好。As described in the above steps S401-S405, after the associated recommended content and the non-associated recommended content are output, the adjacent recommended content in the initialization push list is first marked as the preceding data to be processed and the following data to be processed, respectively, and then the associated recommended content of the preceding data to be processed and the associated score between the associated recommended content of the preceding data to be processed and the preceding data to be processed are collected. At the same time, in order to ensure the consistency of the associated score and the push score, the associated score needs to be converted to the same dimension as the push score and recorded as a conditional parameter for subsequent processing, and then the conditional parameter and the push score of the associated recommended content of the preceding data to be processed are weighted and summed. Processing, so as to obtain a comprehensive correlation push score, and then compare the correlation push score with the push score of the post-processed data. If the correlation push score is higher than the push score of the post-processed data, then the correlation recommended content corresponding to the correlation push score is adjusted to the front of the post-processed data. On the contrary, if the correlation push score is lower, the original order will be kept unchanged. Repeat the above steps to process all adjacent recommended contents in the initialization push list until the entire list is re-sorted. After sorting, the final personalized push list is output, thereby providing users with more accurate and personalized recommendations. When users view recommended content, they can have a certain degree of continuity. For example, when the recommended content is information of category a, information related to information of category a can be recommended immediately. For example, when browsing videos, a certain video is recommended according to user preferences. Then, when browsing the video, the subsequent content with correlation with the video will be recommended first without additional search. Since the recommended content itself is pushed by user preferences, this method makes the continuity of recommended content better.
其中,依据该个性化推送列表推送时,可选择跳过与当前浏览内容相关的关联推荐内容,基于关联推送评分重新确定后的个性化推送列表,除却保证推荐内容的连续性,该方法还应当具备报跳过功能,通过该跳过功能,在依据个性化推送列表推送的当前浏览内容不符合或存在主观意识的观看单一时,亦可通过跳过功能选择直接跳过与当前浏览内容相关联的推荐内容。该方式使得个性化推送列表在符合用户需求的前提下。Among them, when pushing according to the personalized push list, you can choose to skip the related recommended content related to the current browsing content. The personalized push list re-determined based on the related push score, in addition to ensuring the continuity of the recommended content, this method should also have a report skip function. Through this skip function, when the current browsing content pushed according to the personalized push list does not meet or has subjective viewing singleness, you can also choose to directly skip the recommended content associated with the current browsing content through the skip function. This method makes the personalized push list meet the user's needs.
在一个较佳的实施方式中,依据个性化推送下的用户反馈信息对个性化推送列表进行优化,得到更新推送列表的步骤,包括:In a preferred embodiment, the personalized push list is optimized according to the user feedback information under the personalized push, and the step of obtaining the updated push list includes:
S501、获取用户对个性化推送下的各个推荐内容的反馈满意度;S501, obtaining user feedback satisfaction for each recommended content under personalized push;
S502、将反馈满意度转换为各个推荐内容对应推送评分的优化权重;S502, converting the feedback satisfaction into an optimization weight of a push score corresponding to each recommended content;
S503、获取优化函数,并将各个推荐内容对应的推送评分以及对应的优化权重输入至优化函数中进行运算,得到更新推送评分;S503, obtaining an optimization function, and inputting the push score and the corresponding optimization weight corresponding to each recommended content into the optimization function for calculation to obtain an updated push score;
S504、依据更新推送评分对个性化推送列表进行重新排序,得到更新推送列表,且应用更新推送列表执行各个推荐内容的推送处理。S504: re-sort the personalized push list according to the update push score to obtain an update push list, and apply the update push list to execute push processing of each recommended content.
如上述步骤S501-S504所述,为进一步提升用户体验,需要根据用户在个性化推送中的反馈信息来优化推送列表,首先需要收集用户对个性化推送中各个推荐内容的反馈满意度数据,本实施方式通过用户评分来实现,以此了解用户对各个推荐内容的满意度,其次,将收集到的反馈满意度数据转换为各个推荐内容对应推送评分的优化权重,目的是将用户的主观满意度转化为可以量化的优化指标,例如,可以设定一个评分标准,将用户的满意度分为几个等级,并为每个等级分配不同的权重值,然后引入优化函数,将各个推荐内容对应的推送评分以及对应的优化权重输入至优化函数中进行运算,优化函数的表达式为:更新推送评分=优化权重×推送评分,最后,依据更新推送评分对个性化推送列表进行重新排序,将推荐内容按照用户满意度从高到低进行排列,以此便可得到更新推送列表,且将更新推送列表应用到实际的个性化推送处理中,确保用户接收到的推荐内容更加符合他们的兴趣和需求。As described in the above steps S501-S504, in order to further improve the user experience, it is necessary to optimize the push list according to the user's feedback information in the personalized push. First, it is necessary to collect the user's feedback satisfaction data on each recommended content in the personalized push. This embodiment is achieved through user rating, so as to understand the user's satisfaction with each recommended content. Secondly, the collected feedback satisfaction data is converted into the optimization weight of the push score corresponding to each recommended content. The purpose is to convert the user's subjective satisfaction into a quantifiable optimization indicator. For example, a scoring standard can be set to divide the user's satisfaction into several levels, and different weight values are assigned to each level. Then, an optimization function is introduced, and the push score corresponding to each recommended content and the corresponding optimization weight are input into the optimization function for calculation. The expression of the optimization function is: update push score = optimization weight × push score. Finally, the personalized push list is re-sorted according to the update push score, and the recommended content is arranged from high to low according to the user satisfaction. In this way, an update push list can be obtained, and the update push list is applied to the actual personalized push processing to ensure that the recommended content received by the user is more in line with their interests and needs.
请参阅图2,一种AIGC驱动的内容自动个性化推送系统,使用上述的AIGC驱动的内容自动个性化推送方法,包括:Please refer to FIG. 2 , an AIGC-driven automatic personalized content push system, using the above-mentioned AIGC-driven automatic personalized content push method, including:
数据分类模块,数据分类模块用于基于获取的用户行为数据,进行分类处理,得到多个用户偏好标签;A data classification module is used to perform classification processing based on the acquired user behavior data to obtain multiple user preference labels;
初始化模块,初始化模块用于采集各个用户偏好标签下用户的浏览信息,并依据浏览信息对用户偏好标签进行排序处理,得到初始化推送列表;Initialization module, which is used to collect browsing information of users under each user preference tag, and sort the user preference tags according to the browsing information to obtain an initialization push list;
相关性分析模块,相关性分析模块用于依据初始化推送列表匹配推荐内容,并对各个推荐内容进行相关性分析,得到关联推荐内容和非关联推荐内容;A correlation analysis module is used to match recommended content according to the initial push list, and to perform correlation analysis on each recommended content to obtain related recommended content and non-related recommended content;
个性化推送模块,个性化推送模块用于依据关联推荐内容和非关联推荐内容对初始化推送列表进行重新排序,得到个性化推送列表;A personalized push module, which is used to reorder the initial push list according to the associated recommended content and the non-associated recommended content to obtain a personalized push list;
列表更新模块,列表更新模块用于基于个性化推送列表向用户执行个性化推送,且依据个性化推送下的用户反馈信息对个性化推送列表进行优化,得到更新推送列表,并依据更新推送列表继续执行个性化推送。The list update module is used to perform personalized push to users based on the personalized push list, and optimize the personalized push list according to user feedback information under the personalized push, obtain an updated push list, and continue to perform personalized push according to the updated push list.
上述中,该系统包括数据分类模块、初始化模块、相关性分析模块、个性化推送模块和列表更新模块,数据分类模块负责根据收集到的用户行为数据进行细致的分类处理,通过这一过程,系统能够识别并提取出多个用户偏好标签,这些标签反映了用户对不同领域的兴趣和偏好,初始化模块的作用是采集各个用户偏好标签下用户的浏览信息,通过对这些浏览信息的深入分析和处理,系统能够对用户偏好标签进行有效的排序,从而生成一个初始化推送列表,为后续的个性化推送奠定了基础,相关性分析模块依据初始化推送列表匹配推荐内容,并对各个推荐内容进行详细的相关性分析,通过这一过程,系统能够区分关联推荐内容而后非关联推荐内容,个性化推送模块利用关联推荐内容和非关联推荐内容对初始化推送列表进行重新排序,从而生成一个更加精准的个性化推送列表,个性化推送列表能够更好地满足用户的个性化需求,提高用户满意度,最后,列表更新模块负责将个性化推送列表应用到实际的用户推送过程中,它会根据用户对个性化推送内容的反馈信息进行分析,对个性化推送列表进行优化,从而生成一个更新的推送列表,系统会持续使用这个更新后的推送列表进行个性化推送,形成一个良性循环,不断提升推送内容的质量和用户满意度,为用户带来更加精准和贴心的服务体验In the above, the system includes a data classification module, an initialization module, a correlation analysis module, a personalized push module and a list update module. The data classification module is responsible for performing detailed classification processing based on the collected user behavior data. Through this process, the system can identify and extract multiple user preference tags, which reflect the user's interests and preferences in different fields. The function of the initialization module is to collect the browsing information of users under each user preference tag. Through in-depth analysis and processing of these browsing information, the system can effectively sort the user preference tags, thereby generating an initialization push list, which lays the foundation for subsequent personalized push. The correlation analysis module matches the recommended content based on the initialization push list, and performs detailed correlation analysis on each recommended content. Through this process The system can distinguish between related recommended content and non-related recommended content. The personalized push module uses related recommended content and non-related recommended content to reorder the initialized push list, thereby generating a more accurate personalized push list. The personalized push list can better meet the personalized needs of users and improve user satisfaction. Finally, the list update module is responsible for applying the personalized push list to the actual user push process. It will analyze the user's feedback on the personalized push content and optimize the personalized push list to generate an updated push list. The system will continue to use this updated push list for personalized push, forming a virtuous circle, continuously improving the quality of push content and user satisfaction, and bringing users a more accurate and considerate service experience.
请参阅图3,一种电子设备,电子设备包括:Please refer to FIG3 , an electronic device, the electronic device comprising:
至少一个处理器;at least one processor;
以及与至少一个处理器通信连接的存储器;and a memory communicatively coupled to the at least one processor;
其中,存储器存储有可被至少一个处理器执行的计算机程序,计算机程序被至少一个处理器执行,以使至少一个处理器能够上述的AIGC驱动的内容自动个性化推送方法。The memory stores a computer program that can be executed by at least one processor, and the computer program is executed by at least one processor so that the at least one processor can implement the above-mentioned AIGC-driven automatic personalized content push method.
上电子设备的处理器可以为多种类型,例如中央处理器(CPU)、图形处理器(GPU)或专用的AI处理器,这些处理器能够高效地执行复杂的算法和数据处理任务,以实现个性化推送系统的功能,电子设备的存储器可以包括随机存取存储器(RAM)、只读存储器(ROM)以及非易失性存储器,如固态硬盘(SSD)或硬盘驱动器(HDD),存储器中存储的计算机程序包括数据分类模块、初始化模块、相关性分析模块、个性化推送模块和列表更新模块的代码,这些代码在处理器的执行下能够实现整个个性化推送流程,此外,电子设备还可以包括输入输出设备和运算器,如触摸屏、键盘、鼠标和显示器,以便用户能够与设备进行交互,运算器则负责执行更复杂的数学运算和数据处理任务,确保个性化推送系统的高效运行,设备还可以具备网络连接功能,如无线局域网(WLAN)或蜂窝网络模块,以实现与服务器或其他设备的数据交换,在实际应用中,电子设备可以是智能手机、平板电脑、个人电脑或服务器,服务器可以部署在云端,为多个用户同时提供个性化推送服务,通过分布式计算和大数据处理技术,服务器能够处理海量用户数据,实现高效、实时的个性化推送。The processor of the electronic device can be of various types, such as a central processing unit (CPU), a graphics processing unit (GPU) or a dedicated AI processor. These processors can efficiently execute complex algorithms and data processing tasks to realize the functions of the personalized push system. The memory of the electronic device may include a random access memory (RAM), a read-only memory (ROM) and a non-volatile memory, such as a solid-state drive (SSD) or a hard disk drive (HDD). The computer program stored in the memory includes codes of a data classification module, an initialization module, a correlation analysis module, a personalized push module and a list update module. These codes can realize the entire personalized push process under the execution of the processor. In addition, the electronic device The device may also include input and output devices and an operator, such as a touch screen, keyboard, mouse and display, so that users can interact with the device. The operator is responsible for performing more complex mathematical operations and data processing tasks to ensure the efficient operation of the personalized push system. The device may also have a network connection function, such as a wireless local area network (WLAN) or a cellular network module to achieve data exchange with a server or other devices. In actual applications, the electronic device may be a smart phone, tablet computer, personal computer or server. The server can be deployed in the cloud to provide personalized push services to multiple users at the same time. Through distributed computing and big data processing technology, the server can process massive user data and achieve efficient and real-time personalized push.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其它变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、装置、物品或者方法不仅包括那些要素,而且还包括没有明确列出的其它要素,或者是还包括为这种过程、装置、物品或者方法所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、装置、物品或者方法中还存在另外的相同要素。It should be noted that, in this article, the terms "include", "comprises" or any other variations thereof are intended to cover non-exclusive inclusion, so that a process, device, article or method including a series of elements includes not only those elements, but also includes other elements not explicitly listed, or also includes elements inherent to such process, device, article or method. In the absence of further restrictions, an element defined by the sentence "includes a ..." does not exclude the presence of other identical elements in the process, device, article or method including the element.
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以作出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。本发明中未具体描述和解释说明的结构、装置以及操作方法,如无特别说明和限定,均按照本领域的常规手段进行实施。The above is only a preferred embodiment of the present invention. It should be noted that, for those skilled in the art, several improvements and modifications can be made without departing from the principles of the present invention, and these improvements and modifications should also be considered as the protection scope of the present invention. The structures, devices and operating methods not specifically described and explained in the present invention shall be implemented according to the conventional means in the art unless otherwise specified and limited.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411277910.4A CN118827774B (en) | 2024-09-12 | 2024-09-12 | AIGC-driven content automatic personalized pushing system and method |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411277910.4A CN118827774B (en) | 2024-09-12 | 2024-09-12 | AIGC-driven content automatic personalized pushing system and method |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN118827774A true CN118827774A (en) | 2024-10-22 |
| CN118827774B CN118827774B (en) | 2024-11-29 |
Family
ID=93068683
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202411277910.4A Active CN118827774B (en) | 2024-09-12 | 2024-09-12 | AIGC-driven content automatic personalized pushing system and method |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN118827774B (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119807538A (en) * | 2025-03-11 | 2025-04-11 | 海看网络科技(山东)股份有限公司 | AIGC-based personalized content automatic generation and pushing method and AIGC-based personalized content automatic generation and pushing system |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080201287A1 (en) * | 2007-02-21 | 2008-08-21 | Hitachi, Ltd. | Dissimilar item recommendation method, device, and program thereof |
| US20160212494A1 (en) * | 2013-09-06 | 2016-07-21 | Beijing Qihoo Technology Company Limited | Video recommendation method and device |
| CN112214685A (en) * | 2020-09-27 | 2021-01-12 | 电子科技大学 | A Personalized Recommendation Method Based on Knowledge Graph |
| CN112989209A (en) * | 2021-05-10 | 2021-06-18 | 腾讯科技(深圳)有限公司 | Content recommendation method, device and storage medium |
| US20210248198A1 (en) * | 2018-10-31 | 2021-08-12 | Shenzhen Heytap Technology Corp., Ltd. | Content Recommendation Method and Apparatus, Mobile Terminal, and Server |
| CN113643070A (en) * | 2021-08-20 | 2021-11-12 | 林秀珍 | Intelligent information pushing method and system based on big data |
| CN115455280A (en) * | 2022-08-19 | 2022-12-09 | 青岛聚看云科技有限公司 | Recommendation list determining method and server |
| CN117440182A (en) * | 2023-10-25 | 2024-01-23 | 北京华星酷娱文化传媒有限公司 | Intelligent recommendation method and system based on video content analysis and user labels |
| CN117972257A (en) * | 2024-01-30 | 2024-05-03 | 江苏苏宝网络科技有限公司 | A visual website construction promotion method and promotion system |
-
2024
- 2024-09-12 CN CN202411277910.4A patent/CN118827774B/en active Active
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080201287A1 (en) * | 2007-02-21 | 2008-08-21 | Hitachi, Ltd. | Dissimilar item recommendation method, device, and program thereof |
| US20160212494A1 (en) * | 2013-09-06 | 2016-07-21 | Beijing Qihoo Technology Company Limited | Video recommendation method and device |
| US20210248198A1 (en) * | 2018-10-31 | 2021-08-12 | Shenzhen Heytap Technology Corp., Ltd. | Content Recommendation Method and Apparatus, Mobile Terminal, and Server |
| CN112214685A (en) * | 2020-09-27 | 2021-01-12 | 电子科技大学 | A Personalized Recommendation Method Based on Knowledge Graph |
| CN112989209A (en) * | 2021-05-10 | 2021-06-18 | 腾讯科技(深圳)有限公司 | Content recommendation method, device and storage medium |
| CN113643070A (en) * | 2021-08-20 | 2021-11-12 | 林秀珍 | Intelligent information pushing method and system based on big data |
| CN115455280A (en) * | 2022-08-19 | 2022-12-09 | 青岛聚看云科技有限公司 | Recommendation list determining method and server |
| CN117440182A (en) * | 2023-10-25 | 2024-01-23 | 北京华星酷娱文化传媒有限公司 | Intelligent recommendation method and system based on video content analysis and user labels |
| CN117972257A (en) * | 2024-01-30 | 2024-05-03 | 江苏苏宝网络科技有限公司 | A visual website construction promotion method and promotion system |
Non-Patent Citations (2)
| Title |
|---|
| KEYAN CAO ET AL.: "Points-of-Interest Recommendation Algorithm Based on LBSN in Edge Computing Environment", IEEE ACCESS, 10 March 2020 (2020-03-10) * |
| 曾子明;张振;: "社会化标注系统中基于社区标签云的个性化推荐研究", 情报杂志, no. 10, 31 October 2011 (2011-10-31) * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119807538A (en) * | 2025-03-11 | 2025-04-11 | 海看网络科技(山东)股份有限公司 | AIGC-based personalized content automatic generation and pushing method and AIGC-based personalized content automatic generation and pushing system |
Also Published As
| Publication number | Publication date |
|---|---|
| CN118827774B (en) | 2024-11-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110442790B (en) | Method, device, server and storage medium for recommending multimedia data | |
| CN111597446B (en) | Content pushing method and device based on artificial intelligence, server and storage medium | |
| CN110163647B (en) | Data processing method and device | |
| CN109033408B (en) | Information pushing method and device, computer readable storage medium and electronic equipment | |
| CN113392329B (en) | Content recommendation method, device, electronic device and storage medium | |
| CN111859149A (en) | Information recommendation method, device, electronic device and storage medium | |
| CN111967914B (en) | Recommendation method, device, computer equipment and storage medium based on user portrait | |
| CN113065062A (en) | A news recommendation method and system based on user reading time behavior | |
| CN111242310A (en) | Feature validity evaluation method, device, electronic device and storage medium | |
| WO2018157625A1 (en) | Reinforcement learning-based method for learning to rank and server | |
| US9245035B2 (en) | Information processing system, information processing method, program, and non-transitory information storage medium | |
| CN108846097A (en) | The interest tags representation method of user, article recommended method and device, equipment | |
| CN111639485B (en) | Course recommendation method and related equipment based on text similarity | |
| CN111723260A (en) | Method and device for acquiring recommended content, electronic equipment and readable storage medium | |
| CN118827774A (en) | An AIGC-driven automatic personalized content push system and method | |
| JP2022035314A (en) | Information processing unit and program | |
| CN117194779A (en) | Marketing system optimization method, device and equipment based on artificial intelligence | |
| CN115017200A (en) | Search result sorting method and device, electronic equipment and storage medium | |
| CN108595411B (en) | Method for acquiring multiple text abstracts in same subject text set | |
| CN103262079B (en) | Search device and search method | |
| CN119377484A (en) | A method and system for recommending remote education resources | |
| CN113204703B (en) | Material recommendation method, device, equipment and storage medium | |
| CN118379115B (en) | Dynamic commodity display method and system based on multiple scenes | |
| CN112632275B (en) | Crowd clustering data processing method, device and equipment based on personal text information | |
| CN111651643B (en) | Candidate content processing method and related equipment |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |