CN118820705A

CN118820705A - Data processing method, device, storage medium, equipment and program product

Info

Publication number: CN118820705A
Application number: CN202410757287.6A
Authority: CN
Inventors: 章宇威; 杜思良
Original assignee: Douyin Vision Co Ltd
Current assignee: Douyin Vision Co Ltd
Priority date: 2024-06-12
Filing date: 2024-06-12
Publication date: 2024-10-22

Abstract

The application discloses a data processing method, a device, a storage medium, equipment and a program product, wherein the method comprises the following steps: acquiring initial input characteristics of data to be processed, wherein the initial input characteristics comprise object characteristics of a target object, resource characteristics of a plurality of resources to be recommended and scene characteristics, and the scene characteristics comprise scene identifications; performing feature compression processing on the initial input features to obtain low-dimensional input features; selecting a target sub-scene network matched with the scene identification from a plurality of sub-scene networks according to the scene identification in the scene characteristics, wherein each sub-scene network has independent scene parameters; performing feature processing on the low-dimensional input features based on the target scene-dividing network to obtain scene embedding features of the data to be processed; and recommending resources to the target object based on the initial input characteristics and the scene embedding characteristics. The method and the device can improve the calculation efficiency and optimize the scene adaptability, and improve the accuracy and individuation level of recommendation under different scenes.

Description

Data processing method, device, storage medium, equipment and program product

技术领域Technical Field

本申请涉及计算机技术领域，具体涉及一种数据处理方法、装置、存储介质、设备及程序产品。The present application relates to the field of computer technology, and in particular to a data processing method, apparatus, storage medium, device and program product.

背景技术Background Art

在当前的推荐系统技术中，处理不同场景下的推荐需求是一大挑战。由于不同场景具有不同的样式和内容，用户的使用习惯和偏好也各不相同，因此需要针对不同场景进行特定的建模。传统的分场景建模方法包括为每个场景训练单独的模型，训练一个统一的模型来处理所有场景数据，以及在一个通用框架下通过辅助结构调整场景参数。单独训练多个模型成本高且维护困难，而统一模型则容易受大流量场景数据影响，导致小流量场景下的推荐效果不佳。而辅助结构调整方法仍无法完全消除不同场景数据之间的互相影响。In the current recommendation system technology, handling recommendation needs in different scenarios is a major challenge. Since different scenarios have different styles and contents, and users' usage habits and preferences are also different, specific modeling is required for different scenarios. Traditional scenario-based modeling methods include training a separate model for each scenario, training a unified model to process all scenario data, and adjusting scenario parameters through auxiliary structures under a common framework. Training multiple models separately is costly and difficult to maintain, while the unified model is easily affected by high-traffic scenario data, resulting in poor recommendation results in low-traffic scenarios. The auxiliary structure adjustment method still cannot completely eliminate the mutual influence between different scenario data.

发明内容Summary of the invention

本申请实施例提供一种数据处理方法、装置、存储介质、设备及程序产品，能够提升计算效率和优化场景适应性，并提升了不同场景下推荐的准确性和个性化水平。The embodiments of the present application provide a data processing method, apparatus, storage medium, device, and program product, which can improve computing efficiency and optimize scene adaptability, and improve the accuracy and personalization level of recommendations in different scenarios.

一方面，本申请实施例提供一种数据处理方法，所述方法包括：On the one hand, an embodiment of the present application provides a data processing method, the method comprising:

获取待处理数据的初始输入特征，其中，所述初始输入特征包括目标对象的对象特征、多个待推荐资源的资源特征和场景特征，所述场景特征包括场景标识；Acquire initial input features of the data to be processed, wherein the initial input features include object features of a target object, resource features of a plurality of resources to be recommended, and scene features, wherein the scene features include a scene identifier;

对所述初始输入特征进行特征压缩处理，得到低维输入特征；Performing feature compression processing on the initial input features to obtain low-dimensional input features;

根据所述场景特征中的所述场景标识，从多个分场景网络中选出与所述场景标识相匹配的目标分场景网络，各个所述分场景网络具有独立的场景参数；According to the scene identifier in the scene feature, a target sub-scene network matching the scene identifier is selected from a plurality of sub-scene networks, each of the sub-scene networks having independent scene parameters;

基于所述目标分场景网络对所述低维输入特征进行特征处理，得到所述待处理数据的场景嵌入特征；Performing feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed;

基于所述初始输入特征和所述场景嵌入特征，向所述目标对象进行资源推荐。Based on the initial input features and the scene embedding features, resources are recommended to the target object.

另一方面，本申请实施例提供一种数据处理装置，所述装置包括：On the other hand, an embodiment of the present application provides a data processing device, the device comprising:

获取单元，用于获取待处理数据的初始输入特征，其中，所述初始输入特征包括目标对象的对象特征、多个待推荐资源的资源特征和场景特征，所述场景特征包括场景标识；An acquisition unit, configured to acquire initial input features of the data to be processed, wherein the initial input features include object features of a target object, resource features of a plurality of resources to be recommended, and scene features, wherein the scene features include scene identifiers;

压缩单元，用于对所述初始输入特征进行特征压缩处理，得到低维输入特征；A compression unit, used for performing feature compression processing on the initial input features to obtain low-dimensional input features;

选择单元，用于根据所述场景特征中的所述场景标识，从多个分场景网络中选出与所述场景标识相匹配的目标分场景网络，各个所述分场景网络具有独立的场景参数；A selection unit, configured to select, according to the scene identifier in the scene feature, a target sub-scene network matching the scene identifier from a plurality of sub-scene networks, each of the sub-scene networks having an independent scene parameter;

处理单元，用于基于所述目标分场景网络对所述低维输入特征进行特征处理，得到所述待处理数据的场景嵌入特征；A processing unit, configured to perform feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed;

推荐单元，用于基于所述初始输入特征和所述场景嵌入特征，向所述目标对象进行资源推荐。A recommendation unit is used to recommend resources to the target object based on the initial input features and the scene embedding features.

另一方面，本申请实施例一种计算机可读存储介质，所述计算机可读存储介质存储有计算机程序，所述计算机程序适于处理器进行加载，以执行如上任一实施例所述的数据处理方法。On the other hand, an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program is suitable for being loaded by a processor to execute the data processing method described in any of the above embodiments.

另一方面，本申请实施例一种计算机设备，所述计算机设备包括处理器和存储器，所述存储器中存储有计算机程序，所述处理器通过调用所述存储器中存储的所述计算机程序，用于执行如上任一实施例所述的数据处理方法。On the other hand, an embodiment of the present application provides a computer device, which includes a processor and a memory, wherein a computer program is stored in the memory, and the processor is used to execute the data processing method described in any of the above embodiments by calling the computer program stored in the memory.

另一方面，本申请实施例一种计算机程序产品，包括计算机指令，所述计算机指令被处理器执行时实现如上任一实施例所述的数据处理方法。On the other hand, an embodiment of the present application provides a computer program product, comprising computer instructions, which, when executed by a processor, implement the data processing method described in any of the above embodiments.

本申请实施例通过获取待处理数据的初始输入特征，其中，初始输入特征包括目标对象的对象特征、多个待推荐资源的资源特征和场景特征，场景特征包括场景标识；对初始输入特征进行特征压缩处理，得到低维输入特征；根据场景特征中的场景标识，从多个分场景网络中选出与场景标识相匹配的目标分场景网络，各个分场景网络具有独立的场景参数；基于目标分场景网络对低维输入特征进行特征处理，得到待处理数据的场景嵌入特征；基于初始输入特征和场景嵌入特征，向目标对象进行资源推荐。本申请实施例通过特征压缩处理将初始输入特征转换为低维输入特征，减少计算复杂度，从而提高整体计算效率。根据场景特征中的场景标识，从多个分场景网络中选择匹配的目标分场景网络，每个网络具有独立的场景参数，使得推荐系统能够针对不同场景进行优化，提高场景适应性，场景标识驱动的网络选择策略，确保了模型能够动态适应多样化的场景需求，提高了推荐系统的灵活性和针对性。结合初始输入特征和场景嵌入特征进行资源推荐，确保推荐结果不仅考虑了目标对象和资源的属性，还考虑了场景上下文，从而提高推荐的准确性和个性化水平。The embodiment of the present application obtains the initial input features of the data to be processed, wherein the initial input features include the object features of the target object, the resource features of multiple resources to be recommended, and the scene features, wherein the scene features include scene identifiers; the initial input features are subjected to feature compression processing to obtain low-dimensional input features; according to the scene identifier in the scene features, a target sub-scene network matching the scene identifier is selected from multiple sub-scene networks, and each sub-scene network has independent scene parameters; based on the target sub-scene network, the low-dimensional input features are subjected to feature processing to obtain the scene embedding features of the data to be processed; based on the initial input features and the scene embedding features, resources are recommended to the target object. The embodiment of the present application converts the initial input features into low-dimensional input features through feature compression processing, thereby reducing the computational complexity and improving the overall computational efficiency. According to the scene identifier in the scene features, a matching target sub-scene network is selected from multiple sub-scene networks, and each network has independent scene parameters, so that the recommendation system can be optimized for different scenarios and improve the scene adaptability. The network selection strategy driven by the scene identifier ensures that the model can dynamically adapt to the diverse scene requirements and improves the flexibility and pertinence of the recommendation system. Combining the initial input features and scene embedding features for resource recommendation ensures that the recommendation results not only consider the attributes of the target objects and resources, but also the scene context, thereby improving the accuracy and personalization level of the recommendation.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本申请实施例提供的数据处理方法的流程示意图。FIG1 is a flow chart of a data processing method provided in an embodiment of the present application.

图2为本申请实施例提供的数据处理方法的应用场景示意图。FIG. 2 is a schematic diagram of an application scenario of the data processing method provided in an embodiment of the present application.

图3为本申请实施例提供的数据处理装置的结构示意图。FIG3 is a schematic diagram of the structure of a data processing device provided in an embodiment of the present application.

图4为本申请实施例提供的计算机设备的结构示意图。FIG4 is a schematic diagram of the structure of a computer device provided in an embodiment of the present application.

具体实施方式DETAILED DESCRIPTION

下面将结合本申请实施例中的附图，对本申请实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本申请一部分实施例，而不是全部的实施例。基于本申请中的实施例，本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本申请保护的范围。The following will be combined with the drawings in the embodiments of the present application to clearly and completely describe the technical solutions in the embodiments of the present application. Obviously, the described embodiments are only part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those skilled in the art without creative work are within the scope of protection of this application.

本申请实施例提供一种数据处理方法、装置、存储介质、设备及程序产品。示例性地，本申请实施例的数据处理方法可以由计算机设备执行，其中，该计算机设备可以为终端或者服务器等设备。该终端可以为智能手机、平板电脑、笔记本电脑、台式计算机、智能电视、智能音箱、穿戴式智能设备、个人计算机(Personal Computer，PC)、智能车载终端等设备，终端还可以包括客户端，该客户端可以是视频客户端、购物应用程序客户端、阅读应用程序客户端、浏览器客户端或即时通信客户端等。服务器可以是独立的物理服务器，也可以是多个物理服务器构成的服务器集群或者分布式系统，还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、内容分发网络(Content Delivery Network，CDN)、以及大数据和人工智能平台等基础云计算服务的云服务器。The embodiments of the present application provide a data processing method, apparatus, storage medium, device and program product. Exemplarily, the data processing method of the embodiments of the present application can be executed by a computer device, wherein the computer device can be a terminal or a server. The terminal can be a smart phone, a tablet computer, a laptop computer, a desktop computer, a smart TV, a smart speaker, a wearable smart device, a personal computer (PC), a smart car terminal and other devices, and the terminal can also include a client, which can be a video client, a shopping application client, a reading application client, a browser client or an instant messaging client. The server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or a cloud server that provides cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, content delivery networks (CDN), and basic cloud computing services such as big data and artificial intelligence platforms.

本申请实施例可应用于人工智能、机器学习、数据处理、资源推荐等应用场景。The embodiments of the present application can be applied to application scenarios such as artificial intelligence, machine learning, data processing, and resource recommendation.

本申请实施例提供的方案涉及人工智能的数据处理等技术，具体通过如下实施例进行说明。以下分别进行详细说明。需说明的是，以下实施例的描述顺序不作为对实施例优先顺序的限定。The solutions provided in the embodiments of the present application involve technologies such as artificial intelligence data processing, which are specifically described by the following embodiments. The following are described in detail. It should be noted that the order of description of the following embodiments is not intended to limit the priority order of the embodiments.

本申请实施例提供一种数据处理方法，该方法可以由终端或服务器执行，也可以由终端和服务器共同执行；本申请实施例以数据处理方法由服务器执行为例来进行说明。An embodiment of the present application provides a data processing method, which can be executed by a terminal or a server, or by a terminal and a server together. The embodiment of the present application is described by taking the data processing method executed by a server as an example.

请参阅图1至图2，图1为本申请实施例提供的数据处理方法的流程示意图，图2为本申请实施例提供的数据处理方法的应用场景示意图。该方法可以包括以下步骤110至步骤150：Please refer to Figures 1 and 2, Figure 1 is a flow chart of the data processing method provided in the embodiment of the present application, and Figure 2 is a schematic diagram of the application scenario of the data processing method provided in the embodiment of the present application. The method may include the following steps 110 to 150:

步骤110，获取待处理数据的初始输入特征，其中，所述初始输入特征包括目标对象的对象特征、多个待推荐资源的资源特征和场景特征，所述场景特征包括场景标识。Step 110, obtaining initial input features of the data to be processed, wherein the initial input features include object features of the target object, resource features of a plurality of resources to be recommended, and scene features, wherein the scene features include scene identifiers.

在一些实施例中，所述获取待处理数据的初始输入特征，包括：In some embodiments, the obtaining of initial input features of the data to be processed includes:

获取待处理数据，所述待处理数据包括目标对象的对象关联信息、多个待推荐资源的资源关联信息、以及场景信息；Acquire data to be processed, wherein the data to be processed includes object association information of a target object, resource association information of a plurality of resources to be recommended, and scene information;

对所述对象关联信息、所述资源关联信息与所述场景信息进行特征提取处理，得到所述目标对象的所述对象特征、各个所述待推荐资源的资源特征和各个所述待推荐资源对应的场景特征。Feature extraction processing is performed on the object association information, the resource association information and the scene information to obtain the object feature of the target object, the resource feature of each of the resources to be recommended and the scene feature corresponding to each of the resources to be recommended.

在这一步，首先需要从各种来源获取待处理数据。这些数据包括目标对象的对象关联信息和多个待推荐资源的资源关联信息。这些关联信息可以通过各种方式获得，例如从数据库查询、用户输入或其他外部系统接收等。这些数据通常包括目标对象的对象关联信息，多个待推荐资源的资源关联信息，以及场景信息等。In this step, we first need to obtain the data to be processed from various sources. This data includes the object association information of the target object and the resource association information of multiple resources to be recommended. This association information can be obtained in various ways, such as from database query, user input or other external systems. This data usually includes the object association information of the target object, the resource association information of multiple resources to be recommended, and scene information.

例如，目标对象的对象关联信息，涉及到用户或目标对象的详细信息，包括但不限于：用户标识ID(唯一标识用户的字符串或数字)、用户画像信息(如性别、年龄、教育水平)、地理位置(城市、国家、时区)、用户历史行为数据(如浏览历史、点击率、播放、购买记录、评分记录、收藏记录等)、用户偏好(根据用户的历史行为数据分析得出的兴趣爱好)。For example, the object association information of the target object involves detailed information about the user or the target object, including but not limited to: user identification ID (a string or number that uniquely identifies the user), user portrait information (such as gender, age, education level), geographic location (city, country, time zone), user historical behavior data (such as browsing history, click-through rate, playback, purchase history, rating history, collection history, etc.), and user preferences (interests and hobbies derived from analysis of the user's historical behavior data).

待推荐资源的资源关联信息，这包括所有可能被推荐给目标对象的资源的信息，包括但不限于：资源ID(唯一标识资源的字符串或数字)、资源类型(如视频、游戏、产品等)、资源描述(如标题、简介、分类、标签等)、资源属性(如价格、尺寸、颜色、品牌、评分、评论数量等)、资源使用情况(如销售排名、流行度指数等)。Resource association information of the resource to be recommended, which includes information about all resources that may be recommended to the target object, including but not limited to: resource ID (a string or number that uniquely identifies a resource), resource type (such as video, game, product, etc.), resource description (such as title, introduction, category, tag, etc.), resource attributes (such as price, size, color, brand, rating, number of comments, etc.), resource usage (such as sales ranking, popularity index, etc.).

场景信息，包括但不限于：设备类型(如手机、平板、电脑等)、推荐途径(如社交媒体、电子邮件、应用内推荐等)、展示方式(如列表、网格、滑动卡片、动态画面等)、时间和地点(如用户活跃的时间、地理位置等)、环境因素(如天气、节假日等)。Scenario information includes, but is not limited to: device type (such as mobile phones, tablets, computers, etc.), recommendation channels (such as social media, email, in-app recommendations, etc.), display methods (such as lists, grids, sliding cards, dynamic images, etc.), time and place (such as user active time, geographical location, etc.), and environmental factors (such as weather, holidays, etc.).

可以理解的是，在本申请的具体实施方式中，涉及到待处理数据等相关的数据，当本申请以上实施例运用到具体产品或技术中时，需要获得用户许可或者同意，且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准。It is understandable that in the specific implementation of the present application, related data such as data to be processed is involved. When the above embodiments of the present application are applied to specific products or technologies, user permission or consent is required, and the collection, use and processing of relevant data need to comply with relevant laws, regulations and standards of relevant countries and regions.

然后，对这些待处理数据进行特征提取处理。这通常涉及使用各种特征工程技术，如文本挖掘、图像识别、时间序列分析等，以从原始数据中提取出有意义的特征，能够有效地降低数据的维度，同时保留其主要的信息。这些特征可以包括目标对象的对象特征(如用户标识ID、年龄、性别、城市、兴趣偏好等)、待推荐资源的资源特征(如资源ID、资源类型、资源属性、价格、评价等)，以及场景特征(如场景标识ID、时间戳等)。Then, the data to be processed are subjected to feature extraction. This usually involves the use of various feature engineering techniques, such as text mining, image recognition, time series analysis, etc., to extract meaningful features from the raw data, which can effectively reduce the dimension of the data while retaining its main information. These features can include object features of the target object (such as user identification ID, age, gender, city, interest preferences, etc.), resource features of the resources to be recommended (such as resource ID, resource type, resource attributes, price, evaluation, etc.), and scene features (such as scene identification ID, timestamp, etc.).

其中，该目标对象可以为接收目标推荐资源的相关用户。其中，待推荐资源可以包括但不限于视频、游戏、影视、产品、文学作品、应用程序等。The target object may be a user related to the target recommended resource. The resources to be recommended may include but are not limited to videos, games, movies, products, literary works, applications, etc.

例如，特征提取是将待处理数据转换为可用于机器学习模型的格式。这一过程可能包括：For example, feature extraction is the process of converting the data to a format that can be used in a machine learning model. This process may include:

文本挖掘：从待处理数据中的用户的评论或资源描述中提取关键词和主题。Text mining: Extract keywords and topics from user comments or resource descriptions in the data to be processed.

图像识别：如果资源关联信息包括图像或视频，使用图像识别技术提取视觉特征。Image recognition: If the resource-associated information includes images or videos, use image recognition technology to extract visual features.

时间序列分析：分析对象关联信息中用户行为随时间的变化趋势。Time series analysis: Analyze the changing trends of user behavior in object association information over time.

统计分析：计算用户历史行为数据对应的统计数据，如平均评分、购买频率、点击率、播放次数等。Statistical analysis: Calculate statistical data corresponding to user historical behavior data, such as average rating, purchase frequency, click-through rate, number of plays, etc.

例如，在特征提取之前，原始的待处理数据通常需要经过清洗和标准化处理，以确保数据质量。比如通过去除噪声处理，识别并清除数据中的异常值和错误。比如通过数据填充处理，对于缺失的数据采用适当的方法进行填充。比如通过标准化处理，将数据转换到统一的尺度，以便于比较和处理。For example, before feature extraction, the original data to be processed usually needs to be cleaned and standardized to ensure data quality. For example, through noise removal, outliers and errors in the data are identified and removed. For example, through data filling, missing data is filled with appropriate methods. For example, through standardization, data is converted to a unified scale for easy comparison and processing.

例如，将提取的特征整合到一起，形成初始输入特征集。比如整合用户相关的所有特征，形成一个对象特征向量。比如整合每个待推荐资源的相关特征，形成一个或多个资源特征向量。比如整合所有场景相关的特征，形成一个场景特征向量。For example, the extracted features are integrated together to form an initial input feature set. For example, all user-related features are integrated to form an object feature vector. For example, the relevant features of each resource to be recommended are integrated to form one or more resource feature vectors. For example, all scene-related features are integrated to form a scene feature vector.

其中，场景特征包括场景标识(ID)，用于区分不同推荐场景的标签。场景标识是区分不同推荐场景的关键，它允许推荐系统根据不同的场景调整其推荐策略。例如，如果场景标识表明用户正在使用移动设备，推荐系统可能会优先推荐适合移动观看的资源。Among them, scene features include scene identifiers (IDs), which are labels used to distinguish different recommendation scenes. Scene identifiers are the key to distinguishing different recommendation scenes, allowing the recommendation system to adjust its recommendation strategy according to different scenes. For example, if the scene identifier indicates that the user is using a mobile device, the recommendation system may prioritize resources suitable for mobile viewing.

步骤120，对所述初始输入特征进行特征压缩处理，得到低维输入特征。Step 120: perform feature compression processing on the initial input features to obtain low-dimensional input features.

高维度的特征空间会导致计算复杂度显著增加，尤其是在深度学习模型中，大量的特征意味着更多的模型参数，这会占用更多的内存资源和计算时间。通过特征压缩，可以有效减少模型的大小，加速训练和推理过程。High-dimensional feature space will lead to a significant increase in computational complexity, especially in deep learning models, where a large number of features means more model parameters, which will take up more memory resources and computing time. Feature compression can effectively reduce the size of the model and speed up the training and reasoning process.

例如，如图2所示的推荐系统，可以通过底层压缩网络对初始输入特征进行压缩，以将高纬度的初始输入特征映射到更低维度的低维输入特征，减少了后续进入分场景网络的输入，减少了计算成本。例如，底层压缩网络可以是一个深度神经网络(DNN)，该DNN网络可以是一个全连接网络，通过让全连接网络输出一个低纬度的embed来压缩初始输入特征。全连接网络的参数会随着其他网络参数一起参与网络训练，不需要特殊的设计。For example, the recommendation system shown in Figure 2 can compress the initial input features through the underlying compression network to map the high-dimensional initial input features to low-dimensional input features of lower dimensions, thereby reducing the subsequent inputs entering the sub-scenario network and reducing the computational cost. For example, the underlying compression network can be a deep neural network (DNN), and the DNN network can be a fully connected network, which compresses the initial input features by allowing the fully connected network to output a low-dimensional embed. The parameters of the fully connected network will participate in network training along with other network parameters, and no special design is required.

底层压缩网络可以是一个深度神经网络(DNN)，具体来说，可以是一个全连接网络，用于将初始输入特征压缩成低维嵌入。全连接网络由多个全连接层组成，每层包含多个神经元，全连接层中的每个神经元都与前一层的所有神经元相连，这使得网络能够学习输入数据的全局特征。初始输入特征首先通过这些全连接层，每层都会对数据进行转换和压缩。全连接网络的权重和偏置是可学习的参数。在训练过程中，这些参数会通过反向传播算法进行更新，以最小化预测输出和真实标签之间的差异。在网络的最后，一个特定的嵌入层负责生成低维嵌入。该嵌入层的输出就是压缩后的特征表示。在训练过程中，需要定义一个损失函数来衡量模型的预测与真实标签之间的差异。损失函数的选择取决于推荐系统的具体任务(如点击率预测、转化率预测等)。The underlying compression network can be a deep neural network (DNN), specifically, a fully connected network, which is used to compress the initial input features into low-dimensional embeddings. A fully connected network consists of multiple fully connected layers, each containing multiple neurons, and each neuron in a fully connected layer is connected to all neurons in the previous layer, which enables the network to learn the global features of the input data. The initial input features first pass through these fully connected layers, and each layer transforms and compresses the data. The weights and biases of the fully connected network are learnable parameters. During the training process, these parameters are updated through the back-propagation algorithm to minimize the difference between the predicted output and the true label. At the end of the network, a specific embedding layer is responsible for generating low-dimensional embeddings. The output of this embedding layer is the compressed feature representation. During the training process, a loss function needs to be defined to measure the difference between the model's prediction and the true label. The choice of loss function depends on the specific task of the recommendation system (such as click-through rate prediction, conversion rate prediction, etc.).

压缩后得到的低维输入特征将作为后续步骤的输入，例如进入分场景网络进行进一步的特征处理和资源推荐。由于这些特征具有更低的维度，因此可以减少后续计算的成本，同时保留了原始数据中的关键信息。The low-dimensional input features obtained after compression will be used as input for subsequent steps, such as entering the scene-based network for further feature processing and resource recommendation. Since these features have lower dimensions, the cost of subsequent calculations can be reduced while retaining the key information in the original data.

在图2所示的推荐系统，底层压缩网络可以先对每个场景的特征(如用户特征、资源特征、场景特征)分别进行压缩，然后将压缩后的低维特征融合后送入后续的分场景网络。这种分阶段处理不仅优化了计算效率，也为每个场景定制化推荐提供了灵活的基础。In the recommendation system shown in Figure 2, the underlying compression network can first compress the features of each scene (such as user features, resource features, and scene features) separately, and then fuse the compressed low-dimensional features and send them to the subsequent scene-based network. This staged processing not only optimizes computational efficiency, but also provides a flexible basis for customized recommendations for each scene.

步骤130，根据所述场景特征中的所述场景标识，从多个分场景网络中选出与所述场景标识相匹配的目标分场景网络，各个所述分场景网络具有独立的场景参数。Step 130: According to the scene identifier in the scene feature, a target sub-scene network matching the scene identifier is selected from a plurality of sub-scene networks, each of the sub-scene networks having independent scene parameters.

在步骤130中，会根据场景特征中的场景标识，从预先训练好的多个分场景网络中选出一个或多个与目标场景相匹配的分场景网络。这些分场景网络是针对不同的场景或上下文环境进行训练的，因此它们具有不同的场景参数和模型结构。通过选择与目标场景最匹配的分场景网络，可以提高后续资源推荐的准确性和相关性。In step 130, one or more sub-scene networks matching the target scene are selected from the pre-trained multiple sub-scene networks according to the scene identifier in the scene feature. These sub-scene networks are trained for different scenes or contexts, so they have different scene parameters and model structures. By selecting the sub-scene network that best matches the target scene, the accuracy and relevance of subsequent resource recommendations can be improved.

在一些实施例中，所述场景特征包括M个场景标识，M为大于0的自然数；所述根据所述场景特征中的所述场景标识，从多个分场景网络中选出与所述场景标识相匹配的目标分场景网络，包括：In some embodiments, the scene feature includes M scene identifiers, where M is a natural number greater than 0; and selecting a target sub-scene network matching the scene identifier from a plurality of sub-scene networks according to the scene identifier in the scene feature includes:

将所述M个场景标识中的各个场景标识分别与N个分场景网络中的各个分场景网络的预设场景标识进行匹配，得到与所述M个场景标识相匹配的M个目标分场景网络，N大于或等于M。Each scene identifier in the M scene identifiers is matched with the preset scene identifier of each sub-scene network in the N sub-scene networks to obtain M target sub-scene networks matching the M scene identifiers, where N is greater than or equal to M.

如图2所示的推荐系统，为每一个场景建立一个独立的分场景网络。各个分场景网络具有独立的场景参数，独立的场景参数，防止不同场景间的数据互相干扰。这意味着每个分场景网络都能够独立调整其权重和偏置，以最好地适应其对应场景的数据模式。这种设计使得每个分场景网络都能深入理解其场景下的用户需求和行为特征。每个分场景网络可以采用不同的模型结构，例如深度神经网络(DNN)，卷积神经网络(CNN)或循环神经网络(RNN)，具体取决于场景数据的特点。例如，对于视觉内容推荐，CNN可能更为适用；而对于序列数据，如用户行为日志，RNN可能更加有效。As shown in Figure 2, the recommendation system establishes an independent sub-scenario network for each scene. Each sub-scenario network has independent scene parameters to prevent data from different scenes from interfering with each other. This means that each sub-scenario network can independently adjust its weights and biases to best adapt to the data pattern of its corresponding scene. This design enables each sub-scenario network to deeply understand the user needs and behavior characteristics in its scene. Each sub-scenario network can adopt a different model structure, such as a deep neural network (DNN), a convolutional neural network (CNN), or a recurrent neural network (RNN), depending on the characteristics of the scene data. For example, for visual content recommendations, CNN may be more applicable; while for sequence data, such as user behavior logs, RNN may be more effective.

场景选择器会根据当前场景标识(ID)，自动选择对应的分场景网络的输出作为最终的输出。The scene selector will automatically select the output of the corresponding sub-scene network as the final output based on the current scene identifier (ID).

每个业务场景都有对应的场景标识(ID)，该场景标识(ID)和训练特征一起输入如图2所示的推荐系统，场景标识(ID)也可以认为是训练特征的一部分。在实际预测阶段，该场景标识(ID)，也是初始输入特征的一部分。Each business scenario has a corresponding scenario ID, which is input into the recommendation system shown in Figure 2 together with the training features. The scenario ID can also be considered as part of the training features. In the actual prediction stage, the scenario ID is also part of the initial input features.

其中，每个分场景网络包含了对应的场景参数，每个分场景网络对应哪个预设场景标识是模型训练之前定义好的。场景选择器将不属于当前场景的输出进行抑制。Each sub-scenario network contains corresponding scene parameters, and the preset scene identifier corresponding to each sub-scenario network is defined before model training. The scene selector suppresses the output that does not belong to the current scene.

例如，场景选择器会把当前场景标识(ID)与每一个分场景网络的预设当前场景标识(ID)进行对比，如果两者相同就保持分场景网络的输出不变，如果两者不相同则输出则被抑制(设置为0)。最后将所有的分场景输出相加，就是最终的场景嵌入特征(embedding)。这样，推荐系统能有效集中计算资源处理当前场景，同时减少无关信息的干扰。For example, the scene selector will compare the current scene ID with the preset current scene ID of each sub-scene network. If the two are the same, the output of the sub-scene network will remain unchanged. If the two are different, the output will be suppressed (set to 0). Finally, all the sub-scene outputs are added together to form the final scene embedding feature. In this way, the recommendation system can effectively concentrate computing resources to process the current scene while reducing the interference of irrelevant information.

由底层压缩网络、多个分场景网络和场景选择器组成的模型结构是一个可嵌入到其他深度推荐模型的结构，该模型结构的输出(场景嵌入特征)可以成为其他深度推荐模型(如图2所示的目标推荐模型)的输入，加强对应模型分场景的服务能力。The model structure composed of the underlying compression network, multiple sub-scene networks and scene selectors is a structure that can be embedded in other deep recommendation models. The output of this model structure (scene embedding features) can become the input of other deep recommendation models (the target recommendation model as shown in Figure 2), thereby enhancing the service capabilities of the corresponding model in different scenes.

在多场景的推荐系统中，每个场景(如A推荐频道，B推荐频道，C推荐频道，D推荐频道，E短视频无限流，F短视频内流等)都被视为独特的上下文环境，拥有自己的特征和用户行为模式。为了精确捕获这些差异，推荐系统构建了多个专门的分场景网络，每个分场景网络专精于理解并预测对应场景下的用户偏好。这些分场景网络通过预设场景标识与实际发生的场景标识进行匹配，确保在任何给定时刻，推荐系统能调用最适合当前上下文的目标分场景网络。In a multi-scenario recommendation system, each scenario (such as A recommendation channel, B recommendation channel, C recommendation channel, D recommendation channel, E short video unlimited stream, F short video internal stream, etc.) is regarded as a unique context environment with its own characteristics and user behavior patterns. In order to accurately capture these differences, the recommendation system builds multiple specialized sub-scenario networks, each of which specializes in understanding and predicting user preferences in the corresponding scenario. These sub-scenario networks match the preset scenario identifiers with the actual scenario identifiers to ensure that at any given moment, the recommendation system can call the target sub-scenario network that best suits the current context.

例如，当前场景为F短视频内流，对应分场景网络3。在预测阶段，每个分场景网络都会进行预测，并输出一个嵌入表示(embed)。假设各个分场景网络的输出分别为embed1,embed2，……，embedN-1，embedN。在场景选择器中，除了分场景网络3，其余的分场景网络输出的embed会乘以0，最后加和得到的就是embed3。For example, the current scene is the F short video instream, corresponding to sub-scene network 3. In the prediction stage, each sub-scene network will make a prediction and output an embedding representation (embed). Assume that the outputs of each sub-scene network are embed1, embed2, ..., embedN-1, embedN. In the scene selector, except for sub-scene network 3, the embeds output by the remaining sub-scene networks will be multiplied by 0, and the final sum is embed3.

在训练阶段，这种分场景网络结构通过反向传播调整各网络参数，尤其是那些在预测阶段频繁激活的网络，以不断优化其对特定场景的响应能力。对于不适用的场景，即输出被屏蔽(mask)掉的网络，由于其梯度为0，不会影响参数更新，从而保证了模型的高效学习和适应性。During the training phase, this scenario-based network structure adjusts the parameters of each network through back propagation, especially those networks that are frequently activated in the prediction phase, to continuously optimize their responsiveness to specific scenarios. For inapplicable scenarios, that is, networks whose outputs are masked, since their gradients are 0, they will not affect parameter updates, thus ensuring efficient learning and adaptability of the model.

对于新场景或低频场景，系统可以采取策略将其归类为已有的相似场景类别，减少网络复杂度同时保证一定程度的个性化推荐质量。这种方法体现了系统的灵活性和扩展性，使得即使是新兴或小众的场景需求也能得到一定程度的满足。For new or low-frequency scenarios, the system can adopt strategies to classify them into existing similar scenario categories, reducing network complexity while ensuring a certain degree of personalized recommendation quality. This approach reflects the flexibility and scalability of the system, so that even emerging or niche scenario needs can be met to a certain extent.

此外，该分场景网络结构设计为一个模块，可以无缝嵌入到更大的推荐系统框架中。这意味着，除了通过分场景网络处理的明确场景外，整个推荐系统还可能包含其他组件，如基础特征提取层、用户画像建模、内容特征编码等，这些组件共同协作，处理那些未被分场景网络直接覆盖的场景或信息，从而实现全链路的个性化推荐优化。In addition, the sub-scenario network structure is designed as a module that can be seamlessly embedded into a larger recommendation system framework. This means that in addition to the explicit scenarios processed by the sub-scenario network, the entire recommendation system may also contain other components, such as basic feature extraction layer, user portrait modeling, content feature encoding, etc. These components work together to process scenarios or information that are not directly covered by the sub-scenario network, thereby achieving full-link personalized recommendation optimization.

在一些实施例中，所述方法还包括：In some embodiments, the method further comprises:

若原有的所述多个分场景网络中不存在与所述场景标识相匹配的目标分场景网络，则创建所述场景标识对应的新分场景网络。If there is no target sub-scenario network matching the scene identifier in the original multiple sub-scenario networks, a new sub-scenario network corresponding to the scene identifier is created.

在一些实施例中，所述若原有的所述多个分场景网络中不存在与所述场景标识相匹配的目标分场景网络，则创建所述场景标识对应的新分场景网络，包括：In some embodiments, if there is no target sub-scenario network matching the scene identifier in the original multiple sub-scenario networks, creating a new sub-scenario network corresponding to the scene identifier includes:

若原有的所述多个分场景网络中不存在与所述场景标识相匹配的目标分场景网络，则检测所述场景标识对应的数据流量；If there is no target sub-scenario network matching the scenario identifier among the original multiple sub-scenario networks, detecting the data flow corresponding to the scenario identifier;

当所述场景标识对应的数据流量达到预设流量阈值时，创建所述场景标识对应的新分场景网络；When the data traffic corresponding to the scenario identifier reaches a preset traffic threshold, a new scenario network corresponding to the scenario identifier is created;

当所述新分场景网络的创建时间达到预设时间阈值时，将所述新分场景网络新增至原有的所述多个分场景网络中。When the creation time of the new sub-scenario network reaches a preset time threshold, the new sub-scenario network is added to the original multiple sub-scenario networks.

在某些应用场景中，推荐系统需要具备高度的自适应性和扩展性，以应对不断变化的用户需求和市场环境。当现有分场景网络无法覆盖所有出现的场景时，系统需要有机制来识别这些新场景，并适时创建与之匹配的分场景网络。In some application scenarios, the recommendation system needs to be highly adaptable and scalable to cope with the ever-changing user needs and market environment. When the existing scenario-based network cannot cover all emerging scenarios, the system needs to have a mechanism to identify these new scenarios and create a matching scenario-based network in a timely manner.

例如，系统在接收到场景标识后，首先检查现有的多个分场景网络中是否有与之完全匹配的目标分场景网络。如果发现没有适合的网络能够精准地服务于该新出现的场景，系统进入下一个决策环节。For example, after receiving the scene identification, the system first checks whether there is a target scene network that fully matches it among the existing multiple scene networks. If it is found that there is no suitable network that can accurately serve the newly emerged scene, the system enters the next decision-making stage.

例如，为了确保资源的有效利用，系统不会立即为每个新出现的场景创建分场景网络。而是开始监控该新场景标识所对应的数据流量。这一步骤旨在验证新场景是否具有足够的用户参与度和商业价值，值得为之投入额外的计算和存储资源。该数据流量可以包括辛场景标识对应的用户数量、请求频率以及与其他场景的交互情况。For example, to ensure efficient use of resources, the system does not immediately create a sub-scenario network for each new scenario. Instead, it starts monitoring the data traffic corresponding to the new scenario ID. This step is to verify whether the new scenario has sufficient user engagement and commercial value to be worth investing additional computing and storage resources. The data traffic can include the number of users corresponding to the scenario ID, request frequency, and interaction with other scenarios.

其中，当监测到新场景的数据流量达到预设流量阈值时，表明该场景已经吸引了一定量的用户关注和互动，这时系统会认为创建专门的分场景网络是有价值的。流量阈值的设定是一种平衡策略，旨在筛选出真正有潜力和需求的场景，避免因短暂或偶然的流量波动而做出不必要的资源分配。该预设流量阈值是提前根据业务需求和系统容量设定的，用于判断一个新场景是否具有足够的活跃度和影响力，从而值得为其单独建立和训练一个分场景网络。Among them, when the data traffic of a new scene reaches the preset traffic threshold, it indicates that the scene has attracted a certain amount of user attention and interaction. At this time, the system will consider it valuable to create a dedicated sub-scene network. The setting of the traffic threshold is a balancing strategy that aims to screen out scenes with real potential and demand, and avoid unnecessary resource allocation due to short-term or accidental traffic fluctuations. The preset traffic threshold is set in advance based on business needs and system capacity to determine whether a new scene has sufficient activity and influence, so that it is worth establishing and training a separate sub-scene network for it.

其中，在决定创建新分场景网络后，系统将根据该场景的特性，使用历史数据或实时数据对该新分场景网络进行初始化训练。以确保新分场景网络能够快速理解和预测新场景下的用户行为。After deciding to create a new sub-scenario network, the system will use historical data or real-time data to initialize and train the new sub-scenario network based on the characteristics of the scenario, to ensure that the new sub-scenario network can quickly understand and predict user behavior in the new scenario.

其中，新创建的新分场景网络不会立即加入到服务中，而是会有一个“观察期”。当该新分场景网络经过一段时间(预设时间阈值，比如5天)的持续训练和优化，其性能达到稳定并且预测效果得到验证后，才会正式整合进原有的分场景网络体系中，参与实际的推荐服务。The newly created sub-scenario network will not be added to the service immediately, but will have an "observation period". After a period of continuous training and optimization (preset time threshold, such as 5 days), when its performance is stable and the prediction effect is verified, the new sub-scenario network will be formally integrated into the original sub-scenario network system and participate in the actual recommendation service.

例如，如果一个电商平台推出了一个新的促销活动，该活动可能会带来一个新的用户交互场景。通过上述机制，系统可以检测到这一新场景，评估其流量，并在必要时创建一个新的分场景网络来优化这一特定场景下的推荐。For example, if an e-commerce platform launches a new promotion, the promotion may bring about a new user interaction scenario. Through the above mechanism, the system can detect this new scenario, evaluate its traffic, and, if necessary, create a new scenario-based network to optimize the recommendation in this specific scenario.

这一系列步骤的设计，既保证了推荐系统的灵活性，使其能够迅速适应新场景的出现，又通过流量和时间双重阈值的设置，确保了资源的高效利用，避免了过度投资于短期或低价值的场景。此外，这一机制还鼓励了对新机会的探索和挖掘，有助于提升整体推荐系统的长期竞争力和用户满意度。The design of this series of steps not only ensures the flexibility of the recommendation system, enabling it to quickly adapt to the emergence of new scenarios, but also ensures the efficient use of resources by setting dual thresholds of traffic and time, avoiding over-investment in short-term or low-value scenarios. In addition, this mechanism also encourages the exploration and mining of new opportunities, which helps to improve the long-term competitiveness and user satisfaction of the overall recommendation system.

例如，以具体的场景为例来解释上述实施例，以直观地理解这一过程是如何在不同的推荐频道和短视频流场景中的应用。For example, the above embodiment is explained by taking a specific scenario as an example to intuitively understand how this process is applied in different recommendation channels and short video streaming scenarios.

假设有一个新的内容推荐频道：A推荐频道，专注于科技新闻。目前，推荐系统中已存在B、C、D等多个成熟的推荐频道网络，但没有专门针对科技新闻的分场景网络。首先，系统监测A推荐频道的用户数据流量，包括浏览量、点击率、用户停留时间等指标。如果初期流量较低，系统可能不会立即为其创建专属的分场景网络。随着时间推移，如果A推荐频道的用户参与度逐渐提高，其数据流量达到了预先设定的流量阈值(比如每日100万次浏览)，这表明该频道有足够多的用户活动来支持独立的模型训练。在确认A推荐频道满足流量条件后，系统自动触发创建一个全新的分场景网络，该新分场景网络将针对科技新闻的特性和用户偏好进行初始化配置和训练。为了确保新分场景网络的稳定性和准确性，系统设定一个预设时间阈值(比如经过一周的持续训练和测试)，在这段时间内不断优化模型性能。当新分场景网络经过充分训练并验证其效果后，最终将其加入到现有的多个分场景网络中，成为正式的一员，负责输出A推荐频道的场景嵌入特征，以应用到A推荐频道下的个性化内容推荐。Suppose there is a new content recommendation channel: A recommendation channel, which focuses on technology news. At present, there are multiple mature recommendation channel networks such as B, C, and D in the recommendation system, but there is no sub-scenario network specifically for technology news. First, the system monitors the user data traffic of A recommendation channel, including indicators such as page views, click-through rate, and user stay time. If the initial traffic is low, the system may not immediately create a dedicated sub-scenario network for it. Over time, if the user engagement of A recommendation channel gradually increases, its data traffic reaches a pre-set traffic threshold (such as 1 million views per day), which indicates that the channel has enough user activities to support independent model training. After confirming that A recommendation channel meets the traffic conditions, the system automatically triggers the creation of a new sub-scenario network, which will be initialized and trained for the characteristics of technology news and user preferences. In order to ensure the stability and accuracy of the new sub-scenario network, the system sets a preset time threshold (such as after a week of continuous training and testing) to continuously optimize the model performance during this period. After the new sub-scene network has been fully trained and its effectiveness has been verified, it will eventually be added to the existing multiple sub-scene networks and become a formal member, responsible for outputting the scene embedding features of recommendation channel A for application to personalized content recommendations under recommendation channel A.

同样的逻辑可以应用于B推荐频道(如聚焦于娱乐新闻)、C推荐频道(如侧重于财经资讯)、D推荐频道(如侧重于体育直播)、E短视频无限流(如探索类视频推荐)、F短视频内流(如用户关注作者的内容流)等其他场景。每个新场景或频道在达到特定流量阈值并且经过一定时间的训练和验证后，都会生成新分场景网络并整合到分场景网络集合中，以实现更精准的内容推送。The same logic can be applied to other scenarios such as B recommendation channel (such as focusing on entertainment news), C recommendation channel (such as focusing on financial information), D recommendation channel (such as focusing on live sports), E short video unlimited flow (such as exploration video recommendation), F short video internal flow (such as content flow of users following authors), etc. After reaching a specific traffic threshold and undergoing a certain period of training and verification, each new scenario or channel will generate a new sub-scenario network and integrate it into the sub-scenario network set to achieve more accurate content push.

在一些实施例中，所述创建所述场景标识对应的新分场景网络，包括：In some embodiments, the creating a new sub-scenario network corresponding to the scenario identifier includes:

当原有的所述多个分场景网络中存在与所述场景标识的相似度达到相似度阈值的第一预设场景标识时，从所述第一预设场景标识对应的第一分场景网络中迁移部分参数作为所述新分场景网络的初始化场景参数；When there is a first preset scene identifier in the original multiple scene identifiers whose similarity with the scene identifier reaches a similarity threshold, migrating some parameters from the first scene identifier corresponding to the first preset scene identifier as initialization scene parameters of the new scene identifier;

根据所述场景标识对应的初始输入特征，对所述新分场景网络的初始化场景参数进行微调，得到所述场景标识对应的场景参数；According to the initial input features corresponding to the scene identifier, fine-tuning the initialization scene parameters of the new scene classification network to obtain the scene parameters corresponding to the scene identifier;

基于所述场景标识对应的场景参数，生成所述场景标识对应的新分场景网络。Based on the scene parameters corresponding to the scene identifier, a new sub-scene network corresponding to the scene identifier is generated.

其中，在实际应用中，新场景的出现往往不是完全独立的，它可能与已有的某些场景存在相似之处。因此，在创建新分场景网络时，先检查是否存在相似场景，并考虑利用这些相似场景的参数进行迁移，可以大大提高网络构建的效率和性能。In actual applications, the emergence of new scenarios is often not completely independent, and it may be similar to some existing scenarios. Therefore, when creating a new sub-scenario network, first check whether there are similar scenarios and consider using the parameters of these similar scenarios for migration, which can greatly improve the efficiency and performance of network construction.

首先，系统会比较新的场景标识与已有的多个分场景网络中的场景标识之间的相似度。这通常通过计算标识之间的某种距离或相似度度量(如余弦相似度等)来完成。First, the system compares the similarity between the new scene identifier and the scene identifiers in multiple existing scene networks. This is usually done by calculating some distance or similarity measure (such as cosine similarity) between the identifiers.

当计算出的相似度达到预设的相似度阈值(如80％)时，系统认为找到了与新场景相似的已有场景，即第一预设场景标识。When the calculated similarity reaches a preset similarity threshold (eg, 80%), the system considers that an existing scene similar to the new scene is found, ie, the first preset scene identifier.

然后，系统会从第一预设场景标识对应的第一分场景网络中迁移部分参数。这些参数可能包括网络权重、偏置项、结构配置等。迁移的目的是利用已有的学习成果作为新网络的强有力起点，减少从头训练的时间和数据需求。Then, the system will migrate some parameters from the first sub-scenario network corresponding to the first preset scenario identifier. These parameters may include network weights, bias terms, structural configurations, etc. The purpose of migration is to use the existing learning results as a strong starting point for the new network, reducing the time and data requirements for training from scratch.

虽然利用了相似场景的参数作为基础，但每个场景都有其独特性，因此接下来的步骤是对迁移来的参数进行微调。这一步依据新场景标识对应的初始输入特征，通过反向传播等优化算法对初始化场景参数进行调整。微调的过程旨在让模型更好地适应新场景特有的用户偏好和行为模式。Although the parameters of similar scenes are used as a basis, each scene is unique, so the next step is to fine-tune the migrated parameters. This step adjusts the initial scene parameters through optimization algorithms such as back propagation based on the initial input features corresponding to the new scene identifier. The fine-tuning process aims to make the model better adapt to the user preferences and behavior patterns unique to the new scene.

然后，基于经过微调得到的场景参数，系统构建出与场景标识完全匹配的新分场景网络。该网络不仅继承了相似场景的通用知识，也融入了针对特定场景的个性化理解，从而能更准确地为用户提供定制化的推荐内容或服务。Then, based on the fine-tuned scene parameters, the system constructs a new scene-by-scene network that fully matches the scene identifier. This network not only inherits the general knowledge of similar scenes, but also incorporates personalized understanding of specific scenes, so that it can provide users with customized recommended content or services more accurately.

然后，系统会将新分场景网络保存到适当的存储介质中(如硬盘、数据库等)，以便后续调用和使用。同时，系统还会更新相关的索引和配置信息，确保新分场景网络能够被正确地识别和使用。Then, the system will save the new sub-scenario network to an appropriate storage medium (such as a hard disk, database, etc.) for subsequent call and use. At the same time, the system will also update the relevant index and configuration information to ensure that the new sub-scenario network can be correctly identified and used.

例如，假设正在为一个新的科技新闻对应的A推荐频道设计个性化的分场景网络应用到推荐系统中。现有的推荐系统已经包含几个成熟的频道网络，如B推荐频道(如聚焦于娱乐新闻)与C推荐频道(如侧重于财经资讯)。尽管A推荐频道的内容主题与现有频道不同，但与B推荐频道在用户行为模式和某些内容属性上存在一定的相似性，两者的场景标识相似度经过计算达到了预设的相似度阈值。系统首先分析A推荐频道的场景标识，并发现它与B推荐频道的场景标识相似度高，符合迁移条件。由于B推荐频道已有一套成熟且表现良好的场景参数，系统决定从B推荐频道的分场景网络中迁移一部分参数作为A推荐频道的新分场景网络的初始化参数。考虑到科技新闻与娱乐新闻的差异，系统需要根据A推荐频道特有的初始输入特征(如用户对科技关键词的偏好、科技新闻的时效性等)对迁移来的参数进行微调。微调过程可能涉及使用少量初期数据对模型进行训练，以适应新场景下的用户行为和内容特性。经过微调得到的场景参数更加贴合A推荐频道的需求。此时，系统基于这些调整后的参数生成A推荐频的全新分场景网络。该网络不仅继承了B推荐频道的分场景网络的部分高效特性，也融入了针对科技新闻推荐的定制化优化，从而使得目标推荐模型能够更准确地向用户推荐科技相关内容。For example, suppose that a personalized sub-scenario network is being designed for a new recommendation channel A corresponding to technology news and applied to the recommendation system. The existing recommendation system already contains several mature channel networks, such as recommendation channel B (for example, focusing on entertainment news) and recommendation channel C (for example, focusing on financial information). Although the content theme of recommendation channel A is different from that of existing channels, it has certain similarities with recommendation channel B in terms of user behavior patterns and certain content attributes. The similarity of the scene identifiers of the two reaches the preset similarity threshold after calculation. The system first analyzes the scene identifier of recommendation channel A and finds that it has a high similarity with the scene identifier of recommendation channel B, which meets the migration conditions. Since recommendation channel B already has a set of mature and well-performing scene parameters, the system decides to migrate some parameters from the sub-scenario network of recommendation channel B as the initialization parameters of the new sub-scenario network of recommendation channel A. Considering the differences between technology news and entertainment news, the system needs to fine-tune the migrated parameters according to the initial input features unique to recommendation channel A (such as user preference for technology keywords, timeliness of technology news, etc.). The fine-tuning process may involve training the model with a small amount of initial data to adapt to user behavior and content characteristics in the new scenario. The fine-tuned scene parameters are more in line with the needs of the A recommendation channel. At this point, the system generates a new scene-based network for the A recommendation channel based on these adjusted parameters. This network not only inherits some of the high-efficiency characteristics of the scene-based network of the B recommendation channel, but also incorporates customized optimization for science and technology news recommendations, so that the target recommendation model can more accurately recommend science and technology-related content to users.

通过这种基于相似场景的参数迁移与微调策略，新频道"A"能够更快地建立高效、定制化的推荐服务，减少从零开始构建模型所需的时间和资源，同时保证了推荐的精准度和用户体验。类似的方法也可应用于E短视频无限流、F短视频内流等场景中，根据各自的特性找到合适的预设场景进行参数迁移与优化。Through this parameter migration and fine-tuning strategy based on similar scenarios, the new channel "A" can establish an efficient and customized recommendation service more quickly, reducing the time and resources required to build a model from scratch, while ensuring the accuracy of recommendations and user experience. Similar methods can also be applied to scenarios such as E short video infinite streaming and F short video internal streaming, and find suitable preset scenarios for parameter migration and optimization based on their respective characteristics.

当原有的所述多个分场景网络中不存在与所述场景标识的相似度达到相似度阈值的第一预设场景标识时，获取所述新分场景网络的随机初始化场景参数；When there is no first preset scene identifier whose similarity with the scene identifier reaches a similarity threshold in the original multiple sub-scene networks, obtaining a randomly initialized scene parameter of the new sub-scene network;

根据所述场景标识对应的初始输入特征，对所述新分场景网络的随机初始化场景参数进行微调，得到所述场景标识对应的场景参数；According to the initial input features corresponding to the scene identifier, fine-tuning the randomly initialized scene parameters of the new scene segmentation network to obtain the scene parameters corresponding to the scene identifier;

例如，可以基于E短视频无限流场景来具体说明这一过程，因为这一场景相比传统的推荐频道更具动态性和多样性，非常适合展示如何从无到有创建一个全新的分场景网络。For example, this process can be specifically illustrated based on the E short video infinite streaming scenario, because this scenario is more dynamic and diverse than the traditional recommendation channel, and is very suitable for demonstrating how to create a brand new scenario-based network from scratch.

例如，要为一个专注于“户外探险”内容的E短视频无限流创建一个新分场景网络用于个性化推荐系统中。该领域在现有的短视频分类中较为独特，没有直接与之相似度达到预设阈值的预设场景标识(比如已有的可能是搞笑、美食、时尚等)，因此无法直接从现有分场景网络迁移参数。For example, we need to create a new scene-based network for a short video infinite stream focusing on "outdoor adventure" content for use in a personalized recommendation system. This field is relatively unique in the existing short video classification, and there is no preset scene identifier with a similarity threshold (for example, there may be comedy, food, fashion, etc.), so it is impossible to directly migrate parameters from the existing scene-based network.

既然没有现成的相似场景可供借鉴，系统首先会为“户外探险”这一新场景网络随机初始化参数。接下来，系统利用针对“户外探险”这一特定场景收集的初始输入特征来微调这些随机初始化的参数。这些特征可能包括用户在浏览探险相关视频时的停留时间、点赞、评论、分享行为，以及视频本身的标签(如山地攀爬、潜水探险、野外生存技巧等)。通过机器学习算法，系统逐步调整参数以更好地反映用户对这类内容的偏好和互动习惯。Since there are no similar scenarios to draw on, the system will first randomly initialize the parameters for the new scenario network "outdoor adventure". Next, the system uses the initial input features collected for the specific scenario of "outdoor adventure" to fine-tune these randomly initialized parameters. These features may include the user's dwell time, likes, comments, sharing behaviors when browsing adventure-related videos, and the labels of the video itself (such as mountain climbing, diving adventures, wilderness survival skills, etc.). Through machine learning algorithms, the system gradually adjusts the parameters to better reflect the user's preferences and interaction habits for this type of content.

微调完成后，系统根据得到的场景参数来生成“户外探险”短视频无限流的专属的新分场景网络。该新分场景网络现在具备了初步理解并预测用户在该特定内容领域内行为的能力，能够为用户提供连续、个性化且吸引人的探险视频流。After fine-tuning, the system generates a new scene-based network for the infinite stream of "outdoor adventure" short videos based on the obtained scene parameters. The new scene-based network now has the ability to initially understand and predict user behavior in this specific content area, and can provide users with a continuous, personalized and attractive adventure video stream.

在新分场景网络上线后，系统会持续收集反馈数据，如点击率、观看时长、转化率等，进一步迭代优化模型参数，确保推荐越来越精准，提升用户体验和参与度。随着用户数据的积累，“户外探险”场景的推荐网络会逐渐成熟，最终形成一个高度定制化且高效的推荐系统，满足用户对户外探险内容的探索需求。After the new scenario network is launched, the system will continue to collect feedback data, such as click-through rate, viewing time, conversion rate, etc., and further iterate and optimize model parameters to ensure more accurate recommendations and improve user experience and engagement. As user data accumulates, the recommendation network for the "outdoor adventure" scenario will gradually mature, and eventually form a highly customized and efficient recommendation system to meet users' exploration needs for outdoor adventure content.

在一些实施例中，所述场景特征包括场景标识与所述场景标识对应的场景级别；In some embodiments, the scene feature includes a scene identifier and a scene level corresponding to the scene identifier;

所述根据所述场景特征中的所述场景标识，从多个分场景网络中选出与所述场景标识相匹配的目标分场景网络，包括：The step of selecting a target sub-scenario network matching the scene identifier from a plurality of sub-scenario networks according to the scene identifier in the scene feature includes:

根据所述场景标识与所述场景级别，从多个分场景网络中选出与所述场景标识和所述场景级别相匹配的目标分场景网络。According to the scene identifier and the scene level, a target sub-scene network matching the scene identifier and the scene level is selected from a plurality of sub-scene networks.

例如，场景特征不仅包括场景标识，还包括与场景标识对应的场景级别。场景级别提供了一个衡量场景复杂性或重要性的指标，有助于更精确地匹配和处理不同的场景。因此，步骤130中的选择目标分场景网络的过程可以基于场景标识和场景级别来进行，以实现更细致和准确的场景匹配。For example, the scene feature includes not only the scene identifier, but also the scene level corresponding to the scene identifier. The scene level provides an indicator to measure the complexity or importance of the scene, which helps to more accurately match and process different scenes. Therefore, the process of selecting the target sub-scene network in step 130 can be performed based on the scene identifier and the scene level to achieve more detailed and accurate scene matching.

例如，构建一个场景-级别映射表，该表记录了每个场景标识及其对应的场景级别。场景级别作为新增的考量维度，反映了场景的复杂度或重要性，其数值范围和划分标准可根据具体业务需求定制。For example, a scenario-level mapping table is constructed, which records each scenario identifier and its corresponding scenario level. As a newly added consideration dimension, the scenario level reflects the complexity or importance of the scenario, and its value range and classification criteria can be customized according to specific business needs.

然后，还可以维护一个分级的分场景网络架构库，其中不同复杂度和资源消耗的分场景网络分别对应不同的场景级别。例如，低级别场景可能对应轻量级的分场景网络，专注于快速推荐；而高级别场景则匹配深度复杂的分场景网络，追求高度个性化与精准推荐。Then, we can also maintain a hierarchical scenario-based network architecture library, where scenario-based networks of different complexity and resource consumption correspond to different scenario levels. For example, low-level scenarios may correspond to lightweight scenario-based networks that focus on fast recommendations, while high-level scenarios match deep and complex scenario-based networks that pursue highly personalized and accurate recommendations.

在接收到包含场景标识和场景级别的低维输入特征后，首先查询场景-级别映射表，确认当前场景的具体级别。然后依据场景标识及确认的场景级别，在分级的分场景网络架构库中查找最适合的目标分场景网络。匹配过程中，系统会优先考虑那些既符合场景标识又能满足场景级别要求的分场景网络模型。After receiving the low-dimensional input features containing the scene identifier and the scene level, the scene-level mapping table is first queried to confirm the specific level of the current scene. Then, based on the scene identifier and the confirmed scene level, the most suitable target sub-scene network is searched in the hierarchical sub-scene network architecture library. During the matching process, the system will give priority to those sub-scene network models that meet both the scene identifier and the scene level requirements.

例如，在某些情况下，如果资源有限(例如，系统处于高负载状态)，系统可能会降级选择一个稍低于当前场景级别但仍然能有效工作的分场景网络作为目标分场景网络，以确保推荐服务的稳定性和时效性。For example, in some cases, if resources are limited (for example, the system is under high load), the system may downgrade and select a sub-scenario network that is slightly lower than the current scenario level but still works effectively as the target sub-scenario network to ensure the stability and timeliness of the recommendation service.

其中，目标分场景网络的选择过程并非静态不变，而是可以根据实时系统资源状况、推荐系统中各个网络模型的在线性能反馈以及用户满意度等多因素动态调整。这意味着，对于同一场景标识但不同时间点或不同环境下，系统可能选择不同级别的分场景网络以达到最优效果。The selection process of the target scenario network is not static, but can be dynamically adjusted based on multiple factors such as real-time system resource status, online performance feedback of each network model in the recommendation system, and user satisfaction. This means that for the same scenario identifier but at different time points or in different environments, the system may select different levels of scenario networks to achieve the best effect.

下面通过一个电子商务平台的个性化商品推荐的应用场景来具体说明这一方案。The following is an application scenario of personalized product recommendation on an e-commerce platform to illustrate this solution.

例如，有一个电子商务平台，用户在浏览商品时，系统需根据用户的行为和当前环境提供个性化的商品推荐。平台定义了多种场景标识，如“日常浏览”、“节假日促销”、“夜间购物”等，并为每个场景标识设定了不同级别，以反映场景的重要性和复杂度。For example, there is an e-commerce platform where when users browse products, the system needs to provide personalized product recommendations based on the user's behavior and current environment. The platform defines a variety of scene identifiers, such as "daily browsing", "holiday promotions", "night shopping", etc., and sets different levels for each scene identifier to reflect the importance and complexity of the scene.

例如，用户a在晚上10点打开了电商平台，准备购买一些家居用品。此时，场景标识为“夜间购物”，系统根据用户的历史行为和时间点判断此场景级别为“中级”。系统首先对对象特征、商品特征和场景特征进行处理，得到低维输入特征。然后系统查阅场景-级别映射表，确认“夜间购物”场景的级别为“中级”。然后根据“夜间购物”标识和“中级”级别，系统在分级的分场景网络架构库中查找匹配的目标分场景网络。对于“中级”场景，原本应该选择一个中等复杂度的目标分场景网络，目标推荐模型能够根据初始输入特征和该目标分场景网络输出的场景嵌入特征，综合分析用户过去的购物习惯、时间敏感优惠信息以及用户对家居产品的偏好来生成推荐列表。For example, user a opens an e-commerce platform at 10 pm and prepares to buy some household items. At this time, the scene is identified as "night shopping", and the system determines that the level of this scene is "intermediate" based on the user's historical behavior and time point. The system first processes the object features, product features, and scene features to obtain low-dimensional input features. Then the system consults the scene-level mapping table and confirms that the level of the "night shopping" scene is "intermediate". Then, based on the "night shopping" identification and the "intermediate" level, the system searches for a matching target sub-scene network in the hierarchical sub-scene network architecture library. For the "intermediate" scene, a target sub-scene network of medium complexity should be selected. The target recommendation model can generate a recommendation list based on the initial input features and the scene embedding features output by the target sub-scene network, comprehensively analyzing the user's past shopping habits, time-sensitive discount information, and user preferences for home products.

但假设此时系统正面临高负载，资源紧张。为保证服务稳定性，系统决定动态调整，从“夜间购物”标识对应的“中级”的分场景网络降级选择一个“夜间购物”标识对应的“初级”的分场景网络。初级的分场景网络可能不那么依赖复杂的用户行为模式分析，更多基于用户的基本偏好和近期热门商品来生成场景嵌入特征，尽管个性化程度有所降低，但能迅速响应，确保用户最终能够获得推荐列表，维持服务的流畅性。But suppose the system is facing high load and resource constraints at this time. To ensure service stability, the system decides to dynamically adjust and downgrade from the "intermediate" sub-scenario network corresponding to the "night shopping" logo to a "primary" sub-scenario network corresponding to the "night shopping" logo. The primary sub-scenario network may not rely so much on complex user behavior pattern analysis, and is more based on the user's basic preferences and recent popular products to generate scene embedding features. Although the degree of personalization is reduced, it can respond quickly to ensure that users can eventually get the recommendation list and maintain the smoothness of the service.

接下来，系统会持续监控资源状况、推荐系统中各个网络模型的在线性能(如推荐点击率、转化率)以及用户反馈。如果资源状况改善或发现初级分场景网络的在推荐系统中的应用导致用户满意度下降，系统会适时切换回更高级别的分场景网络。这种动态调整机制意味着，即使在不同时间点或资源状况改变时，针对相同的“夜间购物”场景，系统也能根据实际情况选择最合适的推荐策略，确保推荐服务的质量与效率。Next, the system will continue to monitor resource conditions, the online performance of each network model in the recommendation system (such as recommendation click-through rate, conversion rate), and user feedback. If the resource situation improves or it is found that the application of the primary sub-scenario network in the recommendation system has caused a decrease in user satisfaction, the system will switch back to a higher-level sub-scenario network in a timely manner. This dynamic adjustment mechanism means that even at different time points or when resource conditions change, for the same "night shopping" scenario, the system can select the most appropriate recommendation strategy based on actual conditions to ensure the quality and efficiency of the recommendation service.

在步骤130中，不仅可以基于场景标识匹配分场景网络，而且通过整合场景级别信息，实现了更精细化、资源敏感型的网络选择策略，进一步提升了推荐系统的灵活性、效率及个性化水平。这有助于在各种复杂度和重要性的场景下，都能提供最合适的推荐服务。In step 130, not only can the scenario-based networks be matched based on the scenario identifiers, but also a more refined and resource-sensitive network selection strategy can be implemented by integrating the scenario-level information, further improving the flexibility, efficiency and personalization level of the recommendation system. This helps to provide the most appropriate recommendation service in scenarios of various complexity and importance.

步骤140，基于所述目标分场景网络对所述低维输入特征进行特征处理，得到所述待处理数据的场景嵌入特征。Step 140: perform feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed.

如图2所示，在选择了目标分场景网络后，会将低维输入特征输入到该目标分场景网络中，进行进一步的特征处理。该过程可能包括多层神经网络的前向传播、注意力机制的应用等，以生成更高级别的特征表示，即场景嵌入特征。这些特征能够更好地捕捉目标对象在特定场景下的行为和偏好。As shown in Figure 2, after the target sub-scene network is selected, the low-dimensional input features are input into the target sub-scene network for further feature processing. This process may include forward propagation of multi-layer neural networks, application of attention mechanisms, etc. to generate higher-level feature representations, namely scene embedding features. These features can better capture the behavior and preferences of the target object in a specific scene.

在一些实施例中，所述基于所述目标分场景网络对所述低维输入特征进行特征处理，得到所述待处理数据的场景嵌入特征，包括：In some embodiments, the step of performing feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed includes:

将所述低维输入特征分别输入所述M个目标分场景网络进行特征处理，得到各个所述目标分场景网络的分场景输出特征；Inputting the low-dimensional input features into the M target sub-scene networks respectively for feature processing to obtain sub-scene output features of each of the target sub-scene networks;

将各个所述目标分场景网络的分场景输出特征进行聚合，得到所述待处理数据的场景嵌入特征。Aggregate the sub-scene output features of each of the target sub-scene networks to obtain the scene embedding features of the data to be processed.

其中，在步骤140中，系统会将低维输入特征分别输入到M个目标分场景网络中。这里的M表示选择的目标分场景网络的数量，这些目标分场景网络是根据当前的场景标识(或者，当前的场景标识和场景级别)选择出来的，每个网络都针对特定的场景进行了优化。In step 140, the system inputs the low-dimensional input features into M target sub-scene networks respectively. Here, M represents the number of selected target sub-scene networks, which are selected based on the current scene identifier (or the current scene identifier and scene level), and each network is optimized for a specific scene.

例如，在每个目标分场景网络中，低维输入特征会经过一系列的神经网络层，可以包括卷积层、循环层、全连接层等，以及可能的注意力机制等高级特征处理技术。这些处理过程旨在提取和学习特征中的更高级表示，即分场景输出特征，这些特征更加关注于捕捉与特定场景相关的细节和特性。For example, in each target scene-by-scene network, low-dimensional input features go through a series of neural network layers, including convolutional layers, recurrent layers, fully connected layers, and possible advanced feature processing techniques such as attention mechanisms. These processes are designed to extract and learn higher-level representations of features, namely, scene-by-scene output features, which are more focused on capturing details and features related to specific scenes.

然后，系统会将来自各个目标分场景网络的分场景输出特征进行聚合。例如，这种聚合可以通过简单的相加操作来实现，即将所有分场景输出特征直接相加，以得到一个综合的场景嵌入特征。该场景嵌入特征综合了来自不同分场景网络的信息，因此能够更全面地反映待处理数据在不同场景下的特质。Then, the system aggregates the sub-scene output features from each target sub-scene network. For example, this aggregation can be achieved by a simple addition operation, that is, directly adding all sub-scene output features to obtain a comprehensive scene embedding feature. The scene embedding feature integrates information from different sub-scene networks, so it can more comprehensively reflect the characteristics of the data to be processed in different scenes.

然后，通过聚合得到的综合特征即为待处理数据的场景嵌入特征。该场景嵌入特征不仅包含了原始低维输入特征的信息，还融入了针对不同场景的深入学习和理解，使得其对待处理数据的描述更为丰富和准确。Then, the comprehensive features obtained by aggregation are the scene embedding features of the data to be processed. This scene embedding feature not only contains the information of the original low-dimensional input features, but also incorporates in-depth learning and understanding of different scenarios, making its description of the data to be processed richer and more accurate.

通过这样的过程，系统能够根据不同的场景标识(或者，不同的场景标识和场景级别)，选择最合适的目标分场景网络进行特征处理，并通过聚合这些目标分场景网络的输出来生成一个综合的场景嵌入特征。这种方法充分利用了不同分场景网络的特性和优势，提高了场景嵌入特征的表达能力，从而有助于提升后续资源推荐的准确率和效果。Through this process, the system can select the most appropriate target sub-scene network for feature processing according to different scene identifiers (or different scene identifiers and scene levels), and generate a comprehensive scene embedding feature by aggregating the outputs of these target sub-scene networks. This method makes full use of the characteristics and advantages of different sub-scene networks, improves the expressiveness of scene embedding features, and thus helps to improve the accuracy and effectiveness of subsequent resource recommendations.

步骤150，基于所述初始输入特征和所述场景嵌入特征，向所述目标对象进行资源推荐。Step 150: Recommend resources to the target object based on the initial input features and the scene embedding features.

如图2所示，这一步是利用初始输入特征和场景嵌入特征来执行资源推荐任务。这通常涉及使用目标推荐模型来预测目标对象对每个待推荐资源的预测任务(如点击率预测、转化率预测)。然后，根据预测结果对资源进行排序或筛选，选择出最符合目标对象需求和偏好的目标推荐资源作为目标推荐模型的输出结果进行推荐。这确保了推荐不仅基于个体和资源本身的匹配度，也充分考虑了当前场景的影响，提升了推荐的个性化和效果。As shown in Figure 2, this step is to use the initial input features and scene embedding features to perform the resource recommendation task. This usually involves using a target recommendation model to predict the target object's prediction task for each resource to be recommended (such as click-through rate prediction, conversion rate prediction). Then, the resources are sorted or filtered according to the prediction results, and the target recommended resources that best meet the needs and preferences of the target object are selected as the output results of the target recommendation model for recommendation. This ensures that the recommendation is not only based on the match between the individual and the resource itself, but also fully considers the impact of the current scene, thereby improving the personalization and effectiveness of the recommendation.

在一些实施例中，所述基于所述初始输入特征和所述场景嵌入特征，向所述目标对象进行资源推荐，包括：In some embodiments, the recommending resources to the target object based on the initial input feature and the scene embedding feature includes:

基于所述目标对象的对象特征、多个待推荐资源的资源特征、所述场景特征与所述场景嵌入特征执行预测任务，得到所述目标对象针对各个所述待推荐资源的任务预测值，所述预测任务为点击率预测任务与转化率预测任务中的一种；Performing a prediction task based on the object feature of the target object, the resource features of multiple resources to be recommended, the scene feature and the scene embedding feature to obtain a task prediction value of the target object for each resource to be recommended, wherein the prediction task is one of a click rate prediction task and a conversion rate prediction task;

根据所述任务预测值从所述多个待推荐资源中确定出所述目标对象对应的目标推荐资源；Determining a target recommended resource corresponding to the target object from the multiple resources to be recommended according to the task prediction value;

向所述目标对象推荐所述目标推荐资源。The target recommendation resource is recommended to the target object.

在步骤150中，首先基于目标对象的对象特征(如用户标识、用户画像特征、历史行为特征、用户偏好特征等)、多个待推荐资源的资源特征(如资源内容特征、资源类型、资源标签、资源属性等)、场景特征(如时间、地点、上下文环境信息、设备类型、展示方式等)以及之前步骤中生成的场景嵌入特征，执行预测任务。预测任务通常包括点击率预测任务、转化率预测任务等，这些任务用于估计目标对象对特定资源的兴趣或潜在价值。In step 150, firstly, a prediction task is performed based on the object features of the target object (such as user identification, user portrait features, historical behavior features, user preference features, etc.), resource features of multiple resources to be recommended (such as resource content features, resource types, resource tags, resource attributes, etc.), scene features (such as time, location, context information, device type, display method, etc.), and the scene embedding features generated in the previous steps. Prediction tasks usually include click-through rate prediction tasks, conversion rate prediction tasks, etc., which are used to estimate the interest or potential value of the target object in a specific resource.

预测任务的执行通常依赖于一个已经训练好的目标推荐模型。该目标推荐模型能够捕捉用户、资源和场景之间的复杂关系，并基于这些关系预测用户对资源的潜在兴趣。在预测过程中，初始输入特征和场景嵌入特征被作为模型的输入，经过模型内部的一系列计算，最终输出用户对每个待推荐资源的预测值。The execution of the prediction task usually relies on a trained target recommendation model. The target recommendation model can capture the complex relationships between users, resources, and scenarios, and predict the user's potential interest in resources based on these relationships. In the prediction process, the initial input features and scene embedding features are used as inputs to the model. After a series of calculations within the model, the model finally outputs the user's predicted value for each resource to be recommended.

得到预测值后，系统会根据这些预测值从多个待推荐资源中确定出目标对象对应的目标推荐资源。这通常涉及到对预测值的排序、筛选或组合等操作，以确保推荐给用户的资源不仅符合其个人兴趣和偏好，还符合当前场景的需求。例如，系统可以根据预测值的高低对资源进行排序，然后选择排名靠前的资源作为推荐结果。或者，系统可以根据预设的阈值对预测值进行筛选，只选择预测值超过阈值的资源进行推荐。此外，系统还可以根据特定的策略对资源进行组合推荐，以满足用户多样化的需求。After obtaining the predicted values, the system will determine the target recommended resources corresponding to the target object from multiple resources to be recommended based on these predicted values. This usually involves operations such as sorting, filtering, or combining predicted values to ensure that the resources recommended to users not only meet their personal interests and preferences, but also meet the needs of the current scenario. For example, the system can sort resources according to the predicted values, and then select the top-ranked resources as the recommendation results. Alternatively, the system can filter the predicted values according to a preset threshold and only select resources with predicted values exceeding the threshold for recommendation. In addition, the system can also recommend resources in combination according to specific strategies to meet the diverse needs of users.

确定目标推荐资源后，系统会将这些资源以适当的方式推荐给目标对象。推荐的方式可以多样化，如通过电子邮件、短信、应用内通知、个性化页面展示等方式向用户展示推荐资源。同时，系统还可以根据用户的反馈和行为数据对推荐结果进行持续优化，以提高推荐的准确性和效果。After determining the target recommended resources, the system will recommend these resources to the target object in an appropriate manner. The recommendation methods can be diversified, such as displaying recommended resources to users through email, SMS, in-app notifications, personalized page display, etc. At the same time, the system can also continuously optimize the recommendation results based on user feedback and behavior data to improve the accuracy and effectiveness of the recommendation.

在步骤150中，场景嵌入特征发挥了至关重要的作用。它融合了目标对象在特定场景下的行为和偏好信息，使得推荐结果更加符合当前场景的需求。通过利用场景嵌入特征，系统能够更准确地捕捉用户在不同场景下的兴趣变化，从而提供更加个性化的推荐服务。In step 150, the scene embedding feature plays a crucial role. It integrates the behavior and preference information of the target object in a specific scene, making the recommendation result more in line with the needs of the current scene. By using the scene embedding feature, the system can more accurately capture the changes in user interests in different scenes, thereby providing more personalized recommendation services.

步骤150通过结合初始输入特征和场景嵌入特征，实现了对目标对象的个性化资源推荐。这一步骤不仅提高了推荐的准确性和效果，还充分考虑了当前场景的影响，使得推荐结果更加符合用户的实际需求和偏好。在实际应用中，通过不断优化和改进推荐算法和模型，可以进一步提升资源推荐系统的性能和用户体验。Step 150 realizes personalized resource recommendation for the target object by combining the initial input features and the scene embedding features. This step not only improves the accuracy and effect of the recommendation, but also fully considers the influence of the current scene, so that the recommendation results are more in line with the actual needs and preferences of the user. In practical applications, the performance and user experience of the resource recommendation system can be further improved by continuously optimizing and improving the recommendation algorithm and model.

例如，在推荐系统中，当面对多场景的应用时，每个场景可能会有其独特的推荐需求和用户行为模式。下面将结合A推荐频道、B推荐频道、C推荐频道、D推荐频道、E短视频无限流和F短视频内流等场景，对上述的步骤110至步骤150进行举例说明：For example, in a recommendation system, when faced with multi-scenario applications, each scenario may have its own unique recommendation requirements and user behavior patterns. The following will illustrate the above steps 110 to 150 by combining scenarios such as A recommendation channel, B recommendation channel, C recommendation channel, D recommendation channel, E short video unlimited stream and F short video internal stream:

在步骤110中，系统获取与目标对象(用户)和待推荐资源相关的初始输入特征。这些初始输入特征包括用户的对象特征(如年龄、性别、历史行为等)、多个待推荐资源的资源特征(如资源类型、标签、评分等)，以及场景特征(如A推荐频道、B推荐频道等场景标识)。In step 110, the system obtains initial input features related to the target object (user) and the resources to be recommended. These initial input features include the user's object features (such as age, gender, historical behavior, etc.), resource features of multiple resources to be recommended (such as resource type, label, rating, etc.), and scene features (such as scene identification such as A recommended channel, B recommended channel, etc.).

在步骤120中，为了降低计算复杂度和提高模型效率，系统可以通过如图2所示的底层压缩网络对初始输入特征进行压缩，以将高纬度的初始输入特征映射到更低维度的低维输入特征。In step 120, in order to reduce computational complexity and improve model efficiency, the system may compress the initial input features through the underlying compression network as shown in FIG. 2 to map the high-dimensional initial input features to lower-dimensional low-dimensional input features.

在步骤130中，系统根据场景特征中的场景标识(如A、B、C、D、E、F等)，从预先训练好的多个分场景网络中选出与当前场景标识相匹配的目标分场景网络。每个分场景网络都针对特定场景进行了优化，具有独立的场景参数，以捕捉该场景下的用户行为模式和资源特性。In step 130, the system selects a target sub-scenario network that matches the current scene identifier from a plurality of pre-trained sub-scenario networks according to the scene identifier (such as A, B, C, D, E, F, etc.) in the scene feature. Each sub-scenario network is optimized for a specific scene and has independent scene parameters to capture the user behavior patterns and resource characteristics in the scene.

在步骤140中，将低维输入特征输入到选出的目标分场景网络中，通过该网络的多层处理和变换，得到待处理数据的场景嵌入特征。场景嵌入特征包含了在当前场景下，用户和待推荐资源之间的潜在关系和特征表示。In step 140, the low-dimensional input features are input into the selected target sub-scene network, and the scene embedding features of the data to be processed are obtained through multi-layer processing and transformation of the network. The scene embedding features contain the potential relationship and feature representation between the user and the resource to be recommended in the current scene.

在步骤150中，系统结合初始输入特征和场景嵌入特征，利用目标推荐模型进行资源推荐。In step 150, the system combines the initial input features and the scene embedding features and uses the target recommendation model to make resource recommendations.

例如，针对A推荐频道的场景，系统可能更侧重于推荐与用户历史行为和兴趣高度相关的资源。因此，在模型预测时，会更多地考虑用户的对象特征和场景嵌入特征中的长期兴趣部分。For example, for the scenario of channel A recommendation, the system may focus more on recommending resources that are highly relevant to the user's historical behavior and interests. Therefore, when predicting the model, more consideration will be given to the user's object features and the long-term interest part in the scene embedding features.

例如，针对B推荐频道的场景，该频道可能更注重推荐热门或新上线的资源。因此，在模型预测时，会更多地考虑资源特征和场景嵌入特征中的流行度和新鲜度部分。For example, in the scenario of B recommending channel, the channel may focus more on recommending popular or newly launched resources. Therefore, when predicting the model, more consideration will be given to the popularity and freshness of resource features and scene embedding features.

例如，针对C推荐频道的场景，该频道可能是针对特定领域或主题的推荐频道，如科技、娱乐等。在该场景下，系统会根据用户的领域偏好和场景嵌入特征中的领域相关性进行推荐。For example, in the scenario of C recommending a channel, the channel may be a recommendation channel for a specific field or theme, such as technology, entertainment, etc. In this scenario, the system will make recommendations based on the user's field preferences and the field relevance in the scene embedding features.

例如，针对D推荐频道的场景，该频道可能是个性化程度非常高的频道，系统会根据用户的实时行为和场景嵌入特征进行动态推荐。For example, in the scenario of D recommending channel, the channel may be a highly personalized channel, and the system will make dynamic recommendations based on the user's real-time behavior and scenario embedding features.

例如，针对E短视频无限流的场景，在该场景下，系统需要快速且连续地推荐短视频给用户。因此，会使用高效的模型结构，并结合场景嵌入特征中的用户即时兴趣进行推荐。For example, in the scenario of infinite streaming of E short videos, the system needs to recommend short videos to users quickly and continuously. Therefore, an efficient model structure will be used, combined with the user's immediate interests in the scene embedding features for recommendation.

例如，针对F短视频内流的场景，当用户正在观看某个短视频时，系统会根据该视频的内容和用户的观看行为，以及场景嵌入特征中的上下文信息，推荐相关的短视频。For example, for the F short video in-stream scenario, when a user is watching a short video, the system will recommend related short videos based on the content of the video and the user's viewing behavior, as well as the contextual information in the scene embedded features.

在推荐过程中，系统会根据预测任务(如点击率预测、转化率预测等)的结果，对资源进行排序或筛选，最终选择出最符合用户需求和偏好的目标推荐资源进行推荐。同时，系统还会根据用户的反馈和行为数据对推荐结果进行持续优化，以提高推荐的准确性和效果。During the recommendation process, the system will sort or filter resources based on the results of prediction tasks (such as click-through rate prediction, conversion rate prediction, etc.), and ultimately select the target recommended resources that best meet user needs and preferences for recommendation. At the same time, the system will continue to optimize the recommendation results based on user feedback and behavior data to improve the accuracy and effectiveness of the recommendation.

上述所有的技术方案，可以采用任意结合形成本申请的可选实施例，在此不再一一赘述。All of the above technical solutions can be arbitrarily combined to form optional embodiments of the present application, which will not be described in detail here.

为便于更好的实施本申请实施例的数据处理方法，本申请实施例还提供一种数据处理装置。请参阅图3，图3为本申请实施例提供的数据处理装置的结构示意图。其中，该数据处理装置200可以包括：In order to better implement the data processing method of the embodiment of the present application, the embodiment of the present application also provides a data processing device. Please refer to Figure 3, which is a schematic diagram of the structure of the data processing device provided by the embodiment of the present application. The data processing device 200 may include:

获取单元210，用于获取待处理数据的初始输入特征，其中，所述初始输入特征包括目标对象的对象特征、多个待推荐资源的资源特征和场景特征，所述场景特征包括场景标识；An acquisition unit 210 is used to acquire initial input features of the data to be processed, wherein the initial input features include object features of a target object, resource features of a plurality of resources to be recommended, and scene features, wherein the scene features include scene identifiers;

压缩单元220，用于对所述初始输入特征进行特征压缩处理，得到低维输入特征；A compression unit 220, configured to perform feature compression processing on the initial input features to obtain low-dimensional input features;

选择单元230，用于根据所述场景特征中的所述场景标识，从多个分场景网络中选出与所述场景标识相匹配的目标分场景网络，各个所述分场景网络具有独立的场景参数；A selection unit 230, configured to select a target sub-scenario network matching the scene identifier from a plurality of sub-scenario networks according to the scene identifier in the scene feature, each of the sub-scenario networks having independent scene parameters;

处理单元240，用于基于所述目标分场景网络对所述低维输入特征进行特征处理，得到所述待处理数据的场景嵌入特征；A processing unit 240 is used to perform feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed;

推荐单元250，用于基于所述初始输入特征和所述场景嵌入特征，向所述目标对象进行资源推荐。The recommendation unit 250 is used to recommend resources to the target object based on the initial input features and the scene embedding features.

在一些实施例中，所述场景特征包括M个场景标识，M为大于0的自然数；所述选择单元230，可以用于：将所述M个场景标识中的各个场景标识分别与N个分场景网络中的各个分场景网络的预设场景标识进行匹配，得到与所述M个场景标识相匹配的M个目标分场景网络，N大于或等于M。In some embodiments, the scene features include M scene identifiers, where M is a natural number greater than 0; the selection unit 230 can be used to: match each of the M scene identifiers with a preset scene identifier of each sub-scene network in N sub-scene networks, respectively, to obtain M target sub-scene networks that match the M scene identifiers, where N is greater than or equal to M.

在一些实施例中，所述处理单元240，可以用于将所述低维输入特征分别输入所述M个目标分场景网络进行特征处理，得到各个所述目标分场景网络的分场景输出特征；将各个所述目标分场景网络的分场景输出特征进行聚合，得到所述待处理数据的场景嵌入特征。In some embodiments, the processing unit 240 can be used to input the low-dimensional input features into the M target sub-scene networks for feature processing respectively to obtain the sub-scene output features of each of the target sub-scene networks; aggregate the sub-scene output features of each of the target sub-scene networks to obtain the scene embedding features of the data to be processed.

在一些实施例中，所述数据处理装置200还包括：In some embodiments, the data processing device 200 further includes:

创建单元，用于若原有的所述多个分场景网络中不存在与所述场景标识相匹配的目标分场景网络，则创建所述场景标识对应的新分场景网络。A creating unit is used to create a new sub-scenario network corresponding to the scene identifier if there is no target sub-scenario network matching the scene identifier in the original multiple sub-scenario networks.

在一些实施例中，所述创建单元，可以用于：若原有的所述多个分场景网络中不存在与所述场景标识相匹配的目标分场景网络，则检测所述场景标识对应的数据流量；当所述场景标识对应的数据流量达到预设流量阈值时，创建所述场景标识对应的新分场景网络；当所述新分场景网络的创建时间达到预设时间阈值时，将所述新分场景网络新增至原有的所述多个分场景网络中。In some embodiments, the creation unit can be used to: if there is no target sub-scenario network matching the scene identifier in the original multiple sub-scenario networks, then detect the data traffic corresponding to the scene identifier; when the data traffic corresponding to the scene identifier reaches a preset traffic threshold, create a new sub-scenario network corresponding to the scene identifier; when the creation time of the new sub-scenario network reaches a preset time threshold, add the new sub-scenario network to the original multiple sub-scenario networks.

在一些实施例中，所述创建单元在创建所述场景标识对应的新分场景网络时，可以用于：当原有的所述多个分场景网络中存在与所述场景标识的相似度达到相似度阈值的第一预设场景标识时，从所述第一预设场景标识对应的第一分场景网络中迁移部分参数作为所述新分场景网络的初始化场景参数；In some embodiments, when creating a new sub-scenario network corresponding to the scene identifier, the creation unit may be used to: when there is a first preset scene identifier in the original multiple sub-scenario networks whose similarity with the scene identifier reaches a similarity threshold, migrate some parameters from the first sub-scenario network corresponding to the first preset scene identifier as initialization scene parameters of the new sub-scenario network;

根据所述场景标识对应的初始输入特征，对所述新分场景网络的初始化场景参数进行微调，得到所述场景标识对应的场景参数；基于所述场景标识对应的场景参数，生成所述场景标识对应的新分场景网络。According to the initial input features corresponding to the scene identifier, the initialization scene parameters of the new sub-scene network are fine-tuned to obtain the scene parameters corresponding to the scene identifier; based on the scene parameters corresponding to the scene identifier, a new sub-scene network corresponding to the scene identifier is generated.

在一些实施例中，所述创建单元在创建所述场景标识对应的新分场景网络时，可以用于：当原有的所述多个分场景网络中不存在与所述场景标识的相似度达到相似度阈值的第一预设场景标识时，获取所述新分场景网络的随机初始化场景参数；根据所述场景标识对应的初始输入特征，对所述新分场景网络的随机初始化场景参数进行微调，得到所述场景标识对应的场景参数；基于所述场景标识对应的场景参数，生成所述场景标识对应的新分场景网络。In some embodiments, when creating a new sub-scene network corresponding to the scene identifier, the creation unit can be used to: when there is no first preset scene identifier in the original multiple sub-scene networks whose similarity with the scene identifier reaches a similarity threshold, obtain the randomly initialized scene parameters of the new sub-scene network; according to the initial input features corresponding to the scene identifier, fine-tune the randomly initialized scene parameters of the new sub-scene network to obtain the scene parameters corresponding to the scene identifier; based on the scene parameters corresponding to the scene identifier, generate a new sub-scene network corresponding to the scene identifier.

在一些实施例中，所述场景特征包括场景标识与所述场景标识对应的场景级别；所述选择单元230，可以用于：根据所述场景标识与所述场景级别，从多个分场景网络中选出与所述场景标识和所述场景级别相匹配的目标分场景网络。In some embodiments, the scene features include a scene identifier and a scene level corresponding to the scene identifier; the selection unit 230 can be used to: select a target sub-scene network that matches the scene identifier and the scene level from multiple sub-scene networks based on the scene identifier and the scene level.

在一些实施例中，所述推荐单元250，可以用于：基于所述目标对象的对象特征、多个待推荐资源的资源特征、所述场景特征与所述场景嵌入特征执行预测任务，得到所述目标对象针对各个所述待推荐资源的任务预测值，所述预测任务为点击率预测任务与转化率预测任务中的一种；根据所述任务预测值从所述多个待推荐资源中确定出所述目标对象对应的目标推荐资源；向所述目标对象推荐所述目标推荐资源。In some embodiments, the recommendation unit 250 can be used to: perform a prediction task based on the object characteristics of the target object, the resource characteristics of multiple resources to be recommended, the scene characteristics and the scene embedded characteristics to obtain a task prediction value of the target object for each of the resources to be recommended, wherein the prediction task is one of a click-through rate prediction task and a conversion rate prediction task; determine the target recommended resource corresponding to the target object from the multiple resources to be recommended according to the task prediction value; and recommend the target recommended resource to the target object.

在一些实施例中，所述获取单元210，可以用于：获取待处理数据，所述待处理数据包括目标对象的对象关联信息、多个待推荐资源的资源关联信息、以及场景信息；对所述对象关联信息、所述资源关联信息与所述场景信息进行特征提取处理，得到所述目标对象的所述对象特征、各个所述待推荐资源的资源特征和各个所述待推荐资源对应的场景特征。In some embodiments, the acquisition unit 210 can be used to: acquire data to be processed, the data to be processed including object association information of a target object, resource association information of multiple resources to be recommended, and scene information; perform feature extraction processing on the object association information, the resource association information and the scene information to obtain the object features of the target object, the resource features of each of the resources to be recommended and the scene features corresponding to each of the resources to be recommended.

需要说明的是，本申请实施例中的数据处理装置200中各模块的功能可对应参考上述各方法实施例中任意实施例的具体实现方式，这里不再赘述。It should be noted that the functions of each module in the data processing device 200 in the embodiment of the present application can correspond to the specific implementation method of any embodiment in the above-mentioned method embodiments, and will not be repeated here.

上述装置中的各个单元可全部或部分通过软件、硬件及其组合来实现。上述各个单元可以以硬件形式内嵌于或独立于计算机设备中的处理器中，也可以以软件形式存储于计算机设备中的存储器中，以便于处理器调用执行上述各个单元对应的操作。Each unit in the above device can be implemented in whole or in part by software, hardware or a combination thereof. Each unit can be embedded in or independent of a processor in a computer device in the form of hardware, or can be stored in a memory in a computer device in the form of software, so that the processor can call and execute the operations corresponding to each unit.

例如，数据处理装置200可以集成在具备储存器并安装有处理器而具有运算能力的终端或服务器中，或者该数据处理装置200为该终端或服务器。For example, the data processing device 200 may be integrated in a terminal or a server that has a storage device and a processor installed therein and has computing capabilities, or the data processing device 200 may be the terminal or the server.

在一些实施例中，本申请还提供了一种计算机设备，包括存储器和处理器，存储器中存储有计算机程序，该处理器执行计算机程序时实现上述各方法实施例中的步骤。In some embodiments, the present application further provides a computer device, including a memory and a processor, wherein a computer program is stored in the memory, and the processor implements the steps in the above-mentioned method embodiments when executing the computer program.

图4为本申请实施例提供的计算机设备的示意性结构图，如图4所示，计算机设备300可以包括：通信接口301，存储器302，处理器303和通信总线304。通信接口301，存储器302，处理器303通过通信总线304实现相互间的通信。通信接口301用于装置300与外部设备进行数据通信。存储器302可用于存储软件程序以及模块，处理器303通过运行存储在存储器302的软件程序以及模块，例如前述方法实施例中的相应操作的软件程序。FIG4 is a schematic structural diagram of a computer device provided in an embodiment of the present application. As shown in FIG4 , a computer device 300 may include: a communication interface 301, a memory 302, a processor 303, and a communication bus 304. The communication interface 301, the memory 302, and the processor 303 communicate with each other through the communication bus 304. The communication interface 301 is used for the device 300 to communicate data with an external device. The memory 302 can be used to store software programs and modules, and the processor 303 runs the software programs and modules stored in the memory 302, such as the software programs of the corresponding operations in the aforementioned method embodiment.

在一些实施例中，该处理器303可以调用存储在存储器302的软件程序以及模块执行如下操作：In some embodiments, the processor 303 may call software programs and modules stored in the memory 302 to perform the following operations:

获取待处理数据的初始输入特征，其中，所述初始输入特征包括目标对象的对象特征、多个待推荐资源的资源特征和场景特征，所述场景特征包括场景标识；对所述初始输入特征进行特征压缩处理，得到低维输入特征；根据所述场景特征中的所述场景标识，从多个分场景网络中选出与所述场景标识相匹配的目标分场景网络，各个所述分场景网络具有独立的场景参数；基于所述目标分场景网络对所述低维输入特征进行特征处理，得到所述待处理数据的场景嵌入特征；基于所述初始输入特征和所述场景嵌入特征，向所述目标对象进行资源推荐。Acquire initial input features of the data to be processed, wherein the initial input features include object features of a target object, resource features of multiple resources to be recommended, and scene features, wherein the scene features include scene identifiers; perform feature compression processing on the initial input features to obtain low-dimensional input features; based on the scene identifier in the scene features, select a target sub-scene network that matches the scene identifier from multiple sub-scene networks, each of the sub-scene networks having independent scene parameters; perform feature processing on the low-dimensional input features based on the target sub-scene network to obtain scene embedding features of the data to be processed; and recommend resources to the target object based on the initial input features and the scene embedding features.

在一些实施例中，计算机设备300例如可以集成在具备储存器并安装有处理器而具有运算能力的终端或服务器中，或者该计算机设备300为该终端或服务器。该终端可以为智能手机、平板电脑、笔记本电脑、智能电视、智能音箱、穿戴式智能设备、个人计算机等设备。该服务器可以是独立的物理服务器，也可以是多个物理服务器构成的服务器集群或者分布式系统，还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN、以及大数据和人工智能平台等基础云计算服务的云服务器。In some embodiments, the computer device 300 may be integrated in a terminal or server having a storage device and a processor installed therein and having computing capabilities, or the computer device 300 may be the terminal or server. The terminal may be a smart phone, a tablet computer, a laptop computer, a smart TV, a smart speaker, a wearable smart device, a personal computer, or other devices. The server may be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or a cloud server that provides basic cloud computing services such as cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and big data and artificial intelligence platforms.

本申请还提供了一种计算机可读存储介质，用于存储计算机程序。该计算机可读存储介质可应用于计算机设备，并且该计算机程序使得计算机设备执行本申请实施例中的上述各方法中的相应流程，为了简洁，在此不再赘述。The present application also provides a computer-readable storage medium for storing a computer program. The computer-readable storage medium can be applied to a computer device, and the computer program enables the computer device to execute the corresponding processes in the above-mentioned methods in the embodiments of the present application, which will not be described in detail for the sake of brevity.

本申请还提供了一种计算机程序产品，该计算机程序产品包括计算机指令，该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令，处理器执行该计算机指令，使得计算机设备执行本申请实施例中的上述各方法中的相应流程，为了简洁，在此不再赘述。The present application also provides a computer program product, which includes computer instructions, which are stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device executes the corresponding processes in the above-mentioned methods in the embodiments of the present application, which will not be described here for the sake of brevity.

本申请还提供了一种计算机程序，该计算机程序包括计算机指令，计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令，处理器执行该计算机指令，使得计算机设备执行本申请实施例中的上述各方法中的相应流程，为了简洁，在此不再赘述。The present application also provides a computer program, which includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device executes the corresponding processes in the above-mentioned methods in the embodiments of the present application, which will not be described here for the sake of brevity.

应理解，本申请实施例的处理器可能是一种集成电路芯片，具有信号的处理能力。在实现过程中，上述方法实施例的各步骤可以通过处理器中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器可以是通用处理器、数字信号处理器(Digital SignalProcessor，DSP)、专用集成电路(Application Specific Integrated Circuit，ASIC)、现成可编程门阵列(Field Programmable Gate Array，FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成，或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器，闪存、只读存储器，可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器，处理器读取存储器中的信息，结合其硬件完成上述方法的步骤。It should be understood that the processor of the embodiment of the present application may be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the above method embodiment can be completed by the hardware integrated logic circuit or software instructions in the processor. The above processor can be a general processor, a digital signal processor (Digital Signal Processor, DSP), an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), a field programmable gate array (Field Programmable Gate Array, FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components. The methods, steps and logic block diagrams disclosed in the embodiments of the present application can be implemented or executed. The general processor can be a microprocessor or the processor can also be any conventional processor. The steps of the method disclosed in the embodiment of the present application can be directly embodied as a hardware decoding processor to perform, or the hardware and software modules in the decoding processor are combined and performed. The software module can be located in a random access memory, a flash memory, a read-only memory, a programmable read-only memory or an electrically erasable programmable memory, a register, and other mature storage media in the art. The storage medium is located in the memory, and the processor reads the information in the memory and completes the steps of the above method in combination with its hardware.

可以理解，本申请实施例中的存储器可以是易失性存储器或非易失性存储器，或可包括易失性和非易失性存储器两者。其中，非易失性存储器可以是只读存储器(Read-Only Memory，ROM)、可编程只读存储器(Programmable ROM，PROM)、可擦除可编程只读存储器(Erasable PROM，EPROM)、电可擦除可编程只读存储器(Electrically EPROM，EEPROM)或闪存。易失性存储器可以是随机存取存储器(Random Access Memory，RAM)，其用作外部高速缓存。通过示例性但不是限制性说明，许多形式的RAM可用，例如静态随机存取存储器(Static RAM，SRAM)、动态随机存取存储器(Dynamic RAM，DRAM)、同步动态随机存取存储器(Synchronous DRAM，SDRAM)、双倍数据速率同步动态随机存取存储器(Double Data RateSDRAM，DDR SDRAM)、增强型同步动态随机存取存储器(Enhanced SDRAM，ESDRAM)、同步连接动态随机存取存储器(Synchlink DRAM，SLDRAM)和直接内存总线随机存取存储器(DirectRambus RAM，DR RAM)。应注意，本文描述的系统和方法的存储器旨在包括但不限于这些和任意其它适合类型的存储器。It can be understood that the memory in the embodiments of the present application can be a volatile memory or a non-volatile memory, or can include both volatile and non-volatile memories. Among them, the non-volatile memory can be a read-only memory (ROM), a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), or a flash memory. The volatile memory can be a random access memory (RAM), which is used as an external cache. By way of example and not limitation, many forms of RAM are available, such as static random access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), double data rate synchronous dynamic random access memory (DDR SDRAM), enhanced synchronous dynamic random access memory (ESDRAM), synchronous link dynamic random access memory (SLDRAM), and direct RAM bus random access memory (DR RAM). It should be noted that the memory of the systems and methods described herein is intended to include, but is not limited to, these and any other suitable types of memory.

本领域普通技术人员可以意识到，结合本文中所公开的实施例描述的各示例的单元及算法步骤，能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行，取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能，但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art will appreciate that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Professional and technical personnel can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of this application.

所属领域的技术人员可以清楚地了解到，为描述的方便和简洁，上述描述的系统、装置和单元的具体工作过程，可以参考前述方法实施例中的对应过程，在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working processes of the systems, devices and units described above can refer to the corresponding processes in the aforementioned method embodiments and will not be repeated here.

在本申请所提供的几个实施例中，应该理解到，所揭露的系统、装置和方法，可以通过其它的方式实现。例如，以上所描述的装置实施例仅仅是示意性的，例如，所述单元的划分，仅仅为一种逻辑功能划分，实际实现时可以有另外的划分方式，例如多个单元或组件可以结合或者可以集成到另一个系统，或一些特征可以忽略，或不执行。另一点，所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口，装置或单元的间接耦合或通信连接，可以是电性，机械或其它的形式。In the several embodiments provided in the present application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the device embodiments described above are only schematic. For example, the division of the units is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of devices or units, which can be electrical, mechanical or other forms.

本申请实施例中，术语“模块”或“单元”是指有预定功能的计算机程序或计算机程序的一部分，并与其他相关部分一起工作以实现预定目标，并且可以通过使用软件、硬件(如处理电路或存储器)或其组合来全部或部分实现。同样的，一个处理器(或多个处理器或存储器)可以用来实现一个或多个模块或单元。此外，每个模块或单元都可以是包含该模块或单元功能的整体模块或单元的一部分。In the embodiments of the present application, the term "module" or "unit" refers to a computer program or a part of a computer program with a predetermined function, and works together with other related parts to achieve a predetermined goal, and can be implemented in whole or in part by using software, hardware (such as processing circuits or memories) or a combination thereof. Similarly, a processor (or multiple processors or memories) can be used to implement one or more modules or units. In addition, each module or unit can be part of an overall module or unit that includes the function of the module or unit.

所述作为分离部件说明的单元可以是或者也可以不是物理上分开的，作为单元显示的部件可以是或者也可以不是物理单元，即可以位于一个地方，或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.

另外，在本申请实施例中的各功能单元可以集成在一个处理单元中，也可以是各个单元单独物理存在，也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in the embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.

所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。If the functions are implemented in the form of software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application can be essentially or partly embodied in the form of a software product that contributes to the prior art. The computer software product is stored in a storage medium and includes several instructions for a computer device (which can be a personal computer or a server) to perform all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage media include various media that can store program codes, such as USB flash drives, mobile hard drives, ROM, RAM, magnetic disks, or optical disks.

以上所述，仅为本申请的具体实施方式，但本申请的保护范围并不局限于此，任何熟悉本技术领域的技术人员在本申请揭露的技术范围内，可轻易想到变化或替换，都应涵盖在本申请的保护范围之内。因此，本申请的保护范围应所述以权利要求的保护范围为准。The above is only a specific implementation of the present application, but the protection scope of the present application is not limited thereto. Any technician familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the present application, which should be included in the protection scope of the present application. Therefore, the protection scope of the present application should be based on the protection scope of the claims.

Claims

1. A data processing method, characterized in that the method comprises:

Acquire initial input features of the data to be processed, wherein the initial input features include object features of a target object, resource features of a plurality of resources to be recommended, and scene features, wherein the scene features include a scene identifier;

Performing feature compression processing on the initial input features to obtain low-dimensional input features;

According to the scene identifier in the scene feature, a target sub-scene network matching the scene identifier is selected from a plurality of sub-scene networks, each of the sub-scene networks having independent scene parameters;

Performing feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed;

Based on the initial input features and the scene embedding features, resources are recommended to the target object.

2. The data processing method according to claim 1, wherein the scene feature comprises M scene identifiers, where M is a natural number greater than 0;

The step of selecting a target sub-scenario network matching the scene identifier from a plurality of sub-scenario networks according to the scene identifier in the scene feature includes:

Each scene identifier in the M scene identifiers is matched with the preset scene identifier of each sub-scene network in the N sub-scene networks to obtain M target sub-scene networks matching the M scene identifiers, where N is greater than or equal to M.

3. The data processing method according to claim 2, characterized in that the step of performing feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed comprises:

Inputting the low-dimensional input features into the M target sub-scene networks respectively for feature processing to obtain sub-scene output features of each of the target sub-scene networks;

Aggregate the sub-scene output features of each of the target sub-scene networks to obtain the scene embedding features of the data to be processed.

4. The data processing method according to claim 1, characterized in that the method further comprises:

If there is no target sub-scenario network matching the scene identifier in the original multiple sub-scenario networks, a new sub-scenario network corresponding to the scene identifier is created.

5. The data processing method according to claim 4, characterized in that if there is no target sub-scenario network matching the scene identifier in the original multiple sub-scenario networks, creating a new sub-scenario network corresponding to the scene identifier comprises:

If there is no target sub-scenario network matching the scenario identifier among the original multiple sub-scenario networks, detecting the data flow corresponding to the scenario identifier;

When the data traffic corresponding to the scenario identifier reaches a preset traffic threshold, a new scenario network corresponding to the scenario identifier is created;

When the creation time of the new sub-scenario network reaches a preset time threshold, the new sub-scenario network is added to the original multiple sub-scenario networks.

6. The data processing method according to claim 5, wherein the step of creating a new sub-scenario network corresponding to the scenario identifier comprises:

When there is a first preset scene identifier in the original multiple scene identifiers whose similarity with the scene identifier reaches a similarity threshold, migrating some parameters from the first scene identifier corresponding to the first preset scene identifier as initialization scene parameters of the new scene identifier;

According to the initial input features corresponding to the scene identifier, fine-tuning the initialization scene parameters of the new scene classification network to obtain the scene parameters corresponding to the scene identifier;

Based on the scene parameters corresponding to the scene identifier, a new sub-scene network corresponding to the scene identifier is generated.

7. The data processing method according to claim 5, wherein the step of creating a new sub-scenario network corresponding to the scenario identifier comprises:

When there is no first preset scene identifier whose similarity with the scene identifier reaches a similarity threshold in the original multiple sub-scene networks, obtaining a randomly initialized scene parameter of the new sub-scene network;

According to the initial input features corresponding to the scene identifier, fine-tuning the randomly initialized scene parameters of the new scene segmentation network to obtain the scene parameters corresponding to the scene identifier;

8. The data processing method according to claim 1, wherein the scene feature comprises a scene identifier and a scene level corresponding to the scene identifier;

According to the scene identifier and the scene level, a target sub-scene network matching the scene identifier and the scene level is selected from a plurality of sub-scene networks.

9. The data processing method according to any one of claims 1 to 8, characterized in that the recommending resources to the target object based on the initial input features and the scene embedding features comprises:

Performing a prediction task based on the object feature of the target object, the resource features of multiple resources to be recommended, the scene feature and the scene embedding feature to obtain a task prediction value of the target object for each resource to be recommended, wherein the prediction task is one of a click rate prediction task and a conversion rate prediction task;

Determining a target recommended resource corresponding to the target object from the multiple resources to be recommended according to the task prediction value;

The target recommendation resource is recommended to the target object.

10. The data processing method according to any one of claims 1 to 8, wherein the step of obtaining initial input features of the data to be processed comprises:

Acquire data to be processed, wherein the data to be processed includes object association information of a target object, resource association information of a plurality of resources to be recommended, and scene information;

Feature extraction processing is performed on the object association information, the resource association information and the scene information to obtain the object feature of the target object, the resource feature of each of the resources to be recommended and the scene feature corresponding to each of the resources to be recommended.

11. A data processing device, characterized in that the device comprises:

An acquisition unit, configured to acquire initial input features of the data to be processed, wherein the initial input features include object features of a target object, resource features of a plurality of resources to be recommended, and scene features, wherein the scene features include scene identifiers;

A compression unit, used for performing feature compression processing on the initial input features to obtain low-dimensional input features;

A selection unit, configured to select, according to the scene identifier in the scene feature, a target sub-scene network matching the scene identifier from a plurality of sub-scene networks, each of the sub-scene networks having independent scene parameters;

A processing unit, configured to perform feature processing on the low-dimensional input features based on the target scene-based network to obtain scene embedding features of the data to be processed;

A recommendation unit is used to recommend resources to the target object based on the initial input features and the scene embedding features.

12. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program, and the computer program is suitable for being loaded by a processor to execute the data processing method according to any one of claims 1 to 10.

13. A computer device, characterized in that the computer device comprises a processor and a memory, the memory stores a computer program, and the processor executes the data processing method according to any one of claims 1 to 10 by calling the computer program stored in the memory.

14. A computer program product, comprising computer instructions, characterized in that when the computer instructions are executed by a processor, the data processing method according to any one of claims 1 to 10 is implemented.