CN115002513B

CN115002513B - Audio and video scheduling method and device, electronic equipment and computer readable storage medium

Info

Publication number: CN115002513B
Application number: CN202210580614.6A
Authority: CN
Inventors: 饶明佺; 王林鑫; 李琳; 顾明; 刘倍余
Original assignee: China Mobile Communications Group Co Ltd; MIGU Culture Technology Co Ltd
Current assignee: China Mobile Communications Group Co Ltd; MIGU Culture Technology Co Ltd
Priority date: 2022-05-25
Filing date: 2022-05-25
Publication date: 2023-10-20
Anticipated expiration: 2042-05-25
Also published as: CN115002513A

Abstract

The application provides an audio and video scheduling method, an audio and video scheduling device, electronic equipment and a computer readable storage medium, and relates to the technical field of audio and video. The method comprises the following steps: acquiring a first user experience index value corresponding to a target moment; acquiring a first link portrait corresponding to a first preset time period before a target moment under the condition that the first user experience index value is lower than a first threshold value; inputting the first link portrait into a user experience model to obtain a target user experience index value corresponding to a second time period; and adjusting the link and/or the code rate of the audio/video playing corresponding to the second time period according to the target user experience index value and the user experience model. According to the scheme provided by the embodiment of the application, the link and/or the code rate of the audio/video playing can be dynamically scheduled according to the user experience index value of the audio playing, the influence of the public network in a weak network environment on the audio/video playing is avoided, and the efficient operation of the network link is ensured.

Description

Audio and video scheduling methods, devices, electronic equipment and computer-readable storage media

技术领域Technical field

本发明涉及音视频技术领域，特别是指一种音视频调度方法、装置、电子设备及计算机可读存储介质。The present invention relates to the field of audio and video technology, and in particular, to an audio and video scheduling method, device, electronic equipment and computer-readable storage medium.

背景技术Background technique

随着覆盖全球的零接触、远程办公的理念逐渐流行，如何在公网环境为客户提供高可用、高品质、低延时的实时音视频服务，保证网络链路高效运行也越来越重要。目前，为了保障实时音视频在公网环境下的低延时和高质量，在检测到网络处于弱网场景时，通过关键帧重传，码率控制，音视频前向纠错(Forward Error Correction，FEC)的手段重试或降低带宽来保障音视频通信质量，降低弱网场景对音视频播放的影响。但是，若网络一直处于弱网场景，无下限的重传或降低带宽会导致重传风暴或者码率过低，从而影响用户体验。As the concept of zero-contact and remote working becomes more and more popular across the world, how to provide customers with highly available, high-quality, low-latency real-time audio and video services in a public network environment and ensure the efficient operation of network links is becoming increasingly important. Currently, in order to ensure low latency and high quality of real-time audio and video in a public network environment, when it is detected that the network is in a weak network scenario, key frame retransmission, bit rate control, and audio and video forward error correction (Forward Error Correction) are used. , FEC) means to retry or reduce the bandwidth to ensure the quality of audio and video communication and reduce the impact of weak network scenarios on audio and video playback. However, if the network is always in a weak network scenario, unrestricted retransmission or bandwidth reduction will lead to retransmission storms or low bit rates, thus affecting user experience.

发明内容Contents of the invention

本发明的目的是提供一种音视频调度方法、装置、电子设备及计算机可读存储介质，用以解决现有技术中若网络一直处于弱网场景，无下限的重传或降低带宽会导致重传风暴或者码率过低的问题。The purpose of the present invention is to provide an audio and video scheduling method, device, electronic equipment and computer-readable storage medium to solve the problem in the existing technology that if the network is always in a weak network scenario, unrestricted retransmission or reduced bandwidth will cause retransmission. Problems with transmission storm or low code rate.

第一方面，本发明的实施例提供了一种音视频调度方法，包括：In a first aspect, embodiments of the present invention provide an audio and video scheduling method, including:

获取与目标时刻对应的第一用户体验指标值；Obtain the first user experience indicator value corresponding to the target moment;

在所述第一用户体验指标值低于第一阈值的情况下，获取目标时刻之前第一预设时间段对应的第一链路画像；所述链路画像用于指示音视频播放链路的相关信息；When the first user experience index value is lower than the first threshold, obtain the first link portrait corresponding to the first preset time period before the target time; the link portrait is used to indicate the audio and video playback link Related Information;

将所述第一链路画像输入用户体验模型，得到第二时间段对应的目标用户体验指标值；所述第二时间段包括目标时刻之后的第二预设时间段；所述用户体验模型用于表征在目标音视频播放码率下用户视频播放体验指标值与链路画像之间的对应关系；Input the first link portrait into the user experience model to obtain the target user experience index value corresponding to the second time period; the second time period includes the second preset time period after the target time; the user experience model uses It is used to represent the correspondence between the user video playback experience index value and the link portrait under the target audio and video playback bit rate;

根据所述目标用户体验指标值以及所述用户体验模型，调整所述第二时间段对应的音视频播放的链路和/或码率。According to the target user experience indicator value and the user experience model, adjust the link and/or code rate of the audio and video playback corresponding to the second time period.

可选地，所述用户体验指标包括以下至少一项：视频分辨率、视频帧率、视频卡顿率、音频卡顿率、秒开率以及丢包率。Optionally, the user experience indicator includes at least one of the following: video resolution, video frame rate, video freezing rate, audio freezing rate, frame rate per second, and packet loss rate.

可选地，所述获取与目标时刻对应的第一用户体验指标值，包括：Optionally, the obtaining the first user experience indicator value corresponding to the target moment includes:

获取目标时刻对应的第一链路画像，并根据所述第一链路画像确定链路画像特征值；Obtain the first link portrait corresponding to the target time, and determine the link portrait feature value based on the first link portrait;

根据所述链路画像特征值，确定所述用户体验指标值。The user experience index value is determined based on the link portrait characteristic value.

可选地，所述获取目标时刻所述对应的第一链路画像，包括：Optionally, the corresponding first link image at the target moment of acquisition includes:

接收用户在所述目标时刻上报的音视频链路数据；Receive the audio and video link data reported by the user at the target time;

对所述音视频链路数据进行解析处理，生成所述第一链路画像。Analyze and process the audio and video link data to generate the first link portrait.

可选地，所述音视频链路数据包括以下至少一项：时间戳、边缘服务器、终端机型、操作系统、网络类型、丢包率、视频分辨率，帧率、发送码率、接收码率；Optionally, the audio and video link data includes at least one of the following: timestamp, edge server, terminal model, operating system, network type, packet loss rate, video resolution, frame rate, sending code rate, receiving code Rate;

所述第一链路画像包括以下至少一项：时间属性、终端属性、网络属性、地域属性、边缘服务器属性、体验属性；The first link portrait includes at least one of the following: time attributes, terminal attributes, network attributes, regional attributes, edge server attributes, and experience attributes;

所述链路画像特征值包括以下至少一项：时间要素、终端要素、网络要素、地域要素、边缘服务器要素、体验要素。The link portrait feature value includes at least one of the following: time element, terminal element, network element, regional element, edge server element, and experience element.

可选地，所述用户体验模型为通过对第一模型进行模型训练得到的模型：Optionally, the user experience model is a model obtained by performing model training on the first model:

所述第一模型为：The first model is:

其中，f(V|θ)表示用户体验指标值；V为链路画像特征值；Mi为所述链路画像特征值对应的第i个链路画像体验指标值，δi是第i个用户体验指标的权重，θ为音视频播放码率；n为大于或者等于1的整数；Among them, f(V|θ) represents the user experience index value; V is the link portrait characteristic value; Mi is the i-th link portrait experience index value corresponding to the link portrait characteristic value, and δi is the i-th user experience The weight of the indicator, θ is the audio and video playback bit rate; n is an integer greater than or equal to 1;

所述链路画像体验指标值包括以下至少一项：视频分辨率值、视频帧率值、视频卡顿率倒数值、音频卡顿率倒数值、秒开率值。The link portrait experience index value includes at least one of the following: video resolution value, video frame rate value, video freezing rate reciprocal value, audio freezing rate reciprocal value, and second opening rate value.

可选地，对第一模型进行模型训练得到所述用户体验模型，包括：Optionally, perform model training on the first model to obtain the user experience model, including:

获取历史链路画像集；所述历史链路画像集包括当前时刻之前的链路画像；Obtain a historical link portrait set; the historical link portrait set includes link portraits before the current moment;

根据所述历史链路画像集，通过逻辑斯特回归算法和梯度下降方法对所述第一模型进行模型训练，得到每个用户体验指标的权重；According to the historical link portrait set, perform model training on the first model through logistic regression algorithm and gradient descent method to obtain the weight of each user experience indicator;

根据所述第一模型以及每个所述用户体验指标的权重，确定所述体验模型。The experience model is determined according to the first model and the weight of each user experience indicator.

可选地，所述根据所述链路画像特征值，确定所述用户体验指标值，包括：Optionally, determining the user experience index value based on the link portrait characteristic value includes:

根据所述链路画像特征值，通过所述用户体验模型计算所述用户体验指标值。According to the link portrait characteristic value, the user experience index value is calculated through the user experience model.

可选地，所述根据所述目标用户体验指标值以及所述用户体验模型，调整所述第二时间段对应的音视频播放的链路和/或码率，包括：Optionally, adjusting the link and/or code rate of audio and video playback corresponding to the second time period according to the target user experience indicator value and the user experience model includes:

在所述目标用户体验指标值小于第二阈值的情况下，调整用户体验模型对应的链路和/或码率，直至所述用户体验模型对应的目标用户体验指标值大于或者等于所述第二阈值；When the target user experience index value is less than the second threshold, adjust the link and/or code rate corresponding to the user experience model until the target user experience index value corresponding to the user experience model is greater than or equal to the second threshold. threshold;

将目标用户体验指标值大于或者等于第二阈值时，所述用户体验模型对应的链路和码率，作为调整后所述第二时间段对应的音视频播放的链路和码率。When the target user experience indicator value is greater than or equal to the second threshold, the link and code rate corresponding to the user experience model are used as the adjusted link and code rate for audio and video playback corresponding to the second time period.

第二方面，本发明的实施例提供了一种音视频调度装置，包括：In the second aspect, embodiments of the present invention provide an audio and video scheduling device, including:

第一获取模块，用于获取与目标时刻对应的第一用户体验指标值；The first acquisition module is used to acquire the first user experience indicator value corresponding to the target moment;

第二获取模块，用于在所述第一用户体验指标值低于第一阈值的情况下，获取目标时刻之前第一预设时间段对应的第一链路画像；所述链路画像用于指示音视频播放链路的相关信息；The second acquisition module is used to acquire the first link portrait corresponding to the first preset time period before the target time when the first user experience index value is lower than the first threshold; the link portrait is used to Indicates relevant information of audio and video playback links;

处理模块，用于将所述第一链路画像输入用户体验模型，得到第二时间段对应的目标用户体验指标值；所述第二时间段包括目标时刻之后的第二预设时间段；所述用户体验模型用于表征在目标音视频播放码率下用户视频播放体验指标值与链路画像之间的对应关系；A processing module configured to input the first link portrait into the user experience model to obtain the target user experience index value corresponding to the second time period; the second time period includes a second preset time period after the target moment; The user experience model described above is used to characterize the correspondence between user video playback experience index values and link portraits under the target audio and video playback bit rate;

调整模块，用于根据所述目标用户体验指标值以及所述用户体验模型，调整所述第二时间段对应的音视频播放的链路和/或码率。An adjustment module, configured to adjust the link and/or code rate of the audio and video playback corresponding to the second time period according to the target user experience indicator value and the user experience model.

第三方面，本发明的实施例提供了一种电子设备，包括：收发器、处理器、存储器及存储在所述存储器上并可在所述处理器上运行的程序或指令；所述处理器执行所述程序或指令时实现如上所述的音视频调度方法。In a third aspect, embodiments of the present invention provide an electronic device, including: a transceiver, a processor, a memory, and a program or instructions stored on the memory and executable on the processor; the processor When the program or instruction is executed, the audio and video scheduling method as described above is implemented.

第四方面，本发明的实施例提供了一种计算机可读存储介质，其上存储有程序或指令，所述程序或指令被处理器执行时实现如上所述的音视频调度方法中的步骤。In a fourth aspect, embodiments of the present invention provide a computer-readable storage medium on which a program or instructions are stored. When the program or instructions are executed by a processor, the steps in the audio and video scheduling method as described above are implemented.

本发明实施例的音视频调度方法，通过获取与目标时刻对应的第一用户体验指标值，能够监控音视频播放的用户体验，在检测到所述第一用户体验指标值低于第一阈值的情况下，根据所述第一链路画像，并通过用户体验模型，对所述目标用户体验指标值进行预测；根据预测的目标用户体验指标值对第二时间段对应的音视频播放的链路和/或码率进行调整。本发明的方案，能够根据音频播放的用户体验指标值动态调度音视频播放的链路和/或码率，避免因公网处于弱网络环境为音视频播放的影响，保证了网络链路的高效运行，解决了现有技术中若网络一直处于弱网场景，无限的重传导致重传风暴或无限降低带宽导致码率过低的问题。The audio and video scheduling method of the embodiment of the present invention can monitor the user experience of audio and video playback by obtaining the first user experience index value corresponding to the target time. When it is detected that the first user experience index value is lower than the first threshold, In this case, the target user experience index value is predicted based on the first link portrait and the user experience model; and the link corresponding to the audio and video playback in the second time period is predicted based on the predicted target user experience index value. and/or bitrate adjustment. The solution of the present invention can dynamically schedule the link and/or code rate of audio and video playback according to the user experience index value of audio playback, avoid the impact of audio and video playback due to the weak network environment of the public network, and ensure the high efficiency of the network link. Operation solves the problem in the existing technology that if the network is always in a weak network scenario, infinite retransmissions lead to retransmission storms or infinite bandwidth reduction leads to too low a code rate.

附图说明Description of the drawings

图1为本发明实施例的音视频调度方法的流程示意图；Figure 1 is a schematic flow chart of an audio and video scheduling method according to an embodiment of the present invention;

图2为本发明实施例的第一链路画像要素的示意图；Figure 2 is a schematic diagram of the first link portrait elements according to the embodiment of the present invention;

图3为本发明实施例的音视频调度装置的模块示意图；Figure 3 is a schematic module diagram of an audio and video scheduling device according to an embodiment of the present invention;

图4为本发明实施例的音视频调度装置的实体装置的示意图；Figure 4 is a schematic diagram of the physical device of the audio and video scheduling device according to the embodiment of the present invention;

图5为本发明实施例的电子设备的实施结构示意图。FIG. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明要解决的技术问题、技术方案和优点更加清楚，下面将结合附图及具体实施例进行详细描述。在下面描述中，提供诸如具体的配置和组件的特定细节仅仅是为了帮助全面理解本发明的实施例。因此本领域技术人员应该清楚，可以对这里描述的实施例进行各种改变和修改为不脱离本发明的范围和精神。另外，为了清除和简洁，省略了对抑制功能和构造的描述。In order to make the technical problems, technical solutions and advantages to be solved by the present invention clearer, a detailed description will be given below with reference to the accompanying drawings and specific embodiments. In the following description, specific details, such as specific configurations and components, are provided solely to assist in a comprehensive understanding of embodiments of the invention. It will therefore be apparent to those skilled in the art that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the invention. Additionally, a description of inhibitory function and construction has been omitted for clarity and brevity.

应理解，说明书通篇中提到的“一个实施例”或“一实施例”意味着与实施例有关的特定特征、结构或特性包括在本发明的至少一个实施例中。因此，在整个说明书各处出现的“在一个实施例中”或“在一实施例中”未必一定指相同的实施例。此外，这些特定的特征、结构或特性可以任意适合的方式结合在一个或多个实施例中。It will be understood that reference throughout this specification to "one embodiment" or "an embodiment" means that a particular feature, structure, or characteristic associated with the embodiment is included in at least one embodiment of the invention. Thus, the appearances of "in one embodiment" or "in an embodiment" in various places throughout this specification are not necessarily referring to the same embodiment. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner in one or more embodiments.

在本发明的各种实施例中，应理解，下述各过程的序号的大小并不意味着执行顺序的先后，各过程的执行顺序应以其功能和内在逻辑确定，而不应对本发明实施例的实施过程构成任何限定。In various embodiments of the present invention, it should be understood that the size of the sequence numbers of the following processes does not mean the order of execution. The execution order of each process should be determined by its functions and internal logic, and should not be implemented in the present invention. The implementation of the examples does not constitute any limitations.

另外，本文中术语“系统”和“网络”在本文中常可互换使用。Additionally, the terms "system" and "network" are often used interchangeably in this article.

在本申请所提供的实施例中，应理解，“与A相应的B”表示B与A相关联，根据A可以确定B。但还应理解，根据A确定B并不意味着仅仅根据A确定B，还可以根据A和/或其它信息确定B。In the embodiments provided in this application, it should be understood that "B corresponding to A" means that B is associated with A, and B can be determined based on A. However, it should also be understood that determining B based on A does not mean determining B only based on A. B can also be determined based on A and/or other information.

如图1所示，本发明实施例的一种音视频调度方法，包括如下步骤：As shown in Figure 1, an audio and video scheduling method according to the embodiment of the present invention includes the following steps:

步骤101，获取与目标时刻对应的第一用户体验指标值。Step 101: Obtain the first user experience index value corresponding to the target time.

本发明实施例的音视频调度方法，通过获取与目标时刻对应的第一用户体验指标值，能够实时监控音频播放的用户体验指标值。The audio and video scheduling method according to the embodiment of the present invention can monitor the user experience index value of audio playback in real time by obtaining the first user experience index value corresponding to the target time.

步骤102，在所述第一用户体验指标值低于第一阈值的情况下，获取目标时刻之前第一预设时间段对应的第一链路画像；所述链路画像用于指示音视频播放链路的相关信息。Step 102: When the first user experience index value is lower than the first threshold, obtain the first link portrait corresponding to the first preset time period before the target time; the link portrait is used to instruct audio and video playback Information about the link.

需要说明的是，所述第一用户体验指标值低于第一阈值即公网处于弱网络环境，根据所述第一用户体验指标值判断公网的环境。It should be noted that if the first user experience index value is lower than the first threshold, that is, the public network is in a weak network environment, and the environment of the public network is determined based on the first user experience index value.

本发明一实施例中，链路画像可以包括：时间属性、终端属性、网络属性、地域属性、服务器属性、体验属性。通过链路画像表征音视频播放链路的相关信息，能够极大降低链路数据的量级，提高计算所述目标用户体验指标值的速度。In an embodiment of the present invention, the link portrait may include: time attributes, terminal attributes, network attributes, regional attributes, server attributes, and experience attributes. Representing the relevant information of audio and video playback links through link portraits can greatly reduce the magnitude of link data and increase the speed of calculating the target user experience index value.

步骤103，将所述第一链路画像输入用户体验模型，得到第二时间段对应的目标用户体验指标值；所述第二时间段包括目标时刻之后的第二预设时间段；所述用户体验模型用于表征在目标音视频播放码率下用户视频播放体验指标值与链路画像之间的对应关系。Step 103: Enter the first link portrait into the user experience model to obtain the target user experience index value corresponding to the second time period; the second time period includes the second preset time period after the target time; the user The experience model is used to characterize the correspondence between user video playback experience indicator values and link portraits under the target audio and video playback bit rate.

需要说明的是，所述用户体验模型是经过学习训练后的模型，根据所述第一链路画像，通过所述用户体验模型对所述第二时间段对应的目标用户体验指标值进行预测。It should be noted that the user experience model is a model after learning and training. According to the first link portrait, the target user experience index value corresponding to the second time period is predicted through the user experience model.

步骤104，根据所述目标用户体验指标值以及所述用户体验模型，调整所述第二时间段对应的音视频播放的链路和/或码率。Step 104: Adjust the link and/or code rate of the audio and video playback corresponding to the second time period according to the target user experience indicator value and the user experience model.

本发明一实施例中，链路调整主要是指调整边缘服务器，使得在所述第二时间段用户的音视频体验最佳。In one embodiment of the present invention, link adjustment mainly refers to adjusting the edge server so that the user's audio and video experience is optimal during the second time period.

本发明的方案，能够在检测到音视频播放的用户体验指标值低于所述第一阈值时触发对链路和/或码率的动态调整。避免因公网处于弱网络环境为音视频播放的影响，保证了网络链路的高效运行，解决了现有技术中若网络一直处于弱网场景，无限的重传导致重传风暴或无限降低带宽导致码率过低的问题。The solution of the present invention can trigger dynamic adjustment of the link and/or code rate when it is detected that the user experience index value of audio and video playback is lower than the first threshold. It avoids the impact of audio and video playback caused by the weak network environment of the public network, ensures the efficient operation of network links, and solves the problem in the existing technology that if the network is always in a weak network scenario, infinite retransmissions lead to retransmission storms or infinite bandwidth reduction. Causes the problem of too low code rate.

本发明一实施例中，所述视频分辨率对应的所述第一阈值可以为800*480或者1280*720；所述视频帧率对应的所述第一阈值可以为30fps；所述视频卡顿率对应的所述第一阈值可以为6％；所述音频卡顿率对应的所述第一阈值可以为2％。In an embodiment of the present invention, the first threshold corresponding to the video resolution may be 800*480 or 1280*720; the first threshold corresponding to the video frame rate may be 30fps; the video freezes The first threshold corresponding to the audio freezing rate may be 6%; the first threshold corresponding to the audio freezing rate may be 2%.

本发明的实施例，通过所述用户体验指标值判断用户公网是否处于弱网状态。In an embodiment of the present invention, the user experience index value is used to determine whether the user's public network is in a weak network state.

本发明一实施例中，所述根据所述第一链路画像确定链路画像特征值，包括：In an embodiment of the present invention, determining the link portrait characteristic value based on the first link portrait includes:

通过哈希函数对所述第一链路画像进行处理，得到所述链路画像特征值。The first link portrait is processed through a hash function to obtain the link portrait feature value.

通过链路画像的数据格式，能够有效降低链路数据的量级，从而降低确定所述用户体验指标值的计算量。Through the data format of the link portrait, the magnitude of the link data can be effectively reduced, thereby reducing the amount of calculation required to determine the user experience index value.

链路画像是影响链路通信质量的重要因素。本发明一实施例中，用户每隔5秒钟上报音视频链路数据，将接收到的所述音视频链路数据进行保存，得到音视频链路数据集；对所述音视频链路数据集进行解析处理，生成链路画像。Link portrait is an important factor affecting the quality of link communication. In one embodiment of the present invention, the user reports audio and video link data every 5 seconds, and saves the received audio and video link data to obtain an audio and video link data set; The set is parsed and processed to generate a link portrait.

如图2所示，所述第一链路画像包括以下至少一项：时间属性、终端属性、网络属性、地域属性、边缘服务器属性、体验属性；As shown in Figure 2, the first link portrait includes at least one of the following: time attributes, terminal attributes, network attributes, regional attributes, edge server attributes, and experience attributes;

本发明实施例的方案，通过对终端机型、音视频服务质量指标、服务端要素等建立链路画像，能够更全面描述和度量链路。避免了对公网链路通讯质量的调整仅依据服务端，而未考虑时间段、终端(机型)、服务端负载、码率、丢包率、分辨率等质量因素。The solution of the embodiment of the present invention can more comprehensively describe and measure the link by establishing a link portrait based on the terminal model, audio and video service quality indicators, server elements, etc. It avoids adjusting the communication quality of the public network link only based on the server, without considering quality factors such as time period, terminal (model), server load, bit rate, packet loss rate, resolution, etc.

所述第一模型为：The first model is:

本发明一实施例中，链路画像特征值V在码率θ条件下，所述用户体验指标值通过所述第一模型计算。In an embodiment of the present invention, the link portrait characteristic value V is under the condition of code rate θ, and the user experience index value is calculated by the first model.

所述历史链路画像集通过历史时刻接收到的音视频链路数据进行解析处理并保存后确定。The historical link portrait set is determined by parsing and storing the audio and video link data received at historical moments.

通过所述历史链路画像集对所述第一模型进行学习训练，确定每个用户体验指标的权重，进而得到所述用户体验模型，使得用户体验模型符合机型性能好、网络状态好、边缘服务器负载低，在码率θ一定的条件下，用户体验得分越高的期望。The first model is learned and trained through the historical link portrait set to determine the weight of each user experience indicator, and then obtain the user experience model, so that the user experience model meets the requirements of good model performance, good network status, and edge The server load is low, and under the condition of a certain code rate θ, the user experience score is expected to be higher.

本发明实施例的方案，根据所述历史链路画像集对所述第一模型进行模型训练，通过保存有大量历史数据的历史链路画像集对所述第一模型进行学习训练，能够保证训练过后得到所述用户体验模型的准确性；而对所述目标用户体验指标值的计算只采用所述第一预设时间段对应的第一链路画像，节省了计算资源占用，提高了计算效率。According to the solution of the embodiment of the present invention, the first model is trained according to the historical link portrait set, and the first model is learned and trained through the historical link portrait set that stores a large amount of historical data, which can ensure training The accuracy of the user experience model is then obtained; and the calculation of the target user experience index value only uses the first link portrait corresponding to the first preset time period, which saves computing resources and improves computing efficiency. .

可选地，所述根据所述链路画像特征值，确定所述用户体验指标值，包括：Optionally, determining the user experience index value based on the link portrait feature value includes:

本发明实的方案，通过所述用户体验指标值判断用户使用的公网的网络环境，在所述用户体验指标值低于第一阈值的情况下，认为公网的网络运行环境不良，会影响用户音视频播放的体验，为音视频播放的链路和/或码率的动态调整提供一个触发机制。A practical solution of the present invention is to determine the network environment of the public network used by the user through the user experience index value. When the user experience index value is lower than the first threshold, it is considered that the network operating environment of the public network is poor and will affect The user's audio and video playback experience provides a trigger mechanism for dynamic adjustment of audio and video playback links and/or bit rates.

本发明实的一实施例中，在计算所述目标用户体验指标值时，根据所述用户体验模型，所述第一链路画像为输入，所述目标用户体验指标值为输出；在对所述第二时间段对应的音视频播放的链路和/或码率进行调整时，所述目标用户体验指标值为输入，例如第二阈值为输入，根据所述用户体验模型，所述第二时间段对应的音视频播放的链路和/或码率为输出。In an embodiment of the present invention, when calculating the target user experience index value, according to the user experience model, the first link portrait is the input, and the target user experience index value is the output; for all When the audio and video playback link and/or code rate corresponding to the second time period are adjusted, the target user experience indicator value is the input, for example, the second threshold is the input. According to the user experience model, the second The link and/or bitrate of the audio and video playback corresponding to the time period is output.

本发明实的方案，通过第二阈值，并根据所述用户体验模型对所述第二时间段对应的音视频播放的链路和码率进行调整，使得音视频播放保持良好的链路和码率，以获得较好的用户音视频体验。The practical solution of the present invention passes the second threshold and adjusts the link and code rate of the audio and video playback corresponding to the second time period according to the user experience model, so that the audio and video playback maintains good links and code rates. rate to obtain a better user audio and video experience.

如图3所示，本发明的实施例提供了一种音视频调度装置300，包括：As shown in Figure 3, an embodiment of the present invention provides an audio and video scheduling device 300, which includes:

第一获取模块301，用于获取与目标时刻对应的第一用户体验指标值；The first acquisition module 301 is used to acquire the first user experience index value corresponding to the target time;

第二获取模块302，用于在所述第一用户体验指标值低于第一阈值的情况下，获取目标时刻之前第一预设时间段对应的第一链路画像；所述链路画像用于指示音视频播放链路的相关信息；The second acquisition module 302 is configured to acquire the first link portrait corresponding to the first preset time period before the target time when the first user experience index value is lower than the first threshold; the link portrait is Information related to indicating audio and video playback links;

处理模块303，用于将所述第一链路画像输入用户体验模型，得到第二时间段对应的目标用户体验指标值；所述第二时间段包括目标时刻之后的第二预设时间段；所述用户体验模型用于表征在目标音视频播放码率下用户视频播放体验指标值与链路画像之间的对应关系；The processing module 303 is configured to input the first link portrait into the user experience model to obtain the target user experience index value corresponding to the second time period; the second time period includes a second preset time period after the target moment; The user experience model is used to characterize the correspondence between the user video playback experience index value and the link portrait under the target audio and video playback bit rate;

调整模块304，用于根据所述目标用户体验指标值以及所述用户体验模型，调整所述第二时间段对应的音视频播放的链路和/或码率。The adjustment module 304 is configured to adjust the link and/or code rate of the audio and video playback corresponding to the second time period according to the target user experience indicator value and the user experience model.

本发明实施例的音视频调度装置，通过获取与目标时刻对应的第一用户体验指标值，能够监控音视频播放的用户体验，在检测到所述第一用户体验指标值低于第一阈值的情况下，根据所述第一链路画像，并通过用户体验模型，对所述目标用户体验指标值进行预测；根据预测的目标用户体验指标值对第二时间段对应的音视频播放的链路和/或码率进行调整。本发明的方案，能够根据音频播放的用户体验指标值动态调度音视频播放的链路和/或码率，避免因公网处于弱网络环境为音视频播放的影响，保证了网络链路的高效运行，解决了现有技术中若网络一直处于弱网场景，无限的重传导致重传风暴或无限降低带宽导致码率过低的问题。The audio and video scheduling device in the embodiment of the present invention can monitor the user experience of audio and video playback by obtaining the first user experience index value corresponding to the target time. When it is detected that the first user experience index value is lower than the first threshold, In this case, the target user experience index value is predicted based on the first link portrait and the user experience model; and the link corresponding to the audio and video playback in the second time period is predicted based on the predicted target user experience index value. and/or bitrate adjustment. The solution of the present invention can dynamically schedule the link and/or code rate of audio and video playback according to the user experience index value of audio playback, avoid the impact of audio and video playback due to the weak network environment of the public network, and ensure the high efficiency of the network link. Operation solves the problem in the existing technology that if the network is always in a weak network scenario, infinite retransmissions lead to retransmission storms or infinite bandwidth reduction leads to too low a code rate.

可选地，所述第一获取模块还包括：Optionally, the first acquisition module also includes:

获取子单元，用于获取目标时刻对应的第一链路画像，并根据所述第一链路画像确定链路画像特征值；The acquisition subunit is used to obtain the first link portrait corresponding to the target time, and determine the link portrait characteristic value according to the first link portrait;

确定子单元，用于根据所述链路画像特征值，确定所述用户体验指标值。Determining subunit, configured to determine the user experience index value according to the link portrait characteristic value.

可选地，所述获取子单元还用于：Optionally, the acquisition subunit is also used to:

可选地，所述确定子单元还用于：Optionally, the determining subunit is also used to:

可选地，所述调整模块还用于：Optionally, the adjustment module is also used to:

如图4所示，本发明一实施例中，音视频调度装置的实体装置可以对应为数据采集模块、质量数据模块、链路画像模块、机器学习模块、计算模块、指令模块以及动态调度模块。As shown in Figure 4, in one embodiment of the present invention, the physical device of the audio and video scheduling device may correspond to a data collection module, a quality data module, a link portrait module, a machine learning module, a computing module, an instruction module, and a dynamic scheduling module.

可选地，所述数据采集模块用于：Optionally, the data collection module is used for:

获取与目标时刻对应的第一用户体验指标值；以及，Obtain the first user experience indicator value corresponding to the target moment; and,

在所述第一用户体验指标值低于第一阈值的情况下，获取目标时刻之前第一预设时间段对应的第一链路画像。When the first user experience index value is lower than the first threshold, the first link portrait corresponding to the first preset time period before the target time is obtained.

可选地，所述计算模块用于：Optionally, the computing module is used for:

将所述第一链路画像输入用户体验模型，得到第二时间段对应的目标用户体验指标值。The first link portrait is input into the user experience model to obtain the target user experience index value corresponding to the second time period.

可选地，所述指令模块用于根据目标用户体验指标值生成调度指令。Optionally, the instruction module is configured to generate scheduling instructions according to the target user experience indicator value.

可选地，所述动态调度模块，用于：Optionally, the dynamic scheduling module is used for:

可选地，所述链路画像模块，用于：Optionally, the link portrait module is used for:

获取目标时刻对应的第一链路画像，并根据所述第一链路画像确定链路画像特征值。Obtain the first link portrait corresponding to the target time, and determine the link portrait feature value based on the first link portrait.

可选地，所述计算模块，还用于：Optionally, the computing module is also used to:

可选地，所述数据采集模块，还用于：Optionally, the data collection module is also used to:

获取历史链路画像集；所述历史链路画像集包括当前时刻之前的链路画像。Obtain a historical link portrait set; the historical link portrait set includes link portraits before the current moment.

可选地，所述机器学习模块，用于：Optionally, the machine learning module is used for:

可选地，所述计算模块，用于：Optionally, the computing module is used for:

可选地，所述动态调度模块，还用于：Optionally, the dynamic scheduling module is also used to:

如图5所示，本发明的实施例提供了一种电子设备，包括收发器510、处理器500、存储器520及存储在所述存储器520上并可在所述处理器500上运行的程序或指令；所述处理器500执行所述程序或指令时实现上述应用于如上所述的音视频调度方法。As shown in Figure 5, an embodiment of the present invention provides an electronic device, including a transceiver 510, a processor 500, a memory 520, and a program stored on the memory 520 and executable on the processor 500 or Instructions; when the processor 500 executes the program or instructions, the above audio and video scheduling method is applied to the above.

所述收发器510，用于在处理器500的控制下接收和发送数据。The transceiver 510 is used to receive and send data under the control of the processor 500.

其中，在图5中，总线架构可以包括任意数量的互联的总线和桥，具体由处理器500代表的一个或多个处理器和存储器520代表的存储器的各种电路链接在一起。总线架构还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路链接在一起，这些都是本领域所公知的，因此，本文不再对其进行进一步描述。总线接口提供接口。收发器510可以是多个元件，即包括发送机和接收机，提供用于在传输介质上与各种其他装置通信的单元。针对不同的用户设备，用户接口530还可以是能够外接内接需要设备的接口，连接的设备包括但不限于小键盘、显示器、扬声器、麦克风、操纵杆等。In FIG. 5 , the bus architecture may include any number of interconnected buses and bridges, specifically one or more processors represented by processor 500 and various circuits of the memory represented by memory 520 are linked together. The bus architecture can also link together various other circuits such as peripherals, voltage regulators, and power management circuits, which are all well known in the art and therefore will not be described further herein. The bus interface provides the interface. Transceiver 510 may be a plurality of elements, including a transmitter and a receiver, providing a unit for communicating with various other devices over a transmission medium. For different user equipment, the user interface 530 can also be an interface that can connect external and internal required equipment. The connected equipment includes but is not limited to a keypad, a display, a speaker, a microphone, a joystick, etc.

处理器500负责管理总线架构和通常的处理，存储器520可以存储处理器500在执行操作时所使用的数据。The processor 500 is responsible for managing the bus architecture and general processing, and the memory 520 can store data used by the processor 500 when performing operations.

可选地，所述收发器510用于获取与目标时刻对应的第一用户体验指标值；Optionally, the transceiver 510 is used to obtain the first user experience indicator value corresponding to the target moment;

在所述第一用户体验指标值低于第一阈值的情况下，获取目标时刻之前第一预设时间段对应的第一链路画像；所述链路画像用于指示音视频播放链路的相关信息。When the first user experience index value is lower than the first threshold, obtain the first link portrait corresponding to the first preset time period before the target time; the link portrait is used to indicate the audio and video playback link Related Information.

所述处理器500用于将所述第一链路画像输入用户体验模型，得到第二时间段对应的目标用户体验指标值；所述第二时间段包括目标时刻之后的第二预设时间段；所述用户体验模型用于表征在目标音视频播放码率下用户视频播放体验指标值与链路画像之间的对应关系；The processor 500 is configured to input the first link portrait into a user experience model to obtain a target user experience index value corresponding to a second time period; the second time period includes a second preset time period after the target time. ; The user experience model is used to characterize the correspondence between the user video playback experience index value and the link portrait under the target audio and video playback code rate;

可选地，所述收发器510还用于：Optionally, the transceiver 510 is also used for:

可选地，所述收发器510还用于Optionally, the transceiver 510 is also used to

可选地，所述处理器500还用于：Optionally, the processor 500 is also used to:

其中，所述第一模型为：Wherein, the first model is:

本发明的实施例提供了一种计算机可读存储介质，其上存储有程序或指令，所述程序或指令被处理器执行时实现如上所述的音视频调度方法中的步骤。Embodiments of the present invention provide a computer-readable storage medium on which programs or instructions are stored. When the programs or instructions are executed by a processor, the steps in the audio and video scheduling method as described above are implemented.

其中，所述处理器为上述实施例中所述的电子设备中的处理器。所述可读存储介质，包括计算机可读存储介质，如计算机只读存储器(Read-Only Memory，简称ROM)、随机存取存储器(Random Access Memory，简称RAM)、磁碟或者光盘等。Wherein, the processor is the processor in the electronic device described in the above embodiment. The readable storage media includes computer-readable storage media, such as computer read-only memory (ROM), random access memory (Random Access Memory, RAM), magnetic disks or optical disks.

进一步需要说明的是，此说明书中所描述的终端包括但不限于智能手机、平板电脑等，且所描述的许多功能部件都被称为模块，以便更加特别地强调其实现方式的独立性。It should be further noted that the terminals described in this specification include but are not limited to smartphones, tablet computers, etc., and many of the functional components described are called modules to more specifically emphasize the independence of their implementation.

本发明实施例中，模块可以用软件实现，以便由各种类型的处理器执行。举例来说，一个标识的可执行代码模块可以包括计算机指令的一个或多个物理或者逻辑块，举例来说，其可以被构建为对象、过程或函数。尽管如此，所标识模块的可执行代码无需物理地位于一起，而是可以包括存储在不同位里上的不同的指令，当这些指令逻辑上结合在一起时，其构成模块并且实现该模块的规定目的。In the embodiment of the present invention, the module can be implemented in software so as to be executed by various types of processors. For example, an identified module of executable code may include one or more physical or logical blocks of computer instructions, which may be structured, for example, as an object, procedure, or function. Nonetheless, the executable code of an identified module need not be physically located together, but may include different instructions stored on different bits that, when logically combined, constitute the module and implement the provisions of the module Purpose.

实际上，可执行代码模块可以是单条指令或者是许多条指令，并且甚至可以分布在多个不同的代码段上，分布在不同程序当中，以及跨越多个存储器设备分布。同样地，操作数据可以在模块内被识别，并且可以依照任何适当的形式实现并且被组织在任何适当类型的数据结构内。所述操作数据可以作为单个数据集被收集，或者可以分布在不同位置上(包括在不同存储设备上)，并且至少部分地可以仅作为电子信号存在于系统或网络上。In fact, an executable code module can be a single instruction or many instructions, and can even be distributed over multiple different code segments, distributed among different programs, and distributed across multiple memory devices. Likewise, operational data may be identified within modules and may be implemented in any suitable form and organized within any suitable type of data structure. The operational data may be collected as a single data set, or may be distributed over different locations (including on different storage devices), and may exist, at least in part, solely as electronic signals on a system or network.

在模块可以利用软件实现时，考虑到现有硬件工艺的水平，所以可以以软件实现的模块，在不考虑成本的情况下，本领域技术人员都可以搭建对应的硬件电路来实现对应的功能，所述硬件电路包括常规的超大规模集成(VLSI)电路或者门阵列以及诸如逻辑芯片、晶体管之类的现有半导体或者是其它分立的元件。模块还可以用可编程硬件设备，诸如现场可编程门阵列、可编程阵列逻辑、可编程逻辑设备等实现。When the module can be implemented using software, taking into account the level of existing hardware technology, those skilled in the art can build corresponding hardware circuits to implement the corresponding functions without considering the cost. The hardware circuits include conventional very large scale integration (VLSI) circuits or gate arrays as well as existing semiconductors such as logic chips, transistors, or other discrete components. Modules can also be implemented using programmable hardware devices, such as field programmable gate arrays, programmable array logic, programmable logic devices, etc.

上述范例性实施例是参考该些附图来描述的，许多不同的形式和实施例是可行而不偏离本发明精神及教示，因此，本发明不应被建构成为在此所提出范例性实施例的限制。更确切地说，这些范例性实施例被提供以使得本发明会是完善又完整，且会将本发明范围传达给那些熟知此项技术的人士。在该些图式中，组件尺寸及相对尺寸也许基于清晰起见而被夸大。在此所使用的术语只是基于描述特定范例性实施例目的，并无意成为限制用。如在此所使用地，除非该内文清楚地另有所指，否则该单数形式“一”、“一个”和“该”是意欲将该些多个形式也纳入。会进一步了解到该些术语“包含”及/或“包括”在使用于本说明书时，表示所述特征、整数、步骤、操作、构件及/或组件的存在，但不排除一或更多其它特征、整数、步骤、操作、构件、组件及/或其族群的存在或增加。除非另有所示，陈述时，一值范围包含该范围的上下限及其间的任何子范围。The above exemplary embodiments have been described with reference to the accompanying drawings. Many different forms and embodiments are possible without departing from the spirit and teachings of the invention. Therefore, the invention should not be construed as the exemplary embodiments set forth herein. limits. Rather, these exemplary embodiments are provided so that this disclosure will be thorough and complete, and will convey the scope of the invention to those skilled in the art. In the drawings, component sizes and relative sizes may be exaggerated for clarity. The terminology used herein is for the purpose of describing particular exemplary embodiments only and is not intended to be limiting. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It will be further understood that the terms "comprising" and/or "including" when used in this specification indicate the presence of stated features, integers, steps, operations, components and/or components, but do not exclude the presence of one or more other The existence or addition of features, integers, steps, operations, components, components and/or families thereof. Unless otherwise indicated, when stated, a range of values includes the upper and lower limits of the range and any subranges therebetween.

以上所述是本发明的优选实施方式，应当指出，对于本技术领域的普通技术人员来说，在不脱离本发明所述原理的前提下，还可以作出若干改进和润饰，这些改进和润饰也应视为本发明的保护范围。The above is the preferred embodiment of the present invention. It should be pointed out that for those of ordinary skill in the art, several improvements and modifications can be made without departing from the principles of the present invention. These improvements and modifications can also be made. should be regarded as the protection scope of the present invention.

Claims

1. An audio/video scheduling method is characterized by comprising the following steps:

acquiring a first user experience index value corresponding to a target moment;

acquiring a first link portrait corresponding to a first preset time period before a target moment under the condition that the first user experience index value is lower than a first threshold value; the link portrait is used for indicating related information of an audio/video playing link;

inputting the first link portrait into a user experience model to obtain a target user experience index value corresponding to a second time period; the second time period comprises a second preset time period after the target time; the user experience model is used for representing the corresponding relation between the user video playing experience index value and the link portrait under the target audio and video playing code rate;

according to the target user experience index value and the user experience model, adjusting a link and/or a code rate of audio and video playing corresponding to the second time period;

the user experience model is a model obtained by model training of the first model:

the first model is as follows:

wherein f (v|θ) represents a user experience index value; v is the link portrait characteristic value; mi is the ith link image experience index value corresponding to the link image characteristic value, δi is the weight of the ith user experience index, and θ is the audio/video playing code rate; n is an integer greater than or equal to 1;

the link image experience index value comprises at least one of the following: video resolution value, video frame value, video clip rate reciprocal value, audio clip rate reciprocal value, second opening value.

2. The audio-video scheduling method according to claim 1, wherein the user experience index comprises at least one of: video resolution, video frame rate, video clip rate, audio clip rate, second open rate, and packet loss rate.

3. The audio/video scheduling method according to claim 1, wherein the obtaining a first user experience index value corresponding to a target time includes:

acquiring a first link portrait corresponding to a target moment, and determining a link portrait characteristic value according to the first link portrait;

and determining the user experience index value according to the link portrait characteristic value.

4. The audio/video scheduling method according to claim 3, wherein the obtaining the corresponding first link representation at the target time includes:

receiving audio and video link data reported by a user at the target time;

and analyzing the audio and video link data to generate the first link portrait.

5. The audio-video scheduling method of claim 4, wherein the audio-video link data comprises at least one of: timestamp, edge server, terminal model, operating system, network type, packet loss rate, video resolution, frame rate, sending code rate, receiving code rate;

the first link representation includes at least one of: time attribute, terminal attribute, network attribute, region attribute, edge server attribute, experience attribute;

the link portrait characteristic value includes at least one of the following: time element, terminal element, network element, region element, edge server element, and experience element.

6. The audio-video scheduling method according to claim 1, wherein model training the first model to obtain the user experience model comprises:

acquiring a historical link portrait set; the historical link portrait set comprises link portraits before the current moment;

according to the historical link portrait set, performing model training on the first model through a logistic regression algorithm and a gradient descent method to obtain the weight of each user experience index;

and determining the experience model according to the first model and the weight of each user experience index.

7. The audio/video scheduling method according to claim 3, wherein the determining the user experience index value according to the link portrait characteristic value includes:

and calculating the user experience index value through the user experience model according to the link portrait characteristic value.

8. The audio/video scheduling method according to claim 1, wherein the adjusting the link and/or the code rate of the audio/video playing corresponding to the second time period according to the target user experience index value and the user experience model includes:

under the condition that the target user experience index value is smaller than a second threshold value, adjusting a link and/or a code rate corresponding to a user experience model until the target user experience index value corresponding to the user experience model is larger than or equal to the second threshold value;

and when the target user experience index value is greater than or equal to a second threshold value, the link and the code rate corresponding to the user experience model are used as the link and the code rate of the audio and video playing corresponding to the second time period after adjustment.

9. An audio/video scheduling apparatus, comprising:

the first acquisition module is used for acquiring a first user experience index value corresponding to the target moment;

the second acquisition module is used for acquiring a first link portrait corresponding to a first preset time period before the target moment under the condition that the first user experience index value is lower than a first threshold value; the link portrait is used for indicating related information of an audio/video playing link;

the processing module is used for inputting the first link portrait into the user experience model to obtain a target user experience index value corresponding to the second time period; the second time period comprises a second preset time period after the target time; the user experience model is used for representing the corresponding relation between the user video playing experience index value and the link portrait under the target audio and video playing code rate;

the adjusting module is used for adjusting the link and/or the code rate of the audio/video playing corresponding to the second time period according to the target user experience index value and the user experience model;

the first model is as follows:

10. An electronic device, comprising: a transceiver, a processor, a memory, and a program or instructions stored on the memory and executable on the processor; an audio-video scheduling method according to any one of claims 1-8, wherein the program or instructions, when executed by the processor, implement the method.

11. A computer readable storage medium having stored thereon a program or instructions, which when executed by a processor, implements the steps of the audio video scheduling method of any one of claims 1-8.