+

CN117216662A - Risk identification method, system, device, equipment and storage medium - Google Patents

Risk identification method, system, device, equipment and storage medium Download PDF

Info

Publication number
CN117216662A
CN117216662A CN202311198929.5A CN202311198929A CN117216662A CN 117216662 A CN117216662 A CN 117216662A CN 202311198929 A CN202311198929 A CN 202311198929A CN 117216662 A CN117216662 A CN 117216662A
Authority
CN
China
Prior art keywords
text
dialogue
target
risk
potential risk
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311198929.5A
Other languages
Chinese (zh)
Inventor
彭飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing 58 Information Technology Co Ltd
Original Assignee
Beijing 58 Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing 58 Information Technology Co Ltd filed Critical Beijing 58 Information Technology Co Ltd
Priority to CN202311198929.5A priority Critical patent/CN117216662A/en
Publication of CN117216662A publication Critical patent/CN117216662A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The embodiment of the invention provides a risk identification method, a system, a device, equipment and a storage medium, wherein the method comprises the following steps: determining a target position of the potential risk text in the dialogue text, taking the target position as a starting point, and intercepting target text with preset length from the dialogue text, wherein other dialogue texts in the target text are generated before the potential risk text. Then, the target text containing the potential risk text and the risk identification text are used as prompt information, and a generated model is input to further determine whether the potential risk text is at risk or not by the generated model. I.e. the risk recognition means is for further risk recognition of the risk potential text. And during the recognition process, the device can further extract the text of the potential risk based on the target position of the text of the potential risk. The generated model can carry out risk recognition on the potentially-risk text again by means of the potentially-risk text and the potentially-risk text, so that accuracy of text risk recognition is improved.

Description

风险识别方法、系统、装置、设备和存储介质Risk identification methods, systems, devices, equipment and storage media

技术领域Technical field

本发明涉及人工智能技术领域,尤其涉及一种风险识别方法、系统、装置、设备和存储介质。The present invention relates to the field of artificial intelligence technology, and in particular, to a risk identification method, system, device, equipment and storage medium.

背景技术Background technique

即时通信(Instant Messaging,简称IM)工具是一种在用户之间传递即时消息的应用程序,是最流行的互联网通讯工具之一。在实际中,用户使用IM工具与他人聊天时产生的对话文本中可能包括与违规风险相关的信息,比如不合规的收费信息、虚假信息以及欺诈信息等等。因此,如何提高对话文本风险识别的准确性就成为一个亟待解决的技术问题。Instant Messaging (IM) tool is an application for transmitting instant messages between users and is one of the most popular Internet communication tools. In practice, the conversation text generated when users use IM tools to chat with others may include information related to violation risks, such as non-compliant charging information, false information, fraudulent information, etc. Therefore, how to improve the accuracy of risk identification in dialogue text has become an urgent technical problem to be solved.

发明内容Contents of the invention

有鉴于此,本发明实施例提供一种风险识别方法、系统、装置、设备和存储介质,用以提高对话文本风险识别的准确性。In view of this, embodiments of the present invention provide a risk identification method, system, device, equipment and storage medium to improve the accuracy of risk identification in dialogue text.

第一方面,本发明实施例提供一种风险识别方法,包括:In a first aspect, embodiments of the present invention provide a risk identification method, including:

确定潜在风险文本在对话文本中的目标位置;Determine where potentially risky text should be targeted within the conversation text;

以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;Taking the target position as a starting point, intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text;

将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The target text and the risk identification text containing the potential risk text are input into the generative model as prompt information, so that the generative model determines whether the potential risk text contains risks.

第二方面,本发明实施例提供一种风险识别系统,包括:客户端和部署有生成式模型的服务端;In a second aspect, embodiments of the present invention provide a risk identification system, including: a client and a server deployed with a generative model;

所述客户端,用于确定潜在风险文本在对话文本中的目标位置;The client is used to determine the target position of the potential risk text in the conversation text;

以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;发送所述目标文本至所述服务端;Taking the target position as the starting point, intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text; send the target text to the server ;

所述服务端,用于将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入所述生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The server is configured to input the target text and risk identification text containing the potential risk text as prompt information into the generative model, so that the generative model determines whether the potential risk text exists. risk.

第三方面,本发明实施例提供一种风险识别系统,包括:客户端和部署有生成式模型的服务端;In a third aspect, embodiments of the present invention provide a risk identification system, including: a client and a server deployed with a generative model;

所述客户端,用于发送用户输入的潜在风险文本;The client is used to send potentially risky text input by the user;

所述服务端,用于接收所述的潜在风险文本,以由所述潜在风险文本和在所述潜在风险文本之前产生的对话文本构成对话文本;The server is configured to receive the potential risk text to constitute a dialogue text consisting of the potential risk text and the dialogue text generated before the potential risk text;

确定潜在风险文本在对话文本中的目标位置;Determine where potentially risky text should be targeted within the conversation text;

以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;Taking the target position as a starting point, intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text;

将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入所述生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The target text and risk identification text containing the potential risk text are input into the generative model as prompt information, so that the generative model determines whether the potential risk text contains risks.

第四方面,本发明实施例提供一种风险识别装置,包括:In a fourth aspect, embodiments of the present invention provide a risk identification device, including:

位置确定模块,用于确定潜在风险文本在对话文本中的目标位置;The position determination module is used to determine the target position of potentially risky text in the dialogue text;

目标文本确定模块,用于以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;A target text determination module, configured to take the target position as a starting point and intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text;

风险确定模块,用于将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。A risk determination module is configured to input the target text and risk identification text containing the potential risk text as prompt information into a generative model, so that the generative model determines whether the potential risk text contains risks.

第五方面,本发明实施例提供一种电子设备,包括处理器和存储器,所述存储器用于存储一条或多条计算机指令,其中,所述一条或多条计算机指令被所述处理器执行时实现上述第一方面中的风险识别方法。该电子设备还可以包括通信接口,用于与其他设备或通信系统通信。In a fifth aspect, embodiments of the present invention provide an electronic device, including a processor and a memory, the memory being used to store one or more computer instructions, wherein when the one or more computer instructions are executed by the processor Implement the risk identification method in the first aspect above. The electronic device may also include a communication interface for communicating with other devices or communication systems.

第六方面,本发明实施例提供了一种非暂时性机器可读存储介质,所述非暂时性机器可读存储介质上存储有可执行代码,当所述可执行代码被电子设备的处理器执行时,使所述处理器至少可以实现如上述第一方面中的风险识别方法。In a sixth aspect, embodiments of the present invention provide a non-transitory machine-readable storage medium. The non-transitory machine-readable storage medium stores executable code. When the executable code is processed by a processor of an electronic device, When executed, the processor is enabled to at least implement the risk identification method in the above first aspect.

本发明实施例提供的风险识别方法中,风险识别装置在获取到对话文本后,可以先确定对话文本中是否存在潜在风险文本,若存在,则进一步确定潜在风险文本在对话文本中的目标位置。然后,以该目标位置为起点,从对话文本中截取预设长度的目标文本。其中,目标文本中的其他对话文本在潜在风险文本之前产生。最终,将包含潜在风险文本的目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由生成式模型进一步确定潜在风险文本是否存在风险。In the risk identification method provided by the embodiment of the present invention, after obtaining the dialogue text, the risk identification device can first determine whether there is a potential risk text in the dialogue text, and if so, further determine the target position of the potential risk text in the dialogue text. Then, using the target position as a starting point, a preset length of target text is intercepted from the conversation text. Among them, other dialogue texts in the target text are generated before the potential risk text. Finally, the target text containing the potential risk text and the risk identification text are used as prompt information and input into the generative model, so that the generative model further determines whether the potential risk text contains risks.

可见,上述方法中,风险识别装置是对潜在风险文本即有可能存在风险的文本进行进一步风险识别。并且在识别过程中,基于潜在风险文本所处的目标位置,该装置还能够进一步提取出潜在风险文本的上文。则生成式模型可以借助潜在风险文本及其上文对潜在风险文本再次进行风险识别,也即是通过联系上文语境对潜在风险文本进行风险识别,以提高文本风险识别的准确性。It can be seen that in the above method, the risk identification device performs further risk identification on potential risk texts, that is, texts that may be risky. And during the recognition process, based on the target location of the potentially risky text, the device can further extract the upper context of the potentially risky text. Then the generative model can use the potential risk text and its context to identify the risk of the potential risk text again, that is, by contacting the context above to identify the risk of the potential risk text, so as to improve the accuracy of text risk identification.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without exerting creative efforts.

图1为本发明实施例提供的一种风险识别方法的流程图;Figure 1 is a flow chart of a risk identification method provided by an embodiment of the present invention;

图2为本发明实施例提供的另一种风险识别方法的流程图;Figure 2 is a flow chart of another risk identification method provided by an embodiment of the present invention;

图3为本发明实施例提供的又一种风险识别方法的流程图;Figure 3 is a flow chart of yet another risk identification method provided by an embodiment of the present invention;

图4为本发明实施例提供的一种模型训练方法的流程图;Figure 4 is a flow chart of a model training method provided by an embodiment of the present invention;

图5为本发明实施例提供的一种风险识别系统的结构示意图;Figure 5 is a schematic structural diagram of a risk identification system provided by an embodiment of the present invention;

图6为本发明实施例提供的一种企业招聘场景下的风险识别过程的示意图;Figure 6 is a schematic diagram of a risk identification process in an enterprise recruitment scenario provided by an embodiment of the present invention;

图7为本发明实施例提供的一种风险识别装置的结构示意图;Figure 7 is a schematic structural diagram of a risk identification device provided by an embodiment of the present invention;

图8为与图7所示实施例提供的风险识别装置对应的电子设备的结构示意图。FIG. 8 is a schematic structural diagram of an electronic device corresponding to the risk identification device provided by the embodiment shown in FIG. 7 .

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, rather than all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without making creative efforts fall within the scope of protection of the present invention.

在本发明实施例中使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本发明。在本发明实施例和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义,“多种”一般包含至少两种,但是不排除包含至少一种的情况。The terminology used in the embodiments of the present invention is only for the purpose of describing specific embodiments and is not intended to limit the present invention. As used in this embodiment and the appended claims, the singular forms "a," "the" and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. Generally, at least two are included, but at least one is not excluded.

应当理解,本文中使用的术语“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。It should be understood that the term "and/or" used in this article is only an association relationship describing related objects, indicating that there can be three relationships, for example, A and/or B, which can mean: A alone exists, and A and A exist simultaneously. B, there are three situations of B alone. In addition, the character "/" in this article generally indicates that the related objects are an "or" relationship.

取决于语境,如在此所使用的词语“如果”、“若”可以被解释成为“在……时”或“当……时”或“响应于确定”或“响应于识别”。类似地,取决于语境,短语“如果确定”或“如果识别(陈述的条件或事件)”可以被解释成为“当确定时”或“响应于确定”或“当识别(陈述的条件或事件)时”或“响应于识别(陈述的条件或事件)”。Depending on the context, the words "if" or "if" as used herein may be interpreted as "when" or "when" or "in response to a determination" or "in response to an identification." Similarly, depending on the context, the phrase "if determined" or "if (stated condition or event) is identified" may be interpreted as "when determined" or "in response to determination" or "when (stated condition or event) is identified )" or "in response to identifying (a stated condition or event)."

需要说明的有,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的商品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种商品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的商品或者系统中还存在另外的相同要素。It should be noted that the terms "include", "includes" or any other variation thereof are intended to cover a non-exclusive inclusion, such that a good or system that includes a list of elements includes not only those elements, but also those not expressly listed. Other elements may also include elements inherent to the product or system. Without further limitation, an element defined by the statement "comprises a..." does not exclude the presence of other identical elements in the goods or systems that include the stated element.

需要说明的还有,本发明所涉及的用户信息(包括但不限于用户设备信息、用户个人信息等)和数据(包括但不限于用于分析的数据、存储的数据、展示的数据等),均为经用户授权或者经过各方充分授权的信息和数据,并且相关数据的收集、使用和处理需要遵守相关国家和地区的相关法律法规和标准,并提供有相应的操作入口,供用户选择授权或者拒绝。It should be noted that the user information (including but not limited to user equipment information, user personal information, etc.) and data (including but not limited to data used for analysis, stored data, displayed data, etc.) involved in the present invention, All are information and data authorized by users or fully authorized by all parties, and the collection, use and processing of relevant data need to comply with relevant laws, regulations and standards of relevant countries and regions, and corresponding operation portals are provided for users to choose and authorize. Or refuse.

在对本发明各实施例提供的风险识别方法进行详细描述之前,还可以对风险识别的应用场景进行示意性描述:Before describing the risk identification methods provided by various embodiments of the present invention in detail, the application scenarios of risk identification can also be schematically described:

正如背景技术中介绍的,在用户实际使用即时通信工具与他人聊天时产生的对话文本中有可能包括违规信息,比如不合规的收费信息、虚假信息以及欺诈信息等等。As introduced in the background art, the conversation text generated when a user actually uses an instant messaging tool to chat with others may include illegal information, such as non-compliant charging information, false information, fraudulent information, etc.

以企业招聘场景为例,企业的招聘者与应聘者之间的对话如下:Taking the corporate recruitment scenario as an example, the dialogue between the corporate recruiter and the applicant is as follows:

招聘者:“距离我们公司不到一百米的地方有一家银行,我们公司和该银行有合作关系,公司员工可以随时在银行里办理取钱,存钱以及转账等业务。”;Recruiter: "There is a bank less than 100 meters away from our company. Our company has a cooperative relationship with the bank. Company employees can withdraw money, deposit money, transfer money and other services at the bank at any time.";

应聘者:“那还挺方便的。”;Applicant: "That's quite convenient.";

招聘者:“公司规定新员工入职前需要去该银行办理一个业务。”;Recruiter: "The company stipulates that new employees need to go to the bank to handle a business before joining the company.";

应聘者:“具体什么业务?”;Applicant: "What specific business is it?";

招聘者:“入职培训费用,这个是正规费用”。Recruiter: "Onboarding training fee, this is a formal fee."

在上述招聘者与应聘者之间的对话文本中,若对每一条文本单独进行分析,则可以确定出每一条文本都是正常的,不存在违规的情况。然而,联系整段对话的上下文语境可以得知,招聘者要求新员工入职前去指定银行缴纳入职培训费用,即限制了缴纳入职培训费用的银行,此种情况下招聘者提出收取入职培训费的可信度是比较低的。因此,招聘者产生的文本“入职培训费用,这个是正规费用”可以看作是潜在风险文本,该文本内容可能属于不合规收费的范畴,需要更进一步进行风险识别。In the above-mentioned dialogue text between the recruiter and the applicant, if each text is analyzed separately, it can be determined that each text is normal and there are no violations. However, based on the context of the entire conversation, it can be seen that the recruiter requires new employees to go to a designated bank to pay the induction training fee before joining, which means that the banks that pay the induction training fee are restricted. In this case, the recruiter proposes to charge the induction training fee. The credibility is relatively low. Therefore, the text "Induction training fee, this is a formal fee" generated by the recruiter can be regarded as a potential risk text. The content of this text may fall into the category of non-compliant charges, and further risk identification is required.

此时,则可以使用本发明下述各实施例提供的风险识别方法,来进一步判断上述对话中的潜在风险文本是否真正有风险。并且可选地,风险识别装置可以在对话实时产生的过程中进行风险识别,也可以在对话结束后进行风险识别。At this time, the risk identification method provided by the following embodiments of the present invention can be used to further determine whether the potentially risky text in the above conversation is truly risky. And optionally, the risk identification device can perform risk identification during the real-time generation of the dialogue, or can also perform risk identification after the dialogue ends.

可选地,对话文本可以如上述举例是用户双方通过即时通信工具产生的全部对话文本,也可以是用户与智能设备之间产生的对话文本。可选地,智能设备可以包括:具有对话功能的服务机器人、具有对话功能的移动终端以及具有对话功能的智能穿戴设备中的任一种。Optionally, the conversation text may be all the conversation text generated by both users through instant messaging tools as in the above example, or it may be the conversation text generated between the user and the smart device. Optionally, the smart device may include: any one of a service robot with a conversation function, a mobile terminal with a conversation function, and an intelligent wearable device with a conversation function.

基于上述描述,下面结合附图对本发明的一些实施方式作详细说明。在各实施例之间不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。另外,下述各方法实施例中的步骤时序仅为一种举例,而非严格限定。Based on the above description, some embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The following embodiments and features in the embodiments may be combined with each other as long as there is no conflict between the embodiments. In addition, the sequence of steps in the following method embodiments is only an example and is not strictly limited.

图1为本发明实施例提供的一种风险识别方法的流程图。本发明实施例提供的风险识别方法可以由风险识别装置执行。可以理解的是,该风险识别装置可以实现为软件、或者软件和硬件的组合。如图1所示,该方法可以包括如下步骤:Figure 1 is a flow chart of a risk identification method provided by an embodiment of the present invention. The risk identification method provided by the embodiment of the present invention can be executed by a risk identification device. It can be understood that the risk identification device can be implemented as software, or a combination of software and hardware. As shown in Figure 1, the method may include the following steps:

S101,确定潜在风险文本在对话文本中的目标位置。S101. Determine the target position of the potential risk text in the dialogue text.

正如上述实施例中的举例,用户双方可以利用即时通信工具实现至少一轮对话,以产生对话文本。Just like the examples in the above embodiments, both users can use the instant messaging tool to implement at least one round of dialogue to generate dialogue text.

一种情况,风险识别装置可以在对话实时产生的过程中进行风险识别。此种情况下,当用户双方的任一方即任一用户产生一条最新的文本时,该最新的文本可以被风险识别装置都获取到,风险识别装置可以立即对该最新的文本进行初步的风险识别。若风险识别装置确定该最新的文本具有潜在风险时,则该最新的文本即为潜在风险文本。进一步地,风险识别装置可以确定此潜在风险文本在当前产生的对话文本中的目标位置。In one case, the risk identification device can perform risk identification during the real-time generation of the dialogue. In this case, when either user generates a latest text, the latest text can be obtained by the risk identification device, and the risk identification device can immediately perform preliminary risk identification on the latest text. . If the risk identification device determines that the latest text has a potential risk, the latest text is a potential risk text. Further, the risk identification device may determine the target position of the potential risk text in the currently generated dialogue text.

另一种情况,风险识别装置也可以在整段对话结束后进行风险识别。此种情况下,当用户双方对话完成后,风险识别装置可以获取用户双方产生的全部对话文本,并对全部对话文本中的各条文本进行初步的风险识别。若风险识别装置确定某一条对话文本具有潜在风险时,则将该条文本确定为潜在风险文本。进一步地,风险识别装置可以确定此潜在风险文本在当前产生的对话文本中的目标位置。Alternatively, the risk identification device can also perform risk identification after the entire conversation has ended. In this case, after the dialogue between the two users is completed, the risk identification device can obtain all dialogue texts generated by both users, and perform preliminary risk identification on each text in all dialogue texts. If the risk identification device determines that a certain dialogue text has a potential risk, the text is determined to be a potential risk text. Further, the risk identification device may determine the target position of the potential risk text in the currently generated dialogue text.

在上述两种情况下,对于潜在风险文本的确定,一种可选地方式,风险识别装置可以从风险关键词库中搜索,以确定需要初步风险识别的文本中是否存在包含任一风险关键词的文本,并将包含风险关键词的文本作为潜在风险文本。In the above two cases, for the determination of potential risk texts, in an optional way, the risk identification device can search from the risk keyword library to determine whether any text containing any risk keyword needs to be preliminary risk identification. text, and treat text containing risk keywords as potential risk text.

可选地,潜在风险文本可以是上述提及的与收取入职培训费用相关的文本,也可以是与发布虚假不实信息相关的文本,比如不存在的职位信息或者不真实的房源信息等等。Optionally, the potential risk text can be the above-mentioned text related to charging induction training fees, or it can be text related to publishing false and untrue information, such as non-existent job information or untrue housing information, etc. .

当风险识别装置可以在整段对话结束后进行风险识别时,若用户双方之间产生的对话文本数量较多时,则为了提高确定目标位置的定位效率,可选地,风险识别装置还可以对全部对话文本进行筛选,即筛选出某一时段内用户双方产生的部分对话文本,并从该部分对话文本中确定潜在风险文本对应的目标位置。When the risk identification device can perform risk identification after the entire conversation is over, if the number of conversation texts generated between the two users is large, in order to improve the positioning efficiency of determining the target location, optionally, the risk identification device can also perform risk identification on all the dialogue texts. Filter the dialogue text, that is, filter out part of the dialogue text generated by both users within a certain period of time, and determine the target location corresponding to the potentially risky text from this part of the dialogue text.

S102,以目标位置为起点,从对话文本中截取预设长度的目标文本,目标文本中的其他对话文本在潜在风险文本之前产生。S102, starting from the target position, intercept the target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text.

S103,将包含潜在风险文本的目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由生成式模型确定潜在风险文本是否存在风险。S103. Enter the target text containing the potential risk text and the risk identification text as prompt information into the generative model, so that the generative model determines whether the potential risk text contains risks.

基于步骤S101中确定出的目标位置,风险识别装置可以先以目标位置为起点,从对话文本中截取预设长度的目标文本。其中,目标文本中的其他对话文本在潜在风险文本之前产生,即目标文本由潜在风险文本及其上文构成。然后,风险识别装置可以再将包含潜在风险文本的目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由生成式模型确定潜在风险文本是否存在风险。Based on the target location determined in step S101, the risk identification device can first use the target location as a starting point to intercept a target text of a preset length from the conversation text. Among them, other dialogue texts in the target text are generated before the potential risk text, that is, the target text consists of the potential risk text and its upper text. Then, the risk identification device can input the target text containing the potential risk text and the risk identification text as prompt information into the generative model, so that the generative model determines whether the potential risk text contains risks.

可选地,预设长度可以为生成式模型所能处理的最大文本长度阈值,也可以为任一小于该文本长度阈值的值。可选地,生成式模型可以包括文本生成能力较强、模型体积较大的广义线性模型(Generalized Linear Model,简称GLM)或者通用预训练转换器(Generative Pre-training Transformer,简称GPT)模型。Optionally, the preset length can be the maximum text length threshold that the generative model can process, or it can be any value smaller than the text length threshold. Optionally, the generative model may include a Generalized Linear Model (GLM) or a Generative Pre-training Transformer (GPT) model that has strong text generation capabilities and a large model volume.

可选地,风险识别文本用于指导生成式模型进行风险识别,比如可以是“检测下面对话内容中的‘入职培训费用,这个是正规费用’是否涉及欺骗”。其中,“检测下面对话内容中的xx是否涉及欺骗”为风险识别文本中的检测命令,用以指明检测方向;‘入职培训费用,这个是正规费用’为潜在风险文本即待检测文本。Optionally, the risk identification text is used to guide the generative model to perform risk identification, for example, it can be "Detect whether the "onboarding training fee, this is a formal fee" in the following dialogue content involves deception." Among them, "Check whether xx in the following dialogue content involves deception" is the detection command in the risk identification text, which is used to indicate the direction of detection; "Induction training fee, this is a formal fee" is the potential risk text, that is, the text to be detected.

可选地,风险识别文本比如还可以是“检测下面对话内容中的‘入职培训费用,这个是正规费用’是否涉及欺骗,直接回答是或者否”。其中,“直接回答是或者否”为回答格式,用以限定生成式模型输出的风险识别结果的格式。Optionally, the risk identification text can also be, for example, "Check whether the 'onboarding training fee, this is a formal fee' in the dialogue below involves deception, and answer directly yes or no." Among them, "direct answer yes or no" is the answer format, which is used to limit the format of the risk identification results output by the generative model.

本发明实施例提供的风险识别方法中,风险识别装置在获取到对话文本中后,可以先确定对话文本中是否存在潜在风险文本,若存在,则进一步确定潜在风险文本在对话文本中的目标位置。然后,以该目标位置为起点,从对话文本中截取预设长度的目标文本。其中,目标文本中的其他对话文本在潜在风险文本之前产生。最终,将包含潜在风险文本的目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由生成式模型进一步确定潜在风险文本是否存在风险。In the risk identification method provided by the embodiment of the present invention, after obtaining the dialogue text, the risk identification device can first determine whether there is a potential risk text in the dialogue text, and if so, further determine the target position of the potential risk text in the dialogue text. . Then, using the target position as a starting point, a preset length of target text is intercepted from the conversation text. Among them, other dialogue texts in the target text are generated before the potential risk text. Finally, the target text containing the potential risk text and the risk identification text are used as prompt information and input into the generative model, so that the generative model further determines whether the potential risk text contains risks.

可见,上述方法中,风险识别装置是对潜在风险文本即有可能存在风险的文本进行进一步风险识别。并且在识别过程中,基于潜在风险文本所处的目标位置,该装置还能够进一步提取出潜在风险文本的上文。则生成式模型可以借助潜在风险文本及其上文对潜在风险文本再次进行风险识别,也即是通过联系上文语境对潜在风险文本进行风险识别,以提高文本风险识别的准确性。It can be seen that in the above method, the risk identification device performs further risk identification on potential risk texts, that is, texts that may be risky. And during the recognition process, based on the target location of the potentially risky text, the device can further extract the upper context of the potentially risky text. Then the generative model can use the potential risk text and its context to identify the risk of the potential risk text again, that is, by contacting the context above to identify the risk of the potential risk text, so as to improve the accuracy of text risk identification.

另外,在生成式模型确定出该潜在风险文本存在风险后,风险识别装置还可以将此风险识别结果展示在用户终端的显示界面上,以为用户进行风险提示。In addition, after the generative model determines that the potential risk text is risky, the risk identification device can also display the risk identification result on the display interface of the user terminal to provide a risk reminder to the user.

承接上述企业招聘场景举例,假设招聘者和应聘者通过即时通信工具共产生5条对话文本,并且风险识别装置确定出“入职培训费用,这个是正规费用”是潜在风险文本,且该潜在风险文本为第5条对话文本。Taking the example of the above corporate recruitment scenario, assume that the recruiter and the applicant generate a total of 5 conversation texts through instant messaging tools, and the risk identification device determines that "entry training expenses, this is a regular fee" is a potential risk text, and this potential risk text This is the dialogue text of Article 5.

假设预设长度为4条对话文本对应的长度,则可以将第5条对话文本作为起点,往前数4条对话文本,即最终截取出第2条对话文本至第5条对话文本。然后,将第2条对话文本至第5条对话文本和风险识别文本作为提示信息,输入至生成式模型,以由生成式模型输出“入职培训费用,这个是正规费用”是存在风险。最终,将该输出结果展示在用户终端的显示界面上,以提示用户规避风险,提示形式可以参见图6。Assuming that the default length is the length corresponding to four dialogue texts, the fifth dialogue text can be used as the starting point, and four dialogue texts can be counted forward, that is, the second dialogue text to the fifth dialogue text can finally be intercepted. Then, the 2nd to 5th dialogue text and the risk identification text are used as prompt information and input into the generative model, so that the generative model outputs "onboarding training expenses, this is a formal expense", which is a risk. Finally, the output result is displayed on the display interface of the user terminal to prompt the user to avoid risks. The prompt form can be seen in Figure 6.

图1所示实施例中已经提及一种根据风险关键词确定潜在风险文本的目标位置的方法。可选地,风险识别装置还可以通过分析特征向量的方式确定目标位置。A method of determining the target location of potential risk text based on risk keywords has been mentioned in the embodiment shown in Figure 1 . Optionally, the risk identification device can also determine the target location by analyzing the feature vector.

具体地,风险识别装置对用户产生的对话文本中的各条文本分别进行特征提取,以得到具有较低维度的各条文本各自的特征向量。然后,风险识别装置可以根据各条文本各自的特征向量,确定对话文本中是否存在潜在风险文本。若风险识别装置确定出对话文本中存在潜在风险文本,则可以进一步得到潜在风险文本的目标位置。Specifically, the risk identification device performs feature extraction on each text in the dialogue text generated by the user to obtain a feature vector of each text with a lower dimension. Then, the risk identification device can determine whether there is potential risk text in the dialogue text based on the respective feature vectors of each text. If the risk identification device determines that there is a potential risk text in the dialogue text, the target location of the potential risk text can be further obtained.

并且上述特征提取和潜在风险文本定位的过程可以由同一模型即第一模型执行,也可以由不同模型执行。可选地,第一模型可以包括卷积神经网络(Convolutional NeuralNetworks,简称CNN)、转换器的双向编码器表示(Bidirectional Encoder Representationfrom Transformers,简称Bert)模型、循环神经网络(Recurrent Neural Network,简称RNN)以及长短期记忆网络(Long Short-Term Memory,简称LSTM)中的任一种。And the above process of feature extraction and potential risk text positioning can be performed by the same model, that is, the first model, or can be performed by different models. Optionally, the first model may include a convolutional neural network (Convolutional NeuralNetworks, CNN for short), a Bidirectional Encoder Representation from Transformers (Bert for short) model, and a recurrent neural network (Recurrent Neural Network, RNN for short). And any of the Long Short-Term Memory (LSTM) networks.

图1所示实施例已经公开了风险识别装置可以将目标位置作为起点,截取目标文本的过程。可选地,在截取目标文本之前,风险识别装置还可以先划分对话文本,并根据划分结果进行目标文本的截取。可选地,风险识别装置可以按照对话主题对用户产生的对话文本进行划分,也可以按照对话时长对用户产生的对话文本进行划分。The embodiment shown in Figure 1 has disclosed a process in which the risk identification device can use the target position as a starting point to intercept the target text. Optionally, before intercepting the target text, the risk identification device may also divide the conversation text first, and intercept the target text based on the division results. Optionally, the risk identification device may divide the dialogue text generated by the user according to the dialogue topic, or may divide the dialogue text generated by the user according to the dialogue duration.

当按照对话主题对用户产生的对话文本进行划分时,图1所示的方法可以进一步细化为图2。图2为本发明实施例提供的另一种风险识别方法的流程图。如图2所示,该方法可以包括以下步骤:When the user-generated dialogue text is divided according to dialogue topics, the method shown in Figure 1 can be further refined into Figure 2. Figure 2 is a flow chart of another risk identification method provided by an embodiment of the present invention. As shown in Figure 2, the method may include the following steps:

S201,确定潜在风险文本在对话文本中的目标位置。S201: Determine the target position of the potential risk text in the dialogue text.

上述步骤S201的具体实现过程可以参见图1所示实施例中相关步骤的具体描述,在此不再赘述。For the specific implementation process of the above step S201, please refer to the specific description of the relevant steps in the embodiment shown in Figure 1, and will not be described again here.

S202,按照对话主题将对话文本划分为至少一个对话段。S202: Divide the dialogue text into at least one dialogue segment according to the dialogue topic.

S203,以目标位置为起点,按照预设长度从潜在风险文本所属的目标对话段中截取目标文本,目标文本中的其他对话文本在潜在风险文本之前产生。S203, starting from the target position, intercept the target text from the target dialogue segment to which the potential risk text belongs according to a preset length, and other dialogue texts in the target text are generated before the potential risk text.

风险识别装置可以按照对话主题,将对话文本划分为至少一个对话段,并以目标位置为起点,按照预设长度从潜在风险文本所属的目标对话段中截取目标文本。其中,目标文本中的其他对话文本在潜在风险文本之前产生,即目标文本由潜在风险文本及其上文构成。The risk identification device can divide the dialogue text into at least one dialogue segment according to the dialogue topic, and use the target position as a starting point to intercept the target text from the target dialogue segment to which the potential risk text belongs according to a preset length. Among them, other dialogue texts in the target text are generated before the potential risk text, that is, the target text consists of the potential risk text and its upper text.

对于对话主题的划分,可选地,风险识别装置中的语义分析模块可以根据能够明确表征用户主要意图的关键词,进行对话主题的识别分析,以得到不同对话主题对应的对话段。可选地,语义分析模块可以根据能够明确表征对话主题切换的关键词,比如:“聊聊下一个事”或者“昨天说的租金今天再聊聊”,以得到不同对话主题对应的对话段。For the division of dialogue topics, optionally, the semantic analysis module in the risk identification device can perform identification and analysis of dialogue topics based on keywords that can clearly characterize the user's main intention to obtain dialogue segments corresponding to different dialogue topics. Optionally, the semantic analysis module can obtain dialogue segments corresponding to different dialogue topics based on keywords that can clearly represent the switching of dialogue topics, such as: "Let's talk about the next thing" or "Let's talk about the rent yesterday."

在得到至少一个对话段后,风险识别装置可以从至少一个对话段中找到潜在风险文本所属的目标对话段,并以潜在风险文本的目标位置为起点,从目标对话段中截取预设长度的目标文本,即截取出的目标文本是同一对话主题下的并且符合预设长度的文本。After obtaining at least one dialogue segment, the risk identification device can find the target dialogue segment to which the potential risk text belongs from the at least one dialogue segment, and use the target position of the potential risk text as a starting point to intercept a target of a preset length from the target dialogue segment. Text, that is, the intercepted target text is the text under the same conversation topic and meets the preset length.

可选地,若目标对话段的长度小于预设长度,则无需再对目标对话段进行截取,可以直接将目标对话段确定为目标文本。即目标文本是优先将对话主题作为依据进行截取的,该种方式能够保证目标文本的语义完整程度,并且能够防止属于其他主题的对话文本被划分到目标文本中,对后续风险识别造成干扰。Optionally, if the length of the target dialogue segment is less than the preset length, there is no need to intercept the target dialogue segment, and the target dialogue segment can be directly determined as the target text. That is, the target text is intercepted based on the dialogue topic first. This method can ensure the semantic integrity of the target text and prevent dialogue texts belonging to other topics from being divided into the target text, causing interference to subsequent risk identification.

可选地,若目标对话段的长度大于预设长度,则第二模型可以删除目标对话段中的常识文本或者与该对话主题没有直接关联的通用话术文本,将目标对话段中的剩余文本确定为目标文本。并且若目标对话段删除部分文本后的长度仍大于预设长度,则还可以以时间为维度,进一步删除与目标位置对应的时间相差较远的文本。Optionally, if the length of the target conversation segment is greater than the preset length, the second model can delete the common sense text in the target conversation segment or the common discourse text that is not directly related to the conversation topic, and replace the remaining text in the target conversation segment with Identified as the target text. And if the length of the target dialogue segment after deleting part of the text is still greater than the preset length, time can also be used as the dimension to further delete text that is far away from the time corresponding to the target position.

S204,将包含潜在风险文本的目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由生成式模型确定潜在风险文本是否存在风险。S204, input the target text containing the potential risk text and the risk identification text as prompt information into the generative model, so that the generative model determines whether the potential risk text has risks.

上述步骤S204的具体实现过程可以参见图1所示实施例中相关步骤的具体描述,在此不再赘述。For the specific implementation process of the above step S204, please refer to the specific description of the relevant steps in the embodiment shown in FIG. 1, which will not be described again here.

本实施例中,风险识别装置可以按照对话主题将对话文本划分为至少一个对话段,即每个对话段的语义都是完整的。此时,风险识别装置以目标位置为起点从潜在风险文本所属的目标对话段中截取目标文本,即截取出的目标文本与潜在风险文本具有相同对话主题。并且通过划分对话主题的方式还能够防止属于其他主题的对话文本被划分到该目标文本中,也即是防止其他话题的对话文本对后续风险识别造成干扰,从而提高后续文本风险识别的准确性。In this embodiment, the risk identification device can divide the dialogue text into at least one dialogue segment according to the dialogue topic, that is, the semantics of each dialogue segment is complete. At this time, the risk identification device uses the target position as a starting point to intercept the target text from the target dialogue segment to which the potential risk text belongs, that is, the intercepted target text and the potential risk text have the same dialogue topic. Moreover, by dividing the dialogue topics, it can also prevent dialogue texts belonging to other topics from being divided into the target text, that is, preventing dialogue texts on other topics from interfering with subsequent risk identification, thereby improving the accuracy of subsequent text risk identification.

另外,本实施例中未详细描述的内容以及所能实现的技术效果均可以参见上述各实施例中的相关描述,在此不再赘述。In addition, the content not described in detail in this embodiment and the technical effects that can be achieved can be referred to the relevant descriptions in the above embodiments, and will not be described again here.

图2所示实施例提及的截取目标文本的过程具体可以由第二模型执行,即图2所示的方法可以进一步细化为图3所示的方法。可选地,第二模型可以包括CNN、Bert、RNN和LSTM中的任一种。则图3为本发明实施例提供的又一种风险识别方法的流程图。如图3所示,该方法可以包括以下步骤:The process of intercepting the target text mentioned in the embodiment shown in FIG. 2 can be specifically executed by the second model, that is, the method shown in FIG. 2 can be further refined into the method shown in FIG. 3 . Optionally, the second model may include any one of CNN, Bert, RNN, and LSTM. Figure 3 is a flow chart of yet another risk identification method provided by an embodiment of the present invention. As shown in Figure 3, the method may include the following steps:

S301,确定潜在风险文本在对话文本中的目标位置。S301. Determine the target position of the potential risk text in the dialogue text.

上述步骤S301的具体实现过程可以参见图1所示实施例中相关步骤的具体描述,在此不再赘述。For the specific implementation process of the above step S301, please refer to the specific description of the relevant steps in the embodiment shown in FIG. 1, which will not be described again here.

S302,将对话文本、目标位置以及预设长度输入第二模型,以由第二模型将对话文本划分为至少一个对话段,并输出以目标位置为起点,按照预设长度从潜在风险文本所属的目标对话段中截取出的目标文本,目标文本中的其他对话文本在潜在风险文本之前产生。S302. Input the dialogue text, target position and preset length into the second model, so that the second model divides the dialogue text into at least one dialogue segment, and outputs starting from the target position and starting from the preset length from the segment to which the potential risk text belongs. The target text is intercepted from the target dialogue segment, and other dialogue texts in the target text are generated before the potential risk text.

风险识别装置可以将对话文本、潜在风险文本的目标位置以及预设长度输入内置的第二模型。第二模型可以将对话文本划分为至少一个对话段,并根据预设长度的要求,输出以目标位置为起点,从潜在风险文本所属的目标对话段中截取出的、符合该预设长度的目标文本。其中,目标文本中的其他对话文本在潜在风险文本之前产生,即目标文本由潜在风险文本及其上文构成。The risk identification device can input the dialogue text, the target location of the potential risk text and the preset length into the built-in second model. The second model can divide the dialogue text into at least one dialogue segment, and according to the preset length requirement, output a target that meets the preset length and is intercepted from the target dialogue segment to which the potential risk text belongs, starting from the target position. text. Among them, other dialogue texts in the target text are generated before the potential risk text, that is, the target text consists of the potential risk text and its upper text.

S303,将包含潜在风险文本的目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由生成式模型确定潜在风险文本是否存在风险。S303: Enter the target text containing the potential risk text and the risk identification text as prompt information into the generative model, so that the generative model determines whether the potential risk text is risky.

上述步骤S303的具体实现过程可以参见图1所示实施例中相关步骤的具体描述,在此不再赘述。For the specific implementation process of the above step S303, please refer to the specific description of the relevant steps in the embodiment shown in FIG. 1, which will not be described again here.

本实施例中,通过第二模型较强的学习能力,能够实现对话文本的主题划分,以及截取出与潜在风险文本具有相同对话主题的目标文本。并且通过划分对话主题的方式还能够防止属于其他主题的对话文本被划分到该目标文本中,也即是防止其他话题的对话文本对后续风险识别造成干扰,从而提高后续文本风险识别的准确性。In this embodiment, through the strong learning ability of the second model, the topic division of the dialogue text can be achieved, and the target text with the same dialogue topic as the potential risk text can be intercepted. Moreover, by dividing the dialogue topics, it can also prevent dialogue texts belonging to other topics from being divided into the target text, that is, preventing dialogue texts on other topics from interfering with subsequent risk identification, thereby improving the accuracy of subsequent text risk identification.

另外,本实施例中未详细描述的内容以及所能实现的技术效果均可以参见上述各实施例中的相关描述,在此不再赘述。In addition, the content not described in detail in this embodiment and the technical effects that can be achieved can be referred to the relevant descriptions in the above embodiments, and will not be described again here.

根据上述图3所示实施例描述的风险识别装置的使用过程可知,风险识别装置在截取目标文本时可以使用第二模型。则为了保证第二模型截取目标文本的准确性,风险识别装置中的训练模块可以利用有监督训练的方式对第二模型进行训练。则图4为一种模型训练方法的流程图。如图4所示,该方法可以包括以下步骤:According to the use process of the risk identification device described in the embodiment shown in FIG. 3, it can be known that the risk identification device can use the second model when intercepting the target text. In order to ensure the accuracy of the second model in intercepting the target text, the training module in the risk identification device can train the second model using supervised training. Figure 4 is a flow chart of a model training method. As shown in Figure 4, the method may include the following steps:

S401,获取训练对话文本、风险文本在训练对话文本中训练位置,以及训练对话文本中包含风险文本的参考文本。S401: Obtain the training dialogue text, the training position of the risk text in the training dialogue text, and the reference text containing the risk text in the training dialogue text.

可选地,训练模块可以从训练对话文本集中获取训练对话文本,并且训练对话文本集可以是预先收集的、适用于任一企业的专有文本集。其中,包含风险文本的参考文本是包含在训练对话文本中的,风险文本在训练对话文本中的训练位置可以通过预先处理得到。Optionally, the training module can obtain the training dialogue text from the training dialogue text set, and the training dialogue text set can be a pre-collected proprietary text set suitable for any enterprise. Among them, the reference text containing the risk text is included in the training dialogue text, and the training position of the risk text in the training dialogue text can be obtained through pre-processing.

S402,将训练对话文本、训练位置以及预设长度输入第二模型,以由第二模型输出以训练位置为起点,按照预设长度从训练文本中截取出的预测文本。S402: Input the training dialogue text, training position and preset length into the second model, so that the second model outputs predicted text starting from the training position and intercepted from the training text according to the preset length.

S403,对参考文本和预测文本进行损失计算。S403, perform loss calculation on the reference text and predicted text.

S404,根据计算结果训练第二模型。S404, train the second model based on the calculation results.

基于步骤S401获取的训练对话文本、训练位置以及参考文本,训练模块可以将训练对话文本、训练位置以及预设长度作为训练数据,将参考文本作为监督信息对第二模型进行训练,以由第二模型输出以训练位置为起点,按照预设长度从训练文本中截取出的预测文本。Based on the training dialogue text, training location and reference text obtained in step S401, the training module can use the training dialogue text, training location and preset length as training data, and use the reference text as supervision information to train the second model to use the second model. The model output starts from the training position and extracts the predicted text from the training text according to the preset length.

接着,训练模块可以对参考文本和预测文本进行损失计算,并根据得到的损失计算结果对第二模型进行优化。可选地,损失计算过程使用的损失函数可以包括交叉熵损失函数、对数损失函数和平方损失函数中的一种。Then, the training module can perform loss calculation on the reference text and predicted text, and optimize the second model based on the obtained loss calculation results. Optionally, the loss function used in the loss calculation process may include one of a cross-entropy loss function, a logarithmic loss function, and a square loss function.

本实施例中,利用有监督训练的方式即借助包含风险文本的参考文本训练第二模型,使得第二模型的训练效果更好,进一步地,在模型使用阶段时第二模型输出的预测文本也更准确。In this embodiment, a supervised training method is used, that is, the second model is trained with reference text containing risk text, so that the training effect of the second model is better. Furthermore, during the model use stage, the predicted text output by the second model is also more acurrate.

另外,本实施例中未详细描述的内容以及所能实现的技术效果均可以参见上述各实施例中的相关描述,在此不再赘述。In addition, the content not described in detail in this embodiment and the technical effects that can be achieved can be referred to the relevant descriptions in the above embodiments, and will not be described again here.

为了便于理解,还可以从整个系统的角度说明上述各实施例提供的风险识别方法。图5为本发明实施例提供的一种风险识别系统的结构示意图,如图5所示,该风险识别系统包括:客户端和部署有生成式模型的服务端。To facilitate understanding, the risk identification methods provided by the above embodiments can also be described from the perspective of the entire system. Figure 5 is a schematic structural diagram of a risk identification system provided by an embodiment of the present invention. As shown in Figure 5, the risk identification system includes: a client and a server deployed with a generative model.

可选地,在实际中,考虑到客户端的处理能力不同,客户端和服务端可以执行上述实施例中的不同步骤来实现文本的风险识别。Optionally, in practice, considering the different processing capabilities of the client, the client and the server can perform different steps in the above embodiment to implement text risk identification.

当客户端处理能力较高时,客户端在获取到对话文本后,可以先确定对话文本中是否存在潜在风险文本,若存在,则进一步确定潜在风险文本在对话文本中的目标位置。然后,客户端可以将该目标位置作为起点,从对话文本中截取预设长度的目标文本。其中,目标文本中的其他对话文本在潜在风险文本之前产生。接着,客户端可以将目标文本发送至服务端。When the client's processing capability is high, after obtaining the dialogue text, the client can first determine whether there is potentially risky text in the dialogue text, and if so, further determine the target location of the potential risk text in the dialogue text. The client can then use this target location as a starting point to intercept a preset length of target text from the conversation text. Among them, other dialogue texts in the target text are generated before the potential risk text. The client can then send the target text to the server.

服务端可以将风险识别文本以及客户端发送的包含潜在风险文本的目标文本作为提示信息,输入自身部署的生成式模型中,以由生成式模型确定潜在风险文本是否存在风险。The server can use the risk identification text and the target text containing the potential risk text sent by the client as prompt information and input it into the generative model deployed by itself, so that the generative model can determine whether the potential risk text is risky.

当客户端处理能力较低时,客户端可以发送用户输入的潜在风险文本至服务端,以由服务端接收该潜在风险文本并进一步进行风险识别。具体地,服务端可以接收潜在风险文本,以由潜在风险文本和在潜在风险文本之前产生的对话文本构成对话文本。然后,服务端可以确定潜在风险文本在对话文本中的目标位置,并以目标位置为起点,从对话文本中截取预设长度的目标文本。其中,目标文本中的其他对话文本在潜在风险文本之前产生。接着,服务端再将包含潜在风险文本的目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由生成式模型确定潜在风险文本是否存在风险。When the client's processing capability is low, the client can send potential risk text input by the user to the server, so that the server can receive the potential risk text and further identify the risk. Specifically, the server may receive the potential risk text to form a dialogue text consisting of the potential risk text and the dialogue text generated before the potential risk text. Then, the server can determine the target position of the potentially risky text in the conversation text, and use the target position as a starting point to intercept the target text of a preset length from the conversation text. Among them, other dialogue texts in the target text are generated before the potential risk text. Then, the server inputs the target text containing the potential risk text and the risk identification text as prompt information into the generative model, so that the generative model determines whether the potential risk text is risky.

本发明实施例提供的风险识别系统包括:客户端和部署有生成式模型的服务端。考虑到客户端的实际处理能力不同,客户端和服务端可以执行上述实施例中的不同步骤来实现文本的风险识别。基于潜在风险文本所处的目标位置,该系统能够进一步提取出潜在风险文本的上文。则服务端中部署的生成式模型可以借助潜在风险文本及其上文对潜在风险文本再次进行风险识别,也即是通过联系上文语境对潜在风险文本进行风险识别,以提高文本风险识别的准确性。The risk identification system provided by the embodiment of the present invention includes: a client and a server deployed with a generative model. Considering that the actual processing capabilities of the client are different, the client and the server can perform different steps in the above embodiments to implement text risk identification. Based on the target location of the potentially risky text, the system can further extract the context of the potentially risky text. Then the generative model deployed in the server can use the potential risk text and its context to identify the potential risk text again, that is, the potential risk text is risk identified by contacting the context above to improve the effectiveness of text risk identification. accuracy.

另外,本实施例中未详细描述的内容以及所能实现的技术效果均可以参见上述各实施例中的相关描述,在此不再赘述。In addition, the content not described in detail in this embodiment and the technical effects that can be achieved can be referred to the relevant descriptions in the above embodiments, and will not be described again here.

为了便于理解,可以结合企业招聘场景对以上提供的风险识别方法的具体实现过程进行示例性说明。则图6为本发明实施例提供的一种企业招聘场景下的风险识别过程的示意图,该过程具体如下:For ease of understanding, the specific implementation process of the risk identification method provided above can be illustratively described in conjunction with an enterprise recruitment scenario. Figure 6 is a schematic diagram of a risk identification process in an enterprise recruitment scenario provided by an embodiment of the present invention. The process is specifically as follows:

在企业招聘场景下,假设招聘者和应聘者之间共产生7条对话文本,具体如下:In the corporate recruitment scenario, it is assumed that a total of 7 dialogue texts are generated between the recruiter and the applicant, as follows:

招聘者:“你好,我是负责本次招聘的经理,请你简单介绍下你的工作经历。”;Recruiter: "Hello, I am the manager responsible for this recruitment. Please briefly introduce your work experience.";

应聘者:“好的,我之前在企业A工作5年,具有充足的工作经验。”;Applicant: "Okay, I have worked in Company A for 5 years and have sufficient work experience.";

招聘者:“挺好,那我们聊聊下一个事儿,距离我们公司不到一百米的地方有一家银行,我们公司和该银行有合作关系,公司员工可以随时在银行里办理取钱,存钱以及转账等业务。”;Recruiter: "Very good, let's talk about the next thing. There is a bank less than a hundred meters away from our company. Our company has a cooperative relationship with the bank. Company employees can withdraw money at the bank at any time. Services such as depositing money and transferring money.";

应聘者:“那还挺方便的。”;Applicant: "That's quite convenient.";

招聘者:“公司规定新员工入职前需要去该银行办理一个业务。”;Recruiter: "The company stipulates that new employees need to go to the bank to handle a business before joining the company.";

应聘者:“具体什么业务?”;Applicant: "What specific business is it?";

招聘者:“入职培训费用,这个是正规费用。”。Recruiter: "Onboarding training fee, this is a formal fee.".

则风险识别装置在获取到上述7条对话文本后,可以将这7条对话文本输入内置的第一模型,以由第一模型根据各条文本各自的特征向量,输出潜在风险文本“入职培训费用,这个是正规费用”的位置,即第7条对话文本所处的位置。After acquiring the above 7 dialogue texts, the risk identification device can input the 7 dialogue texts into the built-in first model, so that the first model can output the potential risk text "Induction Training Fee" based on the respective feature vectors of each text. , this is the position of "regular fee", which is where the dialogue text of Article 7 is located.

然后,风险识别装置可以利用内置的第二模型对7条对话文本进行对话主题划分,得到包括第1~2条对话文本的第一对话段,以及包括第3~7条对话文本的第二对话段。Then, the risk identification device can use the built-in second model to divide the seven dialogue texts into dialogue topics, and obtain the first dialogue segment including the 1st to 2nd dialogue texts, and the second dialogue segment including the 3rd to 7th dialogue texts. part.

接着,风险识别装置可以从两个对话段中找到第7条对话文本所属的第二对话段,并以第7条对话文本的位置为起点,从第二对话段中截取预设长度的文本。假设预设长度大于第二对话段的长度,则第二模型可以输出第二对话段即由第7条对话文本与其上文构成的第3~7条对话文本。若预设长度小于第二对话段的长度,则可以参见上述各实施例中提及的处理过程,比如删除第二对话段中与对话主题无关的文本,在此不再赘述。其中,第二模型的训练过程可以参见上述各实施例中的描述,在此不再赘述。Then, the risk identification device can find the second dialogue segment to which the seventh dialogue text belongs from the two dialogue segments, and use the position of the seventh dialogue text as a starting point to intercept a preset length of text from the second dialogue segment. Assuming that the preset length is greater than the length of the second dialogue segment, the second model can output the second dialogue segment, that is, the 3rd to 7th dialogue texts composed of the 7th dialogue text and its preceding text. If the preset length is less than the length of the second dialogue segment, you may refer to the processing procedures mentioned in the above embodiments, such as deleting text in the second dialogue segment that is irrelevant to the dialogue topic, which will not be described again here. For the training process of the second model, reference can be made to the descriptions in the above embodiments, and details will not be described again here.

最终,风险识别装置将第3~7条对话文本以及“检测下面对话内容中的‘入职培训费用,这个是正规费用’是否涉及欺骗,直接回答是或者否”作为提示信息,输入生成式模型中,以由生成式模型输出‘入职培训费用,这个是正规费用’是存在风险。并将风险识别结果“该条对话文本存在风险”作为风险提示信息展示在用户终端的显示界面上,以提示用户规避风险。Finally, the risk identification device takes the 3rd to 7th dialogue text and "Check whether the 'entry training fee, this is a formal fee' in the following dialogue content involves deception, and directly answer yes or no" as prompt information, and input it into the generative model , there is a risk in outputting 'onboarding training expenses, this is a formal expense' from the generative model. The risk identification result "This dialogue text is risky" is displayed on the display interface of the user terminal as risk prompt information to prompt the user to avoid risks.

另外,本实施例中未详细描述的内容以及所能实现的技术效果均可以参见上述各实施例中的相关描述,在此不再赘述。In addition, the content not described in detail in this embodiment and the technical effects that can be achieved can be referred to the relevant descriptions in the above embodiments, and will not be described again here.

以下将详细描述本发明的一个或多个实施例的风险识别装置。本领域技术人员可以理解,这些风险识别装置均可使用市售的硬件组件通过本方案所教导的步骤进行配置来构成。The risk identification device of one or more embodiments of the present invention will be described in detail below. Those skilled in the art can understand that these risk identification devices can be constructed using commercially available hardware components and configured through the steps taught in this solution.

图7为本发明实施例提供的一种风险识别装置的结构示意图,如图7所示,该装置包括:Figure 7 is a schematic structural diagram of a risk identification device provided by an embodiment of the present invention. As shown in Figure 7, the device includes:

位置确定模块11,用于确定潜在风险文本在对话文本中的目标位置。The position determination module 11 is used to determine the target position of the potential risk text in the dialogue text.

目标文本确定模块12,用于以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生。The target text determination module 12 is configured to take the target position as a starting point and intercept a target text of a preset length from the dialogue text. Other dialogue texts in the target text are generated before the potential risk text.

风险确定模块13,用于将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The risk determination module 13 is configured to input the target text containing the potential risk text and the risk identification text as prompt information into a generative model, so that the generative model determines whether the potential risk text contains risks.

可选地,所述位置确定模块11,用于分别对所述对话文本中各条文本进行特征提取;根据所述各条文本各自的特征向量,确定所述对话文本中是否存在潜在风险文本;若存在所述潜在风险文本,则确定所述潜在风险文本的所述目标位置。Optionally, the location determination module 11 is configured to perform feature extraction on each text in the dialogue text; determine whether there is a potential risk text in the dialogue text based on the respective feature vectors of each text; If the potential risk text exists, the target location of the potential risk text is determined.

可选地,所述位置确定模块11,还用于将所述对话文本输入第一模型,以由所述第一模型根据所述对话文本中各条文本各自的特征向量,输出所述潜在风险文本的所述目标位置。Optionally, the location determination module 11 is also configured to input the dialogue text into a first model, so that the first model outputs the potential risk according to the respective feature vectors of each text in the dialogue text. The target location of the text.

可选地,所述目标文本确定模块12,用于按照对话主题将所述对话文本划分为至少一个对话段;以所述目标位置为起点,按照预设长度从所述潜在风险文本所属的目标对话段中截取所述目标文本。Optionally, the target text determination module 12 is configured to divide the dialogue text into at least one dialogue segment according to the dialogue topic; starting from the target position, and starting from the target to which the potential risk text belongs according to a preset length. The target text is intercepted from the dialogue segment.

可选地,所述目标文本确定模块12,用于若所述目标对话段的长度小于所述预设长度,则将所述目标对话段确定为所述目标文本。Optionally, the target text determination module 12 is configured to determine the target dialogue segment as the target text if the length of the target dialogue segment is less than the preset length.

可选地,所述目标文本确定模块12,用于将所述对话文本、所述目标位置以及所述预设长度输入第二模型,以由所述第二模型将所述对话文本划分为至少一个对话段,并输出以所述目标位置为起点,按照所述预设长度从所述潜在风险文本所属的目标对话段中截取出的所述目标文本。Optionally, the target text determination module 12 is configured to input the dialogue text, the target position and the preset length into a second model, so that the second model divides the dialogue text into at least A dialogue segment, and output the target text that is intercepted from the target dialogue segment to which the potential risk text belongs according to the preset length, starting from the target position.

可选地,所述装置还包括:训练模块14,用于获取训练对话文本、风险文本在所述训练对话文本中训练位置,以及所述训练对话文本中包含所述风险文本的参考文本;将所述训练对话文本、所述训练位置以及预设长度输入所述第二模型,以由所述第二模型输出以所述训练位置为起点,按照所述预设长度从所述训练文本中截取出的预测文本;对所述参考文本和所述预测文本进行损失计算;根据计算结果训练所述第二模型。Optionally, the device further includes: a training module 14 for obtaining the training dialogue text, the training position of the risk text in the training dialogue text, and the reference text containing the risk text in the training dialogue text; The training dialogue text, the training position and the preset length are input to the second model, so that the output of the second model is intercepted from the training text according to the preset length starting from the training position. the predicted text; perform loss calculation on the reference text and the predicted text; and train the second model according to the calculation results.

图7所示装置可以执行图1至图4所示实施例的方法,本实施例未详细描述的部分,可参考对图1至图4所示实施例的相关说明。该技术方案的执行过程和技术效果参见图1至图4所示实施例中的描述,在此不再赘述。The device shown in Figure 7 can perform the method of the embodiment shown in Figures 1 to 4. For parts not described in detail in this embodiment, reference can be made to the relevant description of the embodiment shown in Figures 1 to 4. For the implementation process and technical effects of this technical solution, please refer to the description in the embodiment shown in Figures 1 to 4, and will not be described again here.

在一个可能的设计中,风险识别装置的结构可实现为一电子设备,如图8所示,该电子设备可以包括:处理器21和存储器22。其中,所述存储器22用于存储支持该电子设备执行上述图1至图4所示实施例中提供的风险识别方法的程序,所述处理器21被配置为用于执行所述存储器22中存储的程序。In a possible design, the structure of the risk identification device can be implemented as an electronic device, as shown in FIG. 8 . The electronic device can include: a processor 21 and a memory 22 . The memory 22 is used to store a program that supports the electronic device to execute the risk identification method provided in the embodiment shown in FIGS. 1 to 4 , and the processor 21 is configured to execute the program stored in the memory 22 . program of.

所述程序包括一条或多条计算机指令,其中,所述一条或多条计算机指令被所述处理器21执行时能够实现如下步骤:The program includes one or more computer instructions, wherein when the one or more computer instructions are executed by the processor 21, the following steps can be implemented:

确定潜在风险文本在对话文本中的目标位置;Determine where potentially risky text should be targeted within the conversation text;

以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;Taking the target position as a starting point, intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text;

将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The target text and the risk identification text containing the potential risk text are input into the generative model as prompt information, so that the generative model determines whether the potential risk text contains risks.

可选地,所述处理器21还用于执行前述图1至图4所示实施例中的全部或部分步骤。Optionally, the processor 21 is also configured to execute all or part of the steps in the aforementioned embodiments shown in FIGS. 1 to 4 .

其中,所述电子设备的结构中还可以包括通信接口23,用于该电子设备与其他设备或通信网络通信。Wherein, the structure of the electronic device may also include a communication interface 23 for the electronic device to communicate with other devices or communication networks.

另外,本发明实施例提供了一种计算机存储介质,用于储存上述电子设备所用的计算机软件指令,其包含用于执行上述图1至图4所示方法实施例中风险识别方法所涉及的程序。In addition, an embodiment of the present invention provides a computer storage medium for storing computer software instructions used in the above-mentioned electronic device, which includes programs involved in executing the risk identification method in the above-mentioned method embodiments shown in FIGS. 1 to 4 .

本发明实施例还提供了一种计算机程序产品,该计算机程序产品包括计算机程序指令,这些计算机程序指令被处理器读取并运行时,执行上述图1至图4所示方法实施例中的风险识别方法。Embodiments of the present invention also provide a computer program product. The computer program product includes computer program instructions. When these computer program instructions are read and run by a processor, the risks involved in executing the method embodiments shown in FIGS. 1 to 4 above are eliminated. recognition methods.

最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that it can still be used Modifications are made to the technical solutions described in the foregoing embodiments, or equivalent substitutions are made to some of the technical features; however, these modifications or substitutions do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (12)

1.一种风险识别方法,其特征在于,包括:1. A risk identification method, characterized by including: 确定潜在风险文本在对话文本中的目标位置;Determine where potentially risky text should be targeted within the conversation text; 以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;Taking the target position as a starting point, intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text; 将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The target text and the risk identification text containing the potential risk text are input into the generative model as prompt information, so that the generative model determines whether the potential risk text contains risks. 2.根据权利要求1所述的方法,其特征在于,所述确定潜在风险文本在对话文本中的目标位置,包括:2. The method according to claim 1, characterized in that determining the target position of the potential risk text in the dialogue text includes: 分别对所述对话文本中各条文本进行特征提取;Perform feature extraction on each text in the dialogue text respectively; 根据所述各条文本各自的特征向量,确定所述对话文本中是否存在潜在风险文本;Determine whether there is potential risk text in the dialogue text according to the respective feature vectors of each text; 若存在所述潜在风险文本,则确定所述潜在风险文本的所述目标位置。If the potential risk text exists, the target location of the potential risk text is determined. 3.根据权利要求1所述的方法,其特征在于,所述确定潜在风险文本在对话文本中的目标位置,包括:3. The method according to claim 1, characterized in that determining the target position of the potential risk text in the dialogue text includes: 将所述对话文本输入第一模型,以由所述第一模型根据所述对话文本中各条文本各自的特征向量,输出所述潜在风险文本的所述目标位置。The dialogue text is input into the first model, so that the first model outputs the target position of the potential risk text according to the respective feature vectors of each text in the dialogue text. 4.根据权利要求1所述的方法,其特征在于,所述以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,包括:4. The method according to claim 1, characterized in that taking the target position as a starting point and intercepting a target text of a preset length from the conversation text includes: 按照对话主题将所述对话文本划分为至少一个对话段;Divide the dialogue text into at least one dialogue segment according to the dialogue topic; 以所述目标位置为起点,按照预设长度从所述潜在风险文本所属的目标对话段中截取所述目标文本。Taking the target position as a starting point, the target text is intercepted from the target dialogue segment to which the potential risk text belongs according to a preset length. 5.根据权利要求4所述的方法,其特征在于,所述方法还包括:5. The method according to claim 4, characterized in that, the method further comprises: 若所述目标对话段的长度小于所述预设长度,则将所述目标对话段确定为所述目标文本。If the length of the target dialogue segment is less than the preset length, the target dialogue segment is determined as the target text. 6.根据权利要求1所述的方法,其特征在于,所述以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,包括:6. The method according to claim 1, characterized in that taking the target position as a starting point and intercepting a target text of a preset length from the conversation text includes: 将所述对话文本、所述目标位置以及所述预设长度输入第二模型,以由所述第二模型将所述对话文本划分为至少一个对话段,并输出以所述目标位置为起点,按照所述预设长度从所述潜在风险文本所属的目标对话段中截取出的所述目标文本。Input the dialogue text, the target position and the preset length into a second model, so that the second model divides the dialogue text into at least one dialogue segment, and outputs the target position as a starting point, The target text is intercepted from the target dialogue segment to which the potential risk text belongs according to the preset length. 7.根据权利要求6所述的方法,其特征在于,所述方法还包括:7. The method according to claim 6, characterized in that the method further comprises: 获取训练对话文本、风险文本在所述训练对话文本中训练位置,以及所述训练对话文本中包含所述风险文本的参考文本;Obtain the training dialogue text, the training position of the risk text in the training dialogue text, and the reference text containing the risk text in the training dialogue text; 将所述训练对话文本、所述训练位置以及预设长度输入所述第二模型,以由所述第二模型输出以所述训练位置为起点,按照所述预设长度从所述训练文本中截取出的预测文本;The training dialogue text, the training position and the preset length are input to the second model, so that the second model outputs the training text from the training text according to the preset length starting from the training position. The intercepted predictive text; 对所述参考文本和所述预测文本进行损失计算;Perform loss calculation on the reference text and the predicted text; 根据计算结果训练所述第二模型。The second model is trained according to the calculation results. 8.一种风险识别系统,其特征在于,所述系统包括:客户端和部署有生成式模型的服务端;8. A risk identification system, characterized in that the system includes: a client and a server deployed with a generative model; 所述客户端,用于确定潜在风险文本在对话文本中的目标位置;The client is used to determine the target position of the potential risk text in the conversation text; 以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;发送所述目标文本至所述服务端;Taking the target position as the starting point, intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text; send the target text to the server ; 所述服务端,用于将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入所述生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The server is configured to input the target text and risk identification text containing the potential risk text as prompt information into the generative model, so that the generative model determines whether the potential risk text exists. risk. 9.一种风险识别系统,其特征在于,所述系统包括:客户端和部署有生成式模型的服务端;9. A risk identification system, characterized in that the system includes: a client and a server deployed with a generative model; 所述客户端,用于发送用户输入的潜在风险文本;The client is used to send potentially risky text input by the user; 所述服务端,用于接收所述的潜在风险文本,以由所述潜在风险文本和在所述潜在风险文本之前产生的对话文本构成对话文本;The server is configured to receive the potential risk text to constitute a dialogue text consisting of the potential risk text and the dialogue text generated before the potential risk text; 确定潜在风险文本在对话文本中的目标位置;Determine where potentially risky text should be targeted within the conversation text; 以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;Taking the target position as a starting point, intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text; 将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入所述生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。The target text and risk identification text containing the potential risk text are input into the generative model as prompt information, so that the generative model determines whether the potential risk text contains risks. 10.一种风险识别装置,其特征在于,包括:10. A risk identification device, characterized in that it includes: 位置确定模块,用于确定潜在风险文本在对话文本中的目标位置;The position determination module is used to determine the target position of potentially risky text in the dialogue text; 目标文本确定模块,用于以所述目标位置为起点,从所述对话文本中截取预设长度的目标文本,所述目标文本中的其他对话文本在所述潜在风险文本之前产生;A target text determination module, configured to take the target position as a starting point and intercept a target text of a preset length from the dialogue text, and other dialogue texts in the target text are generated before the potential risk text; 风险确定模块,用于将包含所述潜在风险文本的所述目标文本以及风险识别文本作为提示信息,输入生成式模型中,以由所述生成式模型确定所述潜在风险文本是否存在风险。A risk determination module is configured to input the target text and risk identification text containing the potential risk text as prompt information into a generative model, so that the generative model determines whether the potential risk text contains risks. 11.一种电子设备,其特征在于,包括:存储器、处理器;其中,所述存储器上存储有可执行代码,当所述可执行代码被所述处理器执行时,使所述处理器执行如权利要求1至7中任一项所述的风险识别方法。11. An electronic device, characterized in that it includes: a memory and a processor; wherein executable code is stored on the memory, and when the executable code is executed by the processor, the processor is caused to execute The risk identification method according to any one of claims 1 to 7. 12.一种非暂时性机器可读存储介质,其特征在于,所述非暂时性机器可读存储介质上存储有可执行代码,当所述可执行代码被电子设备的处理器执行时,使所述处理器执行如权利要求1至7中任一项所述的风险识别方法。12. A non-transitory machine-readable storage medium, characterized in that executable code is stored on the non-transitory machine-readable storage medium. When the executable code is executed by a processor of an electronic device, The processor executes the risk identification method according to any one of claims 1 to 7.
CN202311198929.5A 2023-09-15 2023-09-15 Risk identification method, system, device, equipment and storage medium Pending CN117216662A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311198929.5A CN117216662A (en) 2023-09-15 2023-09-15 Risk identification method, system, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311198929.5A CN117216662A (en) 2023-09-15 2023-09-15 Risk identification method, system, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN117216662A true CN117216662A (en) 2023-12-12

Family

ID=89043845

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311198929.5A Pending CN117216662A (en) 2023-09-15 2023-09-15 Risk identification method, system, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN117216662A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160307201A1 (en) * 2010-11-29 2016-10-20 Biocatch Ltd. Contextual mapping of web-pages, and generation of fraud-relatedness score-values
CN113868398A (en) * 2021-10-14 2021-12-31 北京倍倾心智能科技中心(有限合伙) Dialogue dataset, construction method of security detection model, security evaluation method of dialogue system, medium and computing device
CN115017894A (en) * 2022-05-30 2022-09-06 建信金融科技有限责任公司 Public opinion risk identification method and device
WO2023165111A1 (en) * 2022-03-01 2023-09-07 达而观信息科技(上海)有限公司 Method and system for identifying user intention trajectory in customer service hotline

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160307201A1 (en) * 2010-11-29 2016-10-20 Biocatch Ltd. Contextual mapping of web-pages, and generation of fraud-relatedness score-values
CN113868398A (en) * 2021-10-14 2021-12-31 北京倍倾心智能科技中心(有限合伙) Dialogue dataset, construction method of security detection model, security evaluation method of dialogue system, medium and computing device
WO2023165111A1 (en) * 2022-03-01 2023-09-07 达而观信息科技(上海)有限公司 Method and system for identifying user intention trajectory in customer service hotline
CN115017894A (en) * 2022-05-30 2022-09-06 建信金融科技有限责任公司 Public opinion risk identification method and device

Similar Documents

Publication Publication Date Title
US10552544B2 (en) Methods and systems of automated assistant implementation and management
US11495227B2 (en) Artificial intelligence (AI) based user query intent analyzer
CN113782022B (en) Communication method, device, equipment and storage medium based on intention recognition model
CN111738018A (en) An intent understanding method, apparatus, device and storage medium
CN110782341A (en) Business collection method, device, equipment and medium
CN110955770A (en) Intelligent dialogue system
CN112818109A (en) Intelligent reply method, medium, device and computing equipment for mail
CN113239164A (en) Multi-round conversation process construction method and device, computer equipment and storage medium
CN113010664B (en) Data processing method, device and computer equipment
CN113988866A (en) A method for early warning and disposal of telecom network fraud based on big data analysis
CN113887214A (en) Artificial intelligence based wish presumption method and related equipment thereof
CN117609446A (en) Safe knowledge question-answering method and device
CN112784024A (en) Man-machine conversation method, device, equipment and storage medium
WO2024055603A1 (en) Method and apparatus for identifying text from minor
CN112992128B (en) Training method, device and system of intelligent voice robot
CN113111658A (en) Method, device, equipment and storage medium for checking information
CN117216662A (en) Risk identification method, system, device, equipment and storage medium
CN117932013A (en) LLM-based intention recognition method, device, equipment and storage medium
CN107623620A (en) Processing method, the webserver and the Intelligent dialogue system of randomness interaction data
CN116483972A (en) Method and device for acquiring message, electronic equipment and storage medium
CN110263135A (en) A kind of data exchange matching process, device, medium and electronic equipment
CN116596660A (en) Post-credit collection method, system and medium based on big data analysis and intelligent touch
CN112163066B (en) Artificial intelligent model training method, knowledge point pushing method, device and medium
CN110543556A (en) Dialogue configuration method, storage medium and electronic equipment
CN119136204B (en) Fraud prediction method, fraud prediction device, mobile equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载