+

CN110472995B - Store arrival prediction method and device, readable storage medium and electronic equipment - Google Patents

Store arrival prediction method and device, readable storage medium and electronic equipment Download PDF

Info

Publication number
CN110472995B
CN110472995B CN201910610494.8A CN201910610494A CN110472995B CN 110472995 B CN110472995 B CN 110472995B CN 201910610494 A CN201910610494 A CN 201910610494A CN 110472995 B CN110472995 B CN 110472995B
Authority
CN
China
Prior art keywords
store
user
search
user terminal
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910610494.8A
Other languages
Chinese (zh)
Other versions
CN110472995A (en
Inventor
张凯
王丛超
杨一帆
张弓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hanhai Information Technology Shanghai Co Ltd
Original Assignee
Hanhai Information Technology Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hanhai Information Technology Shanghai Co Ltd filed Critical Hanhai Information Technology Shanghai Co Ltd
Priority to CN201910610494.8A priority Critical patent/CN110472995B/en
Publication of CN110472995A publication Critical patent/CN110472995A/en
Application granted granted Critical
Publication of CN110472995B publication Critical patent/CN110472995B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/52Network services specially adapted for the location of the user terminal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/029Location-based management or tracking services

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Databases & Information Systems (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Signal Processing (AREA)
  • Data Mining & Analysis (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本申请实施例提供了一种到店预测方法、装置、可读存储介质及电子设备,该方法包括:获得用户终端发起的搜索请求;对所述搜索请求进行特征提取,确定所述用户终端的用户的特征、所述搜索请求对应的店铺列表中各个店铺的特征以及用户‑店铺交叉特征;将所述用户的特征、所述各个店铺的特征以及所述用户‑店铺交叉特征输入预先训练的到店概率预测模型,确定所述用户到达所述各个店铺的概率;根据所述用户到达所述各个店铺的概率,确定所述用户是否到达目标店铺。本申请通过到店概率预测模型输出的各个店铺的概率确定出用户到达的目标店铺,实现了对用户的当前位置的准确定位,提高了用户终端向用户推荐的当前位置附近的店铺的可靠性。

The embodiment of the present application provides a store arrival prediction method, device, readable storage medium and electronic device, the method comprising: obtaining a search request initiated by a user terminal; performing feature extraction on the search request to determine the features of the user of the user terminal, the features of each store in the store list corresponding to the search request, and the user-store cross-features; inputting the features of the user, the features of each store, and the user-store cross-features into a pre-trained store arrival probability prediction model to determine the probability of the user arriving at each store; and determining whether the user arrives at the target store based on the probability of the user arriving at each store. The present application determines the target store that the user arrives at by using the probability of each store output by the store arrival probability prediction model, thereby achieving accurate positioning of the user's current location and improving the reliability of stores near the current location recommended to the user by the user terminal.

Description

到店预测方法、装置、可读存储介质及电子设备Store arrival prediction method, device, readable storage medium and electronic device

技术领域Technical Field

本申请涉及信息处理技术领域,尤其涉及一种到店预测方法、装置、可读存储介质及电子设备。The present application relates to the field of information processing technology, and in particular to a store arrival prediction method, device, readable storage medium and electronic device.

背景技术Background Art

随着移动互联网的发展,人们可以很方便地通过移动设备访问网络以获取服务,由此兴起了一批O2O(Online-to-Offline)本地生活化服务(例如:O2O附近搜索),极大的方便了人们的生活。以O2O附近搜索为例,用户通过该搜索功能可以查看前位置附近的美食、娱乐等生活资讯,该搜索功能的具体实现过程为:预先搜集各个商户的经纬度标注,实时获取用户的定位,计算用户的定位与已标注经纬度的各个商户的距离,将距离满足用户的筛选范围的商户按照一定的算法进行排序后得到筛选结果,并将筛选结果返回到用户使用的移动设备进行显示。With the development of mobile Internet, people can easily access the Internet through mobile devices to obtain services, thus a number of O2O (Online-to-Offline) local life services (such as O2O nearby search) have emerged, which greatly facilitates people's lives. Taking O2O nearby search as an example, users can use this search function to view food, entertainment and other life information near their current location. The specific implementation process of this search function is: pre-collect the longitude and latitude annotations of each merchant, obtain the user's location in real time, calculate the distance between the user's location and each merchant with annotated longitude and latitude, sort the merchants whose distance meets the user's screening range according to a certain algorithm to obtain the screening results, and return the screening results to the mobile device used by the user for display.

然而,上述搜索功能的具体实现过程存在一个问题:筛选结果过度依赖于用户的定位,当用户的定位未达到一定的准确度时,计算得到的用户的位置与各个商户的距离存在偏差,导致一部分商户无法被作为筛选结果并返回给用户,严重影响了用户的决策与使用体验。However, there is a problem in the specific implementation process of the above search function: the screening results are overly dependent on the user's location. When the user's location does not reach a certain accuracy, there is a deviation between the calculated user's location and the distance between each merchant, resulting in some merchants being unable to be used as screening results and returned to the user, seriously affecting the user's decision-making and usage experience.

因而,为使得计算得到的用户的定位与已标注经纬度的各个商户的距离满足一定的准确度,需保证用户的定位满足较高的精度要求。在实际情况中,用户的实时定位受网络状况影响较大,当用户在一些大型建筑物内(例如:在商场内)发起O2O附近搜索时,由于普通的定位技术无法定位大型建筑物的楼层信息,并且移动设备在室内无法接收到GPS信号,只能依赖网络运营商的基站实现用户的实时定位,而依赖网络运营商的基站获得的用户的实时定位存在漂移现象,无法保证用户在室内的实时定位满足较高的精度要求。Therefore, in order to ensure that the calculated distance between the user's location and the distance between the merchants with marked longitude and latitude meet a certain accuracy, it is necessary to ensure that the user's location meets a high precision requirement. In actual situations, the user's real-time location is greatly affected by the network conditions. When the user initiates an O2O nearby search in some large buildings (for example, in a shopping mall), the ordinary positioning technology cannot locate the floor information of large buildings, and the mobile device cannot receive GPS signals indoors. The user's real-time location can only be achieved by relying on the base station of the network operator. The real-time location of the user obtained by relying on the base station of the network operator has drift phenomenon, and it cannot be guaranteed that the user's real-time location indoors meets the high precision requirements.

因此,如何更精确地对用户的当前位置进行定位是本领域急需解决的问题。Therefore, how to locate the current position of the user more accurately is a problem that needs to be solved urgently in this field.

发明内容Summary of the invention

本申请实施例提供一种到店预测方法、装置、可读存储介质及电子设备,能实时地预测用户是否到达店铺,实现了对用户的当前位置的精准定位。The embodiments of the present application provide a store arrival prediction method, device, readable storage medium and electronic device, which can predict in real time whether a user has arrived at a store, thereby achieving accurate positioning of the user's current location.

本申请实施例第一方面提供了一种到店预测方法,所述方法包括:A first aspect of an embodiment of the present application provides a method for predicting store arrivals, the method comprising:

获得用户终端发起的搜索请求;Obtaining a search request initiated by a user terminal;

对所述搜索请求进行特征提取,确定所述用户终端的用户的特征、所述搜索请求对应的店铺列表中各个店铺的特征以及用户-店铺交叉特征,所述用户-店铺交叉特征是对所述用户的特征与所述各个店铺的特征进行特征交叉得到的;Extracting features from the search request to determine features of a user of the user terminal, features of each store in the store list corresponding to the search request, and user-store cross features, where the user-store cross features are obtained by cross-feature analysis of the features of the user and the features of each store;

将所述用户的特征、所述各个店铺的特征以及所述用户-店铺交叉特征输入预先训练的到店概率预测模型,确定所述用户到达所述各个店铺的概率;Inputting the characteristics of the user, the characteristics of each store, and the user-store cross-features into a pre-trained store arrival probability prediction model to determine the probability of the user arriving at each store;

根据所述用户到达所述各个店铺的概率,确定所述用户是否到达目标店铺,所述目标店铺为所述各个店铺中的一个。According to the probability of the user arriving at each of the stores, it is determined whether the user arrives at a target store, where the target store is one of the stores.

可选地,所述根据所述用户到达所述各个店铺的概率,确定所述用户是否到达目标店铺的步骤,包括:Optionally, the step of determining whether the user has arrived at a target store according to the probability of the user arriving at each store includes:

在所述用户到达所述目标店铺的概率大于预设的概率阈值的情况下,确定所述用户到达所述目标店铺;或When the probability that the user arrives at the target store is greater than a preset probability threshold, determining that the user arrives at the target store; or

在所述用户到达所述目标店铺的概率大于预设的概率阈值,且所述搜索请求对应的参数值在预设的生效参数值范围内的情况下,确定所述用户到达所述目标店铺。When the probability that the user arrives at the target store is greater than a preset probability threshold and the parameter value corresponding to the search request is within a preset effective parameter value range, it is determined that the user arrives at the target store.

可选地,所述方法还包括:Optionally, the method further comprises:

确定与用户到店相关联的用户行为类型;Determine the types of user behaviors associated with user visits to your store;

从所述用户终端的搜索日志中提取第一类搜索记录和第二类搜索记录,所述第一类搜索记录为符合所述用户行为类型的搜索记录,所述第二类搜索记录为对应的搜索时刻与所述第一类搜索记录的搜索时刻的时间差在预设时长内的搜索记录;Extracting a first category of search records and a second category of search records from the search log of the user terminal, wherein the first category of search records are search records that conform to the user behavior type, and the second category of search records are search records whose corresponding search moments have a time difference with the search moments of the first category of search records within a preset time length;

将所述第一类搜索记录和所述第二类搜索记录中符合所述用户行为类型的搜索记录标记为正样本,以及,将所述第二类搜索记录中不符合所述用户行为类型的搜索记录标记为负样本;Marking the search records that match the user behavior type in the first category of search records and the second category of search records as positive samples, and marking the search records that do not match the user behavior type in the second category of search records as negative samples;

根据所述正样本和所述负样本,对预设模型进行训练,得到所述到店概率预测模型。The preset model is trained according to the positive samples and the negative samples to obtain the store visit probability prediction model.

可选地,所述根据所述正样本和所述负样本,对预设模型进行训练,得到所述到店概率预测模型的步骤,包括:Optionally, the step of training a preset model according to the positive samples and the negative samples to obtain the store arrival probability prediction model includes:

对所述正样本和所述负样本分别进行特征提取,确定所述正样本和所述负样本各自对应的样本用户的特征、所述正样本和所述负样本各自对应的店铺列表中各个样本店铺的特征以及样本用户-样本店铺交叉特征,所述样本用户-样本店铺交叉特征是对所述样本用户的特征与所述各个样本店铺的特征进行特征交叉得到的;Perform feature extraction on the positive samples and the negative samples respectively, determine the features of the sample users corresponding to the positive samples and the negative samples respectively, the features of each sample store in the store list corresponding to the positive samples and the negative samples respectively, and the sample user-sample store cross-features, where the sample user-sample store cross-features are obtained by cross-feature analysis of the features of the sample users and the features of each sample store;

以所述样本用户的特征、所述各个样本店铺的特征以及所述样本用户-样本店铺交叉特征为训练样本,对所述预设模型进行训练,得到所述到店概率预测模型。The characteristics of the sample users, the characteristics of the sample stores, and the cross-characteristics of the sample users and sample stores are used as training samples to train the preset model and obtain the store visit probability prediction model.

可选地,在所述确定所述用户到达所述目标店铺的步骤之后,所述方法还包括:Optionally, after the step of determining that the user has arrived at the target store, the method further includes:

从所述用户终端的搜索日志中提取对应的搜索时刻在确定所述用户到达所述目标店铺之后的搜索记录;Extracting, from the search log of the user terminal, a search record corresponding to the search time after it is determined that the user has arrived at the target store;

在提取的搜索记录是针对所述目标店铺的搜索记录的情况下,将所述提取的搜索记录标记为正样本,并增加所述正样本的权重;In the case where the extracted search record is a search record for the target store, marking the extracted search record as a positive sample and increasing the weight of the positive sample;

若所述提取的搜索记录不是针对所述目标店铺的搜索请求,将所述提取的搜索记录标记为负样本,并减少所述负样本的权重;以及If the extracted search record is not a search request for the target store, marking the extracted search record as a negative sample and reducing the weight of the negative sample; and

根据增加权重后的正样本和减少权重后的负样本,对所述到店概率预测模型进行更新。The store arrival probability prediction model is updated according to the positive samples with increased weights and the negative samples with reduced weights.

可选地,所述用户的特征包括以下至少一者:所述用户终端扫描或连接到的WIFI名称及相应的信号强度、所述用户终端的设备类型、所述用户终端的经纬度、所述用户终端的IP地址、所述用户的用户画像、以及所述用户的消费偏好。Optionally, the user's characteristics include at least one of the following: the name of the WIFI scanned or connected to by the user terminal and the corresponding signal strength, the device type of the user terminal, the longitude and latitude of the user terminal, the IP address of the user terminal, the user's user portrait, and the user's consumption preferences.

可选地,所述各个店铺的特征包括以下至少一者:所述各个店铺的标识、所述各个店铺的WIFI名称、所述各个店铺的WIFI平均连接或扫描强度、所述各个店铺的经纬度、所述各个店铺所属的类目、所述各个店铺售卖的商品的价格区间、所述各个店铺的点击率以及所述各个店铺的访购率。Optionally, the characteristics of each store include at least one of the following: the logo of each store, the WIFI name of each store, the average WIFI connection or scanning strength of each store, the longitude and latitude of each store, the category to which each store belongs, the price range of the goods sold in each store, the click-through rate of each store, and the visit rate of each store.

可选地,所述用户-店铺交叉特征是通过以下至少一种方式得到的:Optionally, the user-store cross feature is obtained by at least one of the following methods:

根据所述用户终端的经纬度和所述各个店铺的经纬度,确定所述用户终端与所述各个店铺的直线距离;Determining the straight-line distance between the user terminal and each store according to the longitude and latitude of the user terminal and the longitude and latitude of each store;

对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端扫描或连接到的店铺的WIFI平均连接或扫描强度进行特征交叉;Performing feature cross-talk on the signal strength of the WIFI of the store scanned or connected by the user terminal and the average connection or scanning strength of the WIFI of the store scanned or connected by the user terminal;

对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端与所述用户终端扫描或连接到的店铺的直线距离进行特征交叉;和/或Performing feature intersection on the signal strength of the WIFI of the store scanned or connected by the user terminal and the straight-line distance between the user terminal and the store scanned or connected by the user terminal; and/or

对所述用户终端的用户点击或消费价格与所述各个店铺的人均价格进行特征交叉。A feature cross-talk is performed between the user click or consumption price of the user terminal and the per capita price of each store.

本申请实施例第二方面提供一种到店预测装置,所述装置包括:A second aspect of an embodiment of the present application provides a store arrival prediction device, the device comprising:

获得模块,用于获得用户终端发起的搜索请求;An acquisition module, used to obtain a search request initiated by a user terminal;

特征提取模块,用于对所述搜索请求进行特征提取,确定所述用户终端的用户的特征、所述搜索请求对应的店铺列表中各个店铺的特征以及用户-店铺交叉特征,所述用户-店铺交叉特征是对所述用户的特征与所述各个店铺的特征进行特征交叉得到的;A feature extraction module, configured to extract features from the search request, determine features of the user of the user terminal, features of each store in the store list corresponding to the search request, and user-store cross features, wherein the user-store cross features are obtained by cross-feature analysis of the features of the user and the features of each store;

概率预测模块,用于将所述用户的特征、所述各个店铺的特征以及所述用户-店铺交叉特征输入预先训练的到店概率预测模型,确定所述用户到达所述各个店铺的概率;以及A probability prediction module, used to input the characteristics of the user, the characteristics of each store, and the user-store cross-features into a pre-trained store arrival probability prediction model to determine the probability of the user arriving at each store; and

确定模块,用于根据所述用户到达所述各个店铺的概率,确定所述用户是否到达目标店铺,所述目标店铺为所述各个店铺中的一个。A determination module is used to determine whether the user has arrived at a target store according to the probability of the user arriving at each store, and the target store is one of the stores.

可选地,所述确定模块包括:Optionally, the determining module includes:

第一确定模块,用于在所述用户到达所述目标店铺的概率大于预设的概率阈值的情况下,确定所述用户到达所述目标店铺;或A first determining module is used to determine that the user has arrived at the target store when the probability that the user has arrived at the target store is greater than a preset probability threshold; or

第二确定模块,用于在所述用户到达所述目标店铺的概率大于预设的概率阈值,且所述搜索请求对应的参数值在预设的生效参数值范围内的情况下,确定所述用户到达所述目标店铺。The second determination module is used to determine that the user has arrived at the target store when the probability that the user has arrived at the target store is greater than a preset probability threshold and the parameter value corresponding to the search request is within a preset effective parameter value range.

可选地,所述装置还包括:Optionally, the device further comprises:

第三确定模块,用于确定与用户到店相关联的用户行为类型;A third determination module is used to determine the user behavior type associated with the user visiting the store;

第一提取模块,用于从所述用户终端的搜索日志中提取第一类搜索记录和第二类搜索记录,所述第一类搜索记录为符合所述用户行为类型的搜索记录,所述第二类搜索记录为对应的搜索时刻与所述第一类搜索记录的搜索时刻的时间差在预设时长内的搜索记录;A first extraction module is used to extract a first type of search record and a second type of search record from the search log of the user terminal, wherein the first type of search record is a search record that conforms to the user behavior type, and the second type of search record is a search record whose corresponding search time has a time difference with the search time of the first type of search record within a preset time length;

标记模块,用于将所述第一类搜索记录和所述第二类搜索记录中符合所述用户行为类型的搜索记录标记为正样本,以及,将所述第二类搜索记录中不符合所述用户行为类型的搜索记录标记为负样本;a marking module, configured to mark the search records in the first category of search records and the second category of search records that match the user behavior type as positive samples, and mark the search records in the second category of search records that do not match the user behavior type as negative samples;

训练模块,用于根据所述正样本和所述负样本,对预设模型进行训练,得到所述到店概率预测模型。The training module is used to train the preset model according to the positive samples and the negative samples to obtain the store arrival probability prediction model.

可选地,所述训练模块包括:Optionally, the training module includes:

特征提取子模块,用于对所述正样本和所述负样本分别进行特征提取,确定所述正样本和所述负样本各自对应的样本用户的特征、所述正样本和所述负样本各自对应的店铺列表中各个样本店铺的特征以及样本用户-样本店铺交叉特征,所述样本用户-样本店铺交叉特征是对所述样本用户的特征与所述各个样本店铺的特征进行特征交叉得到的;A feature extraction submodule is used to extract features from the positive samples and the negative samples respectively, and determine features of sample users corresponding to the positive samples and the negative samples, features of each sample store in the store list corresponding to the positive samples and the negative samples, and sample user-sample store cross features, where the sample user-sample store cross features are obtained by cross-feature analysis of the features of the sample users and the features of each sample store;

训练子模块,用于以所述样本用户的特征、所述各个样本店铺的特征以及所述样本用户-样本店铺交叉特征为训练样本,对所述预设模型进行训练,得到所述到店概率预测模型。The training submodule is used to train the preset model using the characteristics of the sample users, the characteristics of each sample store, and the cross-features of the sample users and sample stores as training samples to obtain the store arrival probability prediction model.

可选地,所述装置还包括:Optionally, the device further comprises:

第二提取模块,用于从所述用户终端的搜索日志中提取对应的搜索时刻在确定所述用户到达所述目标店铺之后的搜索记录;A second extraction module is used to extract, from the search log of the user terminal, the search record at the corresponding search time after it is determined that the user has arrived at the target store;

第一权重调整模块,用于在提取的搜索记录是针对所述目标店铺的搜索记录的情况下,将所述提取的搜索记录标记为正样本,并增加所述正样本的权重;a first weight adjustment module, configured to mark the extracted search record as a positive sample and increase the weight of the positive sample when the extracted search record is a search record for the target store;

第二权重调整模块,用于在所述提取的搜索记录不是针对所述目标店铺的搜索记录的情况下,将所述提取的搜索记录标记为负样本,并减少所述负样本的权重;以及a second weight adjustment module, configured to mark the extracted search record as a negative sample and reduce the weight of the negative sample when the extracted search record is not a search record for the target store; and

更新模块,用于根据增加权重后的正样本和减少权重后的负样本,对所述到店概率预测模型进行更新。The updating module is used to update the store arrival probability prediction model according to the positive samples with increased weights and the negative samples with reduced weights.

可选地,所述用户的特征包括以下至少一者:所述用户终端扫描或连接到的WIFI名称及相应的信号强度、所述用户终端的设备类型、所述用户终端的经纬度、所述用户终端的IP地址、所述用户的用户画像、以及所述用户的消费偏好。Optionally, the user's characteristics include at least one of the following: the name of the WIFI scanned or connected to by the user terminal and the corresponding signal strength, the device type of the user terminal, the longitude and latitude of the user terminal, the IP address of the user terminal, the user's user portrait, and the user's consumption preferences.

可选地,所述各个店铺的特征包括以下至少一者:所述各个店铺的标识、所述各个店铺的WIFI名称、所述各个店铺的WIFI平均连接或扫描强度、所述各个店铺的经纬度、所述各个店铺所属的类目、所述各个店铺售卖的商品的价格区间、所述各个店铺的点击率以及所述各个店铺的访购率。Optionally, the characteristics of each store include at least one of the following: the logo of each store, the WIFI name of each store, the average WIFI connection or scanning strength of each store, the longitude and latitude of each store, the category to which each store belongs, the price range of the goods sold in each store, the click-through rate of each store, and the visit rate of each store.

可选地,所述用户-店铺交叉特征是通过以下至少一种方式得到的:Optionally, the user-store cross feature is obtained by at least one of the following methods:

根据所述用户终端的经纬度和所述各个店铺的经纬度,确定所述用户终端与所述各个店铺的直线距离;Determining the straight-line distance between the user terminal and each store according to the longitude and latitude of the user terminal and the longitude and latitude of each store;

对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端扫描或连接到的店铺的WIFI平均连接或扫描强度进行特征交叉;Performing feature cross-talk on the signal strength of the WIFI of the store scanned or connected by the user terminal and the average connection or scanning strength of the WIFI of the store scanned or connected by the user terminal;

对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端与所述用户终端扫描或连接到的店铺的直线距离进行特征交叉;和/或Performing feature intersection on the signal strength of the WIFI of the store scanned or connected by the user terminal and the straight-line distance between the user terminal and the store scanned or connected by the user terminal; and/or

对所述用户终端的用户点击或消费价格与所述各个店铺的人均价格进行特征交叉。A feature cross-talk is performed between the user click or consumption price of the user terminal and the per capita price of each store.

本申请实施例第三方面提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本申请第一方面所述的方法中的步骤。A third aspect of an embodiment of the present application provides a computer-readable storage medium having a computer program stored thereon, which, when executed by a processor, implements the steps in the method described in the first aspect of the present application.

本申请实施例第四方面提供一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行时实现本申请第一方面所述的方法的步骤。A fourth aspect of an embodiment of the present application provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor, when executed, implements the steps of the method described in the first aspect of the present application.

采用本申请实施例提供的一种到店预测方法,首先获取用户发起的搜索请求,然后对搜索请求进行特征提取,并将提取出的特征(包括:用户终端的用户的特征、搜索请求对应的店铺列表中各个店铺的特征以及用户-店铺交叉特征)输入到店概率预测模型以得到用户到达店铺列表中各个店铺的概率,最后再根据这些概率预测得到用户到达的目标店铺。本申请从用户维度、店铺维度以及用户-店铺交叉维度三个维度对用户终端发起的搜索请求进行特征提取,提高了到店概率预测模型输出的结果的准确度以及最终预测得到的目标店铺的准确度,此外,通过到店概率预测模型输出的各个店铺的概率确定出用户到达的目标店铺,实现了对用户的当前位置的准确定位,提高了用户终端向用户推荐的当前位置附近的店铺的可靠性,增强了用户的使用体验。A store arrival prediction method provided by an embodiment of the present application is adopted. First, a search request initiated by a user is obtained, and then feature extraction is performed on the search request. The extracted features (including: features of the user of the user terminal, features of each store in the store list corresponding to the search request, and user-store cross features) are input into the store probability prediction model to obtain the probability of the user arriving at each store in the store list, and finally the target store that the user arrives at is predicted based on these probabilities. The present application performs feature extraction on the search request initiated by the user terminal from three dimensions: user dimension, store dimension, and user-store cross dimension, which improves the accuracy of the results output by the store arrival probability prediction model and the accuracy of the target store finally predicted. In addition, the target store that the user arrives at is determined by the probability of each store output by the store arrival probability prediction model, which realizes the accurate positioning of the user's current location, improves the reliability of the stores near the current location recommended by the user terminal to the user, and enhances the user's experience.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings required for use in the description of the embodiments of the present application will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present application. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying any creative labor.

图1是本申请各个实施例提供的用户终端与后台服务器进行交互的示意图;FIG1 is a schematic diagram of the interaction between a user terminal and a backend server provided in various embodiments of the present application;

图2是本申请一实施例示出的一种到店预测方法的流程图;FIG2 is a flow chart of a method for predicting store arrivals according to an embodiment of the present application;

图3是本申请一实施例示出的获得用户-店铺交叉特征的流程图;FIG3 is a flow chart of obtaining user-store cross features according to an embodiment of the present application;

图4是本申请一实施例示出的一种到店概率预测模型的训练方法的流程图;FIG4 is a flow chart of a method for training a store arrival probability prediction model according to an embodiment of the present application;

图5是本申请一实施例示出的另一种到店概率预测模型的训练方法的流程图;FIG5 is a flowchart of another method for training a store arrival probability prediction model according to an embodiment of the present application;

图6是本申请一实施例示出的一种更新到店概率预测模型的方法的流程图;FIG6 is a flow chart of a method for updating a store visit probability prediction model according to an embodiment of the present application;

图7是本申请一实施例示出的一种到店预测装置的示意图。FIG. 7 is a schematic diagram of a store arrival prediction device according to an embodiment of the present application.

具体实施方式DETAILED DESCRIPTION

下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will be combined with the drawings in the embodiments of the present application to clearly and completely describe the technical solutions in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of this application.

在对本申请各个实施例进行说明之前,首先对相关技术进行说明。为获得用户的较为精准的定位,在相关技术中,采用对获得的用户的定位进行修正的方式,例如其中的一种修正方式为:在用户的当前位置的一定范围内寻找参考源,例如:若用户当前的网络状态是已连接WIFI,则可将该WIFI的坐标作为一个参考源,再对用户的定位进行修正;又例如:当用户处于大型建筑物内部(例如:大型商场内部)时,由于GPS信号被阻挡,则可以根据用户历史定位的周边的地磁信息对用户的定位进行修正。Before describing each embodiment of the present application, the related technology is described first. In order to obtain a more accurate positioning of the user, in the related technology, a method of correcting the obtained user positioning is adopted, for example, one of the correction methods is: looking for a reference source within a certain range of the user's current position, for example: if the user's current network status is connected to WIFI, the coordinates of the WIFI can be used as a reference source, and then the user's positioning is corrected; for example: when the user is inside a large building (for example: inside a large shopping mall), because the GPS signal is blocked, the user's positioning can be corrected according to the surrounding geomagnetic information of the user's historical positioning.

然而,上述两种对获得的用户的定位进行修正的方式需要获取对应的数据信息,代价高昂,例如:当采用WIFI的坐标作为参考源时,需要额外获得用户所处的区域中的各个WIFI的坐标信息,当利用地磁信息对用户的定位进行修正时,需额外获得用户所处的区域中的地磁信息。However, the above two methods of correcting the obtained user positioning require obtaining corresponding data information, which is costly. For example, when the coordinates of WIFI are used as a reference source, it is necessary to additionally obtain the coordinate information of each WIFI in the area where the user is located. When the user's positioning is corrected using geomagnetic information, it is necessary to additionally obtain the geomagnetic information in the area where the user is located.

在相关技术中,另一种获得用户的定位的方式是:搜集用户连接到的店铺的WIFI、用户在该店铺内的停留时间以及用户在该店铺的购买信息,然后结合业务制定相应的规则,再根据规则判断用户是否在该店铺内。但是该方式仅仅考虑用户连接的店铺的WIFI、用户在该店铺内的停留时间以及用户在该店铺的购买信息,即只从用户一侧来判断用户是否在该店铺内,此外,该方案过度依赖于店铺是否有WIFI、用户是否有购买行为,也不具备较强的普适性,因此这种定位方式对用户定位所得到的定位结果的准确度依然不高。In the related art, another way to obtain the user's location is to collect the WIFI of the store to which the user is connected, the time the user stays in the store, and the user's purchase information in the store, and then formulate corresponding rules in combination with the business, and then determine whether the user is in the store based on the rules. However, this method only considers the WIFI of the store to which the user is connected, the time the user stays in the store, and the user's purchase information in the store, that is, it only determines whether the user is in the store from the user's side. In addition, this solution is overly dependent on whether the store has WIFI and whether the user has purchase behavior, and it does not have strong universality. Therefore, the accuracy of the positioning results obtained by this positioning method for user positioning is still not high.

为了提高用户的定位的准确度,本申请实施例利用O2O(Online-to-Offline)本地生活化服务中许多业务只能在商家店内完成的特性,实时获取用户的搜索请求并标记出用户针对店铺做出了典型行为(例如:参与店内的团购验券活动、购买闪惠、签到、上传UGC等)的消费数据,再针对这些消费数据从用户、店铺、用户和店铺的交叉等多个维度进行特征提取,然后将提取出的特征输入预先训练好的到店概率预测模型,得到用户到达各个店铺的概率,最后再根据这些概率得到用户实际到达的目标店铺。In order to improve the accuracy of user positioning, the embodiment of the present application utilizes the fact that many businesses in O2O (Online-to-Offline) local life services can only be completed in merchants' stores, obtains users' search requests in real time and marks the consumption data of users' typical behaviors towards stores (for example, participating in group purchase and coupon verification activities in stores, purchasing flash discounts, signing in, uploading UGC, etc.), and then extracts features from multiple dimensions such as users, stores, and intersections between users and stores for these consumption data. The extracted features are then input into a pre-trained store arrival probability prediction model to obtain the probability of users arriving at each store, and finally, based on these probabilities, the target store that the user actually arrives at is obtained.

下面将对本申请实施例提供的一种到店预测方法进行详细说明。A store arrival prediction method provided in an embodiment of the present application is described in detail below.

图1是本申请各个实施例提供的用户终端与后台服务器进行交互的示意图。参照图1,后台服务器通过网络与一个或多个用户终端(例如:图1中的用户终端1至用户终端n)进行通信连接,以实现通信交互。后台服务器可以是网络服务器、数据库服务器等。用户终端可以是个人电脑(personal computer,PC)、平板电脑、智能手机等。FIG1 is a schematic diagram of the interaction between a user terminal and a backend server provided in various embodiments of the present application. Referring to FIG1 , the backend server is connected to one or more user terminals (e.g., user terminal 1 to user terminal n in FIG1 ) through a network to achieve communication interaction. The backend server may be a network server, a database server, etc. The user terminal may be a personal computer (PC), a tablet computer, a smart phone, etc.

图2是本申请一实施例示出的一种到店预测方法的流程图,该方法应用于图1中的后台服务器。参照图2,本申请一实施例提供的到店预测方法包括以下步骤:FIG2 is a flow chart of a method for predicting store arrivals according to an embodiment of the present application, and the method is applied to the backend server in FIG1. Referring to FIG2, the method for predicting store arrivals according to an embodiment of the present application includes the following steps:

步骤S11:获得用户终端发起的搜索请求。Step S11: Obtain a search request initiated by a user terminal.

在本实施例中,用户可在用户终端输入搜索请求,后台服务器用于接收用户终端发送的搜索请求。其中,用户终端安装有可支持附近搜索功能的终端应用软件,例如:搜索类应用软件、购物类应用软件或者可为用户提供其他服务的应用软件。通过这类应用软件,用户可以搜索当前位置的周边范围内的任何目标。用户终端接收到用户输入的搜索请求后,将搜索请求发送给后台服务器,然后接收后台服务器针对该次搜索请求返回的搜索结果并在页面上以店铺列表的形式将搜索结果展示给用户。In this embodiment, the user can input a search request in the user terminal, and the backend server is used to receive the search request sent by the user terminal. The user terminal is installed with terminal application software that can support the nearby search function, such as search application software, shopping application software, or application software that can provide other services to the user. Through such application software, the user can search for any target within the surrounding area of the current location. After receiving the search request input by the user, the user terminal sends the search request to the backend server, and then receives the search results returned by the backend server for the search request and displays the search results to the user in the form of a store list on the page.

用户终端接收到用户的搜索请求后,主动将针对此次搜索请求的用户数据发送给后台服务器,或者,用户终端接收到用户的搜索请求后,在下一次后台服务器向用户终端请求用户数据时,再将针对此次搜索请求的用户数据发送给后台服务器,当然,也可以采用其它的方式将针对搜索请求的用户数据发送给后台服务器,本申请各个实施例对此不作具体限制。After receiving the user's search request, the user terminal actively sends the user data for this search request to the background server. Alternatively, after receiving the user's search request, the user terminal sends the user data for this search request to the background server the next time the background server requests the user data from the user terminal. Of course, other methods can also be used to send the user data for the search request to the background server, and the various embodiments of the present application do not impose specific restrictions on this.

其中,用户数据包括用户行为数据、用户信息以及用户终端的设备状态信息。用户行为数据是指:用户对店铺列表中的任意店铺做出用户行为时动态生成的数据,用户行为可以是一般用户行为,例如点击、收藏、分享等行为;也可以是与到店相关的行为(即:只有到达店铺后才能完成的行为),例如:闪惠买单、团购验券、自助点餐、取号排队、上传UGC、用户签到等,当然,也可以为其它类型的用户行为,本申请包括但不限于上述列举出的多种用户行为。用户信息是指:用户画像(例如:用户的年龄、职业、性别等)、用户的消费偏好(例如:经常购买的商品的类目、价格消费区间)以及其它表征用户的个人特征的信息。用户终端的设备状态信息是指:用户终端扫描或连接到的WIFI名称及相应的信号强度、用户终端的设备类型、用户终端的经纬度、用户终端的IP地址等。Among them, user data includes user behavior data, user information and device status information of the user terminal. User behavior data refers to: data dynamically generated when a user performs user behavior on any store in the store list. User behavior can be general user behavior, such as click, favorite, share, etc.; it can also be behavior related to the store (that is, behavior that can only be completed after arriving at the store), such as: flash payment, group purchase coupon verification, self-service ordering, queuing, uploading UGC, user sign-in, etc., of course, it can also be other types of user behavior. This application includes but is not limited to the various user behaviors listed above. User information refers to: user portrait (for example: user's age, occupation, gender, etc.), user's consumption preferences (for example: categories of frequently purchased goods, price consumption range) and other information that characterizes the user's personal characteristics. The device status information of the user terminal refers to: the name of the WIFI scanned or connected to by the user terminal and the corresponding signal strength, the device type of the user terminal, the latitude and longitude of the user terminal, the IP address of the user terminal, etc.

示例地,后台服务器是某生活消费类APP的后台服务器,相应地,用户终端上安装的可支持附近搜索功能的终端应用软件为与后台服务器进行通信交互的生活消费类APP。用户在生活消费类APP的搜索栏中输入“火锅”,则生活消费类APP上会弹出展示有多个与“火锅”相关的店铺的页面,用户可任意点击一个店铺,查询该店铺的相关信息、参与该店铺内的闪惠买单、团购验券、用户签到等活动。生活消费类APP实时记录用户针对该次搜索请求产生的用户行为数据,同时通过用户的个人账户获取用户信息,以及通过用户终端内的应用程序获取用户终端的设备状态信息,利用用户行为数据、用户信息以及用户终端的设备状态信息生成针对此次搜索请求的用户数据并发送给后台服务器。For example, the backend server is the backend server of a certain consumer APP. Accordingly, the terminal application software installed on the user terminal that supports the nearby search function is a consumer APP that communicates and interacts with the backend server. When the user enters "hot pot" in the search bar of the consumer APP, a page showing multiple stores related to "hot pot" will pop up on the consumer APP. The user can click on any store to query the relevant information of the store, participate in flash purchases, group purchase coupons, user sign-in and other activities in the store. The consumer APP records the user behavior data generated by the user for this search request in real time, obtains user information through the user's personal account, and obtains the device status information of the user terminal through the application in the user terminal. The user behavior data, user information and device status information of the user terminal are used to generate user data for this search request and send it to the backend server.

步骤S12:对所述搜索请求进行特征提取,确定所述用户终端的用户的特征、所述搜索请求对应的店铺列表中各个店铺的特征以及用户-店铺交叉特征,所述用户-店铺交叉特征是对所述用户的特征与所述各个店铺的特征进行特征交叉得到的。Step S12: Perform feature extraction on the search request to determine the features of the user of the user terminal, the features of each store in the store list corresponding to the search request, and the user-store cross-features. The user-store cross-features are obtained by performing feature cross-feature extraction on the features of the user and the features of each store.

在本申请各个实施例中,从三个维度对搜索请求进行特征提取,三个维度分别为:用户维度、店铺维度以及用户和店铺的交叉维度。对所述搜索请求进行特征提取,具体为:对与搜索请求对应的用户数据进行特征提取。In various embodiments of the present application, feature extraction is performed on the search request from three dimensions: user dimension, store dimension, and cross dimension of user and store. Feature extraction is performed on the search request, specifically: feature extraction is performed on user data corresponding to the search request.

具体地,确定用户终端的用户的特征是指:从用户数据中提取出符合用户维度的数据,作为用户终端的用户的特征,例如用户终端的用户的特征可以是:用户终端扫描或连接到的WIFI名称及相应的信号强度、用户终端的设备类型、用户终端的经纬度、用户终端的IP地址、用户的用户画像、以及用户的消费偏好。当然,用户的特征还可以包括其它可以表征用户的个性的特征,本申请包括但不限于上述列举的用户的特征。Specifically, determining the characteristics of the user of the user terminal means: extracting data that conforms to the user dimension from the user data as the characteristics of the user of the user terminal. For example, the characteristics of the user of the user terminal can be: the name of the WIFI scanned or connected by the user terminal and the corresponding signal strength, the device type of the user terminal, the latitude and longitude of the user terminal, the IP address of the user terminal, the user portrait of the user, and the consumption preference of the user. Of course, the characteristics of the user can also include other characteristics that can characterize the personality of the user. This application includes but is not limited to the characteristics of the user listed above.

后台服务器预先存储有所有已在终端应用软件上注册的店铺的信息,例如:终端应用软件是生活消费类APP时,后台服务器预先存储有所有已在该生活消费类APP上注册的店铺的信息。由于用户终端发起搜索请求后,后台服务器会向用户终端返回符合搜索条件的店铺列表,因此,确定搜索请求对应的店铺列表中各个店铺的特征是指:从预先存储的所有已注册的店铺的信息中提取店铺列表中各个店铺的特征,其中,店铺列表中各个店铺的特征可以是:各个店铺的标识、各个店铺的WIFI名称、各个店铺的WIFI平均连接或扫描强度、各个店铺的经纬度、各个店铺所属的类目、各个店铺售卖的商品的价格区间、各个店铺的点击率以及各个店铺的访购率。具体地,各个店铺所属的类目表征各个店铺所售卖的商品的类型,例如:某个店铺售卖的商品是服饰,那么该店铺所属的类目是服饰类,又例如:某个店铺售卖的商品是小吃或饮料,那么该店铺所属的类目是餐饮类。各个店铺的访购率是指:在一定时间内,访问该店铺的所有顾客中,产生了购买行为的顾客所占的百分比。The backend server pre-stores the information of all stores registered on the terminal application software. For example, when the terminal application software is a consumer APP, the backend server pre-stores the information of all stores registered on the consumer APP. After the user terminal initiates a search request, the backend server returns a list of stores that meet the search conditions to the user terminal. Therefore, determining the characteristics of each store in the store list corresponding to the search request means extracting the characteristics of each store in the store list from the pre-stored information of all registered stores. The characteristics of each store in the store list can be: the logo of each store, the WIFI name of each store, the average WIFI connection or scanning strength of each store, the longitude and latitude of each store, the category to which each store belongs, the price range of the goods sold in each store, the click rate of each store, and the visit rate of each store. Specifically, the category to which each store belongs represents the type of goods sold by each store. For example, if a store sells clothing, then the category to which the store belongs is clothing. For another example, if a store sells snacks or beverages, then the category to which the store belongs is catering. The visit-to-purchase rate of each store refers to the percentage of customers who make purchases among all customers who visit the store within a certain period of time.

用户-店铺交叉特征是用户与各个店铺的关联特征。图3是本申请一实施例示出的获得用户-店铺交叉特征的流程图。参照图3,用户-店铺交叉特征具体可以通过以下步骤获得:The user-store cross-feature is the association feature between the user and each store. FIG3 is a flow chart of obtaining the user-store cross-feature according to an embodiment of the present application. Referring to FIG3, the user-store cross-feature can be obtained by the following steps:

步骤S121:根据所述用户终端的经纬度和所述各个店铺的经纬度,确定所述用户终端与所述各个店铺的直线距离。Step S121: determining the straight-line distance between the user terminal and each store according to the longitude and latitude of the user terminal and the longitude and latitude of each store.

步骤S122:对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端扫描或连接到的店铺的WIFI平均连接或扫描强度进行特征交叉。Step S122: performing feature cross-talk on the signal strength of the WIFI of the store scanned or connected by the user terminal and the average connection or scanning strength of the WIFI of the store scanned or connected by the user terminal.

将特征A与特征B进行特征交叉是指:采用预设的计算方法对特征A和特征B进行计算以得到特征A与特征B的关联特征C,例如:特征A为用户终端扫描或连接到的店铺的WIFI的信号强度,特征B为与用户终端扫描或连接到的店铺的WIFI平均连接或扫描强度,预设的计算方法为特征A和特征B的比值,则关联特征C为特征A除以特征B所得到的值。Performing feature cross-feature on feature A and feature B means: using a preset calculation method to calculate feature A and feature B to obtain an associated feature C of feature A and feature B. For example, feature A is the signal strength of the WIFI of the store scanned or connected to by the user terminal, and feature B is the average connection or scanning strength of the WIFI of the store scanned or connected to by the user terminal. The preset calculation method is the ratio of feature A to feature B, and the associated feature C is the value obtained by dividing feature A by feature B.

步骤S123:对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端与所述用户终端扫描或连接到的店铺的直线距离进行特征交叉;和/或对所述用户终端的用户点击或消费价格与所述各个店铺的人均价格进行特征交叉。Step S123: Perform feature cross-talk on the signal strength of the WIFI of the store scanned or connected by the user terminal and the straight-line distance between the user terminal and the store scanned or connected by the user terminal; and/or perform feature cross-talk on the user click or consumption price of the user terminal and the per capita price of each store.

各个店铺的人均价格即各个店铺的人均消费价格,用户点击价格即用户点击过的商品的价格,用户消费价格即为用户购买过的商品的价格。本实施例为了保证最终预测得到的目标店铺的准确度,在将用户终端扫描或连接到的店铺的WIFI的信号强度,与用户终端扫描或连接到的店铺的WIFI平均连接或扫描强度进行特征交叉之后,还可以通过以下三种方式中的任一种方式进行特征交叉:The per capita price of each store is the per capita consumption price of each store, the user click price is the price of the product clicked by the user, and the user consumption price is the price of the product purchased by the user. In order to ensure the accuracy of the target store finally predicted, this embodiment can perform feature crossover by any of the following three methods after performing feature crossover on the signal strength of the WIFI of the store scanned or connected by the user terminal and the average connection or scanning strength of the WIFI of the store scanned or connected by the user terminal:

1)用户终端扫描或连接到的店铺的WIFI的信号强度,与用户终端与用户终端扫描或连接到的店铺的直线距离进行特征交叉。1) The signal strength of the WIFI of the store scanned or connected by the user terminal is feature-crossed with the straight-line distance between the user terminal and the store scanned or connected by the user terminal.

2)将用户终端的用户点击价格(或用户消费价格)与各个店铺的人均价格进行特征交叉。2) Perform feature cross-talk between the user click price (or user consumption price) of the user terminal and the per capita price of each store.

3)将用户终端扫描或连接到的店铺的WIFI的信号强度,与用户终端与用户终端扫描或连接到的店铺的直线距离进行特征交叉,且将用户终端的用户点击价格(或用户消费价格)与各个店铺的人均价格进行特征交叉。3) The signal strength of the WIFI of the store scanned or connected by the user terminal is cross-featured with the straight-line distance between the user terminal and the store scanned or connected by the user terminal, and the user click price (or user consumption price) of the user terminal is cross-featured with the per capita price of each store.

步骤S13:将所述用户的特征、所述各个店铺的特征以及所述用户-店铺交叉特征输入预先训练的到店概率预测模型,确定所述用户到达所述各个店铺的概率。Step S13: Input the characteristics of the user, the characteristics of each store, and the user-store cross-features into a pre-trained store arrival probability prediction model to determine the probability of the user arriving at each store.

其中,到店概率预测模型是利用用户终端的搜索日志对预设模型进行训练得到的。具体的训练过程将在下文中进行详细的描述。The store arrival probability prediction model is obtained by training a preset model using the search logs of the user terminal. The specific training process will be described in detail below.

将用户的特征、各个店铺的特征以及用户-店铺交叉特征作为输入值输入到店概率预测模型后,到店概率预测模型会输出用户到达店铺列表中的各个店铺的概率值。用户到达各个店铺的概率值与用户到达该店铺的可能性成正比,概率值越高,用户到达该店铺的可能性越大,概率值越低,用户到达该店铺的可能性越小。After the user's features, the features of each store, and the user-store cross features are input into the store arrival probability prediction model, the store arrival probability prediction model will output the probability value of the user arriving at each store in the store list. The probability value of the user arriving at each store is proportional to the possibility of the user arriving at the store. The higher the probability value, the greater the possibility of the user arriving at the store, and the lower the probability value, the less likely the user is to arrive at the store.

步骤S14:根据所述用户到达所述各个店铺的概率,确定所述用户是否到达目标店铺,所述目标店铺为所述各个店铺中的一个。Step S14: determining whether the user has arrived at a target store based on the probability of the user arriving at each store, where the target store is one of the stores.

目标店铺为用户当前所在的店铺,确定出目标店铺,也即确定出用户当前的精准位置。确定用户是否到达目标店铺的具体步骤将在下文进行说明。The target store is the store where the user is currently located. Determining the target store means determining the user's current precise location. The specific steps for determining whether the user has reached the target store will be described below.

在一种实施方式中,后台服务器是某生活消费类APP的后台服务器,用户终端上安装的可支持附近搜索功能的软件为与后台服务器进行通信交互的生活消费类APP,当用户处于大型商场内部且通过该生活消费类APP搜索附近的店铺时,后台服务器根据用户的搜索请求返回多个满足搜索条件(搜索条件可以从多个方面设置,例如:距离用户当前位置的距离、店铺的价格区间、用户的好评度排名)的店铺,以供用户查看。例如用户可以在搜索栏输入“火锅”,并设置搜索条件为距离当前位置一千米的范围内,生活消费类APP将所有符合搜索条件的与“火锅”相关的店铺展示到页面中,用户可以对感兴趣的店铺进行查看或者做出与某个店铺相关的用户行为(如前文所述,用户行为可以包括:一般用户行为和与到店相关的行为),生活消费类APP实时记录用户产生的用户行为数据,并将用户行为数据、用户信息、用户终端的设备状态信息作为针对用户当前的搜索请求(搜索名称为“火锅”的搜索请求)的用户数据发送给后台服务器。后台服务器对用户数据进行特征提取,并将提取的特征输入至到店概率预测模型,以得到用户到达搜索结果中的与“火锅”相关的各个店铺的概率,然后再根据各个概率预测得到用户当前时刻所在的店铺。在本申请各个实施例中,当前时刻为包含用户发起搜索请求的时刻的一段较短的时长。In one embodiment, the backend server is a backend server of a certain consumer APP, and the software installed on the user terminal that supports the nearby search function is a consumer APP that communicates and interacts with the backend server. When the user is inside a large shopping mall and searches for nearby stores through the consumer APP, the backend server returns multiple stores that meet the search conditions (the search conditions can be set from multiple aspects, such as: the distance from the user's current location, the price range of the store, and the user's favorable rating ranking) according to the user's search request for the user to view. For example, the user enters "hot pot" in the search bar and sets the search condition to be within a range of one kilometer from the current location. The consumer APP displays all stores related to "hot pot" that meet the search conditions on the page, and the user can view the store of interest or perform user behaviors related to a certain store (as described above, user behaviors can include: general user behaviors and behaviors related to visiting the store). The consumer APP records the user behavior data generated by the user in real time, and sends the user behavior data, user information, and device status information of the user terminal as user data for the user's current search request (search request with the search name "hot pot") to the backend server. The backend server extracts features from the user data and inputs the extracted features into the store arrival probability prediction model to obtain the probability of the user arriving at each store related to "hot pot" in the search results, and then predicts the store where the user is currently located based on each probability. In various embodiments of the present application, the current time is a short period of time including the time when the user initiates the search request.

在本申请实施例中,首先获取用户发起的搜索请求,然后对搜索请求进行特征提取,并将提取出的特征(包括:用户终端的用户的特征、搜索请求对应的店铺列表中各个店铺的特征以及用户-店铺交叉特征)输入到店概率预测模型以得到用户到达店铺列表中各个店铺的概率,最后再根据这些概率预测得到用户到达的目标店铺。本申请从用户维度、店铺维度以及用户-店铺交叉维度三个维度对用户终端发起的搜索请求进行特征提取,提高了到店概率预测模型输出的结果的准确度以及最终预测得到的目标店铺的准确度,此外,通过到店概率预测模型输出的各个店铺的概率确定出用户到达的目标店铺,实现了对用户的当前位置的定位,提高了用户终端向用户推荐的当前位置附近的店铺的可靠性,增强了用户的使用体验。In an embodiment of the present application, the search request initiated by the user is first obtained, and then the search request is feature extracted, and the extracted features (including: the features of the user of the user terminal, the features of each store in the store list corresponding to the search request, and the user-store cross-features) are input into the store probability prediction model to obtain the probability of the user arriving at each store in the store list, and finally the target store to be reached by the user is predicted based on these probabilities. The present application performs feature extraction on the search request initiated by the user terminal from three dimensions: user dimension, store dimension, and user-store cross-dimension, which improves the accuracy of the results output by the store probability prediction model and the accuracy of the target store finally predicted. In addition, the target store to be reached by the user is determined by the probability of each store output by the store probability prediction model, which realizes the positioning of the user's current location, improves the reliability of the stores near the current location recommended by the user terminal to the user, and enhances the user's experience.

具体地,步骤S14可以包括:Specifically, step S14 may include:

步骤S141:在所述用户到达所述目标店铺的概率大于预设的概率阈值的情况下,确定所述用户到达所述目标店铺;或Step S141: when the probability that the user arrives at the target store is greater than a preset probability threshold, determining that the user arrives at the target store; or

步骤S142:在所述用户到达所述目标店铺的概率大于预设的概率阈值,且所述搜索请求对应的参数值在预设的生效参数值范围内的情况下,确定所述用户到达所述目标店铺。Step S142: When the probability that the user arrives at the target store is greater than a preset probability threshold, and the parameter value corresponding to the search request is within a preset effective parameter value range, it is determined that the user arrives at the target store.

在本实施例中,判断用户是否到达目标店铺有两种方式,第一种判定方式是:将到店概率预测模型输出的概率与预设的概率阈值进行比较,将概率大于预设的概率阈值的店铺作为用户到达的目标店铺;第二种判定方式是:将概率大于预设的概率阈值的店铺中,搜索请求对应的参数值在预设的生效参数值范围内的店铺作为用户到达的目标店铺,搜索请求对应的参数值与预设的生效参数值范围的关系用于辅助确定用户是否到达目标店铺。In this embodiment, there are two ways to determine whether the user has arrived at the target store. The first determination method is: compare the probability output by the store arrival probability prediction model with the preset probability threshold, and take the store with a probability greater than the preset probability threshold as the target store arrived by the user; the second determination method is: among the stores with a probability greater than the preset probability threshold, the store whose parameter value corresponding to the search request is within the preset effective parameter value range is taken as the target store arrived by the user, and the relationship between the parameter value corresponding to the search request and the preset effective parameter value range is used to assist in determining whether the user has arrived at the target store.

示例地,如果预设的概率阈值为0.8,到店概率预测模型输出的所有概率中只有一个店铺M的概率为0.9,那么根据第一种判定方式,店铺M为用户到达的目标店铺;根据第二种判定方式,如果店铺M是一个餐饮类店铺,则可将搜索请求对应的参数值设置为用户终端当前的时刻,将预设的生效参数值范围设置为午餐时间(例如:11:00-13:00),若用户终端当前的时刻刚好落在预设的生效参数值范围内,可确定用户到达店铺M。当然,搜索请求对应的参数值和预设的生效参数值范围可根据本申请实际应用过程中的需求确定。For example, if the preset probability threshold is 0.8, and among all the probabilities output by the store arrival probability prediction model, only one store M has a probability of 0.9, then according to the first determination method, store M is the target store that the user arrives at; according to the second determination method, if store M is a catering store, the parameter value corresponding to the search request can be set to the current time of the user terminal, and the preset effective parameter value range can be set to lunch time (for example: 11:00-13:00). If the current time of the user terminal happens to fall within the preset effective parameter value range, it can be determined that the user has arrived at store M. Of course, the parameter value corresponding to the search request and the preset effective parameter value range can be determined according to the needs in the actual application process of this application.

在实际情况中,如果后台服务器获取用户终端发起的搜索请求的周期较长,则可能存在到店概率预测模型输出的多个店铺的概率值大于预设阈值的现象。如果后台服务器获取用户终端发起的搜索请求的周期较短,则满足概率值大于预设阈值的店铺的数量通常较少,甚至为零,在此种情况下,如果只存在一个概率值大于预设阈值的店铺,则将该店铺作为用户当前所在的目标店铺,如果存在多个概率值大于预设阈值的店铺,则将多个店铺中概率值最高的店铺作为用户当前所在的店铺。预设阈值是通过计算到店概率预测模型的准确率和召回率,得到的用户到店判定是否生效的一个经验阈值,即:预设阈值是将利用到店概率预测模型预测出的目标店铺,与用户的实际到达的店铺进行对比分析后,结合具体的业务所得到的最佳概率值,只有在该最佳概率值下,预测得到的用户到达的目标店铺的准确度最高。In actual situations, if the backend server obtains the search request initiated by the user terminal for a long period, there may be a phenomenon that the probability values of multiple stores output by the store arrival probability prediction model are greater than the preset threshold. If the backend server obtains the search request initiated by the user terminal for a short period, the number of stores that meet the probability value greater than the preset threshold is usually small, or even zero. In this case, if there is only one store with a probability value greater than the preset threshold, the store is used as the target store where the user is currently located. If there are multiple stores with probability values greater than the preset threshold, the store with the highest probability value among the multiple stores is used as the store where the user is currently located. The preset threshold is an empirical threshold for whether the user's store arrival judgment is effective, obtained by calculating the accuracy and recall rate of the store arrival probability prediction model, that is, the preset threshold is the target store predicted by the store arrival probability prediction model, and the user's actual store is compared and analyzed, and the optimal probability value is obtained in combination with the specific business. Only under this optimal probability value, the accuracy of the predicted target store to be reached by the user is the highest.

在本申请实施例中,设置了两种可判定用户是否到达目标店铺的方法,增强了本申请的到店预测方法在实际应用过程中的灵活性,此外,第二种判定方式中增设了搜索请求对应的参数值是否在预设的生效参数值范围内这一辅助判定条件,提高了判定结果的准确度。In an embodiment of the present application, two methods are set up to determine whether the user has arrived at the target store, which enhances the flexibility of the store arrival prediction method of the present application in actual application. In addition, the second determination method adds an auxiliary determination condition of whether the parameter value corresponding to the search request is within the preset effective parameter value range, thereby improving the accuracy of the determination result.

下面将对到店概率预测模型的训练过程进行说明。The following is an explanation of the training process of the store visit probability prediction model.

图4是本申请一实施例示出的一种到店概率预测模型的训练方法的流程图。参照图4,该训练方法包括:FIG4 is a flow chart of a method for training a store arrival probability prediction model according to an embodiment of the present application. Referring to FIG4 , the training method includes:

步骤S21:确定与用户到店相关联的用户行为类型。Step S21: Determine the user behavior type associated with the user's visit to the store.

与用户到店相关联的用户行为是指:用户必须到达店铺后才能完成的用户行为,例如:闪惠买单、团购验券、自助点餐、取号排队、上传UGC、用户签到、连接WIFI等。对于一个店铺,如果用户产生了与到店相关的用户行为,那么用户实际到达该店铺的可能性较大。User behaviors associated with user visits to a store refer to user behaviors that can only be completed after the user arrives at the store, such as flash discount purchases, group purchase coupon verification, self-service ordering, number queuing, uploading UGC, user sign-in, connecting to WIFI, etc. For a store, if a user generates user behaviors related to visiting the store, then it is more likely that the user actually arrives at the store.

步骤S22:从所述用户终端的搜索日志中提取第一类搜索记录和第二类搜索记录,所述第一类搜索记录为符合所述用户行为类型的搜索记录,所述第二类搜索记录为对应的搜索时刻与所述第一类搜索记录的搜索时刻的时间差在预设时长内的搜索记录。Step S22: extracting first-category search records and second-category search records from the search log of the user terminal, wherein the first-category search records are search records that conform to the user behavior type, and the second-category search records are search records whose corresponding search time and the time difference between the search time of the first-category search record are within a preset time length.

用户终端的搜索日志中包含所有的搜索记录,用户发起的一次搜索请求可以对应多条搜索记录,每次发起搜索请求时生成的多条搜索记录组成了该次搜索请求的用户行为数据。举例来讲,用户在搜索栏中输入“服饰”,那么以该次搜索名称为“服饰”的搜索请求为搜索请求X,点击搜索后,用户终端会展示多个符合“中餐”这一搜索名称的店铺,如果用户1查看了店铺A并在店铺A中完成签到,那么用户1-搜索请求X-店铺A-签到为一条搜索记录,如果用户1查看了店铺A并在店铺A中参与了团购验券活动,那么用户1-搜索请求X-店铺A-团购验券为另一条搜索记录,类似地,用户1还可以基于其它店铺中生成多条搜索记录。The search log of the user terminal contains all search records. A search request initiated by a user can correspond to multiple search records. The multiple search records generated each time a search request is initiated constitute the user behavior data of the search request. For example, if the user enters "clothing" in the search bar, the search request with the search name "clothing" is search request X. After clicking the search, the user terminal will display multiple stores that match the search name "Chinese food". If user 1 visits store A and completes the check-in in store A, then user 1-search request X-store A-check-in is a search record. If user 1 visits store A and participates in the group purchase coupon verification activity in store A, then user 1-search request X-store A-group purchase coupon verification is another search record. Similarly, user 1 can also generate multiple search records based on other stores.

其中,第一类搜索记录为符合用户行为类型的搜索记录,只要搜索记录中携带的用户行为是用户必须到达相应的店铺后才能产生的用户行为,该条搜索记录即可被作为第一类搜索记录。例如:一条搜索记录为用户1-搜索请求X-店铺A-团购验券,由于团购验券是要求用户到达相应的店铺后才能完成的,因此该条搜索记录可作为一条第一类搜索记录,又例如:一条搜索记录为用户1-搜索请求X-店铺A-分享,由于分享不要求用户到达相应的店铺,因此该条搜索记录不能作为一条第一类搜索记录。Among them, the first type of search record is a search record that meets the user behavior type. As long as the user behavior carried in the search record is a user behavior that can only be generated after the user arrives at the corresponding store, the search record can be regarded as a first type of search record. For example: a search record is user 1-search request X-store A-group purchase coupon verification. Since group purchase coupon verification requires the user to arrive at the corresponding store before it can be completed, this search record can be regarded as a first type of search record. For another example: a search record is user 1-search request X-store A-sharing. Since sharing does not require the user to arrive at the corresponding store, this search record cannot be regarded as a first type of search record.

可选地,在选出多条第一类搜索记录后,还可以进一步设置筛选条件,例如筛选条件可以是时间范围条件,将所有搜索记录中对应的搜索时刻位于某一段时间内的第一类搜索记录筛选出来,作为新的第一类搜索记录。Optionally, after selecting multiple first-category search records, further filtering conditions may be set. For example, the filtering condition may be a time range condition, and first-category search records whose corresponding search times in all search records are within a certain period of time are filtered out as new first-category search records.

在本实施例中,第一类搜索记录只是搜索日志中的符合用户行为类型的搜索记录中的一部分搜索记录,为使得采集的搜索记录分布合理,还需采集第二类搜索记录。第二类搜索记录为对应的搜索时刻与第一类搜索记录的搜索时刻的时间差在预设时长内的搜索记录。举例来讲,一条第一类搜索记录的搜索时刻为10:00,若预设时长为1分钟时,则由该第一类搜索记录得到的第二类搜索记录可以为用户在9:59-10:01之间产生的搜索记录。In this embodiment, the first type of search records are only part of the search records in the search log that meet the user behavior type. In order to make the distribution of the collected search records reasonable, the second type of search records need to be collected. The second type of search records are search records whose time difference between the corresponding search time and the search time of the first type of search records is within the preset time length. For example, the search time of a first type of search record is 10:00, if the preset time length is 1 minute, then the second type of search record obtained from the first type of search record can be the search record generated by the user between 9:59-10:01.

本实施例中采用的提取第一类搜索记录和第二类搜索记录的方式保证了提取出的搜索记录具有较好的分布性(例如:预先提取出时间分布较为合理的第一类搜索记录,再基于这些第一类搜索记录提取出第二类搜索记录),避免了无法提取到一定数量的符合用户行为类型的搜索记录的现象,或者提取出的搜索记录的时间分布不合理(例如:搜索记录过度集中于某一个时间段)的现象。The method of extracting the first category search records and the second category search records adopted in the present embodiment ensures that the extracted search records have a good distribution (for example: pre-extracting the first category search records with a more reasonable time distribution, and then extracting the second category search records based on these first category search records), avoiding the phenomenon that a certain number of search records that meet the user behavior type cannot be extracted, or the phenomenon that the time distribution of the extracted search records is unreasonable (for example: the search records are overly concentrated in a certain time period).

步骤S23:将所述第一类搜索记录和所述第二类搜索记录中符合所述用户行为类型的搜索记录标记为正样本,以及,将所述第二类搜索记录中不符合所述用户行为类型的搜索记录标记为负样本。Step S23: Mark the search records in the first category of search records and the second category of search records that match the user behavior type as positive samples, and mark the search records in the second category of search records that do not match the user behavior type as negative samples.

正样本表示用户理论上到达了店铺,负样本表示用户理论上没有到达店铺。由于所有的第一类搜索记录中的用户行为均为与用户到店相关联的用户行为,因而,所有的第一类搜索记录均为正样本。由于第一类搜索记录只是搜索日志中的符合用户行为类型的搜索记录中的一部分搜索记录,因而,第二类搜索记录中还可能存在有多条符合用户行为类型的搜索记录,因此在划分正负样本时还需将第二类搜索记录中的符合用户行为类型的搜索记录标记为正样本,将不符合用户行为类型的搜索记录标记为负样本。A positive sample indicates that the user theoretically arrived at the store, and a negative sample indicates that the user theoretically did not arrive at the store. Since all user behaviors in the first category of search records are user behaviors associated with the user's visit to the store, all first category search records are positive samples. Since the first category of search records is only a part of the search records that meet the user behavior type in the search log, there may be multiple search records that meet the user behavior type in the second category of search records. Therefore, when dividing positive and negative samples, the search records that meet the user behavior type in the second category of search records need to be marked as positive samples, and the search records that do not meet the user behavior type need to be marked as negative samples.

步骤S24:根据所述正样本和所述负样本,对预设模型进行训练,得到所述到店概率预测模型。Step S24: training a preset model based on the positive samples and the negative samples to obtain the store arrival probability prediction model.

本实施例中将划分得到的正样本和负样本作为输入值,采用机器学习算法(例如:逻辑回归算法)对预设模型进行训练得到一个二分类预测模型(即:到店概率预测模型),其作用是:当输入从搜索请求中提取的特征时,可以输出用户到达店铺列表中的各个店铺的概率,店铺列表为搜索请求对应的搜索结果。In this embodiment, the positive samples and negative samples obtained by the division are used as input values, and a machine learning algorithm (for example, a logistic regression algorithm) is used to train the preset model to obtain a binary classification prediction model (i.e., a store arrival probability prediction model). When the features extracted from the search request are input, the probability of the user arriving at each store in the store list can be output. The store list is the search result corresponding to the search request.

图5是本申请一实施例示出的另一种到店概率预测模型的训练方法的流程图。参照图5,步骤S24包括:FIG5 is a flow chart of another method for training a store arrival probability prediction model according to an embodiment of the present application. Referring to FIG5 , step S24 includes:

步骤S241:对所述正样本和所述负样本分别进行特征提取,确定所述正样本和所述负样本各自对应的样本用户的特征、所述正样本和所述负样本各自对应的店铺列表中各个样本店铺的特征以及样本用户-样本店铺交叉特征,所述样本用户-样本店铺交叉特征是对所述样本用户的特征与所述各个样本店铺的特征进行特征交叉得到的。Step S241: Perform feature extraction on the positive samples and the negative samples respectively, determine the features of the sample users corresponding to the positive samples and the negative samples respectively, the features of each sample store in the store list corresponding to the positive samples and the negative samples respectively, and the sample user-sample store cross-features, wherein the sample user-sample store cross-features are obtained by performing feature cross-feature extraction on the features of the sample users and the features of each sample store.

步骤S242:以所述样本用户的特征、所述各个样本店铺的特征以及所述样本用户-样本店铺交叉特征为训练样本,对所述预设模型进行训练,得到所述到店概率预测模型。Step S242: The preset model is trained using the characteristics of the sample users, the characteristics of the sample stores, and the cross-features of the sample users and sample stores as training samples to obtain the store visit probability prediction model.

在本实施例中,每条搜索记录还携带有用户信息和用户终端的设备状态信息,在得到正样本和负样本后,可以从每一条样本中提取出对应的样本用户的特征、对应的样本店铺的特征以及样本用户-样本店铺交叉特征,然后再将这部分特征输入预设模型并对预设模型进行训练,得到到店概率预测模型。In this embodiment, each search record also carries user information and device status information of the user terminal. After obtaining positive samples and negative samples, the characteristics of the corresponding sample user, the characteristics of the corresponding sample store, and the cross-features of the sample user and the sample store can be extracted from each sample. These characteristics are then input into the preset model and the preset model is trained to obtain a store visit probability prediction model.

图6是本申请一实施例示出的一种更新到店概率预测模型的方法的流程图。参照6,该方法包括:FIG6 is a flow chart of a method for updating a store arrival probability prediction model according to an embodiment of the present application. Referring to FIG6, the method includes:

步骤S31:从所述用户终端的搜索日志中提取对应的搜索时刻在确定所述用户到达所述目标店铺之后的搜索记录。Step S31: extracting the search record corresponding to the search time after it is determined that the user has arrived at the target store from the search log of the user terminal.

在根据到店概率预测模型预测得到用户到达的目标店铺之后,从用户终端的搜索日志中提取对应的搜索时刻在确定用户到达目标店铺之后的搜索记录,将这部分搜索记录作为反馈记录,利用反馈记录可以实现对样本权重的调整,便于对到店概率预测模型进行更新。例如:用户实际上只到达了店铺A,但是可能对其它的店铺做出了与用户到店相关联的用户行为,此时可以将与店铺A相关的样本标记为正样本并调整权重值,将其它店铺的样本标记为正负并调整权重值,以保证每一条样本使用时的可靠性。After the target store that the user will arrive at is predicted according to the store arrival probability prediction model, the search records of the corresponding search time after the user arrives at the target store are extracted from the search log of the user terminal, and these search records are used as feedback records. The feedback records can be used to adjust the sample weights, which is convenient for updating the store arrival probability prediction model. For example, if the user actually only arrives at store A, but may have made user behaviors related to the user's store visit to other stores, the samples related to store A can be marked as positive samples and the weight values can be adjusted, and the samples of other stores can be marked as positive and negative and the weight values can be adjusted to ensure the reliability of each sample when used.

步骤S32:在提取的搜索记录是针对所述目标店铺的搜索记录的情况下,将所述提取的搜索记录标记为正样本,并增加所述正样本的权重。Step S32: when the extracted search record is a search record for the target store, mark the extracted search record as a positive sample and increase the weight of the positive sample.

举例来讲,某次预测得到用户到达的目标店铺为店铺A,那么在反馈记录中,可以将与店铺A相关的搜索记录作为权重值较大的正样本,例如:当与店铺A相关的搜索记录中的用户行为是团购验券行为(也可以为其它用户行为,例如:上传UGC、连接WIFI)时,可以将该搜索记录标记为权重值较大的正样本。For example, if a prediction shows that the target store that the user will arrive at is store A, then in the feedback record, the search record related to store A can be used as a positive sample with a larger weight value. For example, when the user behavior in the search record related to store A is a group purchase coupon verification behavior (it can also be other user behaviors, such as uploading UGC, connecting to WIFI), the search record can be marked as a positive sample with a larger weight value.

可选地,根据反馈记录中的用户行为的不同,本实施例对不同的正样本(反馈记录中与目标店铺相关的搜索记录)设置大小不同的权重值,例如:正样本中,用户行为是团购验券行为的样本所对应的权重值大于用户行为是点击行为的样本所对应的权重值。Optionally, according to different user behaviors in the feedback records, this embodiment sets different weight values for different positive samples (search records related to the target store in the feedback records). For example, among the positive samples, the weight value corresponding to the sample in which the user behavior is group purchase coupon verification behavior is greater than the weight value corresponding to the sample in which the user behavior is click behavior.

步骤S33:在所述提取的搜索记录不是针对所述目标店铺的搜索记录的情况下,将所述提取的搜索记录标记为负样本,并减少所述负样本的权重。Step S33: When the extracted search record is not a search record for the target store, the extracted search record is marked as a negative sample, and the weight of the negative sample is reduced.

在本实施例中,还可以调整与店铺A无关的搜索记录的权重,具体地,将与店铺A无关的搜索记录标记为负样本,并减少该负样本的权重。例如:当所提取的搜索记录中的用户行为是不是团购验券行为(或者其它用户行为,例如:上传UGC、连接WIFI)时,可以将该搜索记录标记为权重值较小的负样本。In this embodiment, the weight of search records that are not related to store A can also be adjusted. Specifically, the search records that are not related to store A are marked as negative samples, and the weight of the negative samples is reduced. For example, when the user behavior in the extracted search record is a group purchase coupon verification behavior (or other user behaviors, such as uploading UGC, connecting to WIFI), the search record can be marked as a negative sample with a smaller weight value.

可选地,不同的负样本的权重值的大小也是可以调整的,根据反馈记录中的用户行为的不同,本实施例对不同的负样本(反馈记录中与目标店铺无关的搜索记录)设置大小不同的权重值。例如:负样本中,用户行为是团购验券的样本所对应的权重值大于用户行为是点击行为的样本所对应的权重值。Optionally, the weight values of different negative samples can also be adjusted. According to different user behaviors in the feedback records, this embodiment sets different weight values for different negative samples (search records in the feedback records that are not related to the target store). For example, among the negative samples, the weight value corresponding to the sample whose user behavior is group purchase coupon verification is greater than the weight value corresponding to the sample whose user behavior is click behavior.

步骤S34:根据增加权重后的正样本和减少权重后的负样本,对所述到店概率预测模型进行更新。Step S34: updating the store arrival probability prediction model according to the weighted positive samples and the weighted negative samples.

在本实施例中,将所有的反馈记录作为之后对到店概率预测模型进行更新时的样本,实现了从搜索日志中挖掘反馈记录并反作用于模型的预测过程,根据确定出的用户到达的目标店铺与确定出目标店铺之后用户产生的反馈行为,对反馈记录中的正样本和负样本进行权重值的调整,保证了更新到店概率预测模型时采用的样本的可靠性,在可靠的样本的数量较多的情况下,实现了对到店概率预测模型的不断迭代优化,有效地提高了到店概率预测模型的预测结果的准确度,以及根据到店概率预测模型的预测结果预测得到的目标店铺的准确度。In this embodiment, all feedback records are used as samples for subsequent updating of the store arrival probability prediction model, thereby realizing the prediction process of mining feedback records from search logs and reacting to the model; weight values of positive and negative samples in the feedback records are adjusted according to the determined target store that the user has arrived at and the feedback behavior generated by the user after the target store is determined, thereby ensuring the reliability of the samples used when updating the store arrival probability prediction model; when there are a large number of reliable samples, continuous iterative optimization of the store arrival probability prediction model is realized, thereby effectively improving the accuracy of the prediction results of the store arrival probability prediction model and the accuracy of the target store predicted according to the prediction results of the store arrival probability prediction model.

本申请实施例中的到店预测方法可更为准确地对用户进行定位,提高了O2O附近搜索中距离计算的准确度,适用于多种需计算店铺到用户的距离的场景,例如各类服务型软件的周边搜索服务(包括:附近美食、果蔬生鲜、鲜花配送等),能为用户提供更优质的搜索体验。此外,本申请提出的到店预测方法不仅简单易行、成本低,而且还能利用反馈记录对到店预测模型不断地迭代更新,使得到店预测模型的输出结果的准确度可以随着搜索业务规模的发展以及搜索数据量(例如:搜索记录)的累积而越来越高,从而实现对用户更为精准地定位。The store arrival prediction method in the embodiment of the present application can locate the user more accurately, improve the accuracy of distance calculation in O2O nearby search, and is applicable to a variety of scenarios where the distance from the store to the user needs to be calculated, such as the surrounding search services of various service-oriented software (including: nearby food, fresh fruits and vegetables, flower delivery, etc.), which can provide users with a better search experience. In addition, the store arrival prediction method proposed in this application is not only simple and easy to implement, and low in cost, but also can use feedback records to continuously iterate and update the store arrival prediction model, so that the accuracy of the output results of the store arrival prediction model can be higher and higher with the development of the search business scale and the accumulation of search data (for example: search records), thereby achieving more accurate positioning of users.

基于同一发明构思,本申请一实施例提供一种到店预测装置。图7是本申请一实施例示出的一种到店预测装置的示意图。参照图7,该装置700包括:Based on the same inventive concept, an embodiment of the present application provides a store arrival prediction device. FIG7 is a schematic diagram of a store arrival prediction device shown in an embodiment of the present application. Referring to FIG7 , the device 700 includes:

获得模块701,用于获得用户终端发起的搜索请求;An acquisition module 701 is used to acquire a search request initiated by a user terminal;

特征提取模块702,用于对所述搜索请求进行特征提取,确定所述用户终端的用户的特征、所述搜索请求对应的店铺列表中各个店铺的特征以及用户-店铺交叉特征,所述用户-店铺交叉特征是对所述用户的特征与所述各个店铺的特征进行特征交叉得到的;A feature extraction module 702 is used to extract features from the search request, determine features of the user of the user terminal, features of each store in the store list corresponding to the search request, and user-store cross features, where the user-store cross features are obtained by cross-feature analysis of the features of the user and the features of each store;

概率预测模块703,用于将所述用户的特征、所述各个店铺的特征以及所述用户-店铺交叉特征输入预先训练的到店概率预测模型,确定所述用户到达所述各个店铺的概率;The probability prediction module 703 is used to input the characteristics of the user, the characteristics of each store, and the user-store cross-characteristics into a pre-trained store arrival probability prediction model to determine the probability of the user arriving at each store;

确定模块704,用于根据所述用户到达所述各个店铺的概率,确定所述用户是否到达目标店铺,所述目标店铺为所述各个店铺中的一个。The determination module 704 is used to determine whether the user has arrived at a target store according to the probability of the user arriving at each store, and the target store is one of the stores.

可选地,所述确定模块包括:Optionally, the determining module includes:

第一确定模块,用于在所述用户到达所述目标店铺的概率大于预设的概率阈值的情况下,确定所述用户到达所述目标店铺;或A first determining module is used to determine that the user has arrived at the target store when the probability that the user has arrived at the target store is greater than a preset probability threshold; or

第二确定模块,用于在所述用户到达所述目标店铺的概率大于预设的概率阈值,且所述搜索请求对应的参数值在预设的生效参数值范围内的情况下,确定所述用户到达所述目标店铺。The second determination module is used to determine that the user has arrived at the target store when the probability that the user has arrived at the target store is greater than a preset probability threshold and the parameter value corresponding to the search request is within a preset effective parameter value range.

可选地,所述装置还包括:Optionally, the device further comprises:

第三确定模块,用于确定与用户到店相关联的用户行为类型;A third determination module is used to determine the user behavior type associated with the user visiting the store;

第一提取模块,用于从所述用户终端的搜索日志中提取第一类搜索记录和第二类搜索记录,所述第一类搜索记录为符合所述用户行为类型的搜索记录,所述第二类搜索记录为对应的搜索时刻与所述第一类搜索记录的搜索时刻的时间差在预设时长内的搜索记录;A first extraction module is used to extract a first type of search record and a second type of search record from the search log of the user terminal, wherein the first type of search record is a search record that conforms to the user behavior type, and the second type of search record is a search record whose corresponding search time has a time difference with the search time of the first type of search record within a preset time length;

标记模块,用于将所述第一类搜索记录和所述第二类搜索记录中符合所述用户行为类型的搜索记录标记为正样本,以及,将所述第二类搜索记录中不符合所述用户行为类型的搜索记录标记为负样本;a marking module, configured to mark the search records in the first category of search records and the second category of search records that match the user behavior type as positive samples, and mark the search records in the second category of search records that do not match the user behavior type as negative samples;

训练模块,用于根据所述正样本和所述负样本,对预设模型进行训练,得到所述到店概率预测模型。The training module is used to train the preset model according to the positive samples and the negative samples to obtain the store arrival probability prediction model.

可选地,所述训练模块包括:Optionally, the training module includes:

特征提取子模块,用于对所述正样本和所述负样本分别进行特征提取,确定所述正样本和所述负样本各自对应的样本用户的特征、所述正样本和所述负样本各自对应的店铺列表中各个样本店铺的特征以及样本用户-样本店铺交叉特征,所述样本用户-样本店铺交叉特征是对所述样本用户的特征与所述各个样本店铺的特征进行特征交叉得到的;A feature extraction submodule is used to extract features from the positive samples and the negative samples respectively, and determine features of sample users corresponding to the positive samples and the negative samples, features of each sample store in the store list corresponding to the positive samples and the negative samples, and sample user-sample store cross features, where the sample user-sample store cross features are obtained by cross-feature analysis of the features of the sample users and the features of each sample store;

训练子模块,用于以所述样本用户的特征、所述各个样本店铺的特征以及所述样本用户-样本店铺交叉特征为训练样本,对所述预设模型进行训练,得到所述到店概率预测模型。The training submodule is used to train the preset model using the characteristics of the sample users, the characteristics of each sample store, and the cross-features of the sample users and sample stores as training samples to obtain the store arrival probability prediction model.

可选地,所述装置还包括:Optionally, the device further comprises:

第二提取模块,用于从所述用户终端的搜索日志中提取对应的搜索时刻在确定所述用户到达所述目标店铺之后的搜索记录;A second extraction module is used to extract, from the search log of the user terminal, the search record at the corresponding search time after it is determined that the user has arrived at the target store;

第一权重调整模块,用于在提取的搜索记录是针对所述目标店铺的搜索记录的情况下,将所述提取的搜索记录标记为正样本,并增加所述正样本的权重;a first weight adjustment module, configured to mark the extracted search record as a positive sample and increase the weight of the positive sample when the extracted search record is a search record for the target store;

第二权重调整模块,用于在所述提取的搜索记录不是针对所述目标店铺的搜索记录的情况下,将所述提取的搜索记录标记为负样本,并减少所述负样本的权重;以及a second weight adjustment module, configured to mark the extracted search record as a negative sample and reduce the weight of the negative sample when the extracted search record is not a search record for the target store; and

更新模块,用于根据增加权重后的正样本和减少权重后的负样本,对所述到店概率预测模型进行更新。The updating module is used to update the store arrival probability prediction model according to the positive samples with increased weights and the negative samples with reduced weights.

可选地,所述用户的特征包括以下至少一者:所述用户终端扫描或连接到的WIFI名称及相应的信号强度、所述用户终端的设备类型、所述用户终端的经纬度、所述用户终端的IP地址、所述用户的用户画像、以及所述用户的消费偏好。Optionally, the user's characteristics include at least one of the following: the name of the WIFI scanned or connected to by the user terminal and the corresponding signal strength, the device type of the user terminal, the longitude and latitude of the user terminal, the IP address of the user terminal, the user's user portrait, and the user's consumption preferences.

可选地,所述各个店铺的特征包括以下至少一者:所述各个店铺的标识、所述各个店铺的WIFI名称、所述各个店铺的WIFI平均连接或扫描强度、所述各个店铺的经纬度、所述各个店铺所属的类目、所述各个店铺售卖的商品的价格区间、所述各个店铺的点击率以及所述各个店铺的访购率。Optionally, the characteristics of each store include at least one of the following: the logo of each store, the WIFI name of each store, the average WIFI connection or scanning strength of each store, the longitude and latitude of each store, the category to which each store belongs, the price range of the goods sold in each store, the click-through rate of each store, and the visit rate of each store.

可选地,所述用户-店铺交叉特征是通过以下至少一种方式得到的:Optionally, the user-store cross feature is obtained by at least one of the following methods:

根据所述用户终端的经纬度和所述各个店铺的经纬度,确定所述用户终端与所述各个店铺的直线距离;Determining the straight-line distance between the user terminal and each store according to the longitude and latitude of the user terminal and the longitude and latitude of each store;

对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端扫描或连接到的店铺的WIFI平均连接或扫描强度进行特征交叉;Performing feature cross-talk on the signal strength of the WIFI of the store scanned or connected by the user terminal and the average connection or scanning strength of the WIFI of the store scanned or connected by the user terminal;

对所述用户终端扫描或连接到的店铺的WIFI的信号强度,与所述用户终端与所述用户终端扫描或连接到的店铺的直线距离进行特征交叉;和/或Performing feature intersection on the signal strength of the WIFI of the store scanned or connected by the user terminal and the straight-line distance between the user terminal and the store scanned or connected by the user terminal; and/or

对所述用户终端的用户点击或消费价格与所述各个店铺的人均价格进行特征交叉。A feature cross-talk is performed between the user click or consumption price of the user terminal and the per capita price of each store.

基于同一发明构思,本申请另一实施例提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本申请上述任一实施例所述的方法中的步骤。Based on the same inventive concept, another embodiment of the present application provides a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the steps in the method described in any of the above embodiments of the present application are implemented.

基于同一发明构思,本申请另一实施例提供一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行时实现本申请上述任一实施例所述的方法中的步骤。Based on the same inventive concept, another embodiment of the present application provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor, when executed, implements the steps of the method described in any of the above embodiments of the present application.

对于装置实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。As for the device embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and the relevant parts can be referred to the partial description of the method embodiment.

本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments. The same or similar parts between the various embodiments can be referenced to each other.

本领域内的技术人员应明白,本申请实施例的实施例可提供为方法、装置、或计算机程序产品。因此,本申请实施例可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art will appreciate that the embodiments of the present application can be provided as methods, devices, or computer program products. Therefore, the present application can adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment in combination with software and hardware. Moreover, the present application can adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program code.

本申请实施例是参照根据本申请实施例的方法、终端设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理终端设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理终端设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application embodiment is described with reference to the flowchart and/or block diagram of the method, terminal device (system) and computer program product according to the embodiment of the present application. It should be understood that each process and/or box in the flowchart and/or block diagram, and the combination of the process and/or box in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions can be provided to a processor of a general-purpose computer, a special-purpose computer, an embedded processor or other programmable data processing terminal device to produce a machine, so that the instructions executed by the processor of the computer or other programmable data processing terminal device produce a device for realizing the function specified in one process or multiple processes in the flowchart and/or one box or multiple boxes in the block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理终端设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal device to operate in a specific manner, so that the instructions stored in the computer-readable memory produce a manufactured product including an instruction device that implements the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理终端设备上,使得在计算机或其他可编程终端设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程终端设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing terminal device so that a series of operating steps are executed on the computer or other programmable terminal device to produce computer-implemented processing, so that the instructions executed on the computer or other programmable terminal device provide steps for implementing the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.

尽管已描述了本申请实施例的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例做出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本申请实施例范围的所有变更和修改。Although the preferred embodiments of the present application have been described, those skilled in the art may make additional changes and modifications to these embodiments once they have learned the basic creative concept. Therefore, the appended claims are intended to be interpreted as including the preferred embodiments and all changes and modifications that fall within the scope of the embodiments of the present application.

最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者终端设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者终端设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者终端设备中还存在另外的相同要素。Finally, it should be noted that, in this article, relational terms such as first and second, etc. are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any such actual relationship or order between these entities or operations. Moreover, the terms "include", "comprise" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or terminal device including a series of elements includes not only those elements, but also other elements not explicitly listed, or also includes elements inherent to such process, method, article or terminal device. In the absence of further restrictions, the elements defined by the sentence "comprise a ..." do not exclude the existence of other identical elements in the process, method, article or terminal device including the elements.

以上对本申请所提供的一种到店预测方法、装置、可读存储介质及电子设备,进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The above is a detailed introduction to a store arrival prediction method, device, readable storage medium and electronic device provided by the present application. Specific examples are used in this article to illustrate the principles and implementation methods of the present application. The description of the above embodiments is only used to help understand the method of the present application and its core idea; at the same time, for general technical personnel in this field, according to the idea of the present application, there will be changes in the specific implementation method and application scope. In summary, the content of this specification should not be understood as a limitation on the present application.

Claims (9)

1. A method of store arrival prediction, the method comprising:
obtaining a search request initiated by a user terminal;
Extracting features of the user terminal, features of each store in a store list corresponding to the search request and user-store crossing features are determined, wherein the user-store crossing features are obtained by performing feature crossing on the features of the user and the features of each store;
Inputting the characteristics of the user, the characteristics of each store and the user-store intersection characteristics into a pre-trained store arrival probability prediction model, and determining the probability of the user arriving at each store;
determining whether the user arrives at a target store according to the probability that the user arrives at each store, wherein the target store is one of the stores;
Wherein the characteristics of each store include at least one of: the identification of each store, the WIFI name of each store, the WIFI average connection or scanning intensity of each store, the longitude and latitude of each store, the category to which each store belongs, the price interval of the commodity sold by each store, the click rate of each store and the visit rate of each store;
the user-store intersection feature is obtained by at least one of:
The signal intensity of the WIFI of the store scanned or connected by the user terminal is subjected to characteristic intersection with the average connection or scanning intensity of the WIFI of the store scanned or connected by the user terminal;
The signal intensity of WIFI of the store scanned or connected by the user terminal is subjected to characteristic intersection with the linear distance between the user terminal and the store scanned or connected by the user terminal, wherein the linear distance between the user terminal and the store scanned or connected by the user terminal is determined according to the longitude and latitude of the user terminal and the longitude and latitude of each store;
and performing characteristic intersection on the user click or consumption price of the user terminal and the average price of each store.
2. The method of claim 1, wherein the step of determining whether the user arrives at a target store based on the probability that the user arrives at the respective store comprises:
determining that the user arrives at the target store under the condition that the probability of the user arriving at the target store is greater than a preset probability threshold;
Or determining that the user arrives at the target store under the condition that the probability of the user arriving at the target store is larger than a preset probability threshold and the parameter value corresponding to the search request is in a preset effective parameter value range.
3. The method according to claim 1, wherein the method further comprises:
determining a user behavior type associated with a user going to a store;
Extracting a first type of search record and a second type of search record from a search log of the user terminal, wherein the first type of search record is a search record conforming to the user behavior type, and the second type of search record is a search record with a time difference between a corresponding search time and a search time of the first type of search record within a preset duration;
marking the search records which are in accordance with the user behavior type in the first type search record and the second type search record as positive samples, and marking the search records which are not in accordance with the user behavior type in the second type search record as negative samples;
training a preset model according to the positive sample and the negative sample to obtain the arrival probability prediction model.
4. The method of claim 3, wherein the training the preset model based on the positive and negative samples to obtain the arrival probability prediction model comprises:
Respectively extracting the characteristics of the positive sample and the negative sample, and determining the characteristics of sample users corresponding to the positive sample and the negative sample, the characteristics of each sample store in a store list corresponding to the positive sample and the negative sample and sample user-sample store crossing characteristics, wherein the sample user-sample store crossing characteristics are obtained by carrying out characteristic crossing on the characteristics of the sample users and the characteristics of each sample store;
And training the preset model by taking the characteristics of the sample user, the characteristics of each sample store and the sample user sample store cross characteristics as training samples to obtain the store arrival probability prediction model.
5. The method of claim 2, wherein after the step of determining that the user arrived at the target store, the method further comprises:
Extracting a search record of the corresponding search moment after determining that the user arrives at the target store from a search log of the user terminal;
In the case where the extracted search record is a search record for the target store, marking the extracted search record as a positive sample, and increasing the weight of the positive sample;
marking the extracted search record as a negative sample and reducing the weight of the negative sample in the event that the extracted search record is not a search record for the target store; and updating the arrival probability prediction model according to the positive sample after increasing the weight and the negative sample after reducing the weight.
6. The method of claim 1, wherein the characteristics of the user include at least one of: the WIFI name and corresponding signal strength that the user terminal scans or connects to, the equipment type of the user terminal, the longitude and latitude of the user terminal, the IP address of the user terminal, the user portrait of the user, and the consumption preference of the user.
7. An arrival prediction apparatus, the apparatus comprising:
the acquisition module is used for acquiring a search request initiated by the user terminal;
The feature extraction module is used for carrying out feature extraction on the search request, and determining the features of the user terminal, the features of each store in the store list corresponding to the search request and the user-store cross features, wherein the user-store cross features are obtained by carrying out feature cross on the features of the user and the features of each store;
The probability prediction module is used for inputting the characteristics of the user, the characteristics of each store and the user-store intersection characteristics into a pre-trained store arrival probability prediction model and determining the probability of the user arriving at each store;
The determining module is used for determining whether the user arrives at a target store according to the probability that the user arrives at each store, wherein the target store is one of the stores;
Wherein the characteristics of each store include at least one of: the identification of each store, the WIFI name of each store, the WIFI average connection or scanning intensity of each store, the longitude and latitude of each store, the category to which each store belongs, the price interval of the commodity sold by each store, the click rate of each store and the visit rate of each store;
the user-store intersection feature is obtained by at least one of:
The signal intensity of the WIFI of the store scanned or connected by the user terminal is subjected to characteristic intersection with the average connection or scanning intensity of the WIFI of the store scanned or connected by the user terminal;
The signal intensity of WIFI of the store scanned or connected by the user terminal is subjected to characteristic intersection with the linear distance between the user terminal and the store scanned or connected by the user terminal, wherein the linear distance between the user terminal and the store scanned or connected by the user terminal is determined according to the longitude and latitude of the user terminal and the longitude and latitude of each store;
and performing characteristic intersection on the user click or consumption price of the user terminal and the average price of each store.
8. A computer readable storage medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the steps of the method according to any of claims 1-6.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor performs the steps of the method according to any of claims 1-6.
CN201910610494.8A 2019-07-08 2019-07-08 Store arrival prediction method and device, readable storage medium and electronic equipment Active CN110472995B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910610494.8A CN110472995B (en) 2019-07-08 2019-07-08 Store arrival prediction method and device, readable storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910610494.8A CN110472995B (en) 2019-07-08 2019-07-08 Store arrival prediction method and device, readable storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN110472995A CN110472995A (en) 2019-11-19
CN110472995B true CN110472995B (en) 2024-11-01

Family

ID=68507089

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910610494.8A Active CN110472995B (en) 2019-07-08 2019-07-08 Store arrival prediction method and device, readable storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN110472995B (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111194005A (en) * 2019-12-05 2020-05-22 中国科学院地理科学与资源研究所 An indoor pedestrian semantic location extraction method and prediction method
JP6932761B2 (en) * 2019-12-10 2021-09-08 東芝テック株式会社 Shopping support device and shopping support program, server and its control program, and shopping support method
CN111275479B (en) * 2020-01-07 2023-11-10 北京爱笔科技有限公司 People flow prediction method, device and system
CN113543117B (en) * 2020-04-22 2022-10-04 中国移动通信集团重庆有限公司 Method, device and computing device for predicting number portability users
CN111831967B (en) * 2020-06-19 2024-11-15 北京嘀嘀无限科技发展有限公司 A method, device, electronic device and medium for identifying arrival at a store
CN112561569B (en) * 2020-12-07 2024-02-27 上海明略人工智能(集团)有限公司 Dual-model-based store arrival prediction method, system, electronic equipment and storage medium
CN112566033B (en) * 2021-02-24 2021-05-11 浙江口碑网络技术有限公司 Method and device for uploading and acquiring characteristics of wireless signals
CN118075873B (en) * 2024-04-19 2024-06-21 浙江口碑网络技术有限公司 Positioning method and data processing method based on wireless network data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107203518A (en) * 2016-03-16 2017-09-26 阿里巴巴集团控股有限公司 Method, system and device, the electronic equipment of on-line system personalized recommendation
CN107481093A (en) * 2017-07-21 2017-12-15 北京京东尚科信息技术有限公司 Personalized shop Forecasting Methodology and device

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663033A (en) * 2012-03-23 2012-09-12 汉海信息技术(上海)有限公司 Method for searching interest points in designated area of map by hand-drawing way
US9223831B2 (en) * 2013-01-02 2015-12-29 Codeq Llc System, method and computer program product for searching summaries of mobile apps reviews
CN105446972B (en) * 2014-06-17 2022-06-10 阿里巴巴集团控股有限公司 Searching method, device and system based on and fused with user relationship data
US9703929B2 (en) * 2014-10-21 2017-07-11 uBiome, Inc. Method and system for microbiome-derived diagnostics and therapeutics
JP6478734B2 (en) * 2014-10-31 2019-03-06 株式会社東芝 Item recommendation device, item recommendation method and program
CN106485562B (en) * 2015-09-01 2020-12-04 苏宁云计算有限公司 Commodity information recommendation method and system based on user historical behaviors
CN106055607B (en) * 2016-05-25 2020-05-19 百度在线网络技术(北京)有限公司 User visit prediction model establishment method and device and user visit prediction method and device
CN107609960A (en) * 2017-10-18 2018-01-19 口碑(上海)信息技术有限公司 Rationale for the recommendation generation method and device
US11250340B2 (en) * 2017-12-14 2022-02-15 Microsoft Technology Licensing, Llc Feature contributors and influencers in machine learned predictive models
CN108230040B (en) * 2018-01-17 2020-09-25 口碑(上海)信息技术有限公司 Store arrival prediction method and device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107203518A (en) * 2016-03-16 2017-09-26 阿里巴巴集团控股有限公司 Method, system and device, the electronic equipment of on-line system personalized recommendation
CN107481093A (en) * 2017-07-21 2017-12-15 北京京东尚科信息技术有限公司 Personalized shop Forecasting Methodology and device

Also Published As

Publication number Publication date
CN110472995A (en) 2019-11-19

Similar Documents

Publication Publication Date Title
CN110472995B (en) Store arrival prediction method and device, readable storage medium and electronic equipment
US12299725B2 (en) System and method for generating geographic zone information for consumers
US20210334875A1 (en) Commodity recommendation method and commodity recommendation device
CN111949834A (en) Site selection method and site selection platform
KR102000076B1 (en) Method and server for recommending online sales channel on online shoppingmall intergrated management system
KR101868583B1 (en) Method for providing affiliate store recommendation service using bigdata analysis with objective information
CN102667840A (en) Context information using system, device and method
CN105279672A (en) Lead recommendations
JP5802716B2 (en) Information processing system, program, and verification method
KR20200115694A (en) System and method for providing of trading area information using big data
CN108429776B (en) Network object pushing method, device, client, interaction equipment and system
US20150178768A1 (en) System and method for intermediating electronic commerce using offline transaction information
KR102286661B1 (en) Chat based on-demand shopping curation apparatus
Shi et al. A generalized count model on customers' purchases in O2O market
TWI778335B (en) Information promotion method, mobile device and computer program product
CN112232924B (en) Regional store planning method
CN110796300A (en) User consumption level-based travel service recommendation system, method and terminal
KR102592795B1 (en) System for trading steel scrap material
KR100799628B1 (en) How to provide commercial business analysis service using real-time integration of company information
KR102439510B1 (en) Customer-tailored store-linked discount rate and store recommendation system and method
CN106296253A (en) A kind of ad data security control and the method and system of issue
CN110659920A (en) Method and device for recommending service object
KR20220112205A (en) Method And Apparatus for Evaluating Market Price of Used Product
CN115345690A (en) Product recommendation method, device, equipment and storage medium
KR102698606B1 (en) Company sales forecasting system and company sales forecasting method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载