CN112149502A

CN112149502A - Unfavorable geology positioning forecasting method based on convolutional neural network

Info

Publication number: CN112149502A
Application number: CN202010838249.5A
Authority: CN
Inventors: 陈再励; 吴立; 程瑶; 董道军; 闫天俊; 李丽平; 张美霞
Original assignee: China University of Geosciences Wuhan
Current assignee: China University of Geosciences Wuhan
Priority date: 2020-08-19
Filing date: 2020-08-19
Publication date: 2020-12-29

Abstract

The present invention provides a method for predicting poor geological location based on convolutional neural network, comprising: firstly establishing a geophysical detection image data set; then building a target prediction and positioning neural network model based on feature extraction, using the image data set, The target prediction and positioning neural network model is trained to obtain a trained target prediction and positioning neural network model; finally, the image result data obtained by a certain physical detection method is input into the trained target prediction and positioning neural network model, and the actual bad geology is carried out. Positioning prediction. The beneficial effects of the invention are: the location scale and nature state of the unfavorable geological bodies within the range passed in the construction process of underground engineering such as tunnels can be accurately predicted, the decision basis for engineering design and construction management is provided, and the existing geological prediction geophysical prospecting is reduced. It can improve the safety of engineering construction by solving the problems of low interpretability of the method, relying on expert experience, and low prediction accuracy.

Description

A prediction method of bad geological location based on convolutional neural network

技术领域technical field

本发明涉及不良地质定位预报技术领域，尤其涉及一种基于卷积神经网络的不良地质定位预报方法。The invention relates to the technical field of bad geological positioning prediction, in particular to a bad geological positioning forecasting method based on a convolutional neural network.

背景技术Background technique

在部分实际工程项目中，我们发现富水破碎带这一不良地质条件为隧道建设带来重大风险的问题，当前多采用基于物探法进行隧道地质超前预报，但现有的超前地质预报，通过物探传感器检测得到的波形数据图像，需要经过具有经验的专家来解释，在预报过程中存在着对物探传感得到测量结果图的解释性低、依赖专家经验、预测准确率不高的问题，因此针对某一特定不良地质(如富水破碎带)在超前预报图像中进行精准预测定位，十分关键。In some actual engineering projects, we found that the poor geological conditions of the water-rich broken zone bring major risks to the tunnel construction. Currently, the geophysical prospecting method is used to carry out advanced geological forecasting of tunnels. The waveform data image detected by the sensor needs to be interpreted by experienced experts. In the prediction process, there are problems such as low interpretability of the measurement result map obtained by geophysical sensing, relying on expert experience, and low prediction accuracy. It is very important to accurately predict and locate a certain unfavorable geology (such as the water-rich fracture zone) in the advance forecast image.

近年随着深度学习(Deep Learning)的快速发展，卷积神经网络模型(CNN，Convolutional Neural Networks)通过其特征采样、权重共享、运算降维的特性，在图像分类、目标检测、图像理解的领域取得了广泛的应用。本方法基于深度卷积神经网络对物探法(地质雷达、TSP、瞬变电磁等)探测图像进行自动辨识及概率分类，研发具有较好泛化性能的预测网络模型系统，提升超前地质预报中对富水破碎带这一不良地质的可解释性及准确度。With the rapid development of Deep Learning in recent years, Convolutional Neural Networks (CNN, Convolutional Neural Networks), through its features of feature sampling, weight sharing, and operational dimensionality reduction, are used in the fields of image classification, target detection, and image understanding. has been widely used. This method is based on the deep convolutional neural network to automatically identify and probabilize the detection images of geophysical methods (geo-radar, TSP, transient electromagnetic, etc.) Interpretability and accuracy of the unfavorable geology of the water-rich fracture zone.

发明内容SUMMARY OF THE INVENTION

为了解决上述问题，本发明提供了一种基于卷积神经网络的不良地质定位预报方法，主要包括以下步骤：In order to solve the above problems, the present invention provides a method for predicting poor geological location based on a convolutional neural network, which mainly includes the following steps:

本申请以富水破碎带这一不良地质超前预报的示例说明，本方法可以适用于物探法图像数据，实现富水破碎带或其他不良地质的精准预报。This application uses an example of an unfavorable geological advance forecast of the water-rich fracture zone, and this method can be applied to geophysical image data to achieve accurate prediction of the water-rich fracture zone or other bad geology.

一种基于卷积神经网络的不良地质定位预报方法，包括以下步骤：A method for predicting poor geological location based on convolutional neural network, comprising the following steps:

S101：建立物探法探测图像数据集；S101: establish a geophysical detection image data set;

S102：构建基于特征提取的目标预测定位神经网络模型；S102: construct a target prediction and positioning neural network model based on feature extraction;

S103：采用所述图像数据集，对所述目标预测定位神经网络模型进行训练，得到训练好的目标预测定位神经网络模型；S103: Using the image data set, train the target prediction and positioning neural network model to obtain a trained target prediction and positioning neural network model;

S104：输入某物探法得到的图像结果数据至所述训练好的目标预测定位神经网络模型中，进行实际不良地质定位预测。S104: Input the image result data obtained by a geophysical method into the trained neural network model for target prediction and positioning, and perform actual bad geological positioning prediction.

进一步地，步骤S101中，建立物探法探测图像数据集；具体包括：Further, in step S101, a geophysical detection image data set is established; specifically, it includes:

S201：通过超前地质预报案例收集、实际现场项目数据采集，建立物探法探测结果初步图像数据集；所述初步图像数据集包括基于物探法进行超前地质预报得到的多张结果图像；S201: Establish a preliminary image data set of geophysical detection results through advanced geological prediction case collection and actual on-site project data collection; the preliminary image data set includes a plurality of result images obtained from advanced geological prediction based on the geophysical method;

S202：根据富水带破碎带预报机理及其图像学特征，对所述初步图像数据集中各图像上的异常地质进行预报；S202: Predict the abnormal geology on each image in the preliminary image data set according to the prediction mechanism of the fractured zone of the water-rich zone and its image characteristics;

S203：结合预报结论，并采用专家经验法再次确定所述初步图像数据集中各图像上富水破碎带的位置，以富水破碎带定位区域为标签内容对所述初步图像数据集中各图像进行数据标注，得到标注后的初步图像数据集；S203: Combine the prediction conclusion, and use the expert experience method to re-determine the position of the water-rich crushing zone on each image in the preliminary image data set, and use the location area of the water-rich crushing zone as the label content to perform data analysis on each image in the preliminary image data set Annotate, get the initial image dataset after annotation;

S204：采用多种图像数据增广方法，对所述标注后的初步图像数据集进行扩容增广，并耦合部分物探法探测富水破碎带负样本数据，得到最终的图像数据集。S204: Using a variety of image data augmentation methods, expand and augment the marked preliminary image data set, and couple with some geophysical methods to detect negative sample data of the water-rich broken zone to obtain a final image data set.

进一步地，步骤S201中，所述物探法包括TSP、地质雷达和瞬变电磁等；所述初步图像数据集中样本容量大于500幅图像。Further, in step S201, the geophysical method includes TSP, geological radar, transient electromagnetic, etc.; the sample size in the preliminary image data set is greater than 500 images.

进一步地，步骤S102中，所述的基于特征提取的目标预测定位神经网络模型包括依次连接的特征提取基础网络和定位预测网络；Further, in step S102, the target prediction and positioning neural network model based on feature extraction includes a feature extraction basic network and a positioning prediction network that are connected in sequence;

其中，所述特征提取基础网络包括依次连接的：CBR块、最大池化层、BaseRN1层、2个BaseRN0层、BaseRN1层、3个BaseRN0层、BaseRN1层、5个BaseRN0层；Wherein, the feature extraction basic network includes sequentially connected: CBR block, maximum pooling layer, BaseRN1 layer, 2 BaseRN0 layers, BaseRN1 layer, 3 BaseRN0 layers, BaseRN1 layer, 5 BaseRN0 layers;

所述定位预测网络包括依次连接的最大池化层、全连接层和Sigmoid层。The localization prediction network includes a max pooling layer, a fully connected layer and a sigmoid layer that are connected in sequence.

进一步地，所述CBR块包括依次连接的卷基层、正则化层和激活层；其中，C代表卷积层，B代表正则化层Batch Norm，R代表激活层，采用Leaky-ReLU激活函数。Further, the CBR block includes a convolutional base layer, a regularization layer and an activation layer connected in sequence; wherein, C represents the convolution layer, B represents the regularization layer Batch Norm, and R represents the activation layer, using the Leaky-ReLU activation function.

进一步地，BaseRN1层和BaseRN0层均基于残差网络思想设计，引入残差项，便于构建深度网络模型；其中，BaseRN0层和BaseRN1层中主干均为CBR-CBR-CB模块，且BaseRN0层和BaseRN1层中主干CBR-CBR-CB模块参数一致，具体为：第一个CBR块的滤波器尺寸为1×1，64个通道；第二个CBR块的滤波器尺寸为3×3，64个通道；第三个CB块的滤波器尺寸为1×1，256个通道；BaseRN0层中分支CB块的滤波器尺寸与主干的CB块滤波器尺寸一致，BaseRN1中分支CBR块的滤波器尺寸与主干的第一个CBR块的滤波器尺寸一致；所述CB块包括依次连接的卷基层和正则化层。Further, both the BaseRN1 layer and the BaseRN0 layer are designed based on the residual network idea, and the residual term is introduced to facilitate the construction of a deep network model; among them, the backbones in the BaseRN0 layer and the BaseRN1 layer are CBR-CBR-CB modules, and the BaseRN0 layer and BaseRN1 The parameters of the backbone CBR-CBR-CB modules in the layer are the same, specifically: the filter size of the first CBR block is 1×1, 64 channels; the filter size of the second CBR block is 3×3, 64 channels. ; The filter size of the third CB block is 1 × 1, 256 channels; the filter size of the branch CB block in the BaseRN0 layer is consistent with the filter size of the backbone CB block, and the filter size of the branch CBR block in BaseRN1 is the same as that of the backbone. The filter size of the first CBR block is the same; the CB block includes a convolutional base layer and a regularization layer connected in sequence.

进一步地，步骤S103中，在所述特征提取基础网络中，特征提取层的输出特征映射以16倍的因子进行四级下采样，在空间分辨率和提取特征强度之间进行折衷,激活层Leaky-ReLU的最后输出为14×14×1024；所述特征提取网络采用四个阶段对输入图像进行2倍的降采样，实现16倍的下采样：具体如下：Further, in step S103, in the feature extraction basic network, the output feature map of the feature extraction layer is down-sampled by a factor of 16 in four levels, and a compromise is made between the spatial resolution and the extracted feature strength, and the activation layer Leaky - The final output of ReLU is 14×14×1024; the feature extraction network uses four stages to downsample the input image by 2 times to achieve 16 times downsampling: the details are as follows:

第一阶段：输入尺寸大小为W×H×C的三通道RGB图像，在第一阶段输入进入一个CBR块；其中，W＝256为图像的宽，H＝256为图像的高，C＝3代表图像三通道；此CBR块中滤波器尺寸为1×1，64个通道，经过CBR块一次下采样，此时输出的图像尺寸大小为128×128×64；The first stage: input a three-channel RGB image with a size of W×H×C, and enter a CBR block in the first stage; where W=256 is the width of the image, H=256 is the height of the image, and C=3 Represents three channels of the image; the filter size in this CBR block is 1×1, 64 channels, after the CBR block is downsampled once, the output image size is 128×128×64;

第二阶段：首先通过一个过滤器为3×3和步幅2的最大池化层，然后再通过一个BaseRN1层和2个BaseRN0层；通过第二阶段后输出的图像尺寸大小为64×64×256；The second stage: first pass a max pooling layer with a filter of 3 × 3 and stride 2, and then pass through a BaseRN1 layer and 2 BaseRN0 layers; the size of the output image after passing through the second stage is 64 × 64 × 256;

第三阶段：将第二阶段的输出输入至依次连接的1个BaseRN1层和3个BaseRN0层，通过BaseRN0层、BaseRN1层的组合得到更深层的网络模型结构，输出图形尺寸大小为32×32×512；The third stage: Input the output of the second stage to 1 BaseRN1 layer and 3 BaseRN0 layers connected in sequence, and obtain a deeper network model structure through the combination of the BaseRN0 layer and the BaseRN1 layer, and the output graph size is 32×32× 512;

第四阶段：将第三阶段的输出输入至依次连接的1个BaseRN1层和5个BaseRN0层，通过BaseRN0、BaseRN1的组合进一步得到更深层的网络模型结构，输出图像尺寸大小为16×16×1024。The fourth stage: Input the output of the third stage to 1 BaseRN1 layer and 5 BaseRN0 layers connected in sequence, and further obtain a deeper network model structure through the combination of BaseRN0 and BaseRN1, and the output image size is 16×16×1024 .

进一步地，在定位预测网络中，将特征提取网络的输出结果输入定位预测网络中，通过最大池化层有效降低模型参数误差造成的特征估计均值的偏移影响，最后连接全连接层及Sigmoid层进行回归定位，通过多任务损失函数的设计实现对位置和分类的同时检测。Further, in the localization prediction network, the output of the feature extraction network is input into the localization prediction network, the maximum pooling layer is used to effectively reduce the offset effect of the feature estimation mean caused by the model parameter error, and finally the fully connected layer and the sigmoid layer are connected. Perform regression positioning, and realize simultaneous detection of position and classification through the design of multi-task loss function.

进一步地，步骤S103中，训练时，为实现对某一不良地质的精准定位预报，设计多任务耦合的损失函数，在定位预测网络最后通过sigmoid回归方法实现对不良地质的概率置信度判断及定位区域数据输出；具体的：Further, in step S103, during training, in order to achieve accurate positioning prediction for a certain unfavorable geology, a multi-task coupling loss function is designed, and the sigmoid regression method is finally used in the positioning prediction network to realize the probability confidence judgment and positioning of the unfavorable geology. Area data output; specific:

损失函数的设计针对不良地质目标预测置信度的判断和目标定位两大任务，分别对应构建目标损失函数，其中：The design of the loss function is aimed at the two tasks of judging the confidence of bad geological target prediction and target positioning, and correspondingly constructing the target loss function, among which:

目标预测置信度：目标预测置信度损失函数L_conf(o,p)，表示预测的目标区域为所要监测的不良地质的概率，采用二值交叉熵损失表征：Target prediction confidence: the target prediction confidence loss function L _conf (o,p), which represents the probability that the predicted target area is the bad geology to be monitored, and is represented by binary cross-entropy loss:

其中，o_i∈{0,1}表示第i个预测框中是否存在目标，0表示不存在，1表示存在；

表示第i个预测框中是否存在目标的概率，通过对目标预测定位神经网络模型输出的结果p_i进行sigmoid求值得到，

属于(0,1)；i＝1,2,…,I，I为预测的结果框个数；L_conf(o,p)的值受预测框个数和值的影响，范围不定，L_conf(o,p)越小，损失越小说明预测结果越准确；Among them, o _i ∈ {0,1} indicates whether there is a target in the i-th prediction box, 0 indicates no existence, and 1 indicates existence;

Indicates the probability of whether there is a target in the i-th prediction box, which is obtained by sigmoid evaluation of the result p _i output by the target prediction and positioning neural network model,

Belongs to (0,1); i=1,2,...,I, I is the number of predicted result boxes; the value of L _conf (o,p) is affected by the number and value of predicted boxes, and the range is indeterminate, L _conf The smaller the (o, p), the smaller the loss, the more accurate the prediction result;

目标定位：目标定位损失函数L_loc(μ,σ)，为提高结果定位精确度，对定位框的预测包围框Boundingbox结果(x,y,w,h)引入高斯分布建模，得到各个参数的均值与方差；Boundingbox为一个长方形框，(x,y)表示其中心点位置，(w,h)表示其大小；Target positioning: target positioning loss function L _loc (μ,σ), in order to improve the accuracy of the result positioning, the Gaussian distribution modeling is introduced into the predicted bounding box results (x, y, w, h) of the positioning box, and the results of each parameter are obtained. Mean and variance; Boundingbox is a rectangular box, (x, y) represents the position of its center point, and (w, h) represents its size;

目标定位损失函数采用负对数似然损失(NLL,negative log likelihood loss)表征：

The target localization loss function is represented by a negative log likelihood loss (NLL, negative log likelihood loss):

其中，

in,

其中，W×H是输入图像的宽、高，将输入图像分割为H行W列个网格；I是预测框个数，i、j、k索引得到第k行第j列的第i个预测框及其基坐标；γ_t是对(x,y,w,h)预测结果权重影响的超参数，是模型训练的一个超参数，随着训练逐步逼近一个稳定值，可以有效保障训练结果的准确性；G_ijk代表在H×W的坐标系下，第k行第j列像素点位置上第i个预测框的结果真实值{x,y,w,h}_truth，满足均值为μ，方差为σ的正太分布G～N(μ,σ²)；μ_t,σ_t,t∈{x,y,w,h}是模型预测结果，通过sigmoid函数转化到(0,1)范围内，表示在当前t＝i,j,k索引的预测框内的中心坐标，μ_w,μ_h表示以μ_x,μ_y为中心点的预测区域矩形框的长宽；同时用方差量σ_t,t∈{x,y,w,h}表征预测结果可靠性，0表示可靠，1表示不可靠；Among them, W×H is the width and height of the input image, and the input image is divided into grids of H rows and W columns; I is the number of prediction frames, and the i, j, and k indices get the i-th row of the k-th row and the j-th column. Prediction frame and its base coordinates; γ _t is a hyperparameter that affects the weight of (x, y, w, h) prediction results, and is a hyperparameter for model training. As the training gradually approaches a stable value, the training results can be effectively guaranteed accuracy; G _ijk represents the true value {x, y, w, h} _truth of the result of the i-th prediction frame at the pixel position of the k-th row and the j-th column in the H×W coordinate system, satisfying the mean value μ , the normal distribution G～N(μ,σ ² ) with variance σ; μ _t ,σ _t , t∈{x,y,w,h} is the model prediction result, which is transformed to the (0,1) range by the sigmoid function , represents the center coordinates in the prediction frame of the current t=i, j, k index, μ _w , μ _h represent the length and width of the rectangular frame of the prediction area with μ _x and μ _y as the center points; at the same time, the variance σ is used _t , t∈{x,y,w,h} represents the reliability of the prediction result, 0 means reliable, 1 means unreliable;

综合L_conf(o,p)和L_loc(μ,σ)得到多任务损失函数，通过权重因子λ₁,λ₂耦合两个损失函数得到综合的不良地质定位预报损失函数L(o,p,μ,σ)：Synthesize L _conf (o,p) and L _loc (μ,σ) to obtain a multi-task loss function, and couple the two loss functions with weight factors λ ₁ , λ ₂ to obtain a comprehensive bad geological location prediction loss function L(o,p, μ,σ):

L(o,p,μ,σ)＝λ₁L_conf(o,p)+λ₂L_loc(μ,σ)L(o,p,μ,σ)＝λ ₁ L _conf (o,p)+λ ₂ L _loc (μ,σ)

其中，λ₁,λ₂这两个权重因子根据具体任务需求设定。Among them, the two weighting factors λ ₁ and λ ₂ are set according to specific task requirements.

进一步地，在S101中构建的图像数据集属于小样本数据集；步骤S103中，通过迁移学习方法对目标预测定位神经网络模型进行训练，以提升训练速度。Further, the image data set constructed in S101 belongs to the small sample data set; in step S103 , the target prediction and positioning neural network model is trained by the transfer learning method to improve the training speed.

本发明提供的技术方案带来的有益效果是：本申请所提出的技术方案可以准确地预报隧道等地下工程建设过程中所通过范围内的不良地质体的位置规模和性质状态，为工程设计及施工管理部分提供决策依据，降低现有地质预报物探法解释性低、依赖专家经验、预测准确率不高的问题，提升工程施工的安全性。The beneficial effects brought about by the technical solution provided by the present invention are: the technical solution proposed in the present application can accurately predict the location scale and nature state of the unfavorable geological bodies within the range passed during the construction of underground projects such as tunnels, which is beneficial for engineering design and construction. The construction management part provides the basis for decision-making, reduces the problems of low interpretability of existing geophysical prediction methods, relies on expert experience, and low prediction accuracy, and improves the safety of engineering construction.

附图说明Description of drawings

下面将结合附图及实施例对本发明作进一步说明，附图中：The present invention will be further described below in conjunction with the accompanying drawings and embodiments, in which:

图1是本发明实施例中一种基于卷积神经网络的不良地质定位预报方法的流程图；1 is a flowchart of a method for predicting poor geological location based on a convolutional neural network in an embodiment of the present invention;

图2是本发明实施例中目标预测定位神经网络的结构图。FIG. 2 is a structural diagram of a target prediction and positioning neural network in an embodiment of the present invention.

具体实施方式Detailed ways

为了对本发明的技术特征、目的和效果有更加清楚的理解，现对照附图详细说明本发明的具体实施方式。In order to have a clearer understanding of the technical features, objects and effects of the present invention, the specific embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

本发明的实施例提供了一种基于卷积神经网络的不良地质定位预报方法。Embodiments of the present invention provide a method for predicting poor geological location based on a convolutional neural network.

请参考图1，图1是本发明实施例中一种基于卷积神经网络的不良地质定位预报方法的流程图，具体包括如下步骤：Please refer to FIG. 1. FIG. 1 is a flowchart of a method for predicting poor geological location based on a convolutional neural network in an embodiment of the present invention, which specifically includes the following steps:

为解决物探法超前地质预报因探测数据样本量少造成的训练过拟合、预测精度低的问题，基于图像数据增广方法实现数据集样本扩容。In order to solve the problems of over-fitting of training and low prediction accuracy caused by the small sample size of the detection data in the advance geological prediction of the geophysical method, the sample expansion of the data set is realized based on the image data augmentation method.

步骤S101中，建立物探法探测图像数据集；具体包括：In step S101, a geophysical detection image data set is established; specifically, it includes:

S201：通过超前地质预报案例收集、实际现场项目数据采集，建立物探法探测结果初步图像数据集；所述初步图像数据集包括基于物探方法(TSP、地质雷达、瞬变电磁等)进行超前地质预报得到的多张结果图像，收集整理后得到样本集合，为要处理的原始数据；所述初步图像数据集中样本容量大于500幅图像；S201: Establish a preliminary image data set of geophysical detection results through advanced geological prediction case collection and actual field project data collection; the preliminary image data set includes advanced geological prediction based on geophysical methods (TSP, geological radar, transient electromagnetic, etc.). The obtained multiple result images are collected and arranged to obtain a sample set, which is the original data to be processed; the sample capacity in the preliminary image data set is greater than 500 images;

S202：根据富水破碎带预报机理及其图像学特征，对所述初步图像数据集中各图像上的异常地质进行预报；并指导特征提取模型的网络结构设计；S202: Predict the abnormal geology on each image in the preliminary image data set according to the prediction mechanism of the water-rich fracture zone and its image characteristics; and guide the network structure design of the feature extraction model;

S204：采用多种图像数据增广方法，对所述标注后的初步图像数据集进行扩容增广，并耦合部分物探法探测富水破碎带负样本数据(即探测结果表明不含富水破碎带的数据)，得到最终的图像数据集；所述图像数据增广方法包括：图像镜像、平移、缩放、旋转、裁剪、高斯噪声等；S204: Using a variety of image data augmentation methods to expand and expand the labeled preliminary image data set, and couple with some geophysical methods to detect the negative sample data of the water-rich fractured zone (that is, the detection result indicates that the water-rich fractured zone is not included) data) to obtain the final image data set; the image data augmentation method includes: image mirroring, translation, scaling, rotation, cropping, Gaussian noise, etc.;

其中，最终的图像数据集中的图像数据大于1000幅；按8：2的比例，分为训练数据集和测试数据集，完成图像数据集构建。Among them, the image data in the final image data set is more than 1000; according to the ratio of 8:2, it is divided into a training data set and a test data set, and the construction of the image data set is completed.

针对隧道工程复杂地质环境下不良地质类型识别分类难度大的问题，基于深度学习的特征提取方法，设计具有目标预测定位功能的神经网络模型，判断图像中是否存在富水破碎带定位区域。Aiming at the difficulty of identifying and classifying unfavorable geological types in the complex geological environment of tunnel engineering, based on the feature extraction method of deep learning, a neural network model with the function of target prediction and positioning is designed to determine whether there is a water-rich fracture zone in the image.

根据多种不良地质在探测数据上的图形图像学机理，选择设计合适的目标预测定位网络模型结构，基于一个通用原则，更深层次的网络能够学习更复杂的函数和输入的表示形式，使得模型性能更好，本方法选择引入ResNet残差网络概念设计基础模块，设计超深度卷积网络模型结构。According to the graphic imagery mechanism of various bad geology on the detection data, the appropriate target prediction and positioning network model structure is selected and designed. Based on a general principle, deeper networks can learn more complex functions and input representations, so that the model performance can be improved. Better, this method chooses to introduce the ResNet residual network concept to design the basic module, and design the super-deep convolutional network model structure.

请参阅图2，图2是本发明实施例中目标预测定位神经网络的结构图；步骤S102中，所述的基于特征提取的目标预测定位神经网络模型包括依次连接的特征提取基础网络和定位预测网络；Please refer to FIG. 2. FIG. 2 is a structural diagram of a target prediction and positioning neural network in an embodiment of the present invention; in step S102, the feature extraction-based target prediction and positioning neural network model includes a feature extraction basic network and a positioning prediction network that are sequentially connected. network;

其中，所述CBR块包括依次连接的卷基层、正则化层和激活层；其中，C代表卷积层，B代表正则化层Batch Norm，R代表激活层；在本发明实施例中，采用Leaky-ReLU激活函数；避免传统ReLU激活函数在负区间神经元静默的问题；Wherein, the CBR block includes a convolutional base layer, a regularization layer and an activation layer connected in sequence; wherein, C represents a convolutional layer, B represents a regularization layer Batch Norm, and R represents an activation layer; in the embodiment of the present invention, Leaky -ReLU activation function; avoid the problem of silent neurons in the negative interval of the traditional ReLU activation function;

BaseRN1层和BaseRN0层均基于残差网络思想设计，引入残差项，便于构建深度网络模型；其中，BaseRN0层和BaseRN1层中主干均为CBR-CBR-CB模块，且BaseRN0层和BaseRN1层中主干CBR-CBR-CB模块参数一致，具体为：第一个CBR块的滤波器尺寸为1×1，64个通道；第二个CBR块的滤波器尺寸为3×3，64个通道；第三个CB块的滤波器尺寸为1×1，256个通道；BaseRN0层中分支CB块的滤波器尺寸与主干的CB块滤波器尺寸一致，BaseRN1中分支CBR块的滤波器尺寸与主干的第一个CBR块的滤波器尺寸一致；所述CB块包括依次连接的卷基层和正则化层；Both the BaseRN1 layer and the BaseRN0 layer are designed based on the residual network idea, and the residual term is introduced to facilitate the construction of a deep network model; among them, the backbone in the BaseRN0 layer and the BaseRN1 layer are CBR-CBR-CB modules, and the backbone in the BaseRN0 layer and the BaseRN1 layer The CBR-CBR-CB module parameters are the same, specifically: the filter size of the first CBR block is 1×1, 64 channels; the filter size of the second CBR block is 3×3, 64 channels; the third The filter size of each CB block is 1×1, 256 channels; the filter size of the branch CB block in the BaseRN0 layer is the same as the filter size of the backbone CB block, and the filter size of the branch CBR block in BaseRN1 is the same as the first one of the backbone. The filter size of each CBR block is the same; the CB block includes a volume base layer and a regularization layer connected in sequence;

步骤S103中，在所述特征提取基础网络中，特征提取层的输出特征映射以16倍的因子进行四级下采样，在空间分辨率和提取特征强度之间进行折衷,激活层Leaky-ReLU的最后输出为14×14×1024；所述特征提取网络采用四个阶段对输入图像进行2倍的降采样，实现16倍的下采样：具体如下：In step S103, in the feature extraction basic network, the output feature map of the feature extraction layer is down-sampled in four levels by a factor of 16, and a compromise is made between the spatial resolution and the extracted feature strength, and the activation layer Leaky-ReLU The final output is 14×14×1024; the feature extraction network uses four stages to downsample the input image by 2 times to achieve 16 times downsampling: the details are as follows:

第一阶段：输入尺寸大小为W×H×C的三通道RGB图像，在第一阶段输入进入一个CBR块，如图2所示；其中，W＝256为图像的宽，H＝256为图像的高，C＝3代表图像三通道；此CBR块中滤波器尺寸为1×1，64个通道，经过CBR块一次下采样，此时输出的图像尺寸大小为128×128×64；The first stage: input a three-channel RGB image with a size of W×H×C, and enter a CBR block in the first stage, as shown in Figure 2; where W=256 is the width of the image, and H=256 is the image The height of C=3 represents three channels of the image; the filter size in this CBR block is 1×1, 64 channels, after the CBR block is downsampled once, the output image size is 128×128×64;

BaseRN0层和BaseRN1层均基于残差网络的思想设计，是对ResNet残差模块的改进升级版本；BaseRN0层、BaseRN1层后均接leaky-ReLU激活函数，是一种通用处理方式；The BaseRN0 layer and the BaseRN1 layer are both designed based on the idea of the residual network, which is an improved and upgraded version of the ResNet residual module; the BaseRN0 layer and the BaseRN1 layer are connected to the leaky-ReLU activation function, which is a general processing method;

第四阶段：将第三阶段的输出输入至依次连接的1个BaseRN1层和5个BaseRN0层，通过BaseRN0、BaseRN1的不同组合进一步得到更深层的网络模型结构，输出图像尺寸大小为16×16×1024；The fourth stage: Input the output of the third stage to 1 BaseRN1 layer and 5 BaseRN0 layers connected in sequence, and further obtain a deeper network model structure through different combinations of BaseRN0 and BaseRN1, and the output image size is 16×16× 1024;

其中，在第三阶段和第四阶段，也可以根据需求，通过增加或者减少BaseRNO层和BaseRN1层来提升模型深度。Among them, in the third and fourth stages, the model depth can also be improved by adding or reducing the BaseRNO layer and the BaseRN1 layer according to the needs.

在定位预测网络中，将特征提取网络的输出结果输入定位预测网络中，通过最大池化层有效降低模型参数误差造成的特征估计均值的偏移影响，最后连接全连接层及Sigmoid层进行回归定位，通过多任务损失函数的设计实现对预测和定位的同时检测；预测是判断是否存在待检测的不良地质；定位是若存在该不良地质，对不良地质的区域范围进行定位；In the localization prediction network, the output of the feature extraction network is input into the localization prediction network, and the maximum pooling layer is used to effectively reduce the offset of the feature estimation mean caused by the model parameter error. Finally, the fully connected layer and the sigmoid layer are connected for regression positioning. , the simultaneous detection of prediction and positioning is realized through the design of multi-task loss function; prediction is to judge whether there is bad geology to be detected; positioning is to locate the area of bad geology if there is such bad geology;

步骤S103中，训练时，为实现对某一不良地质的精准定位预报，设计多任务耦合的损失函数，在定位预测网络最后通过sigmoid回归方法实现对不良地质的概率置信度判断及定位区域数据输出；具体的：In step S103, during training, in order to realize the accurate positioning prediction of a certain unfavorable geology, a multi-task coupling loss function is designed, and the sigmoid regression method is finally used in the positioning prediction network to realize the probability confidence judgment of the unfavorable geology and the data output of the location area. ;specific:

属于(0,1)；i＝1,2,…,I，I为预测框个数；L_conf(o,p)的值受预测的结果框个数和值的影响，范围不定，L_conf(o,p)越小，损失越小说明预测结果越准确；Among them, o _i ∈ {0,1} indicates whether there is a target in the i-th prediction box, 0 indicates no existence, and 1 indicates existence;

Belongs to (0,1); i=1,2,...,I, I is the number of prediction boxes; the value of L _conf (o,p) is affected by the number and value of the predicted result boxes, the range is indeterminate, L _conf The smaller the (o, p), the smaller the loss, the more accurate the prediction result;

目标定位：目标定位损失函数L_loc(μ,σ)，为提高结果定位精确度，对定位框的预测包围框Boundingbox结果(x,y,w,h)引入高斯分布建模，得到各个参数的均值

与方差

通过对均值、方差都训练得到损失最小的结果实现预测精度的提升，作为输出特征张量中定位区域的表征，这种方式不仅有区域定位，同时对定位的不确定性进行了评价，得到的结果是定位最精确的结果；Boundingbox为一个长方形框，(x,y)表示其中心点位置，(w,h)表示其大小；Target positioning: target positioning loss function L _loc (μ,σ), in order to improve the accuracy of the result positioning, the Gaussian distribution modeling is introduced into the predicted bounding box results (x, y, w, h) of the positioning box, and the results of each parameter are obtained. mean

with variance

The prediction accuracy is improved by training the mean and variance to obtain the result with the smallest loss, which is used as the representation of the positioning area in the output feature tensor. This method not only has regional positioning, but also evaluates the uncertainty of positioning. The result is The most accurate result of positioning; Boundingbox is a rectangular box, (x, y) represents the position of its center point, and (w, h) represents its size;

目标定位损失函数采用负对数似然损失(NLL,negative log likelihood loss)表征：The target localization loss function is represented by a negative log likelihood loss (NLL, negative log likelihood loss):

其中，

in,

其中，W×H是输入图像的宽、高，将输入图像分割为H行W列个网格；I是预测框个数，Among them, W×H is the width and height of the input image, and the input image is divided into grids of H rows and W columns; I is the number of prediction frames,

由于每次预测的结果是多个目标预测框，所以通过j∈[1,W]代表列，k∈[1,H]代表行索引到对应网格，i∈[1,I]代表这个网格预测的第i个预测框区域，通过i、j、k索引得到第k行第j列的第i个预测框及其基坐标，每个网格上有多个尺度的预测框来做结果逼近；Since the result of each prediction is multiple target prediction frames, j∈[1,W] represents the column, k∈[1,H] represents the row index to the corresponding grid, and i∈[1,I] represents the grid For the i-th prediction frame area of grid prediction, the i-th prediction frame and its base coordinates of the k-th row and the j-th column are obtained through the i, j, and k indices. There are multiple scales of prediction frames on each grid to make the result. approach;

γ_t是对(x,y,w,h)预测结果权重影响的超参数，是模型训练的一个超参数，随着训练逐步逼近一个稳定值，可以有效保障训练结果的准确性；G_ijk代表在H×W的坐标系下，第k行第j列像素点位置上第i个预测框的结果真实值{x,y,w,h}_truth，满足均值为μ，方差为σ的正太分布G～N(μ,σ²)；μ_t、σ_t,t∈{x,y,w,h}是模型预测结果，μ_x,μ_y通过sigmoid函数转化到(0,1)范围内，表示在当前t＝kji索引的预测框内的中心坐标，μ_w,μ_h表示以μ_x,μ_y为中心点的预测框的长宽；γ _t is a hyperparameter that affects the weight of (x, y, w, h) prediction results, and is a hyperparameter for model training. As the training gradually approaches a stable value, the accuracy of the training results can be effectively guaranteed; G _ijk represents In the H×W coordinate system, the true value {x,y,w,h} _truth of the result of the i-th prediction frame at the pixel point position of the k-th row and the j-th column satisfies a normal distribution with a mean value of μ and a variance of σ G～N(μ,σ ² ); μ _t , σ _t , t∈{x,y,w,h} are the model prediction results, μ _x , μ _y are transformed into the range of (0,1) by the sigmoid function, Represents the center coordinates in the prediction frame of the current t=kji index, μ _w , μ _h represent the length and width of the prediction frame with μ _x and μ _y as the center points;

同时用方差量σ_t,t∈{x,y,w,h}表征预测结果可靠性，0表示可靠，1表示不可靠，求解过程如下：At the same time, the variance σ _t , t∈{x,y,w,h} is used to represent the reliability of the prediction results, 0 means reliable, 1 means unreliable, and the solution process is as follows:

t₁∈{x,y}

t ₁ ∈{x,y}

t₂∈{w,h}

t ₂ ∈{w,h}

t₃∈{x,y,w,h}

t ₃ ∈{x,y,w,h}

综合得到目标定位与目标预测置信度概率的多任务损失函数，通过权重因子λ₁,λ₂耦合两个损失函数得到综合的不良地质定位预报损失函数L(o,p,μ,σ)：The multi-task loss function of target positioning and target prediction confidence probability is comprehensively obtained, and the combined two loss functions are coupled by weight factors λ ₁ , λ ₂ to obtain a comprehensive bad geological positioning prediction loss function L(o,p,μ,σ):

其中，λ₁,λ₂这两个权重因子根据具体任务需求设定，有些场景需要优先确定目标是否为异常地质情况及其概率准确性，那么λ₁权重提升；有些任务场景关注异常地质的位置预测准确性，则λ₂权重提升。Among them, the two weight factors λ ₁ , λ ₂ are set according to the specific task requirements. In some scenarios, it is necessary to prioritize whether the target is abnormal geological conditions and its probability accuracy, then the weight of λ ₁ is increased; some task scenarios focus on the location of abnormal geological conditions Prediction accuracy, the λ ₂ weight is improved.

在S101中构建的图像数据集属于小样本数据集；步骤S103中，通过迁移学习方法提升训练速度；具体步骤为：The image data set constructed in S101 belongs to the small sample data set; in step S103, the training speed is improved by the transfer learning method; the specific steps are:

S301：先基于大样本数据集A预先训练原始目标预测定位神经网络模型；得到初步训练后的目标预测定位神经网络模型A；所述大样本数据集可以使用ImageNet等；S301: first pre-train the original target prediction and positioning neural network model based on the large sample data set A; obtain the target prediction and positioning neural network model A after preliminary training; the large sample data set can use ImageNet, etc.;

S302：冻结模型A的前t层参数，将所述图像数据集中的训练数据集作为小样本数据集B对模型A进行迁移训练；得到训练后的目标预测定位神经网络模型B；其中，t属于(1,L)，为根据经验预先设定的值；S302: Freeze the parameters of the first t layers of the model A, and use the training data set in the image data set as the small sample data set B to perform migration training on the model A; obtain the trained target prediction and positioning neural network model B; wherein, t belongs to (1, L), which are preset values based on experience;

S303：采用所述图像数据集中的测试数据集对所述训练后的目标预测定位神经网络模型进行测试；并判断测试是否达到要求；若是，则将当前的目标预测定位神经网络模型作为训练好的目标预测定位神经网络模型；否则，采用所述训练数据集对所述模型B的后t至L层参数进行重新训练，根据训练数据集标签反馈训练，得到模型B的所有参数；同时可以微调模型参数，来优化结果，直到训练后的目标预测定位神经网络模型B能够满足任务要求。S303: Use the test data set in the image data set to test the trained target prediction and positioning neural network model; and determine whether the test meets the requirements; if so, use the current target prediction and positioning neural network model as the trained neural network model Target prediction and positioning neural network model; otherwise, use the training data set to retrain the parameters of the last t to L layers of the model B, and feed back the training according to the training data set label to obtain all the parameters of the model B; at the same time, the model can be fine-tuned. parameters to optimize the results until the trained target prediction and positioning neural network model B can meet the task requirements.

微调Fine-tune其实是在大样本ImageNet数据集上已经训练好了原始目标预测定位神经网络模型的所有参数，然后冻结A的前t层参数，这前t层参数不需要再训练了；然后使用构建的小样本的训练数据集，进行训练A的t-L层参数，得到t-L层的参数；根据测试评估的结果(评估的阈值可以根据任务需求的精度具体设定)，判断参数是否训练较好、模型能否满足任务要求，如果不达标则重复以上步骤进行训练，同时可以调整冻结的层数等参数，来优化结果。Fine-tuning Fine-tune actually trains all parameters of the original target prediction and positioning neural network model on the large-sample ImageNet data set, and then freezes the parameters of the first t layer of A. The parameters of the first t layer do not need to be retrained; then use The small-sample training data set constructed is used to train the parameters of the t-L layer of A to obtain the parameters of the t-L layer; according to the results of the test evaluation (the evaluation threshold can be specifically set according to the accuracy of the task requirements), it is judged whether the parameters are well trained, Whether the model can meet the task requirements, if not, repeat the above steps for training, and adjust parameters such as the number of frozen layers to optimize the results.

训练过程中需要有训练数据集和测试数据集，训练过程是重复N次epoch来不断的更新参数，直至结果最优，每次训练过程中，先通过训练数据集训练得到模型参数，然后通过测试数据集评估该组参数，然后进行下次epoch的训练，直到完成设定的N次epoch训练或者评估结果满足预先设置的要求The training process requires a training data set and a test data set. The training process is to repeat N epochs to continuously update the parameters until the result is optimal. In each training process, the model parameters are obtained by training the training data set first, and then pass the test. The data set evaluates the set of parameters, and then performs the training of the next epoch until the set N epoch training is completed or the evaluation results meet the preset requirements

在实际应用中，输入某物探法得到的图像结果数据至所述训练好的目标预测定位神经网络模型中，自动输出是否存在欲预报的不良地质概率置信度及其所在区域，降低对专业人员及专家经验的依赖，提高工作效率。In practical applications, input the image result data obtained by a geophysical method into the trained neural network model for target prediction and positioning, and automatically output whether there is a bad geological probability confidence level to be predicted and its location, reducing the need for professionals and professionals. Rely on expert experience to improve work efficiency.

本发明的有益效果包括：The beneficial effects of the present invention include:

1)构建了地质超前预报数据集，数据集是深度学习及网络模型训练的基础；1) Constructed a geological advance forecast data set, which is the basis for deep learning and network model training;

2)基于残差网络思想，设计了超高深度的深层特征提取网络模型，能够通过模型的深度激活检测地质超前预报图像中隐性、抽象的信息，完成相应不良地质特征的提取；2) Based on the residual network idea, an ultra-high-depth deep feature extraction network model is designed, which can detect the recessive and abstract information in the geological advance forecast image through the deep activation of the model, and complete the extraction of the corresponding bad geological features;

3)在特征提取完成后，通过设计多任务损失函数实现某不良地质类型概率置信度判断和区域定位；3) After the feature extraction is completed, a multi-task loss function is designed to realize the probability confidence judgment and regional positioning of a certain unfavorable geological type;

4)该方法可以随着图像数据集的不断收集扩充而不断升级进化，提升性能及检测精度、范围等等。4) The method can be continuously upgraded and evolved with the continuous collection and expansion of image data sets, improving performance, detection accuracy, range, and so on.

本申请所提出的技术方案可以准确地预报隧道等地下工程建设过程中所通过范围内的不良地质体的位置规模和性质状态，为工程设计及施工管理部分提供决策依据，降低现有地质预报物探法解释性低、依赖专家经验、预测准确率不高的问题，提升工程施工的安全性。The technical solution proposed in this application can accurately predict the location, scale and nature of unfavorable geological bodies within the scope of the construction of tunnels and other underground projects, provide decision-making basis for engineering design and construction management, and reduce existing geological prediction and geophysical exploration. It can improve the safety of engineering construction by solving the problems of low interpretability of the method, relying on expert experience, and low prediction accuracy.

以上所述仅为本发明的较佳实施例，并不用以限制本发明，凡在本发明的精神和原则之内，所作的任何修改、等同替换、改进等，均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included in the protection of the present invention. within the range.

Claims

1. A bad geological positioning forecasting method based on a convolutional neural network is characterized by comprising the following steps: the method comprises the following steps:

s101: establishing a geophysical prospecting image data set;

s102: constructing a target prediction positioning neural network model based on feature extraction;

s103: training the target prediction positioning neural network model by adopting the image data set to obtain a trained target prediction positioning neural network model;

s104: and inputting image result data obtained by a geophysical prospecting method into the trained target prediction positioning neural network model to perform actual unfavorable geological positioning prediction.

2. The method for forecasting unfavorable geologic positioning based on convolutional neural network as defined in claim 1, wherein: in the step S101, a geophysical prospecting image data set is established; the method specifically comprises the following steps:

s201: establishing a geophysical prospecting method detection result preliminary image data set through advanced geological forecast case collection or actual field project data acquisition; the preliminary image data set comprises a plurality of result images obtained by performing advanced geological forecast based on a geophysical prospecting method;

s202: forecasting abnormal geology on each image in the preliminary image data set according to a water-rich fractured zone forecasting mechanism and the imaging characteristics of the water-rich fractured zone forecasting mechanism;

s203: determining the position of a water-rich broken band on each image in the preliminary image data set again by adopting an expert experience method in combination with a forecast conclusion, and performing data annotation on each image in the preliminary image data set by taking a water-rich broken band positioning area as label content to obtain an annotated preliminary image data set;

s204: and adopting an image data augmentation method to perform capacity expansion augmentation on the marked preliminary image data set, and coupling a partial geophysical prospecting method to detect the sample data of the water-rich broken belt to obtain a final image data set.

3. The method for forecasting the unfavorable geologic localization of a convolutional neural network as claimed in claim 2, wherein: in step S201, the geophysical prospecting method includes TSP, geological radar, and transient electromagnetism; the preliminary image dataset has a sample capacity greater than 500 images.

4. The method for forecasting unfavorable geologic positioning based on convolutional neural network as defined in claim 1, wherein: in step S102, the target prediction positioning neural network model based on feature extraction includes a feature extraction basic network and a positioning prediction network that are connected in sequence;

wherein, the characteristic extraction basic network comprises the following connected in sequence: CBR block, max pooling layer, BaseRN1 layer, 2 BaseRN0 layers, BaseRN1 layers, 3 BaseRN0 layers, BaseRN1 layers, 5 BaseRN0 layers;

the positioning prediction network comprises a maximum pooling layer, a full connection layer and a Sigmoid layer which are sequentially connected.

5. The method for forecasting unfavorable geologic positioning based on convolutional neural network as defined in claim 4, wherein: the CBR block comprises a volume base layer, a regularization layer and an activation layer which are sequentially connected; wherein C represents a convolution layer, B represents a regularization layer Batch Norm, R represents an activation layer, and a Leaky-ReLU activation function is adopted.

6. The method for forecasting unfavorable geologic positioning based on a convolutional neural network as claimed in claim 5, wherein: the BaseRN1 layer and the BaseRN0 layer are designed based on a residual error network concept, and a residual error item is introduced, so that a depth network model is conveniently constructed; wherein, the trunks in the BaseRN0 layer and the BaseRN1 layer are both CBR-CBR-CB modules, and the parameters of the trunks CBR-CBR-CB modules in the BaseRN0 layer and the BaseRN1 layer are consistent, and the concrete steps are as follows: the filter size of the first CBR block is 1 × 1, 64 channels; the filter size of the second CBR block is 3 × 3, 64 channels; the filter size of the third CB block is 1 × 1, 256 channels; the filter size of the branched CB block in the BaseRN0 layer is consistent with the CB block filter size of the trunk, and the filter size of the branched CBR block in the BaseRN1 is consistent with the filter size of the first CBR block of the trunk; the CB block comprises a volume base layer and a regularization layer which are connected in sequence.

7. The method for forecasting unfavorable geologic positioning based on convolutional neural network as defined in claim 4, wherein: in step S103, in the feature extraction basic network, the output feature mapping of the feature extraction layer is subjected to four-level down-sampling by a factor of 16 times, a tradeoff is made between spatial resolution and extracted feature strength, and the final output of the active layer leak-ReLU is 14 × 14 × 1024; the feature extraction network performs 2-time down-sampling on the input image in four stages to realize 16-time down-sampling: the method comprises the following specific steps:

the first stage is as follows: inputting a three-channel RGB image with the size of W multiplied by H multiplied by C, and inputting the three-channel RGB image into a CBR block at a first stage; w-256 is the width of the image, H-256 is the height of the image, and C-3 represents three channels of the image; the filter size in the CBR block is 1 multiplied by 1, 64 channels are sampled by the CBR block for one time, and the size of the output image is 128 multiplied by 64;

and a second stage: first pass through a maximum pooling layer of 3 × 3 and stride 2 with a filter, then pass through a BaseRN1 layer and 2 BaseRN0 layers; the size of the image output after the second stage is 64 × 64 × 256;

and a third stage: inputting the output of the second stage to 1 BaseRN1 layer and 3 BaseRN0 layers which are connected in sequence, obtaining a deeper network model structure through the combination of the BaseRN0 layer and the BaseRN1 layer, and enabling the size of an output graph to be 32 multiplied by 512;

a fourth stage: the output of the third stage is input to 1 BaseRN1 layer and 5 BaseRN0 layers connected in sequence, a further deeper network model structure is obtained by the combination of BaseRN0 and BaseRN1, and the size of the output image is 16 × 16 × 1024.

8. The method for forecasting unfavorable geologic positioning based on convolutional neural network as defined in claim 4, wherein: in the positioning prediction network, the output result of the feature extraction network is input into the positioning prediction network, the offset influence of the feature estimation mean value caused by the parameter error of the model is effectively reduced through the maximum pooling layer, finally, the full-connection layer and the Sigmoid layer are connected for regression positioning, and the simultaneous detection of prediction and positioning is realized through the design of a multi-task loss function.

9. The method for forecasting unfavorable geologic positioning based on a convolutional neural network as claimed in claim 8, wherein: in the step S103, during training, in order to realize accurate positioning forecast of a certain unfavorable geology, a multitask coupling loss function is designed, and finally probability confidence judgment and positioning area data output of the unfavorable geology are realized through a sigmoid regression method in a positioning prediction network; specifically, the method comprises the following steps:

the design of the loss function aims at two tasks of judging the prediction confidence coefficient of the unfavorable geological target and positioning the target, and the target loss function is respectively and correspondingly constructed, wherein:

target prediction confidence: target prediction confidence loss function L_conf(o, p) representing the probability that the predicted target region is the unfavorable geology to be monitored, and adopting binary cross entropy loss characterization:

wherein o is_iE {0,1} represents whether a target exists in the ith prediction box, 0 represents absence and 1 represents existence;

representing the probability of whether the target exists in the ith prediction box, and locating the result p output by the neural network model through predicting the target_iThe sigmoid is evaluated to obtain the result,

belongs to (0, 1); i is 1,2, …, and I is the number of prediction frames; l is_confThe value of (o, p) is influenced by the number and value of the prediction frames, the range is not fixed, L_confThe smaller the (o, p), the smaller the loss, the more accurate the prediction result;

target positioning: target location loss function L_loc(mu, sigma), in order to improve the result positioning accuracy, Gaussian distribution modeling is introduced to the prediction bounding box result (x, y, w, h) of the positioning box to obtain the mean value and the variance of each parameter; the bounding box is a rectangular box, (x, y) represents the center position of the box, and (w, h) represents the size of the box;

the target positioning loss function is characterized by adopting negative log likelihood loss:

wherein,

w multiplied by H is the width and the height of an input image, and the input image is divided into H rows and W columns of grids; i is the number of the prediction frames, and the ith prediction frame and the base coordinate of the jth row and the jth column of the kth line are obtained by indexing I, j and k; gamma ray_tIs a hyper-parameter affecting the (x, y, w, h) prediction result weight; g_ijkRepresenting the real value { x, y, W, H } of the result of the ith prediction box at the position of the jth row pixel point on the kth line under the H multiplied by W coordinate system_truthA positive distribution G to N (mu, sigma) satisfying a mean value of mu and a variance of sigma²)；μ_t,σ_tAnd t ∈ { x, y, w, h } is the model prediction result, passing sigmThe oid function translates to a (0,1) range, representing the center coordinate, μ, within the current t ═ i, j, k indexed prediction box_w,μ_hIs expressed in μ_x,μ_yThe length and width of a rectangular frame of the prediction region which is a central point; using the square error amount sigma simultaneously_tThe t belongs to { x, y, w, h } representation prediction result reliability, 0 represents reliable, and 1 represents unreliable;

synthesis L_conf(o, p) and L_loc(mu, sigma) to obtain a multitask loss function by a weighting factor lambda₁,λ₂And coupling the two loss functions to obtain a comprehensive bad geology positioning forecast loss function L (o, p, mu, sigma):

L(o,p,μ,σ)＝λ₁L_conf(o,p)+λ₂L_loc(μ,σ)

wherein λ is₁,λ₂These two weighting factors are set according to the specific task requirements.

10. The method for forecasting unfavorable geologic positioning based on convolutional neural network as defined in claim 1, wherein: the image dataset constructed in S101 belongs to a small sample dataset; in step S103, the target prediction positioning neural network model is trained by a transfer learning method to increase the training speed.