CN112508239A

CN112508239A - Energy storage output prediction method based on VAE-CGAN

Info

Publication number: CN112508239A
Application number: CN202011315545.3A
Authority: CN
Inventors: 饶宇飞; 李朝晖; 于琳琳; 滕卫军; 孙鑫; 谷青发; 杨海晶; 徐鹏煜
Original assignee: Electric Power Research Institute of State Grid Henan Electric Power Co Ltd; State Grid Corp of China SGCC
Current assignee: Electric Power Research Institute of State Grid Henan Electric Power Co Ltd; State Grid Corp of China SGCC
Priority date: 2020-11-22
Filing date: 2020-11-22
Publication date: 2021-03-16

Abstract

A method for predicting energy storage output based on VAE-CGAN, comprising the following steps: collecting historical output data of energy storage power stations; obtaining characteristic information of the historical output data through training of a VAE model; inputting the characteristic information into a generator and a discriminator, and setting Noise information and input the historical output data of the energy storage power station into the discriminator; the generator generates sample data and inputs it into the discriminator, the discriminator judges the source of the generated sample data according to the condition information and the real sample data, and modifies the parameters if the source can be judged And repeat the generation + discrimination process until the discriminator cannot source the sample data, and a strong generator is obtained to predict the output of the energy storage device. This model builds a continuous conditional input space. Compared with the prior art, the present invention can not only generate data based on conditional information in historical scenarios, but also can generate data accurately based on unknown conditional information, which is beneficial for energy storage in uncertain scenarios. The output below is accurately modeled.

Description

Energy storage output prediction method based on VAE-CGAN

The technical field is as follows:

the invention belongs to the technical field of power distribution network energy storage output modeling, and particularly relates to a VAE-CGAN-based energy storage output prediction method.

Background art:

in recent years, new energy power generation is continuously developed, and the development of forms of solar power generation, wind power, tidal power and the like also brings new challenges to a power grid. For example, the forms of solar power generation, tidal power generation and the like are influenced by conditions such as weather and tidal cycle, and the power generation is intermittent; and wind power generation is also influenced by different wind power in different seasons, so that the power generation has uncertainty. As the proportion of new energy power generation is increasing, the influence of instability of new energy power generation on the power system is increasing. Naturally also the modeling of the output of the stored energy is affected.

The energy storage device has the effect of 'peak clipping and valley filling', can cope with the load in the peak period of the load, and reduces the peak-valley difference. The power grid enterprise can obtain more peak load benefits while adjusting the peak and relieving the power supply pressure. The output of the energy storage device is influenced by factors such as new energy power generation, weather, seasons and the like; meanwhile, although the electrical load is basically periodic, the electrical load still fluctuates continuously, and therefore, the determination of the energy storage output is influenced. The significance of the energy storage output modeling is that the electric energy can be dispatched in a more reasonable operation mode, the cost of a power grid enterprise can be saved, the profit can be improved, and the utilization rate of new energy can be improved.

For the reasons mentioned above, the determination of the energy storage capacity needs to be faced with the influence of uncertain external conditions. Due to the influence of unstable factors in the aspects of new energy power generation and load, the method establishes a continuous condition input space by using a VAE (variable auto-encoder) model, and inputs a CGAN (Conditional access adaptive network) model to establish an energy storage output model. Therefore, the energy storage output uncertainty modeling based on the VAE-CGAN can not only generate condition information in a historical scene, but also train characteristic information of the energy storage output uncertainty modeling by using a VAE model in the face of abnormal external conditions and perform modeling, and can accurately and comprehensively model the output of the energy storage while ensuring the accuracy.

The invention content is as follows:

the invention aims to determine the energy storage output force under different loads and generated energy under the condition of unstable generated energy of new energy, thereby enhancing the peak clipping and valley filling effects, reducing the waste conditions such as air abandon and the like and improving the economy.

The invention specifically adopts the following technical scheme:

a VAE-CGAN-based energy storage output prediction method is characterized by comprising the following steps:

step 1: collecting historical operation data of the energy storage device, wherein the historical operation data comprises battery voltage, battery ampere hours, discharge multiplying power, battery operation voltage, traditional energy power generation ratio and new energy power generation ratio;

step 2: directly inputting the historical operating data acquired in the step 1 into a VAE model for training to generate data characteristic information, namely generating new data containing information of the historical operating data;

and step 3: constructing a generator and a discriminator based on the CGAN, wherein the generator is used for generating simulation sample data, and the discriminator is used for discriminating the sources of the simulation sample data and real sample data;

and 4, step 4: inputting the characteristic information obtained in the step 2 as condition information into the generator and the discriminator, and performing countermeasure training on the generator and the discriminator, namely, the discriminator receives the simulation sample data and the real sample data acquired in the step 1 and judges the sources of the two data;

and 5: updating the parameters of the generator and the discriminator, namely optimizing the generator and the discriminator;

step 6: adding 1 to the iteration times, returning to the step 5 until the discriminator cannot distinguish the sample data source, and outputting a generator;

and 7: and (4) acquiring real-time operation data of the energy storage device, inputting the real-time operation data into the generator output in the step (6), and predicting the energy storage output condition of the energy storage device.

The invention further adopts the following preferred technical scheme:

the step 2 comprises the following steps:

step 201: constructing an encoder by adopting a layer of LSTM neural network; a decoder is constructed by adopting a layer of LSTM neural network and a layer of full connection layer;

step 202: the encoder receives a first high-dimensional data sequence X ═ X of historical operating data₁，x₂，…x_n]Then, mapping a high-dimensional data sequence X of the historical operating data into a mean vector mu with the length of m and a standard deviation vector sigma with the length of m;

step 203: the encoder calculates a mean vector mu, a standard deviation vector sigma and a parameter sequence delta [ delta 1, delta 2, … delta m ═ according to the mean vector mu and the standard deviation vector sigma]And calculating a hidden variable sequence Z ═ Z by the following formula₁，z₂，…z_m]；

z_i＝μ_i+δi·exp(σ_i)

Wherein z is_iFor the ith value in the hidden variable sequence Z，μ_iIs the ith value, σ, in the mean vector μ_iIs the ith value in the standard deviation vector σ, δ i is a parameter obtained by random sampling in a sample set subject to bernoulli distribution, and subject to a standard normal distribution, i.e., δ i to N (0,1), i ═ 1,2, …, m; step 204: inputting the hidden variable sequence Z obtained by calculation in the step 203 into a decoder, and restoring the hidden variable sequence Z into a second high-dimensional data sequence X ', wherein the second high-dimensional data sequence X' is a characteristic information sequence.

In step 2, the following objective function is adopted for training:

wherein L is₁Is a self-encoding reconstruction error, L₂Is KL divergence, P (x | Z) is a prior distribution of hidden variables Z representing a decoder in the VAE, q (Z | x) is a posterior distribution of values Z in a sequence of hidden variables derived from any value x in the real data sequence representing an encoder in the VAE;

and when the self-coding reconstruction error value is maximum and the KL divergence value is minimum, finishing training to obtain the characteristic information.

In the step 2, the battery voltage, the battery ampere hour and the discharge multiplying power are trained by adopting the same VAE model;

and the traditional energy power generation ratio and the new energy power generation ratio are trained by adopting independent VAE models respectively.

In step 4, random noise information Z is set_noiseAnd input into a generator together with the condition information, the generator utilizes the nonlinear mapping capability of the neural network to give the noise information z_noiseAnd condition information Y into simulation sample data G (z)_noise|y)。

In step 4, the generator generating the simulation sample comprises the following steps:

step 401: input random noise sequence Z_noiseWhen the length is n and the length of the conditional information sequence Y is m, the generator establishes a full connection layer to enable the generator to be connected with the random noise sequence Z_noiseThe number of the neurons of the full connection layer corresponding to the condition information sequence Y and the random noise sequence Z are respectively_noiseThe length of the conditional information sequence Y is consistent with that of the conditional information sequence Y;

step 402: the data of the fully connected layer is corrected such that its mean value is approximately 0 and its variance is approximately 1, i.e., towards a standard normal distribution N (0, 1).

Step 403: random noise Z to be corrected_noiseSplicing the correction sequence with the condition information Y correction sequence to form a spliced sequence with the length of (n + m)/10;

step 404: multiplying the same appearance probability p by (n + m)/10 neurons in the spliced sequence with the length of (n + m)/10 obtained in the previous layer to be 0.5;

step 405: in step 404, half of the neurons are temporarily deleted in the current generation, and the undeleted neurons are output from the generator as the simulation sample data.

In step 4, the discriminator discriminating the source of the input data comprises the following steps:

step 406: the discriminator receives the condition information Y, the real sample data sequence X and the simulation sample data sequence generated by the generator;

step 407: the discriminator respectively generates a first hidden layer for the condition information sequence Y, the real sample data sequence X and the simulation sample data sequence, wherein the number of the neurons of the hidden layer is i, and a first weight matrix W is established_maxout1Calculating the numerical value of the neuron in the first hidden layer by the following formula;

t’_i＝w_i1×t₁+w_i2×t₂+....+w_in×t_n

wherein, w_ilIs a weight matrix W_maxout1The 1 st element of the ith row, t'_iIs the value of the ith neuron in the first hidden layer, t_iIs the ith data of the input sequence.

Step 408: dividing neurons in a first hidden layer of a condition information sequence Y, a first hidden layer of a real sample data sequence X and a first hidden layer of a simulation sample data sequence into s groups, selecting the neuron with the largest value from each group of the groups as the output of the first hidden layer, and generating a condition information neural network layer, a real sample data neural network layer and a simulation sample data neural network layer, wherein the neural network layer contains s neurons;

step 409: splicing the generation condition information neural network layer, the real sample data neural network layer and the simulation sample data neural network layer to obtain a first neural network layer, wherein the first neural network layer is provided with 3s neurons;

step 410: setting the neuron deletion probability to be 0.5, and processing the first neural network layer in the step 409 to obtain a second neural network layer with random length;

step 411: establishing a second weight matrix W_maxout2Mapping the second neural network layer by the following formula to obtain a second hidden layer, selecting the neuron with the maximum value in the second hidden layer 2 to map the neuron into a (0,1) interval, and taking the mapped data as the output result of the discriminator;

q’_i＝w’_i1×q₁+w’_i2×q₂+....+w’_in×q_n

W’_j1is a weight matrix W_maxout2Line j 1 st element, q'₁For the 1 st data of the input sequence, q_jIs the jth data in the second hidden layer.

In step 411, when the output result of the discriminator is 0.5, the discriminator cannot determine the source of the received data sample; otherwise, the discrimination results in that the received data sample originates from the generator.

In step 5, the constraints of the CGAN model are:

wherein, G (z)_noiseY) is sample data generated by the generator according to the condition information Y and the noise information znoise; d (G (z)_noiseY)) is the output of the discriminator;

replacing with the mean of the generated sample data and the real sample data for log D (x | y) mathematical expectation;

for z is a random number, P_z(z) is a probability distribution function of the random number z,

obey a probability distribution P when z_z(z) mathematical expectation; p_data(x)Probability distribution of input real sample data; p is a probability distribution; v (D, G) represents the game function between the generator and the arbiter.

In said step 5, the generator is optimized by the following formula:

wherein G is the constraint condition of the encoder,

for mathematical expectation, P_data(x | y) is the probability that x is from the true sample data, P, under the conditional constraint of y_G(x | y) is the probability that x comes from the sample data generated by the generator under the conditional constraint of y;

when the value of the formula is minimum, the generator is considered to be optimized.

In said step 5, the discriminator is optimized by the following formula:

wherein D is a discriminator constraint condition;

the discriminator optimization is considered to be completed when the formula takes the maximum value.

When the output result of the discriminator is 0.5, the game between the discriminator and the generator is considered to be balanced, the training is finished, and the trained generator is output;

the invention has the following beneficial effects:

the energy storage output is predicted by using the VAE-CGAN model, so that not only can condition information under a historical scene be modeled, but also unknown condition information can be dealt with, the condition information hidden in an input sequence is trained through the VAE model, the CGAN model of a continuous condition input space is constructed, an optimal generator is obtained through confrontation, and the accuracy of energy storage output prediction is guaranteed. Therefore, the optimal output of the stored energy under the condition of large working condition change can be determined more comprehensively and accurately.

Drawings

FIG. 1 is a flow chart of a VAE-CGAN-based energy storage processing prediction method

FIG. 2 VAE training procedure

FIG. 3 CGAN Game model

FIG. 4 is a block diagram of a CGAN neural network

Detailed Description

The following detailed description of the embodiments of the invention is provided in connection with the accompanying drawings.

As shown in fig. 1, the energy storage output prediction method based on VAE-CGAN of the present invention specifically includes the following steps:

step 1: the method comprises the steps of collecting battery voltage, battery ampere hours, discharge rate C and other data such as the power generation ratio of traditional energy and new energy to serve as input sequences of a VAE model, collecting different input sequences to serve as input of the VAE for extracting features according to different systems used by an energy storage battery in actual conditions, wherein the feature sequences with strong correlation such as the battery voltage, the battery ampere hours and the discharge rate C form a multi-element time sequence, analyzing the multi-element time sequence by using the same VAE, and training the data without correlation such as the power generation ratio of the traditional energy and the new energy by using a single VAE. Here, the VAE model input sequence is represented by X ═ X1, X2, … xn ].

Step 2: and (3) training various types of data acquired in the step (1) by using a VAE model to obtain a characteristic information sequence.

Specifically, as shown in fig. 2, step 2 includes the following steps:

step 201: adopting a layer of LSTM (Long short-term memory) neural network to construct an encoder; a decoder is constructed using a layer of LSTM neural network and a layer of fully-connected layers.

Step 202: the encoder receives a first high-dimensional data sequence X ═ X of historical operating data₁，x₂，…x_n]Then, mapping the high-dimensional data sequence X of the historical operating data into a mean vector mu with the length of m and a standard deviation vector sigma with the length of m.

Step 203: the encoder calculates the mean vector mu, the standard deviation vector sigma and the parameter sequence delta [ delta 1, delta 2, … delta m ═ m]And calculating a hidden variable sequence Z ═ Z by the following formula₁，z₂，…z_m]。

z_i＝μ_i+δ_i·exp(σ_i) (1)

Wherein z is_iIs the ith value, mu, in the hidden variable sequence Z_iIs the ith value, σ, in the mean vector μ_iFor the ith value in the standard deviation vector σ, δ i is a parameter obtained by random sampling in a sample set that obeys a bernoulli distribution, and obeys a standard normal distribution, i.e., δ i to N (0,1), i ═ 1,2, …, m.

Step 204: inputting the hidden variable sequence Z obtained by calculation in step 203 into a decoder, restoring the hidden variable sequence Z into a second high-dimensional data sequence X ', wherein X' is [ X1 ', X2', … xn '], that is, the hidden variable sequence Z is identical in dimension to the VAE model input sequence, and finally reconstructing the high-dimensional data feature X' by the full connection layer to obtain a generated m-dimensional feature information sequence Y.

Specifically, the total constraint L in the VAE model training process is calculated and expressed by the following formula;

L＝max loss＝∫q(z|x)log P(x|z)dz-KL(q(z|x)||P(z)) (2)

the former part of the objective function is the self-coding reconstruction error, which is also called variation lower limit, and the maximum value is taken to represent that when X is used as the input of the encoder, m hidden variables zi are extracted from the encoder in a coding mode, and finally the decoder can recover X from Z with the maximum probability.

The self-encoded reconstruction error L1 is calculated by the following equation:

in the continuous optimization of the encoder, the self-encoding reconstruction error is continuously increased by changing parameters in the LSTM neural network.

The second part is KL divergence, and when the second part takes the minimum value, σ_i＝0，μ_iWhen z is 0, z can be obtained from the formula (1)_iFollowing a standard normal distribution, q (z | x) ═ p (z), KL divergence is zero. The KL divergence is calculated by the following formula:

p (x | Z) is an a priori distribution of hidden variable Z representing the decoder in the VAE, q (Z | x) is a Z a posteriori distribution derived from x representing the encoder in the VAE, it is desirable to fit these two distributions as closely as possible in the VAE model, so P (Z) in the VAE model is a Gaussian distribution with mean 0 and variance 1 when the two fit, KL divergence is 0, when we adjust that q (Z | x) and P (Z | x) are completely consistent, KL divergence vanishes to 0, and Lb and logP (x) are completely consistent. Therefore, no matter what the value of logP (x), we can always make Lb equal to logP (x) by adjusting, and because Lb is the lower bound of logP (x), solving Maximum logP (x) is equivalent to solving Maximum Lb, even if self-coding reconstruction error is Maximum, an optimal encoder model is obtained, and at this time, the trained VAE model can generalize the data characteristics of input data and generate a characteristic sequence as a condition information sequence used in the CGAN model.

And step 3: the method comprises the steps of constructing a generator and a discriminator based on the CGAN, wherein the generator is used for generating simulation sample data, and the discriminator is used for discriminating the sources of the simulation sample data and real sample data.

And 4, step 4: and (3) inputting the characteristic information obtained in the step (2) as condition information into the generator and the discriminator, and performing countermeasure training on the generator and the discriminator, namely, the discriminator receives the simulation sample data and the real sample data acquired in the step (1) and judges the sources of the two data.

And, a random noise information z is set at random_noiseInput into a generator together with the condition information, the generator utilizes the nonlinear mapping capability of the neural network to give the noise information z_noiseAnd condition information Y into simulation sample data G (z)_noiseY) is also input to the generator.

In particular, as shown in fig. 3-4, the generation of the immersive sample by the generator comprises in particular the following steps:

step 401: input random noise sequence Z_noiseWhen the length is n and the length of the conditional information sequence Y is m, the generator establishes a full connection layer to enable the generator to be connected with the random noise sequence

The number of neurons of the full-connection layer corresponding to the condition information sequence Y is respectively equal to the random noise sequence

The length of the condition information sequence Y is consistent, wherein the full connection layer aggregates and classifies the characteristics of the input sequence under the condition of not changing the length of the input sequence.

Step 402: the data of the fully connected layer is corrected such that its mean value is approximately 0 and its variance is approximately 1, i.e., towards a standard normal distribution N (0, 1). Preferably, in the present invention, the data of the full-link layer is corrected using the Batch-Normalization algorithm.

Step 403: random noise Z to be corrected_noiseAnd splicing the correction sequence with the condition information Y correction sequence to form a spliced sequence with the length of (n + m)/10. In one embodiment of the invention, the input random noise sequence Z_noiseIs 200 and the length of the conditional information sequence Y is 1000, a splicing sequence of length 120 is thus obtained in this step.

Step 404: the neurons in the stitched sequence obtained in step 403 are multiplied by the same probability of occurrence p, which is 0.5. Preferably, the concatenation sequence is processed using the dropout algorithm at this step 404.

Step 405: in step 404, about half of the neurons are temporarily deleted in this generation. Therefore, only part of input sequences participate in the generation in the process of generating the sample data by the generator, and the return value only adjusts the neurons participating in the generation, so that the over-fitting phenomenon of the generator can be avoided. The remaining undeleted neurons are output from the generator and input to the discriminator as generation sample data.

Specifically, as shown in fig. 3-4, the step of identifying the source of the data sample by the discriminator specifically comprises the following steps:

step 407: and the discriminator respectively generates a first hidden layer for the condition information sequence Y, the real sample data sequence X and the simulation sample data sequence, wherein the number of the neurons of the hidden layer is i. Preferably, in this step, the maxout algorithm is adopted to map each data sequence, and specifically, the weight matrix W is set_maxout1And calculating the hidden layer neuron value by the following formula:

t’_i＝w_i1×t₁+w_i2×t₂+....+w_in×t_n (5)

Step 408: dividing neurons in a first hidden layer of a condition information sequence Y, a first hidden layer of a real sample data sequence X and a first hidden layer of a simulation sample data sequence into s groups, selecting the neuron with the largest value from each group of the groups as the output of the first hidden layer, and generating a condition information neural network layer, a real sample data neural network layer and a simulation sample data neural network layer, wherein the neural network layer contains s neurons. Preferably, in the present invention, the neurons in the first hidden layer are divided into 5 groups, thereby obtaining a neural network layer containing 5 neurons.

Step 409: and splicing the generation condition information neural network layer, the real sample data neural network layer and the simulation sample data neural network layer to obtain a first neural network layer, wherein the first neural network layer is provided with 3s neurons. Preferably, a neural network layer containing 15 neurons is obtained in one embodiment of the present invention.

Step 410: setting the neuron deletion probability to be 0.5, and processing the first neural network layer in the step 409 to obtain a second neural network layer with a random length. Specifically, in one embodiment of the present invention, the first neural network layer is processed using the dropout algorithm, and by setting the neuron deletion probability p to 0.5, that is, in one optimization process of the generator model, each of the 15 neurons has a 50% probability of being temporarily deleted until a part of the neurons is restored and deleted again with the probability p in the next optimization process. A generated neural network layer of random length is obtained.

Step 411: and mapping the second neural network layer by the following formula to obtain a second hidden layer, selecting the neuron with the maximum value in the second hidden layer 2 to map into a (0,1) interval, and taking the mapped data as the output result of the discriminator. That is, when the output result of the discriminator is 0.5, the discriminator cannot judge the source of the received data sample; otherwise, the discrimination results in that the received data sample originates from the generator.

q’_i＝w’_i1×q₁+w’_i2×q₂+....+w’_in×q_n (6)

Preferably, in step 411, the maxout algorithm is used to map the second neural network layer; and mapping the neurons of the second hidden layer by adopting a sigmoid function.

And 5: the parameters of the generator and discriminator are updated, i.e. the generator and discriminator are optimized. Specifically, after the generator completes one generation, the neural network calculates a loss function of the generated data and propagates the loss function back to modify the parameters of the generator. When the generator completes the adjustment of the parameters, the discriminator judges the generated sample sequence, and when the output data of the discriminator is 0.5, the discriminator cannot judge that the data comes from the generator. When the discriminator output is not equal to 0.5, the generator continues the above-described generation and parameter adjustment process until the discriminator cannot distinguish the source of the sequence of samples generated by the generator. The probability that the generator generates the sample sequence at the moment is considered to be derived from real data by the discriminator, the discriminator also has the probability of 50% to judge that the sample sequence generated by the generator comes from the generator, namely the game between the generator and the generator reaches an equilibrium state, and the obtained generator model parameters can be considered as an optimal generator, namely a generator model for energy storage output prediction is established.

The constraint conditions of the CGAN model are as follows:

as a mathematical expectation of log D (x | y), the generated sample data and the mean of the real sample data are used instead;

obey a probability distribution P when z_z(z) mathematical expectation; p_data(x)Probability distribution of input real sample data; p_z(z) is a probability distribution function for the random number z; p is a probability distribution; v (D, G) represents the game function between the generator and the arbiter.

In said step 5, the generator and the discriminator are optimized by the following two equations, respectively:

wherein the generator constraint is as follows (7):

wherein the discriminator constraint is the following equation (8):

according to respective constraint conditions, a generator and a discriminator in the CGAN model continuously adjust model parameters to enable a constraint function of the generator to be minimum and model parameters of the discriminator to be maximum, and with the aim of optimizing the generator parameters through back propagation of a neural network, the generator parameters comprise a mapping function of a full connection layer in the generator, values on neurons, the number of the neurons and the like.

Step 6: the number of iterations is increased by 1 and the process returns to step 5 until the discriminator cannot distinguish the source of the sample data. Namely, when the output result of the discriminator is 0.5, the game between the discriminator and the generator is considered to be balanced, the training is finished, so that a stronger generator is obtained, and the generator is output as an output prediction model of the energy storage device.

The VAE-CGAN model is used for predicting the energy storage output, not only can condition information under a historical scene be modeled, but also unknown condition information can be responded, the condition information hidden in an input sequence is trained through the VAE model, the continuous condition input space input CGAN model is constructed, an optimal generator is obtained through confrontation, and the accuracy of the energy storage output prediction is guaranteed. Therefore, the optimal output of the stored energy under the working environment with strong uncertainty can be determined more comprehensively and accurately.

While the best mode for carrying out the invention has been described in detail and illustrated in the accompanying drawings, it is to be understood that the same is by way of illustration and example only and is not to be taken by way of limitation, the scope of the invention should be determined by the appended claims and any changes or modifications which fall within the true spirit and scope of the invention should be construed as broadly described herein.

Claims

1. a kind of energy storage output prediction method based on VAE-CGAN, is characterized in that, described method comprises the following steps:

Step 1: collect historical operating data of the energy storage device, the operating data includes battery voltage, battery ampere-hour, discharge rate, battery operating voltage, the proportion of traditional energy power generation and the proportion of new energy power generation;

Step 2: The historical operation data collected in the step 1 is directly input into the VAE model for training, and the data characteristic information is generated, that is, new data containing the information of the historical operation data is generated;

Step 3: Construct a generator and a discriminator based on CGAN, wherein the generator is used to generate realistic sample data, and the discriminator is used to identify the source of the realistic sample data and the real sample data;

Step 4: The feature information obtained in the step 2 is input into the generator and the discriminator as condition information, and the generator and the discriminator are trained against each other, that is, the discriminator receives the realistic sample data and the real sample data collected in step 1 to determine the source of the two data;

Step 5: update the parameters of the generator and the discriminator, that is, optimize the generator and the discriminator;

Step 6: increase the number of iterations by 1 and return to step 5, until the discriminator cannot distinguish the source of the sample data, and output the trained generator;

Step 7: Collect the real-time operation data of the energy storage device and input it into the generator output in step 6 to predict the energy storage output of the energy storage device.

2. the energy storage output prediction method based on VAE-CGAN according to claim 1, is characterized in that:

The step 2 includes the following steps:

Step 201: use a layer of LSTM neural network to build an encoder; use a layer of LSTM neural network and a fully connected layer to build a decoder;

Step 202: After receiving the first high-dimensional data sequence X=[x ₁ , x ₂ , . . . x _n ] of the historical operating data, the encoder maps the high-dimensional data sequence X of the historical operating data to a length m The mean vector μ of and the standard deviation vector σ of length m;

Step 203: The encoder calculates the latent variable sequence Z=[z ₁ , z ₂ , ... z according to the mean vector μ, the standard deviation vector σ and the parameter sequence δ=[δ1, δ2, . _m ];

_zi = μ _i +δ _i ·exp(σ _i )

Among them, zi is the _ith value in the latent variable sequence Z, μ _i is the ith value in the mean vector μ, σ _i is the ith value in the standard deviation vector σ, δi is in the Bernoulli distribution The parameters obtained by random sampling in the sample set, and obey the standard normal distribution, that is, δi～N(0,1), i=1,2,...,m; Step 204: Calculate the hidden variable sequence Z obtained in the step 203 Input into the decoder, and restore the latent variable sequence Z to a second high-dimensional data sequence X', and the second high-dimensional data sequence X' is a feature information sequence.

3. the energy storage output prediction method based on VAE-CGAN according to claim 2, is characterized in that:

In step 2, the following objective function is used for training:

Among them, L ₁ is the auto-encoding reconstruction error, L ₂ is the KL divergence, P(x|z) is a prior distribution of the latent variable Z, representing the decoder in the VAE, and q(z|x) is based on the real data The posterior distribution of the value z in a latent variable sequence derived from any value x in the sequence, representing the encoder in the VAE;

When the error value of the auto-encoding reconstruction is the largest and the value of the KL divergence is the smallest, the training is ended to obtain the feature information.

4. the energy storage output prediction method based on VAE-CGAN according to any one of claims 1-3, is characterized in that:

In the step 2, the battery voltage, battery ampere-hour, and discharge rate are trained using the same VAE model;

The proportion of traditional energy power generation and the proportion of new energy power generation are trained using separate VAE models.

5. the energy storage output prediction method based on VAE-CGAN according to any one of claims 1-3, is characterized in that:

In the step 4, random noise information Z _noise is set and input into the generator together with the condition information. The generator uses the nonlinear mapping ability of the neural network to convert the given noise information z _noise and condition information Y is mapped to realistic sample data G(z _noise |y).

6. the energy storage output prediction method based on VAE-CGAN according to claim 5, is characterized in that:

In step 4, the generator to generate a realistic sample includes the following steps:

Step 401: When the length of the input random noise sequence Z _noise is n, and the length of the conditional information sequence Y is m, the generator establishes a fully connected layer, so that the fully connected layer corresponding to the random noise sequence Z _noise and the conditional information sequence Y is The number of neurons is consistent with the length of the random noise sequence Z _noise and the conditional information sequence Y respectively;

Step 402: Correct the data of the fully connected layer so that the mean value is approximately 0 and the variance is approximately 1, that is, it is approximated to the standard normal distribution N(0,1);

Step 403: splicing the corrected random noise Z _noise correction sequence and the conditional information Y correction sequence to make it a spliced sequence with a length of (n+m)/10;

Step 404: Multiply (n+m)/10 neurons in the splicing sequence of length (n+m)/10 obtained in the previous layer by the same occurrence probability p=0.5;

Step 405: In the step 404, half of the neurons are temporarily deleted in this generation, and the neurons that have not been deleted are output from the generator as simulation sample data.

7. The method for predicting energy storage output based on VAE-CGAN according to any one of claims 1-3, wherein in step 4, the discriminator to identify the source of input data comprises the following steps:

Step 406: the discriminator receives the condition information Y, the real sample data sequence X, and the realistic sample data sequence generated by the generator;

Step 407: The discriminator generates a first hidden layer for the conditional information sequence Y, the real sample data sequence X and the simulated sample data sequence, wherein the number of neurons in the hidden layer is i, and establishes a first weight matrix W _maxout1 , and calculate the value of neurons in the first hidden layer by the following formula;

t' _i =w _i1 ×t ₁ +w _i2 ×t ₂ +....+w _in ×t _n

Wherein, w _il is the first element of the i-th row in the weight matrix W _maxout1 , t' _i is the value of the i-th neuron in the first hidden layer, and t _i is the i-th data of the input sequence;

Step 408: Divide the neurons in the first hidden layer of the conditional information sequence Y, the first hidden layer of the real sample data sequence X, and the first hidden layer of the simulated sample data sequence into s groups, and group the neurons from each group. The neuron with the largest numerical value is selected as the output of the first hidden layer, and the conditional information neural network layer, the real sample data neural network layer and the simulated sample data neural network layer are generated, wherein the neural network layer contains s neurons. ;

Step 409: splicing the generation condition information neural network layer, the real sample data neural network layer and the simulated sample data neural network layer to obtain a first neural network layer, and the first neural network layer has 3s neurons;

Step 410: Set the neuron deletion probability to 0.5, and process the first neural network layer in step 409 to obtain a second neural network layer of random length;

Step 411 : establish a second weight matrix W _maxout2 , and map the second neural network layer by the following formula to obtain a second hidden layer, and select the neuron with the largest value in the second hidden layer 2 to map to (0, 1) In the interval, use the mapped data as the output result of the discriminator;

q' _i =w' _i1 ×q ₁ +w' _i2 ×q ₂ +....+w' _in ×q _n

W' _j1 is the first element of the jth row of the weight matrix W _maxout2 , q' ₁ is the first data of the input sequence, and q _j is the jth data in the second hidden layer.

8. the energy storage output prediction method based on VAE-CGAN according to claim 7, is characterized in that:

In the step 411, when the output result of the discriminator is 0.5, the discrimination result is that the discriminator cannot determine the source of the received data sample; otherwise, the discrimination result is that the received data sample comes from the generator.

9. The energy storage output prediction method based on VAE-CGAN according to any one of claims 1-3, characterized in that:

In the step 5, the constraints of the CGAN model are:

Among them, G(z _noise |y) is the sample data generated by the generator according to the condition information Y and the noise information znoise; D(G(z _noise |y)) is the output of the discriminator;

is the mathematical expectation of log D(x|y), which is replaced by the mean value of the generated sample data and the real sample data;

For z is a random number, P _z (z) is the probability distribution function of the random number z,

is the mathematical expectation when z obeys the probability distribution P _z (z); P _{data (x)} is the probability distribution of the input real sample data; P is the probability distribution; V (D, G) represents the difference between the generator and the discriminator game function.

10. The energy storage output prediction method based on VAE-CGAN according to claim 9, is characterized in that:

In said step 5, the generator is optimized by the following formula:

where G is the encoder constraint,

is the mathematical expectation, P _data (x|y) is the probability that x comes from the real sample data under the conditional constraint of y, and P _G (x|y) is the conditional constraint of y, x comes from the sample generated by the generator the probability of the data;

The generator optimization is considered complete when the value of this formula is the smallest.

11. The energy storage output prediction method based on VAE-CGAN according to claim 9, is characterized in that:

In the step 5, the discriminator is optimized by the following formula:

Among them, D is the discriminator constraint;

The discriminator optimization is considered complete when the formula takes the maximum value.

12. The energy storage output prediction method based on VAE-CGAN according to any one of claims 1-3, characterized in that:

In the step 6, when the output result of the discriminator is 0.5, it is considered that the game between the discriminator and the generator has reached a balance, and the training ends.