+

CN115641630A - A small-sample multi-pose face recognition method based on hypergraph and multi-task collaboration - Google Patents

A small-sample multi-pose face recognition method based on hypergraph and multi-task collaboration Download PDF

Info

Publication number
CN115641630A
CN115641630A CN202211196489.5A CN202211196489A CN115641630A CN 115641630 A CN115641630 A CN 115641630A CN 202211196489 A CN202211196489 A CN 202211196489A CN 115641630 A CN115641630 A CN 115641630A
Authority
CN
China
Prior art keywords
hypergraph
representing
matrix
image
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211196489.5A
Other languages
Chinese (zh)
Inventor
樊肖锦
祝烈煌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Institute of Technology BIT
Original Assignee
Beijing Institute of Technology BIT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Institute of Technology BIT filed Critical Beijing Institute of Technology BIT
Priority to CN202211196489.5A priority Critical patent/CN115641630A/en
Publication of CN115641630A publication Critical patent/CN115641630A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

The invention relates to a hypergraph and multi-task cooperation based small-sample multi-pose face recognition method, and belongs to the technical field of artificial intelligence face recognition. The invention utilizes hypergraph and non-negative matrix decomposition to obtain an image similar to a front image, and designs a multi-pose face recognition framework based on hypergraph deflexion. The frame first separates the non-attitude deflected images. On the basis, a feature coding method based on improved support vector description is provided, features of the image without posture deflection are extracted, and joint optimization is carried out on the features and a classifier based on dictionary learning, so that the features are extracted and classified. The feature coding method utilizes improved support vector data description and triangular coding to make the extracted features more discriminative. Meanwhile, an effective feature extraction and feature classification optimization model is established, a solution closer to global optimum is easy to obtain, and the recognition performance is improved.

Description

Small sample multi-pose face recognition method based on hypergraph and multi-task cooperation
Technical Field
The invention relates to a face recognition method, in particular to a small sample multi-pose face recognition method based on hypergraph and multi-task cooperation, and belongs to the technical field of artificial intelligence face recognition.
Background
As a new identity recognition technology, face recognition is rapidly developed under the wave of artificial intelligence, and becomes a popular research field in recent years. The multi-pose face recognition mainly researches the influence of face angle change on the face recognition effect. How to solve the influence of the gesture on the face recognition, improve the multi-gesture face recognition effect and be beneficial to applying the face recognition technology to more identity recognition scenes.
The existing method based on posture correction mainly converts multi-posture human faces into front human faces. Currently, methods for fitting a frontal face include two types: one is an affine transformation method, which transforms a multi-pose face into a frontal face, but the transformation relation is simple, but face classification feature information is lost, and the recognition effect is not ideal. The other method is a virtual fusion method, which fuses a plurality of pieces of multi-pose face image information to construct a front face image, performs pose correction to a certain extent, and incorporates classification errors into a target function, thereby realizing the face classification capability.
Although the method based on the posture correction achieves better results in processing the face recognition and some scenes, the problem of recognizing the face image of the multi-posture thumbnail can be better solved. However, in practical applications, these multi-pose homogeneous images pose a great challenge to these methods because it is difficult for the pose deflection images to reveal the relationship between multiple samples.
In summary, the existing multi-pose face recognition method based on small samples has the defects and limitations. First, image features without attitude deflection are not easily extracted. Secondly, the extracted features are not highly discriminatory. Therefore, the conventional method cannot reveal the relationship between samples of the same kind, and it is difficult to learn the dictionary efficiently.
Disclosure of Invention
The invention aims to overcome the defects and shortcomings in the prior art, and creatively provides a small sample multi-pose face recognition method based on hypergraph and multi-task cooperation for effectively solving the technical problem of multi-pose small sample face recognition.
Inspired by non-negative matrix decomposition, each image related to posture change can be decomposed through the non-negative matrix decomposition, one matrix obtained by decomposition is used as an image without posture deflection, and the other matrix is used as a posture change matrix. And finally, obtaining an image without posture deviation through multiple iterative decomposition. Compared with other single-resolution image identification methods, the methods can better deal with the identification problem of multi-resolution images. The hypergraph may represent a complex relationship between samples. And (4) inspiring by the hypergraph, regarding each image as a node in the hypergraph, performing non-negative matrix decomposition on the hypergraph formed by a plurality of images, and extracting the image with better performance and no posture deflection.
On the basis, the invention provides a small sample multi-pose face recognition method based on hypergraph and multi-task cooperation for the first time.
The innovation points of the invention are as follows: a hypergraph-based deflexion and multi-task collaborative optimization method is adopted, and an image similar to a front image is obtained by utilizing a hypergraph and non-negative matrix factorization. On the basis, a feature coding method based on improved support vector description is provided, and is combined with a classifier based on dictionary learning to be optimized for feature extraction and feature classification.
(1) A multi-pose face recognition framework based on hypergraph deflexion is provided. The framework firstly separates the non-attitude deflection images, then extracts the features of the non-attitude deflection images by using the proposed feature coding method based on the improved support vector data description, and identifies the extracted features.
(2) A feature coding method based on improved support vector description is provided. The feature coding method utilizes improved support vector data description and triangular coding to make the extracted features more discriminative.
(3) An effective feature extraction and feature classification optimization model is established, so that a solution closer to global optimum is easy to obtain, and the identification performance of the algorithm is improved.
Advantageous effects
Compared with the prior art, the invention has the following advantages:
1. the invention introduces the hypergraph and embeds the hypergraph into a non-negative matrix decomposition process to more comprehensively disclose the relationship among samples of the same type.
2. The invention designs an improved feature coding method for supporting vector data description, and improves the identification of extracted features.
Drawings
FIG. 1 is a schematic diagram of hypergraph deflexion and multitask co-optimization in the present invention.
Fig. 2 is a schematic view of a flow chart of a face recognition process in the invention.
Fig. 3 is a schematic diagram of a process of extracting a front image.
Fig. 4 is a schematic diagram of encoding.
Detailed Description
The present invention is further described below with reference to examples in order to make the technical contents clearer and easier to understand. The described embodiments are merely representative of the invention, rather than complete. The invention belongs to the protection scope based on the embodiment of the invention.
A small sample multi-pose face recognition method based on hypergraph and multi-task cooperation comprises the steps of image orthogonalization, feature extraction, feature classification and joint optimization, and specifically comprises the following steps:
step 1: and carrying out image obverse processing.
As shown in fig. 1, feature discrimination enhancement is performed based on non-negative matrix factorization and hypergraph embedding. Comprises the following steps.
Step 1.1: non-negative matrix factorization is performed.
Given any non-negative matrix X 0 Are all decomposed into two non-negative matrices
Figure BDA0003870742510000031
P T The sum, as follows:
Figure BDA0003870742510000032
wherein,
Figure BDA0003870742510000033
as a basis matrix, the matrix is,
Figure BDA0003870742510000034
is a submatrix, F represents a Frobenius norm, T represents a matrix transpose,
Figure BDA0003870742510000035
representing dimension symbols, m representing a sample X 0 N represents the number of samples of the source data, and r represents the number of samples of the target data. s.t. is used to prompt that a constraint is followed.
Then, update
Figure BDA0003870742510000036
And P T
Figure BDA0003870742510000037
Wherein X represents the source data, P represents the transformation matrix, ij represents the ijth iteration, and jk represents the jk iteration.
Step 1.2: and (6) carrying out hypergraph embedding.
The hypergraph G is an ordered binary group G = (V, e). V is a non-empty set of nodes/vertices, called a set of vertices. e is a cluster of non-empty subsets whose elements are called super-edges.
Unlike the normal graph, each edge of the hypergraph not only connects two vertices, but also can connect more vertices.
Given a hypergraph G = (V, e), V = { V = { V) 1 ,v 2 ,…,v k Is a finite set of data points, where v is i (i =1,2, …, k) is a vertex. e.g. of the type j Is a super edge, and the super edge set e satisfies the following conditions:
Figure BDA0003870742510000041
e 1 ∪e 2 ∪e 3 …∪e t =V
where t represents the super edge number.
Each super edge e i All have a corresponding weight w j The vertex hyper-edges will form a correlation matrix
Figure BDA0003870742510000042
The elements in the matrix are calculated using the following formula:
Figure BDA0003870742510000043
degree d of each vertex in hypergraph G i Defined as the sum of the weights of the super edges to which it belongs. Degree of overcrowding ρ i The number of nodes defined as the number of the nodes to which the super edge belongs is calculated as follows:
Figure BDA0003870742510000044
wherein w j Representing the super-edge weight.
Let D v Is a diagonal matrix whose main diagonal elements are
Figure BDA0003870742510000048
Let D e And W is the sum of j And w j The diagonal matrices generated, j =1,2, …, t, respectively.
Then, the regularized hypergraph laplacian matrix L is calculated by the following equation H
Figure BDA0003870742510000045
Wherein,
Figure BDA0003870742510000046
representing a diagonal matrix D e The inverse of (c).
Further, the present invention proposes a new feature encoding method capable of obtaining features of each image with almost no attitude deflection, which features have good category resolution. The method comprises the following specific steps:
representing a given data set as
Figure BDA0003870742510000047
Each column in Y represents an image sample.
First, for each image, noise in the image is removed (gaussian filter removal may be applied).
Then, whether the pixel of each image is negative is checked, the negative value is assigned to be 0, and the positive value retains the original value to obtain Y W ,Y W Representing a preprocessed image set.
Then, construct Y W Of the regularized hypergraph laplacian matrix L H . Let the number of hypergraph edges be t, the number of hypergraphs be N, N = t, and the number of vertices included in each hypergraph edge be s. The vertex included in each super edge is represented by Y n W Generated by itself and its nearest s-1 neighbor, Y n W Represents Y W Column n.
In particular, w j The calculation method of (2) is as follows:
Figure BDA0003870742510000051
Figure BDA0003870742510000052
where exp () represents an exponential function.
To obtain Y W And L H The objective function is then as follows:
Figure BDA0003870742510000053
wherein,
Figure BDA0003870742510000054
represents Y W Is not negative, the error produced. Tr (P) T L H P) regular terms representing a hypergraph capable of protecting local dataAnd the performance of the algorithm is improved by the structure. Because the problem is difficult to solve, an iterative solution method is directly adopted to solve the problem. The lagrange function Δ corresponding to the above equation is:
Figure BDA0003870742510000055
wherein Ψ is Ψ mk Correspond to
Figure BDA0003870742510000056
Of lagrange multipliers of [ phi ], [ mk Representing the mk th iteration of Ψ. Phi is from phi nk Corresponds to P mk Matrices of lagrange multipliers of > 0, phi nk Represents the nk iteration of Φ; λ denotes the coefficient and Tr denotes the trace of the matrix.
For ease of calculation, Δ of the above equation is rewritten as:
Figure BDA0003870742510000057
wherein,
Figure BDA0003870742510000061
Tr(B)=Tr(B T ). B denotes an exemplary matrix.
By taking Δ and
Figure BDA0003870742510000062
the partial derivative of (a) yields:
Figure BDA0003870742510000063
Figure BDA0003870742510000064
according to KKT conditions (Karush-Kuhn-Tucker conditions, carrocon-Kuen-Take conditions)
Figure BDA0003870742510000065
And phi nk P nk =0, yielding:
Figure BDA0003870742510000066
Figure BDA0003870742510000067
in the above equation, the subscript of each variable indicates the number of iterations of the variable.
Figure BDA0003870742510000068
And P nk Updating is carried out by the following method:
Figure BDA0003870742510000069
Figure BDA00038707425100000610
wherein,
Figure BDA00038707425100000611
representing the multiplication of the elements of the two matrices. Output the output
Figure BDA00038707425100000612
Is a collection of images that are geometrically without a pose deflection. D v Representing a diagonal matrix.
Fig. 3 shows a process of extracting a near-frontal image from an image relating to a change in posture. F denotes the original image set with attitude deflection, Y W Representing the set of images obtained after preprocessing Y,
Figure BDA00038707425100000613
an image set representing an approximate frontal image obtained by decomposition and iteration, and P represents a pose transformation matrix. In FIG. 3Firstly, preprocessing each image in the original image set to obtain a non-negative image set without noise pollution. The hypergraph is then embedded in a non-negative matrix factorization to preserve the structure of the decomposed image. Finally, through matrix decomposition and iterative updating, an image set with almost no deflection is obtained.
Step 2: and (5) feature extraction. Comprises the following steps.
Step 2.1: feature coding based on improved support vector data description.
First, the present invention proposes an improved support vector data description with which to obtain the sphere center and radius of each cluster. Then, feature encoding is performed using the radius and center of the sphere corresponding to each cluster.
The existing support vector data description considers that each data point plays the same role in calculating the radius of each cluster, but this does not coincide with the actual situation. Therefore, the invention assigns a learned weight to each data in model learning and proposes an improved support vector data description, as follows:
Figure BDA0003870742510000071
wherein r represents the radius of the sphere, y i Denotes the ith sample, ρ (y) i ) Denotes y i B represents the center of the sphere, num represents the number of samples, χ i The value of the relaxation variable is represented by,
Figure BDA0003870742510000079
are parameters.
For the convenience of solving, the above equation is written in the form of lagrange function:
Figure BDA0003870742510000072
wherein,
Figure BDA0003870742510000073
representing the Lagrangian function, alpha, beta representing the corresponding Lagrangian factors, alpha i 、β i The ith elements representing alpha and beta, respectively.
Order to
Figure BDA0003870742510000074
And
Figure BDA0003870742510000075
obtaining:
Figure BDA0003870742510000076
wherein Q = (C)<x i ,x j >) num×num ,Ω=(<x i ,x j >) num×1 ,e=(1,1,1,…,1) T ,x i 、x j Is to remove the ith and jth samples of the attitude deflection. α = [ α = 1 ,α 2 ,…,α num ]α is obtained using a linear algorithm and T represents transposition.
r represents a vector composed of the radii of the SVDD clustering sphere, and is obtained by the following formula:
Figure BDA0003870742510000077
where γ is the set of support vectors, i.e. the sample points used in the above equation are support vectors. Whether a sample point is a support vector or not, the condition needs to be satisfied: if the sample point y i Is a support vector, then it corresponds to alpha i Is non-zero. r = [ r ] 1 ,r 2 ,…,r C ]And C is the number of clusters in the data set.
Step 2.2: and carrying out triangular coding.
For each image with its attitude deflection removed, decomposing it into
Figure BDA0003870742510000078
Blocks, each block being encoded。
Specifically, for an image q with attitude deflection removed, it is decomposed into
Figure BDA0003870742510000086
And (5) blocking.
For any block q j
Figure BDA0003870742510000081
Representing the number of decomposed blocks included in the image, is coded as U (q) j ):
U(q j )=[U 1 (q j ) U 2 (q j )…U C (q j )] T
Wherein, U i (q j )=[U i,1 (q j ) U i,2 (q j )],i=1,2,…,C,
Figure BDA0003870742510000082
U i,1 (q j ) And U i,2 (q j ) All are obtained by triangular coding.
U i,1 (q j )=max{0,d(s)-s i (q j )},s i (q j )=||q j -o i || 2 Denotes a symbol from q j To o i A distance of o i Denotes the center of the SVDD sphere formed by the jth cluster, and d(s) is all s i (q j ) Is measured. U shape i,2 (q j )=max{0,A(m)-m i (q j )},
Figure BDA0003870742510000083
Is all m i Is measured.
Fig. 4 shows a schematic diagram of the encoding. q. q.s j Representing the jth block of image q, image q being divided into
Figure BDA0003870742510000084
And (5) blocking. O is i Representing the center of the SVDD sphere formed by the ith cluster, clustering a plurality of sample points into a cluster, r i Is shown asRadius, s, of i clustered SVDD spheres i (q j ) Denotes q j And O i The distance between them. O is j The center of the SVDD sphere formed in the jth cluster is shown. r is j Is the radius of the SVDD sphere formed for the jth cluster. s j (q j ) Denotes q j To O j The distance between them.
Thus, the image is encoded as F q The expression is as follows:
F q =[(U(q 1 )) T (U(q 2 )) T …(U(q N )) T ] T
and step 3: and (5) classifying the features. Comprises the following steps.
Through the operations of deflexion removal and feature coding in the steps 1 and 2, the influence of the posture change on the face recognition is greatly reduced. In order to further improve the recognition rate of the algorithm, the dictionary is learned, the learned dictionary is used for representing the test samples, and the classes of the test samples are determined according to the representation residual errors.
Specifically, the model of the classifier based on dictionary learning is as follows:
Figure BDA0003870742510000085
where X is a training sample, D is a learned dictionary, Z is a representation coefficient, D i Represents the ith atom in D. F represents the Frobenius norm.
And 4, step 4: and (5) a joint optimization stage. Comprises the following steps.
In order to obtain a global optimal solution of HDMCO (HDMCO, a short term for representing a proposed algorithm), feature extraction and feature classification are jointly optimized as follows:
Figure BDA0003870742510000091
thus, α, D and Z were obtained. Z represents a coefficient.
Wherein α is derived from the formula:
Figure BDA0003870742510000092
the value of alpha is obtained by using a linear algorithm.
D is obtained from the following formula:
Figure BDA0003870742510000093
the above transformation is solved as follows:
Figure BDA0003870742510000094
wherein J represents an intermediate matrix, θ represents a coefficient, v i Representing the ith atom of the matrix V.
D is obtained by iteratively solving the variables in the above equation.
D is obtained by solving the following formula:
Figure BDA0003870742510000095
solutions of the above formula are as follows:
Figure BDA0003870742510000101
where shrnk (x, a) = signmax (| x | -a, 0), x represents an example variable. η represents a coefficient.
And 5: and predicting a classification result by using a classifier obtained by learning, classifying the sample, and realizing face recognition.

Claims (2)

1. A small sample multi-pose face recognition method based on hypergraph and multi-task cooperation is characterized by comprising image orthogonalization, feature extraction, feature classification and joint optimization;
step 1: image surface treatment;
based on nonnegative matrix decomposition and hypergraph embedding, the method carries out feature discrimination enhancement and comprises the following steps;
step 1.1: performing non-negative matrix factorization;
given any non-negative matrix X 0 Are all decomposed into two non-negative matrices
Figure FDA0003870742500000011
P T The sum, as follows:
Figure FDA0003870742500000012
wherein,
Figure FDA0003870742500000013
as a basis matrix, the matrix is,
Figure FDA0003870742500000014
is a submatrix, F represents a Frobenius norm, T represents a matrix transpose,
Figure FDA0003870742500000015
representing dimension symbols, m representing a sample X 0 N represents the number of samples of the source data, r represents the number of samples of the target data; s.t. for prompting that a constraint is followed;
then, update
Figure FDA0003870742500000016
And P T
Figure FDA0003870742500000017
Wherein X represents source data, P represents a transformation matrix, ij represents the ijth iteration, and jk represents the jk iteration;
step 1.2: carrying out hypergraph embedding;
hypergraph G is an ordered binary group G = (V, e); v is a non-empty set with nodes/vertices as elements, called a vertex set; e is a cluster of non-empty subsets whose elements are called super-edges;
each edge of the hypergraph not only connects two vertexes, but also can connect more vertexes;
given a hypergraph G = (V, e), V = { V = { V) 1 ,v 2 ,…,v k Is a finite set of data points, where v is i (i =1,2, …, k) is a vertex; e.g. of the type j Is a super edge, and the super edge set e satisfies the following conditions:
Figure FDA0003870742500000018
e 1 ∪e 2 ∪e 3 …∪e t =V
wherein t represents a super edge number;
each super edge e i All have a corresponding weight w j The vertex hyper-edges will form a correlation matrix
Figure FDA0003870742500000021
The elements in the matrix are calculated using the following formula:
Figure FDA0003870742500000022
degree d of each vertex in hypergraph G i Defining as the sum of the weights of the super edges to which it belongs; degree of overcrowding ρ i The number of nodes defined as the number of the nodes to which the super edge belongs is calculated as follows:
Figure FDA0003870742500000023
wherein, w j Representing a super-edge weight;
let D v Is a diagonal matrix whose main diagonal elements are
Figure FDA0003870742500000024
Let D e And W is the sum of j And w j The diagonal matrices, j =1,2, …, t, generated respectively;
then, the regularized hypergraph laplacian matrix L is calculated by the following equation H
Figure FDA0003870742500000025
Wherein,
Figure FDA0003870742500000026
representing a diagonal matrix D e The inverse of (1);
step 2: extracting characteristics;
step 2.1: feature encoding based on the improved support vector data description;
using a support vector data description to obtain the sphere center and radius of each cluster; then, carrying out feature coding by utilizing the radius and the center of the ball corresponding to each cluster;
assigning a learned weight to each data in model learning, and proposing a support vector data description, which is as follows:
Figure FDA0003870742500000027
wherein r represents the radius of the sphere, y i Denotes the ith sample, ρ (y) i ) Denotes y i B represents the center of the sphere, num represents the number of samples, χ i The value of the relaxation variable is represented by,
Figure FDA0003870742500000028
is a parameter;
the above equation is written in the form of a lagrange function:
Figure FDA0003870742500000031
wherein,
Figure FDA0003870742500000032
representing the Lagrangian function, alpha, beta representing the corresponding Lagrangian factors, alpha i 、β i The ith element representing α and β, respectively;
order to
Figure FDA0003870742500000033
And
Figure FDA0003870742500000034
obtaining:
Figure FDA0003870742500000035
wherein Q = (< x) i ,x j >) num×num ,Ω=(<x i ,x j >) num×1 ,e=(1,1,1,…,1) T ,x i 、x j Is the ith and jth samples from which attitude deflection is removed; α = [ α = 12 ,…,α num ]Alpha is obtained by using a linear algorithm, and T represents transposition;
r represents a vector consisting of the radii of the SVDD cluster sphere, and is obtained by the following formula:
Figure FDA0003870742500000036
wherein γ is a set of support vectors, i.e., the sample points used in the above formula are support vectors; whether a sample point is a support vector or not, the condition needs to be satisfied: if the sample point y i Is a support vector, then it corresponds to alpha i Is non-zero; r = [ r ] 1 ,r 2 ,…,r C ]C is a cluster in the data setThe number of (2);
step 2.2: carrying out triangular coding;
for each image with its attitude deflection removed, decomposing it into
Figure FDA0003870742500000037
Blocks, each block being encoded;
for an image q with its attitude deflection removed, it is decomposed into
Figure FDA0003870742500000038
A block; for any block q j
Figure FDA0003870742500000039
Figure FDA00038707425000000310
Representing the number of decomposed blocks included in the image, is encoded as U (q) j ):
U(q j )=[U 1 (q j ) U 2 (q j )…U C (q j )] T
Wherein,
Figure FDA00038707425000000311
U i,1 (q j ) And U i,2 (q j ) All are obtained by triangular coding;
U i,1 (q j )=max{0,d(s)-s i (q j )},s i (q j )=||q j -o i || 2 denotes a symbol from q j To o i A distance of o i Denotes the center of the SVDD sphere formed by the jth cluster, and d(s) is all s i (q j ) The mean value of (a); u shape i,2 (q j )=max{0,A(m)-m i (q j )},
Figure FDA0003870742500000041
Figure FDA0003870742500000042
Is all m i The mean value of (a);
wherein q is j Representing the jth block of image q, image q being divided into
Figure FDA0003870742500000043
A block; o is i Representing the center of the SVDD sphere formed by the ith cluster, clustering a plurality of sample points into a cluster, r i Denotes the radius, s, of the SVDD sphere formed by the ith cluster i (q j ) Denotes q j And O i The distance between them; o is j Represents the center of the SVDD sphere formed by the jth cluster; r is j Is the radius of the SVDD sphere formed in cluster j; s j (q j ) Represents q j To O j The distance between them;
the image is coded as F q The expression is as follows:
Figure FDA0003870742500000044
and step 3: classifying the characteristics;
representing the test sample by learning the dictionary and using the learned dictionary, determining a category of the test sample from the representation residuals;
the model of the classifier based on dictionary learning is as follows:
Figure FDA0003870742500000045
where X is a training sample, D is a learned dictionary, Z is a representation coefficient, D i Represents the ith atom in D; f represents a Frobenius norm;
and 4, step 4: a joint optimization stage;
in order to obtain a global optimal solution of the algorithm HDMCO, feature extraction and feature classification are optimized in a combined mode, and the method specifically comprises the following steps:
Figure FDA0003870742500000046
thereby obtaining α, D and Z; z represents a coefficient;
wherein α is derived from the formula:
Figure FDA0003870742500000047
Figure FDA0003870742500000051
obtaining the value of alpha by utilizing a linear algorithm;
d is obtained from the following formula:
Figure FDA0003870742500000052
the above transformation is solved as follows:
Figure FDA0003870742500000053
wherein J represents an intermediate matrix, θ represents a coefficient, v i The ith atom representing the matrix V;
d is obtained by iteratively solving the variables in the above formula;
d is obtained by solving the following formula:
Figure FDA0003870742500000054
solutions of the above formula are as follows:
Figure FDA0003870742500000055
wherein shrink (x, a) = signmax (| x | -a, 0), x represents an exemplified variable; η represents a coefficient;
and 5: and predicting a classification result by using a classifier obtained by learning, classifying the sample, and realizing face recognition.
2. The method for recognizing the multi-pose face with the small sample based on the hypergraph and the multi-task cooperation as claimed in claim 1, wherein in the step 1, a feature coding method is adopted to obtain the features of each image with almost no pose deflection, and the features have good category resolution, which is as follows:
representing a given data set as
Figure FDA0003870742500000056
Each column in Y represents an image sample;
firstly, removing noise in each image;
then, whether the pixel of each image is negative is checked, the negative value is assigned to be 0, and the positive value retains the original value to obtain Y W ,Y W Representing a pre-processed image set;
then, construct Y W Is represented by the regularized hypergraph laplacian matrix L H (ii) a Setting the number of the edges of the hypergraph as t, the number of the hypergraphs as N, N = t, and the number of vertexes contained in each hypergraph as s; the vertex included in each super edge is represented by Y n W Generated by itself and its nearest s-1 neighbor, Y n W Represents Y W The nth column of (1);
w j the calculation method of (2) is as follows:
Figure FDA0003870742500000061
Figure FDA0003870742500000062
wherein exp () represents an exponential function;
to obtain Y W And L H The objective function is then as follows:
Figure FDA0003870742500000063
wherein,
Figure FDA0003870742500000064
Figure FDA0003870742500000065
represents Y W Error resulting from the non-negative decomposition of (d); tr (P) T L H P) a regularization term representing a hypergraph;
by adopting an iterative solution method, the lagrangian function delta corresponding to the above formula is:
Figure FDA0003870742500000066
wherein Ψ is Ψ mk Correspond to
Figure FDA0003870742500000067
Of lagrange multipliers of [ phi ], [ mk The mk th iteration representing Ψ; phi is from phi nk Corresponds to P mk Matrices composed of lagrange multipliers no less than 0, phi nk Represents the nk iteration of Φ; λ represents a coefficient, tr represents a trace of the matrix;
rewriting Δ of the above formula as:
Figure FDA0003870742500000068
wherein,
Figure FDA0003870742500000069
b represents an exemplary matrix;
by respectivelyTaking a sum of
Figure FDA0003870742500000071
The partial derivative of (a) yields:
Figure FDA0003870742500000072
Figure FDA0003870742500000073
according to the KKT condition
Figure FDA0003870742500000074
And phi nk P nk =0, yielding:
Figure FDA0003870742500000075
Figure FDA0003870742500000076
in the above formula, the subscript of each variable represents the number of iterations of the variable;
Figure FDA0003870742500000077
and P nk Updating is carried out by the following method:
Figure FDA0003870742500000078
Figure FDA0003870742500000079
wherein,
Figure FDA00038707425000000710
representing the multiplication of elements of two matrices; output of
Figure FDA00038707425000000711
Is a set of images that are geometrically without pose deflection; d v Representing a diagonal matrix.
CN202211196489.5A 2022-09-29 2022-09-29 A small-sample multi-pose face recognition method based on hypergraph and multi-task collaboration Pending CN115641630A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211196489.5A CN115641630A (en) 2022-09-29 2022-09-29 A small-sample multi-pose face recognition method based on hypergraph and multi-task collaboration

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211196489.5A CN115641630A (en) 2022-09-29 2022-09-29 A small-sample multi-pose face recognition method based on hypergraph and multi-task collaboration

Publications (1)

Publication Number Publication Date
CN115641630A true CN115641630A (en) 2023-01-24

Family

ID=84942888

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211196489.5A Pending CN115641630A (en) 2022-09-29 2022-09-29 A small-sample multi-pose face recognition method based on hypergraph and multi-task collaboration

Country Status (1)

Country Link
CN (1) CN115641630A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060008814A (en) * 2004-07-24 2006-01-27 삼성전자주식회사 How to recognize face images with various poses
US20200057809A1 (en) * 2018-08-14 2020-02-20 Hrl Laboratories, Llc Hypergraph-based method for segmenting and clustering customer observables for vehicles
CN110895700A (en) * 2018-09-12 2020-03-20 北京京东尚科信息技术有限公司 Image recognition method and system
CN111126169A (en) * 2019-12-03 2020-05-08 重庆邮电大学 Face recognition method and system based on orthogonalization of graph-regularized non-negative matrix factorization
CN112418041A (en) * 2020-11-16 2021-02-26 武汉大学 Multi-pose face recognition method based on face orthogonalization

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20060008814A (en) * 2004-07-24 2006-01-27 삼성전자주식회사 How to recognize face images with various poses
US20200057809A1 (en) * 2018-08-14 2020-02-20 Hrl Laboratories, Llc Hypergraph-based method for segmenting and clustering customer observables for vehicles
CN110895700A (en) * 2018-09-12 2020-03-20 北京京东尚科信息技术有限公司 Image recognition method and system
CN111126169A (en) * 2019-12-03 2020-05-08 重庆邮电大学 Face recognition method and system based on orthogonalization of graph-regularized non-negative matrix factorization
CN112418041A (en) * 2020-11-16 2021-02-26 武汉大学 Multi-pose face recognition method based on face orthogonalization

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李向利;贾梦雪;: "基于预处理的超图非负矩阵分解算法", 计算机科学, no. 07, 31 July 2020 (2020-07-31), pages 76 - 82 *

Similar Documents

Publication Publication Date Title
Bu et al. Hyperspectral and multispectral image fusion via graph Laplacian-guided coupled tensor decomposition
CN107609541B (en) Human body posture estimation method based on deformable convolution neural network
CN109359608B (en) A face recognition method based on deep learning model
CN112307995B (en) Semi-supervised pedestrian re-identification method based on feature decoupling learning
Dambreville et al. A framework for image segmentation using shape models and kernel space shape priors
WO2021227366A1 (en) Method for automatically and accurately detecting plurality of small targets
CN114492581B (en) A method based on transfer learning and attention mechanism meta-learning applied to small sample image classification
CN111666873A (en) Training method, recognition method and system based on multitask deep learning network
Bose et al. Light weight structure texture feature analysis for character recognition using progressive stochastic learning algorithm
CN114863151B (en) Image dimension reduction clustering method based on fuzzy theory
CN113111814A (en) Regularization constraint-based semi-supervised pedestrian re-identification method and device
Huang et al. Age classification with deep learning face representation
CN107346327A (en) The zero sample Hash picture retrieval method based on supervision transfer
CN112348001B (en) Training method, recognition method, device, equipment and medium for expression recognition model
CN111126169B (en) Face recognition method and system based on orthogonalization graph regular nonnegative matrix factorization
CN106650744A (en) Image object co-segmentation method guided by local shape migration
CN111062928A (en) Method for identifying lesion in medical CT image
CN113421318A (en) Font style migration method and system based on multitask generation countermeasure network
CN114049541B (en) Visual scene recognition method based on structured information feature decoupling and knowledge transfer
CN113313153B (en) Low-rank NMF image clustering method and system based on adaptive graph regularization
CN111832391B (en) Image Dimensionality Reduction Method and Image Recognition Method Based on Low Rank Discriminant Embedding Method Based on Truncated Kernel Norm
CN106529604A (en) Adaptive image tag robust prediction method and system
CN115601648A (en) Open set image classification method based on cluster analysis
CN112257600A (en) Face recognition method and system
CN115641630A (en) A small-sample multi-pose face recognition method based on hypergraph and multi-task collaboration

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载