WO2024224504A1

WO2024224504A1 - Verification device, verification method, and program

Info

Publication number: WO2024224504A1
Application number: PCT/JP2023/016420
Authority: WO
Inventors: 直人桐淵; 奈実芦澤; 亮平鈴木
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2023-04-26
Filing date: 2023-04-26
Publication date: 2024-10-31
Anticipated expiration: 2025-10-26

Abstract

Provided is a verification device capable of verifying the sameness of an AI that continuously learns and changes. This verification device includes: an acquisition unit which acquires a first output data group obtained when input data is given to a machine learning model group considered to be the same, and second output data obtained when the input data is given to a machine learning model to be verified; and a verification unit which uses the first output data group and the second output data to verify, by statistical hypothesis testing, whether or not the machine learning model to be verified is the same as the machine learning model group.

Description

Verification device, verification method, and program

　本発明は、複数のAI(以下、機械学習モデルともいう)の同一性を検証する技術に関する。 The present invention relates to a technology for verifying the identity of multiple AIs (hereinafter also referred to as machine learning models).

　第三者から提供されたAIを利用するとき、AIの偽装や改竄の恐れがあるため、想定するAIと利用するAIが同一かを確かめたいという要望がある。 When using AI provided by a third party, there is a risk of the AI being faked or tampered with, so there is a demand to verify that the AI intended and the AI being used are the same.

　同一性を検証する従来技術としてディジタル署名（非特許文献１参照）が知られている。 Digital signatures (see Non-Patent Document 1) are known as a conventional technology for verifying identity.

執筆者：藤崎英一郎、「知識ベース1群３編６章ディジタル署名」、［online］、2010年、電子情報通信学会、［2023年4月19日検索］，インターネット＜URL：https://www.ieice-hbkb.org/files/01/01gun_03hen_06.pdf＞Author: Eiichiro Fujisaki, "Knowledge Base Group 1, Part 3, Chapter 6, Digital Signature", [online], 2010, Institute of Electronics, Information and Communication Engineers, [Retrieved April 19, 2023], Internet <URL: https://www.ieice-hbkb.org/files/01/01gun_03hen_06.pdf>

　AIの中には、外部環境への適応や精度の向上を目的として継続的に学習して変化するものもある。非特許文献１の技術を用いることで、完全に同一であるAIの同一性を検証することはできるが、継続的に学習して変化するAIについて、変化前後のAIを同一のAIと判定することはできない。継続的に学習して変化するAIについて、AIの作成者や利用者が想定している範囲内でAIが変化した場合には変化前後のAIを同一のAIと判定し、誤った学習や偽装・改竄によって想定の範囲を超えてAIが変化した場合には同一のAIではないと判定したいという要望がある。ここで、継続的に学習して変化するAIは、変化するが故に、想定の範囲を超えた変化が生じているか否かを気付きにくいという問題がある。 Some AIs continuously learn and change in order to adapt to the external environment and improve their accuracy. By using the technology in Non-Patent Document 1, it is possible to verify the identity of completely identical AIs, but for AIs that continuously learn and change, it is not possible to determine that the AI before and after the change is the same AI. For AIs that continuously learn and change, there is a demand to determine that the AI before and after the change is the same AI if the AI changes within the range expected by the AI creator or user, and to determine that the AI is not the same AI if the AI changes beyond the expected range due to erroneous learning, falsification, or tampering. Here, the problem with AIs that continuously learn and change is that it is difficult to notice whether changes beyond the expected range have occurred because they are changing.

　従来同様、AIの偽装や改竄の恐れを検知したいという要望がある。ここで、継続的に学習して変化するAIは、変化するが故に、第三者によるAIの偽装や改竄が行われた場合に、気付きにくいという問題がある。 As in the past, there is a demand to detect the possibility of AI being disguised or tampered with. However, there is a problem with AI that continuously learns and changes, in that the fact that it changes means that it is difficult to notice when a third party disguises or tampers with the AI.

　そのため、継続的に学習して変化するAIについて、想定の範囲内で変化したAIか否か、および、利用者や製作者が想定するAIか否か（偽装や改竄が行われたAIではないか否か）、を検証したい。 For this reason, we want to verify whether AI that continuously learns and changes has changed within the expected range, and whether it is the AI that users and creators expect (whether it has been faked or tampered with).

　本発明は、継続的に学習して変化するAIの同一性も検証することができる検証装置、検証方法、およびプログラムを提供することを目的とする。 The present invention aims to provide a verification device, verification method, and program that can verify the identity of AI that continuously learns and changes.

　上記の課題を解決するために、本発明の一態様によれば、検証装置は、同一と見做される機械学習モデル群に入力データを与えたときに得られる第一の出力データ群と、入力データを検証対象の機械学習モデルに与えたときに得られる第二の出力データとを取得する取得部と、第一の出力データ群と第二の出力データとを用いて、検証対象の機械学習モデルが、機械学習モデル群と同一であるかに否かを統計的仮説検定により検証する検証部と、を含む。 In order to solve the above problem, according to one aspect of the present invention, a verification device includes an acquisition unit that acquires a first output data group obtained when input data is provided to a group of machine learning models considered to be identical, and a second output data obtained when the input data is provided to the machine learning model to be verified, and a verification unit that uses the first output data group and the second output data to verify whether the machine learning model to be verified is identical to the group of machine learning models through statistical hypothesis testing.

　本発明によれば、継続的に学習して変化するAIの同一性も検証することができるという効果を奏する。 The present invention has the effect of being able to verify the identity of an AI that is continually learning and changing.

第一実施形態に係る第一実施形態に係る検証装置の機能ブロック図。FIG. 2 is a functional block diagram of a verification device according to the first embodiment; 第一実施形態に係る第一実施形態に係る検証装置の処理フローの例を示す図。FIG. 4 is a diagram showing an example of a processing flow of the verification apparatus according to the first embodiment; 本手法を適用するコンピュータの構成例を示す図。FIG. 13 is a diagram showing an example of the configuration of a computer to which the present technique is applied.

　以下、本発明の実施形態について、説明する。なお、以下の説明に用いる図面では、同じ機能を持つ構成部や同じ処理を行うステップには同一の符号を記し、重複説明を省略する。以下の説明において、ベクトルや行列の各要素単位で行われる処理は、特に断りが無い限り、そのベクトルやその行列の全ての要素に対して適用されるものとする。 Below, an embodiment of the present invention will be described. In the drawings used in the following description, components having the same functions and steps performing the same processing will be given the same reference numerals, and duplicate explanations will be omitted. In the following description, processing performed on an element-by-element basis of a vector or matrix will be applied to all elements of that vector or matrix, unless otherwise specified.

＜第一実施形態のポイント＞
　同一と見做されるAIの特性の分布と検証したいAIの特性の分布を取得し、統計的仮説検定によって分布が異なるかを調べることで、同一性を検証する。ここで、ある入力に対するAIの出力を「AIの特性」と呼ぶ。 <Key Points of the First Embodiment>
The distribution of the characteristics of the AI considered to be identical and the distribution of the characteristics of the AI to be verified are obtained, and the identity is verified by examining whether the distributions differ through statistical hypothesis testing. Here, the output of an AI for a certain input is called the "AI characteristic."

　変化するAIであっても、検証対象のAIが、利用者が想定するAIか、あるいは、利用者や製作者の想定しない範囲でAIが変化していないかを、事前に取得した想定されるAIの情報との同一性を検証することで確かめることができる。 Even if the AI is one that changes, it is possible to verify whether the AI being verified is the AI the user expects, or whether the AI has changed in ways not anticipated by the user or creator, by verifying its identity with information about the expected AI obtained in advance.

　なお、本実施形態では、完全に同一であるAIのみでなく、想定の範囲（許容される変化の範囲）内のAIを同一のAIと見做す。このような構成とすることで、継続的に学習し変化するAIに追従しやすくなる。 In addition, in this embodiment, not only completely identical AIs are considered to be the same, but AIs that fall within the expected range (the range of allowable changes) are considered to be the same AI. This configuration makes it easier to keep up with AI that is continually learning and changing.

＜第一実施形態＞
　図１は第一実施形態に係る検証装置１００の機能ブロック図を、図２はその処理フローを示す。 First Embodiment
FIG. 1 is a functional block diagram of a verification device 100 according to the first embodiment, and FIG. 2 shows the processing flow thereof.

　検証装置１００は、取得部１１０と記憶部１２０と検証部１３０とを含む。 The verification device 100 includes an acquisition unit 110, a storage unit 120, and a verification unit 130.

　検証装置１００は、同一と見做されるAI群に入力データX=(x₁,…,x_K)を与えたときに得られる出力データY={(y_1,1,…,y_1,K),…,(y_N,1,…,y_N,K)}と、同じ入力データX=(x₁,…,x_K)を検証対象のAIに与えたときに得られる出力データY_m=(y_m,1,…,y_m,K)とを入力とし、検証対象のAIが、同一と見做されるAI群と同一であるかに否かを検証し、検証結果を出力する。 The verification device 100 takes as input the output data Y={( _y1,1 , ..., y1, _K ), ..., (yN _,1 _, ..., yN _,K )} obtained when input data X=( _x1 , ..., xK) is given to a group of AIs that are considered to be identical, and the output data _Ym =( _ym,1 _, ..., _ym,K ) obtained when the same input data X=(x1, ..., _xK ) is given to the AI to be verified, and verifies whether the AI to be verified is identical to the group of AIs that are considered to be identical, and outputs the verification results.

　検証装置１００は、例えば、中央演算処理装置（CPU: Central Processing Unit）、主記憶装置（RAM: Random Access Memory）などを有する公知又は専用のコンピュータに特別なプログラムが読み込まれて構成された特別な装置である。検証装置１００は、例えば、中央演算処理装置の制御のもとで各処理を実行する。検証装置１００に入力されたデータや各処理で得られたデータは、例えば、主記憶装置に格納され、主記憶装置に格納されたデータは必要に応じて中央演算処理装置へ読み出されて他の処理に利用される。検証装置１００の各処理部は、少なくとも一部が集積回路等のハードウェアによって構成されていてもよい。検証装置１００が備える各記憶部は、例えば、RAM（Random Access Memory）などの主記憶装置、またはリレーショナルデータベースやキーバリューストアなどのミドルウェアにより構成することができる。ただし、各記憶部は、必ずしも検証装置１００がその内部に備える必要はなく、ハードディスクや光ディスクもしくはフラッシュメモリ（Flash Memory）のような半導体メモリ素子により構成される補助記憶装置により構成し、検証装置１００の外部に備える構成としてもよい。 The verification device 100 is a special device configured by loading a special program into a publicly known or dedicated computer having, for example, a central processing unit (CPU), a main memory (RAM), etc. The verification device 100 executes each process under the control of, for example, the central processing unit. Data input to the verification device 100 and data obtained in each process are stored, for example, in the main memory, and the data stored in the main memory is read out to the central processing unit as necessary and used for other processes. At least a part of each processing unit of the verification device 100 may be configured by hardware such as an integrated circuit. Each memory unit provided in the verification device 100 can be configured by, for example, a main memory such as a RAM (Random Access Memory), or middleware such as a relational database or a key-value store. However, each storage unit does not necessarily need to be provided inside the verification device 100, but may be configured as an auxiliary storage device made up of a hard disk, optical disk, or semiconductor memory element such as flash memory, and may be configured to be provided outside the verification device 100.

　以下、各部について説明する。 Each part is explained below.

＜取得部１１０および記憶部１２０＞
　取得部１１０は、予め図示しない記憶部に記憶しておいた入力データX=(x₁,…,x_K)を取り出し、同一と見做されるAI群に含まれる各AIに対して出力する。同一と見做されるAI群は、N個のAIを含む。Nは1以上の整数の何れかであり、検証を行うのに十分な量とする。n番目のAIは、入力データX=(x₁,…,x_K)を入力とし、出力データ(y_n,1,…,y_n,K)を求め、取得部１１０へ出力する。n=1,2,…,Nについて同様の処理を行い、取得部１１０は、出力データY={(y_1,1,…,y_1,K),…,(y_N,1,…,y_N,K)}を取得し（Ｓ１１０）、記憶部１２０に格納する。この処理を、検証対象のAIに入力データX=(x₁,…,x_K)を与えたときに得られる出力データY_m=(y_m,1,…,y_m,K)が入力される前に行う。 <Acquisition Unit 110 and Storage Unit 120>
The acquisition unit 110 retrieves input data X=( _x1 , ..., _xK ) previously stored in a storage unit (not shown) and outputs it to each AI included in the group of AIs considered to be identical. The group of AIs considered to be identical includes N AIs. N is any integer equal to or greater than 1, and is a sufficient amount for verification. The nth AI receives input data X=( _x1 , ..., _xK ), obtains output data ( _yn,1 , ..., _yn,K ), and outputs it to the acquisition unit 110. The same process is performed for n=1, 2, ..., N, and the acquisition unit 110 acquires output data Y={( _y1,1 , ..., y1 _,K ), ..., ( _yN,1 , ..., yN _,K )} (S110) and stores it in the storage unit 120. This process is carried out before the output data _Ym = ( _ym,1 , ..., _ym,K ₎ obtained when the input data X = ( _x1 , ..., xK) is given to the AI to be verified is input.

　検証対象のAIが決まると、取得部１１０は、入力データX=(x₁,…,x_K)を検証対象のAIに対して出力する。検証対象のAIは、入力データX=(x₁,…,x_K)を入力とし、出力データY_m=(y_m,1,…,y_m,K)を求め、取得部１１０へ出力する。取得部１１０は、出力データY_mを取得し（Ｓ１１０）、検証部１３０に出力する。 Once the AI to be verified is decided, the acquisition unit 110 outputs input data X = ( _x1 , ..., _xK ) to the AI to be verified. The AI to be verified receives input data X = ( _x1 , ..., _xK ), determines output data _Ym = ( _ym,1 , ..., ym _,K ), and outputs it to the acquisition unit 110. The acquisition unit 110 acquires the output data _Ym (S110) and outputs it to the verification unit 130.

　ここで、同一と見做されるAI群は、AIの作成者や利用者の想定の範囲（許容される変化または差異の範囲）内のAI群を指し、AIの作成者や利用者が用途に合わせて想定の範囲や条件を適宜設定すればよい。同一と見做されるAI群は、例えば、継続的に学習して変化するAIを所定の期間経過毎にN回取得することで得てもよいし、ある時点の継続的に学習して変化するAIをN個のクライアントに配布し、所定の期間経過後にN個のクライアントから変化後のAIを取得することで得てもよい。なお、元となる1つのAIを同一と見做されるAI群に含まれるN個のAIの中の一つとしても良いし、AI群に含まなくともよい。ここで、AIの作成者や利用者が想定している範囲を超えて変化したAIは、元となる1つのAIから変化したものであっても、同一と見做されるAI群に含まないようにする。なお、想定の範囲を超えて変化したか否かの判定は、閾値等を設けて、AIの内部で用いる変化するパラメータと閾値との大小関係、または、AIの出力データと閾値との大小関係に基づいて、自動的に行ってもよいし、人手によって行ってもよい。また、AIの内部で用いるパラメータの学習前の初期値としてランダムパラメータを用いる場合には、同一の構造を持つAIであって、ランダムパラメータが異なるAIを複数用意することができるので、同一の構造を持つことを同一のAIと見做す条件として、これら複数のAIを同一と見做されるAI群としてもよい。さらに、これらの方法を組み合わせてAI群を取得してもよい。例えば、同一の構造を持つAIであって、ランダムパラメータが異なるAIをN個用意し、N個のクライアントに配布し、所定の期間経過後にN個のクライアントから変化後のAIを取得してもよい。
＜検証部１３０＞
　検証部１３０は、出力データY_m=(y_m,1,…,y_m,K)を入力とし、出力データY={(y_1,1,…,y_1,K),…,(y_N,1,…,y_N,K)}を記憶部１２０から取り出し、検証対象のAIが、同一と見做されるAI群と同一であるかに否かを統計的仮説検定により検証し（Ｓ１３０）、検証結果を出力する。検証結果は、検証対象のAIが同一と見做されるAI群と同一と見做せるか否かを示す情報であればよく、例えば、「検証対象のAIが同一と見做されるAI群と同一と見做せる」ことを示す値(例えば、1)、または、「検証対象のAIが同一と見做されるAI群と同一と見做せない」ことを示す値(例えば、0)である。 Here, a group of AIs considered to be identical refers to a group of AIs within the range of the assumptions of the creator or user of the AI (the range of acceptable changes or differences), and the creator or user of the AI may set the range of assumptions and conditions appropriately according to the purpose. A group of AIs considered to be identical may be obtained, for example, by acquiring an AI that continuously learns and changes N times at each predetermined period of time, or by distributing an AI that continuously learns and changes at a certain point in time to N clients and acquiring the changed AI from the N clients after a predetermined period of time has passed. Note that a single original AI may be one of the N AIs included in the group of AIs considered to be identical, or it may not be included in the AI group. Here, an AI that has changed beyond the range expected by the creator or user of the AI is not included in the group of AIs considered to be identical, even if it has changed from a single original AI. In addition, the determination of whether or not the change has exceeded the expected range may be performed automatically or manually based on the magnitude relationship between the variable parameters used inside the AI and the threshold value, or the magnitude relationship between the output data of the AI and the threshold value, by setting a threshold value or the like. In addition, when a random parameter is used as the initial value of the parameter used inside the AI before learning, it is possible to prepare multiple AIs that have the same structure but different random parameters, and these multiple AIs may be considered as a group of AIs that are considered to be the same, with the condition of having the same structure being that the AIs are considered to be the same. Furthermore, a group of AIs may be obtained by combining these methods. For example, N AIs that have the same structure but different random parameters may be prepared and distributed to N clients, and the changed AIs may be obtained from the N clients after a predetermined period of time has passed.
<Verification Unit 130>
The verification unit 130 receives the output data _Ym = ( _ym,1 , ..., _ym,K ), retrieves the output data Y = {( _y1,1 , ..., y1 _,K ), ..., ( _yN,1 , ..., yN _,K )} from the storage unit 120, verifies whether the AI to be verified is identical to the group of AIs considered to be identical by statistical hypothesis testing (S130), and outputs the verification result. The verification result may be information indicating whether the AI to be verified can be considered to be identical to the group of AIs considered to be identical, and may be, for example, a value (e.g., 1) indicating that "the AI to be verified can be considered to be identical to the group of AIs considered to be identical" or a value (e.g., 0) indicating that "the AI to be verified cannot be considered to be identical to the group of AIs considered to be identical".

　以下、検証部１３０の検証方法について説明する。 The verification method used by the verification unit 130 is explained below.

(1)出力データY_mまたはYの要素y_p,k(pはm、1,2,…,Nであり、k=1,2,…,K)がスカラ値の場合、出力データY_m、Yをそれぞれ統計的にある母集団から抽出された標本とみなし、母集団の確率分布が異なるかどうかを統計的仮説検定により調べ、検定の結果をもって、AIとしても異なるか否かを判断する。つまり、母集団の確率分布が異なる場合にはAIとして異なり、母集団の確率分布が同じ場合にはAIとして同一と見做せる。ここで用いる統計的仮説検定は、母集団の確率分布が異なるかどうかを検証することができるものであり、例えば2標本のコルモゴロフ-スミルノフ検定や、マン・ホイットニーのU検定が挙げられる。AIの同一性を検証する方法として、AIを関数と見做して入出力の距離から類似度を近似し、類似度と閾値との大小関係に基づき、検証対象のAIが同一と見做されるAI群と同一と見做せるか否かを判定する方法も考えられるが、その場合、類似度に対する適切な閾値を設定する必要がある。一方、統計的仮説検定では、別途適切な閾値を設定する必要はなく、従来の手法に基づいて母集団の確率分布が異なるかどうかを調べることができる。
(2)出力データY_mまたはYの要素y_p,kがベクトル(多次元)の場合、出力データY={(y_1,1,…,y_1,K),…,(y_N,1,…,y_N,K)}の中からある任意のAIの出力データ(y_g,1,…,y_g,K)(gは1,2,…,Nの何れか)を選択し、出力データy_g,kを基準として、要素y_p',kとの距離d_p',kを計算する。ここで、p'はg以外(p'≠g)のm、1,2,…,Nである。例えば、同一と見做されるAI群に含まれるN-1個のAIの元となる1つのAIがある場合には、その元となるAIの出力データを選択してもよい。ここで、d_p',kがスカラ値となるような距離を計算すれば、出力データY_mまたはYの要素y_p,kがスカラ値の場合と同様の処理(ただし、要素y_p,kに代えて距離d_p',kを用いる)によって、母集団の確率分布が異なるかどうかを統計的仮説検定により調べ、検定の結果をもって、AIとしても異なるか否かを判断することができる。例えば、距離として、ユークリッド距離が挙げられる。

ただし、y_p',k,q、y_q,k,qは、それぞれベクトル値である要素y_p',k、y_g,kのq次元目の値であり、Q_kはベクトル値である要素y_p',k、y_g,kの次元数である。なお、距離を計算する手法は一例であり、ベクトル値をスカラ値に変換できる方法であれば他の方法を用いてもよい。 (1) When the output data _Ym or the element _yp,k (p is m, 1, 2, ..., N, k = 1, 2, ..., K) of Y is a scalar value, the output data _Ym and Y are regarded as samples statistically extracted from a certain population, and a statistical hypothesis test is used to examine whether the probability distributions of the populations are different, and the results of the test are used to determine whether the AIs are different as well. In other words, if the probability distributions of the populations are different, the AIs are considered to be different, and if the probability distributions of the populations are the same, the AIs are considered to be the same. The statistical hypothesis test used here can verify whether the probability distributions of the populations are different, and examples of such tests include the Kolmogorov-Smirnov test for two samples and the Mann-Whitney U test. As a method for verifying the identity of AIs, it is possible to consider a method in which the AI is regarded as a function, the similarity is approximated from the distance between the input and output, and whether the AI to be verified can be considered to be the same as a group of AIs that are considered to be the same based on the magnitude relationship between the similarity and a threshold value. In this case, however, it is necessary to set an appropriate threshold value for the similarity. On the other hand, statistical hypothesis testing does not require the setting of a separate appropriate threshold, and can examine whether the probability distributions of populations differ based on conventional methods.
(2) If the output data _Ym or element _yp,k of Y is a vector (multidimensional), select the output data (yg _,1 , ..., yg _,K ) (g _{is 1, 2} _, ... _, N) of any AI from the output data Y = {(y1,1, ..., y1, _K ), ..., (yN,1, ..., yN,K)}, and calculate the distance _dp',k from element yp _',k based on the output data _yg,k . Here, p' is m, 1, 2, ..., N other than g (p' ≠ g). For example, if there is one AI that is the source of N-1 AIs included in a group of AIs that are considered to be identical, the output data of that source AI may be selected. If we calculate a distance such that dp _',k is a scalar value, we can use the same process as when the output data _Ym or element _yp,k of Y is a scalar value (but use distance dp _', _{k instead of element yp,} k) to examine whether the probability distributions of the populations are different through statistical hypothesis testing, and use the test results to determine whether the AI is also different. For example, the distance can be Euclidean distance.

Here, yp _',k,q and yq _,k,q are the q-th dimension values of the vector-valued elements yp _',k and _yg,k, respectively, and _Qk is the number of dimensions of the vector-valued elements yp _',k and _yg,k . Note that the method for calculating the distance is just one example, and other methods may be used as long as they can convert vector values into scalar values.

＜効果＞
　以上の構成により、AIの内部構造やAI内部のパラメータを参照することなく、出力データのみから、継続的に学習して変化するAIの同一性も検証することができる。＜Effects＞
With the above configuration, it is possible to verify the identity of an AI that continuously learns and changes from only the output data, without referring to the internal structure or parameters of the AI.

＜変形例＞
　本実施形態では、取得部１１０は、出力データY={(y_1,1,…,y_1,K),…,(y_N,1,…,y_N,K)}を記憶部１２０に格納しているが、出力データY={(y_1,1,…,y_1,K),…,(y_N,1,…,y_N,K)}を受け取った直後、または、同時に、検証を行う場合には、記憶部１２０を経由せずに、出力データYを検証部１３０に直接出力する構成としてもよい。また、取得部１１０は、出力データY_mを検証部１３０に出力しているが、出力データY_mを受け取った直後、または、同時に、検証を行わない場合には、検証を行うまで記憶部１２０に格納する構成としてもよい。この場合、検証部１３０が、検証時に記憶部１２０から出力データY_mを取り出せばよい。 <Modification>
In this embodiment, the acquisition unit 110 stores the output data Y={( _y1,1 , ..., y1 _,K ), ..., (yN _,1 , ..., yN _,K )} in the storage unit 120, but when verification is performed immediately after or simultaneously with receiving the output data Y={( _y1,1 , ..., y1 _,K ), ..., (yN _,1 , ..., yN _,K )}, the output data Y may be directly output to the verification unit 130 without going through the storage unit 120. In addition, the acquisition unit 110 outputs the output data _Ym to the verification unit 130, but when verification is not performed immediately after or simultaneously with receiving the output data _Ym , the output data Ym may be stored in the storage unit 120 until verification is performed. In this case, the verification unit 130 may take out the output data _Ym from the storage unit 120 at the time of verification.

　本実施形態では、取得部１１０は、入力データX=(x₁,…,x_K)を、同一と見做されるAI群に含まれる各AIおよび検証対象のAIに対して出力しているが、この処理を外部の装置が行い、取得部１１０は、出力データY={(y_1,1,…,y_1,K),…,(y_N,1,…,y_N,K)}、Y_m=(y_m,1,…,y_m,K)の取得、および、格納または出力を行う構成としてもよい。 In this embodiment, the acquisition unit 110 outputs input data X = ( _x1 , ..., _xK ) to each AI included in a group of AIs considered to be identical and to the AI to be verified, but this processing may be performed by an external device and the acquisition unit 110 may be configured to acquire and store or output output data _Y = {( _y1,1 , ..., y1 _,K ), ..., ( _yN,1 _, ..., yN _, _{K)}, Ym = (ym,1, ..., ym,K} ).

＜その他の変形例＞
　本発明は上記の実施形態及び変形例に限定されるものではない。例えば、上述の各種の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。 <Other Modifications>
The present invention is not limited to the above-mentioned embodiment and modified examples. For example, the above-mentioned various processes may be executed not only in chronological order as described, but also in parallel or individually depending on the processing capacity of the device executing the processes or as necessary. In addition, appropriate modifications are possible within the scope of the present invention.

＜プログラム及び記録媒体＞
　上述の各種の処理は、図３に示すコンピュータ２０００の記録部２０２０に、上記方法の各ステップを実行させるプログラムを読み込ませ、制御部２０１０、入力部２０３０、出力部２０４０、表示部２０５０などに動作させることで実施できる。 <Program and recording medium>
The various processes described above can be implemented by loading a program that executes each step of the above method into the recording unit 2020 of the computer 2000 shown in Figure 3, and operating the control unit 2010, input unit 2030, output unit 2040, display unit 2050, etc.

　この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等どのようなものでもよい。 The program describing this processing can be recorded on a computer-readable recording medium. Examples of computer-readable recording media include magnetic recording devices, optical disks, magneto-optical recording media, and semiconductor memories.

　また、このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ－ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させる構成としてもよい。 The program may be distributed, for example, by selling, transferring, or lending portable recording media such as DVDs or CD-ROMs on which the program is recorded. Furthermore, the program may be distributed by storing the program in a storage device of a server computer and transferring the program from the server computer to other computers via a network.

　このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶装置に格納する。そして、処理の実行時、このコンピュータは、自己の記録媒体に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。また、このプログラムの別の実行形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよく、さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。また、サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。なお、本形態におけるプログラムには、電子計算機による処理の用に供する情報であってプログラムに準ずるもの（コンピュータに対する直接の指令ではないがコンピュータの処理を規定する性質を有するデータ等）を含むものとする。 A computer that executes such a program, for example, first stores in its own storage device the program recorded on a portable recording medium or the program transferred from a server computer. Then, when executing a process, the computer reads the program stored on its own recording medium and executes the process according to the read program. As another execution form of the program, the computer may read the program directly from the portable recording medium and execute the process according to the program, or may execute the process according to the received program each time a program is transferred from the server computer to the computer. The above-mentioned process may also be executed by a so-called ASP (Application Service Provider) type service that does not transfer the program from the server computer to the computer, but realizes the processing function only by issuing an execution instruction and obtaining the results. Note that the program in this form includes information used for processing by an electronic computer that is equivalent to a program (such as data that is not a direct command to the computer but has properties that specify the processing of the computer).

　また、この形態では、コンピュータ上で所定のプログラムを実行させることにより、本装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 In addition, in this embodiment, the device is configured by executing a specific program on a computer, but at least a portion of the processing may be realized by hardware.

Claims

an acquisition unit that acquires a first output data group obtained when input data is provided to a group of machine learning models that are considered to be identical, and a second output data obtained when the input data is provided to a machine learning model to be verified;
A verification unit that verifies whether the machine learning model to be verified is identical to the machine learning model group by statistical hypothesis testing using the first output data group and the second output data.
Verification device.

An acquisition step in which an acquisition unit acquires a first output data group obtained when input data is provided to a group of machine learning models considered to be identical, and a second output data obtained when the input data is provided to a machine learning model to be verified;
A verification step in which a verification unit verifies whether the machine learning model to be verified is identical to the machine learning model group by a statistical hypothesis test using the first output data group and the second output data.
Verification method.

A program for causing a computer to function as the verification device of claim 1.