JP7544883B2

JP7544883B2 - Interaction Between LUT and AMVP

Info

Publication number: JP7544883B2
Application number: JP2023016139A
Authority: JP
Inventors: リージャン; カイジャン; ホンビンリウ; ユエワン
Original assignee: Beijing ByteDance Network Technology Co Ltd; ByteDance Inc
Current assignee: Beijing ByteDance Network Technology Co Ltd; ByteDance Inc
Priority date: 2018-06-29
Filing date: 2023-02-06
Publication date: 2024-09-03
Anticipated expiration: 2039-07-01
Also published as: JP2021530940A; US11528501B2; EP4325861A3; CN114125450A; KR20210024504A; SG11202013028PA; KR102641872B1; JP7295231B2; CA3105330A1; CN114125450B; CN110662043A; TW202015422A; MX2020013828A; WO2020003284A1; EP3797516A1; US20210014525A1; US20230064498A1; EP4325861A2; TWI719525B; CA3105330C

Description

関連出願の相互参照
パリ条約に基づく適用可能な特許法および／または規則に基づいて、本願は、２０１８年６月２９日出願の国際特許出願ＰＣＴ／ＣＮ２０１８／０９３６６３号、２０１８年９月１２日出願の国際特許出願ＰＣＴ／ＣＮ２０１８／１０５１９３号、２０１９年１月１６日出願の国際特許出願ＰＣＴ／ＣＮ２０１９／０７２０５８号の優先権および利益を適時に主張することを目的とする。米国の法律の下、あらゆる目的のために、国際特許出願ＰＣＴ／ＣＮ２０１８／０９３６６３号、国際特許出願第ＰＣＴ／ＣＮ２０１８／１０５１９３号、および国際特許出願第ＰＣＴ／ＣＮ２０１９／０７２０５８の開示の全文は、本願の開示の一部として参照により援用される。 CROSS-REFERENCE TO RELATED APPLICATIONS Under applicable patent laws and/or regulations under the Paris Convention, this application is intended to timely claim priority to and the benefit of International Patent Application No. PCT/CN2018/093663, filed June 29, 2018, International Patent Application No. PCT/CN2018/105193, filed September 12, 2018, and International Patent Application No. PCT/CN2019/072058, filed January 16, 2019. Under the laws of the United States, for all purposes, the entire disclosures of International Patent Application No. PCT/CN2018/093663, International Patent Application No. PCT/CN2018/105193, and International Patent Application No. PCT/CN2019/072058 are incorporated by reference as part of the disclosure of this application.

この特許明細書は、映像符号化および復号化技術、デバイスおよびシステムに関する。 This patent specification relates to video encoding and decoding techniques, devices and systems.

映像圧縮の進歩にもかかわらず、デジタル映像は、依然として、インターネットおよび他のデジタル通信ネットワークにおいて最大の帯域幅の使用量を占めている。映像の受信および表示が可能な接続されたユーザ機器の数が増加するにつれ、デジタル映像の使用に対する帯域幅需要は増大し続けることが期待される。 Despite advances in video compression, digital video still accounts for the largest bandwidth usage on the Internet and other digital communications networks. As the number of connected user devices capable of receiving and displaying video increases, the bandwidth demands for digital video use are expected to continue to grow.

本明細書は、デジタル映像を符号化および復号化するための方法、システム、およびデバイスを開示する。 This specification discloses methods, systems, and devices for encoding and decoding digital video.

１つの例示的な態様において、映像復号化の方法は、テーブルを維持することであって、各テーブルは、動き候補のセットを含み、各動き候補は、対応する動き情報に関連付けられる、ことと、第１の映像ブロックと、第１の映像ブロックを含む映像のビットストリーム表現との間で変換を行うことであって、変換を行うことは、動き候補のセットのうちの少なくとも一部を予測因子として使用して第１の映像ブロックの動き情報を処理する、こととを含むように提供される。 In one exemplary aspect, a method of video decoding is provided that includes maintaining tables, each table including a set of motion candidates, each motion candidate being associated with corresponding motion information, and converting between a first video block and a bitstream representation of a video including the first video block, the converting including processing the motion information of the first video block using at least a portion of the set of motion candidates as predictors.

さらに別の代表的な態様では、本明細書で説明される様々な技法は、非一時的なコンピュータ可読媒体に記憶されるコンピュータプログラム製品として実施され得る。このコンピュータプログラム製品は、本明細書に記載の方法を実行するためのプログラムコードを含む。 In yet another representative aspect, the various techniques described herein may be implemented as a computer program product stored on a non-transitory computer-readable medium. The computer program product includes program code for performing the methods described herein.

１つ以上の実装形態の詳細は、添付の添付ファイル、図面、および以下の説明に記載されている。他の特徴は、説明および図面、並びに特許請求の範囲の記載から明らかとなろう。 Details of one or more implementations are set forth in the accompanying attachments, drawings, and description below. Other features will be apparent from the description and drawings, and from the claims.

映像エンコーダの実装形態の例を示すブロック図である。FIG. 2 is a block diagram illustrating an example implementation of a video encoder. Ｈ．２６４映像符号化規格におけるマクロブロックの分割を示す。1 shows a macroblock division in the H.264 video coding standard. 符号化ブロック（ＣＢ：ＣｏｄｉｎｇＢｌｏｃｋ）を予測ブロック（ＰＵ：ＰｒｅｄｉｃｔｉｏｎＢｌｏｃｋ）に分割する例を示す。An example of dividing a coding block (CB) into prediction blocks (PU) will be shown. ＣＴＢをＣＢおよび変換ブロック（ＴＢ：ＴｒａｎｓｆｏｒｍＢｌｏｃｋ）に細分するための例示的な実装形態を示す。実線はＣＢ境界を示し、点線はＴＢ境界を示し、その分割を含むＣＴＢの例、および対応する４分木を含む。1 illustrates an example implementation for subdividing a CTB into CBs and Transform Blocks (TBs), with solid lines indicating CB boundaries and dotted lines indicating TB boundaries, including an example CTB with its division, and the corresponding quadtree. 映像データを分割するための４分木２分木（ＱＴＢＴ：ＱｕａｄＴｒｅｅＢｉｎａｒｙＴｒｅｅ）構造の一例を示す。1 shows an example of a quad tree binary tree (QTBT) structure for dividing video data. 映像ブロックの分割の例を示す。1 shows an example of video block division. ４分木の分割の例を示す。An example of a quadtree division is shown below. ツリー型信号通知の例を示す。1 shows an example of a tree-type signaling. マージ候補リスト構築のための導出処理の一例を示す。13 illustrates an example of a derivation process for building a merge candidate list. 空間的マージ候補の位置の例を示す。1 shows examples of spatial merge candidate locations. 空間的マージ候補の冗長性チェックに考慮される候補対の例を示す。1 shows examples of candidate pairs considered for redundancy check of spatial merge candidates. Ｎ×２Ｎおよび２Ｎ×Ｎパーティションの第２のＰＵの位置の例を示す。4 shows examples of the location of the second PU for N×2N and 2N×N partitions. 時間的マージ候補のための動きベクトルのスケーリングを示す。13 illustrates the scaling of motion vectors for temporal merge candidates. 時間的マージ候補の候補位置とその同一位置のピクチャを示す。Shows the candidate locations of temporal merge candidates and their co-located pictures. 結合双方向予測マージ候補の例を示す。1 illustrates an example of a combined bi-predictive merge candidate. 動きベクトル予測候補の導出処理の例を示す。13 shows an example of a process for deriving motion vector prediction candidates. 空間的動きベクトル候補のための動きベクトルのスケーリングの例を示す。13 illustrates an example of motion vector scaling for spatial motion vector candidates. ＣＵの動き予測のための例示的なＡＴＭＶＰ（ＡｌｔｅｒｎａｔｉｖｅＴｅｍｐｏｒａｌＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎを示す。1 illustrates an exemplary Alternative Temporal Motion Vector Prediction (ATMVP) for motion prediction of a CU. ソースブロックおよびソースピクチャの識別の一例を絵で示す。1 illustrates a pictorial example of source block and source picture identification. ４つのサブブロックおよび近傍のブロックを有する１つのＣＵの例を示す。An example of one CU with four sub-blocks and neighboring blocks is shown. バイラテラルマッチングの例を示す。1 shows an example of bilateral matching. テンプレートマッチングの例を示す。An example of template matching will be given below. ＦＲＵＣ（ＦｒａｍｅＲａｔｅＵｐＣｏｎｖｅｒｓｉｏｎ）におけるユニラテラル動き推定（ＭＥ：ＭｏｔｉｏｎＥｓｔｉｍａｔｉｏｎ）の例を示す。An example of unilateral motion estimation (ME) in Frame Rate Up Conversion (FRUC) is shown. バイラテラルテンプレートマッチングに基づくＤＭＶＲの例を示す。1 shows an example of DMVR based on bilateral template matching. 空間的マージ候補を導出するために使用する空間的に近傍のブロックの例を示す。1 shows an example of spatially neighboring blocks used to derive spatial merging candidates. ルックアップテーブル更新のための代表的な位置の選択方法の一例を示す。13 shows an example of how to select a representative location for updating a lookup table. 新しい動き情報のセットでルックアップテーブルを更新する例を示す。An example of updating a lookup table with a new set of motion information is given. 新しい動き情報のセットでルックアップテーブルを更新する例を示す。An example of updating a lookup table with a new set of motion information is given. 本明細書に記載されるビジュアルメディアの復号化またはビジュアルメディアの符号化技術を実装するためのハードウェアプラットフォームの一例を示すブロック図である。FIG. 2 is a block diagram illustrating an example of a hardware platform for implementing the visual media decoding or visual media encoding techniques described herein. 映像ビットストリーム処理の別の例示的な方法のフローチャートである。4 is a flow chart of another exemplary method of video bitstream processing. 提案されるＨＭＶＰ方法による復号化フローチャートの一例を示す。1 shows an example of a decoding flowchart according to the proposed HMVP method. 提案されるＨＭＶＰ方法を用いたテーブルの更新例を示す。An example of updating a table using the proposed HMVP method is shown. 冗長性除去に基づくＬＵＴ更新方法（１つの冗長性動き候補を除去する）の例を示す。1 shows an example of a redundancy elimination based LUT update method (eliminating one redundant motion candidate). 冗長性除去に基づくＬＵＴ更新方法（１つの冗長性動き候補を除去する）の例を示す。1 shows an example of a redundancy elimination based LUT update method (eliminating one redundant motion candidate). 冗長性除去に基づくＬＵＴ更新方法（複数の冗長性動き候補を除去する）の例を示す。1 shows an example of a redundancy elimination based LUT update method (eliminating multiple redundant motion candidates). 冗長性除去に基づくＬＵＴ更新方法（複数の冗長性動き候補を除去する）の例を示す。1 shows an example of a redundancy elimination based LUT update method (eliminating multiple redundant motion candidates). タイプ１のブロックとタイプ２のブロックとの相違点の一例を示す。An example of the difference between type 1 blocks and type 2 blocks is shown below.

映像の圧縮率を改善するために、研究者らは、映像を符号化する新しい技術を絶えず求めている。 To improve video compression rates, researchers are constantly seeking new techniques for encoding video.

１．導入 1. Introduction

本明細書は、映像符号化技術に関する。具体的には、映像符号化における動き情報の符号化（例えば、マージモード、ＡＭＶＰモード）に関する。ＨＥＶＣのような既存の映像符号化規格に適用してもよいし、規格（ＶｅｒｓａｔｉｌｅＶｉｄｅｏＣｏｄｉｎｇ）を確定させるために適用してもよい。本発明は、将来の映像符号化規格または映像コーデックにも適用可能である。 This specification relates to video coding technology. Specifically, it relates to coding of motion information in video coding (e.g., merge mode, AMVP mode). It may be applied to existing video coding standards such as HEVC, or may be applied to finalize a standard (Versatile Video Coding). The present invention is also applicable to future video coding standards or video codecs.

簡単な説明 Brief description

映像符号化規格は、主に周知のＩＴＵ－ＴおよびＩＳＯ／ＩＥＣ規格の開発によって発展してきた。ＩＴＵ－ＴはＨ．２６１とＨ．２６３を作り、ＩＳＯ／ＩＥＣはＭＰＥＧ－１とＭＰＥＧ－４Ｖｉｓｕａｌを作り、両団体はＨ．２６２／ＭＰＥＧ－２ＶｉｄｅｏとＨ．２６４／ＭＰＥＧ－４ＡＶＣ（ＡｄｖａｎｃｅｄＶｉｄｅｏＣｏｄｉｎｇ）とＨ．２６５／ＨＥＶＣ規格を共同で作った。Ｈ．２６２以来、映像符号化規格は、時間予測と変換符号化が利用されるハイブリッド映像符号化構造に基づく。典型的なＨＥＶＣエンコーダフレームワークの一例を図１に示す。 Video coding standards have evolved primarily through the development of well-known ITU-T and ISO/IEC standards. ITU-T produced H.261 and H.263, ISO/IEC produced MPEG-1 and MPEG-4 Visual, and the two organizations jointly produced H.262/MPEG-2 Video, H.264/MPEG-4 AVC (Advanced Video Coding), and H.265/HEVC standards. Since H.262, video coding standards have been based on a hybrid video coding structure in which temporal prediction and transform coding are utilized. An example of a typical HEVC encoder framework is shown in Figure 1.

２．１パーティション構造 2.1 Partition structure

２．１．１Ｈ．２６４／ＡＶＣにおけるパーティションツリー構造 2.1.1 Partition tree structure in H.264/AVC

以前の規格における符号化層のコアは、１６×１６ブロックの輝度サンプルを含み、通常の４：２：０カラーサンプリングの場合、２つの対応する８×８ブロックの彩度サンプル含むマクロブロックであった。 The core of the coding layer in previous standards was a macroblock containing a 16x16 block of luma samples and, in the case of normal 4:2:0 color sampling, two corresponding 8x8 blocks of chroma samples.

イントラ符号化されたブロックは、画素間の空間的相関を利用するために空間予測を使用する。２つのパーティションを規定する。１６×１６および４×４である。 Intra-coded blocks use spatial prediction to exploit spatial correlation between pixels. We define two partitions: 16x16 and 4x4.

インター符号化されたブロックは、ピクチャ間の動きを推定することで、空間的予測の代わりに時間予測を用いる。動きは、１６×１６マクロブロックまたはそのサブマクロブロックパーティションのいずれかに対して独立して推定できる。１６×８、８×１６、８×８、８×４、４×８、４×４（図２参照）。１つのサブマクロブロックパーティション当たり１つの動きベクトル（ＭＶ）のみが許可される。 Inter-coded blocks use temporal prediction instead of spatial prediction by estimating the motion between pictures. Motion can be estimated independently for a 16x16 macroblock or any of its sub-macroblock partitions: 16x8, 8x16, 8x8, 8x4, 4x8, 4x4 (see Figure 2). Only one motion vector (MV) per sub-macroblock partition is allowed.

２．１．２ＨＥＶＣにおけるパーティションツリー構造 2.1.2 Partition tree structure in HEVC

ＨＥＶＣにおいて、ＣＴＵは、様々な局所的特徴に適応するように、符号化ツリーと呼ばれる４分木構造を用いてＣＵに分割される。インターピクチャ（時間的）予測またはイントラピクチャ（空間的）予測を使用した、ピクチャ領域を符号化するかどうかの決定は、ＣＵレベルで行われる。各ＣＵは、ＰＵ分割タイプに応じて１つ、２つまたは４つのＰＵに更に分割することができる。１つのＰＵの内部では、同じ予測処理が適用され、ＰＵ単位で関連情報がデコーダに送信される。ＰＵ分割タイプに基づく予測処理を適用して残差ブロックを得た後、ＣＵのための符号化ツリーに類似した別の４分木構造に基づいて、ＣＵを変換ユニット（ＴＵ）に分割することができる。ＨＥＶＣ構造の重要な特徴の１つは、ＣＵ、ＰＵ、ＴＵを含む複数のパーティション概念を有することである。 In HEVC, CTUs are partitioned into CUs using a quad-tree structure called a coding tree to accommodate various local features. The decision to code a picture region using inter-picture (temporal) or intra-picture (spatial) prediction is made at the CU level. Each CU can be further partitioned into one, two or four PUs depending on the PU partition type. Inside one PU, the same prediction process is applied and related information is sent to the decoder on a PU-by-PU basis. After applying the prediction process based on the PU partition type to obtain the residual block, the CU can be partitioned into transform units (TUs) based on another quad-tree structure similar to the coding tree for CUs. One of the key features of the HEVC structure is that it has a multiple partition concept including CU, PU and TU.

以下、ＨＥＶＣを使用したハイブリッド映像符号化に関連する様々な特徴に焦点を当てる。 Below we highlight various features related to hybrid video coding using HEVC.

１）符号化ツリーユニットおよび符号化ツリーブロック（ＣＴＢ）構造。ＨＥＶＣにおける類似した構造は、符号化ツリーユニット（ＣＴＵ）であり、この符号化ツリーユニットは、エンコーダによって選択されたサイズを有し、従来のマクロブロックよりも大きくてもよい。ＣＴＵは、輝度ＣＴＢと、対応する彩度ＣＴＢおよび構文要素とからなる。輝度ＣＴＢのサイズＬ×Ｌは、Ｌ＝１６、３２、または６４のサンプルとして選択することができ、より大きいサイズは、一般的に、より優れた圧縮を有効にする。ＨＥＶＣは、次いで、ツリー構造および４分木様の信号通知を使用して、ＣＴＢをより小さなブロックに分割することをサポートする。 1) Coding Tree Unit and Coding Tree Block (CTB) Structure. A similar structure in HEVC is the coding tree unit (CTU), which has a size selected by the encoder and may be larger than a conventional macroblock. A CTU consists of a luma CTB and a corresponding chroma CTB and syntax elements. The size LxL of the luma CTB can be selected as L=16, 32, or 64 samples, with larger sizes generally enabling better compression. HEVC then supports splitting the CTB into smaller blocks using a tree structure and quadtree-like signaling.

２）符号化ユニット（ＣＵ）および符号化ブロック（ＣＢ）：ＣＴＵの４分木の構文は、その輝度および彩度ＣＢのサイズおよび位置を指定する。４分木のルートはＣＴＵに関連付けられる。従って、輝度ＣＴＢのサイズは、輝度ＣＢに対してサポートされる最大のサイズである。ＣＴＵを輝度ＣＢおよび彩度ＣＢに分割することは、共に信号通知されることである。１つの輝度ＣＢおよび通常２つの彩度ＣＢは、関連する構文と共に、１つの符号化ユニット（ＣＵ）を形成する。ＣＴＢは、１つのＣＵのみを含んでもよく、または複数のＣＵを形成するように分割されてもよく、各ＣＵは、それに関連付けられた予測ユニット（ＰＵ）への分割と、１つの変換ユニットのツリー（ＴＵ）とを有する。 2) Coding Units (CUs) and Coding Blocks (CBs): The quadtree syntax of a CTU specifies the size and location of its luma and chroma CBs. The root of the quadtree is associated with the CTU. The size of the luma CTB is therefore the maximum size supported for the luma CB. The partitioning of a CTU into luma and chroma CBs is signaled together. One luma CB and usually two chroma CBs, together with the associated syntax, form one coding unit (CU). A CTB may contain only one CU or may be partitioned to form multiple CUs, each with its associated partition into prediction units (PUs) and one tree of transform units (TUs).

３）予測ユニットおよび予測ブロック（ＰＢ）：インターピクチャまたはイントラピクチャ予測を使用してピクチャ領域を符号化するかどうかの決定は、ＣＵレベルで行われる。ＰＵの分割構造は、そのルートがＣＵレベルにある。基本的な予測タイプの決定に基づいて、次に、輝度および彩度ＣＢのサイズをさらに分割し、輝度および彩度予測ブロック（ＰＢ）から予測することができる。ＨＥＶＣは、６４×６４から４×４までの可変ＰＢサイズのサンプルをサポートする。図３は、Ｍ×ＭのＣＵのための許可されたＰＢの例を示す。 3) Prediction Units and Prediction Blocks (PBs): The decision of whether to code a picture region using inter-picture or intra-picture prediction is made at the CU level. The partitioning structure of the PU has its root at the CU level. Based on the decision of the basic prediction type, the size of the luma and chroma CBs can then be further partitioned and predicted from the luma and chroma prediction blocks (PBs). HEVC supports variable PB size samples from 64x64 to 4x4. Figure 3 shows an example of allowed PBs for an MxM CU.

４）ＴＵおよび変換ブロック：予測残差は、ブロック変換を使用して符号化される。ＴＵツリー構造は、そのルートがＣＵレベルにある。この輝度ＣＢ残差は、輝度変換ブロック（ＴＢ）と同一であってもよいし、小さな輝度ＴＢにさらに分割されてもよい。彩度ＴＢについても同様である。正方形ＴＢサイズ４×４、８×８、１６×１６、および３２×３２に対して、離散コサイン変換（ＤＣＴ）の整数基底関数に類似した整数基底関数が規定される。輝度イントラピクチャ予測残差の４×４変換のために、離散サイン変換（ＤＳＴ）の形式から導出される整数変換が代替的に指定される。 4) TUs and Transform Blocks: The prediction residual is coded using a block transform. The TU tree structure has its root at the CU level. This luma CB residual may be identical to the luma transform block (TB) or may be further split into smaller luma TBs. Similarly for the chroma TB. For square TB sizes 4x4, 8x8, 16x16, and 32x32, integer basis functions similar to those of the discrete cosine transform (DCT) are specified. For the 4x4 transform of the luma intra-picture prediction residual, an integer transform derived from a form of the discrete sine transform (DST) is alternatively specified.

図４は、ＣＴＢをＣＢ［及び変換ブロック（ＴＢ）］に細分する例を示す。実線はＣＢ境界を示し、点線はＴＢ境界を示す。（ａ）ＣＴＢとその分割（ｂ）対応する４分木。 Figure 4 shows an example of subdivision of a CTB into CBs [and transform blocks (TBs)]. Solid lines indicate CB boundaries, dotted lines indicate TB boundaries. (a) CTB and its division (b) corresponding quadtree.

２．１．２．１変換ブロックおよびユニットへのツリー構造の分割 2.1.2.1 Dividing the tree structure into transformation blocks and units

残差符号化の場合、ＣＢは、変換ブロック（ＴＢ）に再帰的に分割することができる。この分割は、残差４分木によって信号通知される。図４に示すように、１つのブロックを再帰的に象限に分割することができるように、正方形のＣＢおよびＴＢの分割のみを指定する。サイズＭ×Ｍの所与の輝度ＣＢに対して、フラグは、それがサイズＭ／２×Ｍ／２の４つのブロックに分割されるかどうかを信号通知する。さらなる分割が可能である場合、ＳＰＳに示される残留４分木の最大深さによって信号通知されるように、各象限には、それが４つの象限に分割されているかどうかを示すフラグが割り当てられる。残差４分木の結果得られる葉ノードブロックは、変換符号化によってさらに処理される変換ブロックである。エンコーダは、それが使用することになる最大および最小輝度ＴＢサイズを示す。ＣＢサイズが最大ＴＢサイズよりも大きい場合、分割は非明示的に行われる。分割により、示された最小値よりも小さい輝度ＴＢサイズとなる場合、分割を行わないことが、非明示的に行われる。輝度ＴＢサイズが４×４である場合を除き、彩度ＴＢサイズは、各次元において輝度ＴＢサイズの半分であり、この場合、４つの４×４輝度ＴＢによって覆われる領域には１つの４×４彩度ＴＢが使用される。イントラピクチャ予測ＣＵの場合、最近の近傍のＴＢ（ＣＢ内またはＣＢ外）の復号サンプルを、イントラピクチャ予測のための参照データとして用いる。 For residual coding, the CB can be recursively split into transform blocks (TBs). This split is signaled by the residual quadtree. We only specify square CB and TB splits so that one block can be recursively split into quadrants as shown in Figure 4. For a given luma CB of size MxM, a flag signals whether it is split into four blocks of size M/2xM/2. If further splits are possible, each quadrant is assigned a flag indicating whether it is split into four quadrants, as signaled by the maximum depth of the residual quadtree indicated in the SPS. The leaf node blocks resulting from the residual quadtree are the transform blocks that are further processed by transform coding. The encoder indicates the maximum and minimum luma TB sizes it will use. If the CB size is larger than the maximum TB size, the split is done implicitly. If the split would result in a luma TB size smaller than the indicated minimum, no split is done implicitly. The chroma TB size is half the luma TB size in each dimension, except when the luma TB size is 4x4, in which case one 4x4 chroma TB is used for the area covered by four 4x4 luma TBs. For intra-picture predicted CUs, decoded samples of the nearest neighboring TBs (inside or outside the CB) are used as reference data for intra-picture prediction.

従来の規格とは対照的に、ＨＥＶＣ設計により、インターピクチャ予測ＣＵのために１つのＴＢが複数のＰＢにまたがることが可能となり、４分木構造のＴＢの分割の潜在的な符号化効率の利点が最大となる。 In contrast to previous standards, the HEVC design allows a TB to span multiple PBs for inter-picture predicted CUs, maximizing the potential coding efficiency benefits of quadtree TB partitioning.

２．１．２．２親子ノード 2.1.2.2 Parent and child nodes

ＣＴＢは、４分木構造に基づいて分割され、そのノードは符号化ユニットである。４分木構造における複数のノードは、葉ノードおよび非葉ノードを含む。葉ノードは、ツリー構造内に子ノードを持たない（すなわち、葉ノードはそれ以上分割されない）。非葉ノードは、ツリー構造のルートノードを含む。ルートノードは、映像データの最初の映像ブロック（例えば、ＣＴＢ）に対応する。複数のノードのうちのそれぞれの非ルートノードにおいて、それぞれの非ルートノードは、それぞれの非ルートノードのツリー構造における親ノードに対応する映像ブロックのサブブロックである映像ブロックに対応する。複数の非葉ノードのそれぞれの非葉ノードは、ツリー構造において１つ以上の子ノードを有する。 The CTB is divided based on a quadtree structure, whose nodes are coding units. The multiple nodes in the quadtree structure include leaf nodes and non-leaf nodes. The leaf nodes do not have child nodes in the tree structure (i.e., the leaf nodes are not further divided). The non-leaf nodes include a root node of the tree structure. The root node corresponds to a first video block (e.g., the CTB) of the video data. In each non-root node of the multiple nodes, the non-root node corresponds to a video block that is a sub-block of a video block corresponding to a parent node in the tree structure of the respective non-root node. Each non-leaf node of the multiple non-leaf nodes has one or more child nodes in the tree structure.

２．１．３ＪＥＭにおけるより大きいＣＴＵを有する４分木＋２分木ブロック構造 2.1.3 Quadtree + binary tree block structure with larger CTU in JEM

ＨＥＶＣを超えた将来の映像符号化技術を探索するため、２０１５年には、ＶＣＥＧとＭＰＥＧが共同でＪＶＥＴ（ＪｏｉｎｔＶｉｄｅｏＥｘｐｌｏｒａｔｉｏｎＴｅａｍ）を設立した。それ以来、多くの新しい方法がＪＶＥＴによって採用され、ＪＥＭ（ＪｏｉｎｔＥｘｐｌｏｒａｔｉｏｎＭｏｄｅ）と呼ばれる参照ソフトウェアに組み込まれてきた。 To explore future video coding technologies beyond HEVC, VCEG and MPEG jointly established the Joint Video Exploration Team (JVET) in 2015. Since then, many new methods have been adopted by JVET and incorporated into reference software called Joint Exploration Mode (JEM).

２．１．３．１ＱＴＢＴブロックの分割構造 2.1.3.1 Division structure of QTBT block

ＨＥＶＣとは異なり、ＱＴＢＴ構造は、複数のパーティションタイプの概念を削除する。すなわち、ＣＵ、ＰＵ、ＴＵのコンセプトの切り離しを取り除き、ＣＵパーティションの形状の柔軟性を向上させる。ＱＴＢＴブロック構造において、ＣＵは正方形または長方形のいずれかを有することができる。図５に示すように、まず、符号化ツリーユニット（ＣＴＵ）を４分木構造で分割する。４分木の葉ノードは、２分木構造によってさらに分割される。２分木の分割には、対称水平分割と対称垂直分割の２つの分割タイプがある。２分木の葉ノードは、符号化ユニット（ＣＵ）と呼ばれ、このセグメント化は、それ以上の分割を行うことなく、予測および変換処理に使用される。これは、ＱＴＢＴの符号化されたブロック構造において、ＣＵ、ＰＵおよびＴＵが同じブロックサイズを有することを意味する。ＪＥＭにおいて、ＣＵは、しばしば異なる色成分の符号化ブロック（ＣＢ）からなり、例えば、４：２：０彩度フォーマットのＰおよびＢスライスの場合、１つのＣＵは１つの輝度ＣＢおよび２つの彩度ＣＢを含み、また、ＣＵは、しばしば単一の成分のＣＢからなり、例えば、Ｉスライスの場合、１つのＣＵは、１つの輝度ＣＢのみ、または、２つの彩度ＣＢのみを含む。 Unlike HEVC, the QTBT structure removes the concept of multiple partition types, i.e., it removes the separation of the concepts of CU, PU, and TU, and improves the flexibility of the shape of CU partitions. In the QTBT block structure, a CU can have either a square or a rectangle. As shown in Figure 5, first, a coding tree unit (CTU) is divided by a quadtree structure. The leaf nodes of the quadtree are further divided by a binary tree structure. There are two types of division in the binary tree: symmetric horizontal division and symmetric vertical division. The leaf nodes of the binary tree are called coding units (CUs), and this segmentation is used for prediction and transform processing without further division. This means that in the coded block structure of QTBT, CUs, PUs, and TUs have the same block size. In JEM, a CU often consists of coding blocks (CBs) of different color components, e.g., for P and B slices in 4:2:0 chroma format, one CU contains one luma CB and two chroma CBs, and a CU often consists of CBs of a single component, e.g., for I slices, one CU contains only one luma CB or only two chroma CBs.

ＱＴＢＴ分割スキームに対して以下のパラメータを規定する。
－ＣＴＵのサイズ：１つの４分木のルートノードのサイズ、ＨＥＶＣと同じ概念
－ＭｉｎＱＴＳｉｚｅ：最小許容の４分木の葉ノードサイズ
－ＭａｘＢＴＳｉｚｅ：最大許容の２分木ルートノードサイズ
－ＭａｘＢＴＤｅｐｔｈ：最大許容の２分木の深さ
－ＭｉｎＢＴＳｉｚｅ：最小許容の２分木の葉ノードのサイズ The following parameters are specified for the QTBT partitioning scheme:
-CTU size: size of root node of one quadtree, same concept as HEVC -MinQTSize: minimum allowable quadtree leaf node size -MaxBTSize: maximum allowable binary tree root node size -MaxBTDepth: maximum allowable binary tree depth -MinBTSize: minimum allowable binary tree leaf node size

ＱＴＢＴの分割構造の一例において、ＣＴＵのサイズを、２つの対応する６４×６４ブロックの彩度サンプルを有する１２８×１２８の輝度サンプルとして設定し、ＭｉｎＱＴＳｉｚｅを１６×１６として設定し、ＭａｘＢＴＳｉｚｅを６４×６４として設定し、ＭｉｎＢＴＳｉｚｅ（幅および高さの両方について）を４×４として設定し、ＭａｘＢＴＤｅｐｔｈを４として設定する。４分木の分割は、まずＣＴＵに適用され、４分木の葉ノードを生成する。４分木の葉ノードは、１６×１６（即ち、ＭｉｎＱＴＳｉｚｅ）から１２８×１２８（即ち、ＣＴＵサイズ）までのサイズを有することが可能である。葉４分木のノードが１２８×１２８である場合、サイズがＭａｘＢＴＳｉｚｅ（すなわち、６４×６４）を超えるため、２分木によってさらに分割されない。そうでない場合、葉４分木のノードは、２分木によってさらに分割されてもよい。従って、この４分木の葉ノードは、２分木のルートノードでもあり、その２分木の深さは０である。２分木の深さがＭａｘＢＴＤｅｐｔｈ（すなわち、４）に達した場合、それ以上の分割は考慮されない。２分木のノードの幅がＭｉｎＢＴＳｉｚｅ（すなわち、４）に等しい場合、それ以上の水平分割は考慮されない。同様に、２分木のノードの高さがＭｉｎＢＴＳｉｚｅに等しい場合、それ以上の垂直分割は考慮されない。２分木の葉ノードは、さらに分割することなく、予測および変換処理によってさらに処理される。ＪＥＭにおいて、最大ＣＴＵサイズは、２５６×２５６個の輝度サンプルである。 In one example of a QTBT partitioning structure, the size of the CTU is set as 128x128 luma samples with two corresponding 64x64 blocks of chroma samples, MinQTSize is set as 16x16, MaxBTSize is set as 64x64, MinBTSize (for both width and height) is set as 4x4, and MaxBTDepth is set as 4. A quadtree partition is first applied to the CTU to generate quadtree leaf nodes. The quadtree leaf nodes can have sizes from 16x16 (i.e., MinQTSize) to 128x128 (i.e., CTU size). If a leaf quadtree node is 128x128, it will not be further partitioned by the bipartite tree since its size exceeds MaxBTSize (i.e., 64x64). Otherwise, the leaf quadtree node may be further split by the binary tree. Thus, the leaf node of this quadtree is also the root node of the binary tree, whose depth is 0. If the depth of the binary tree reaches MaxBTDepth (i.e., 4), no further splits are considered. If the width of a binary tree node is equal to MinBTSize (i.e., 4), no further horizontal splits are considered. Similarly, if the height of a binary tree node is equal to MinBTSize, no further vertical splits are considered. The leaf node of the binary tree is further processed by the prediction and transform process without further splits. In JEM, the maximum CTU size is 256x256 luma samples.

図５（左）はＱＴＢＴを用いたブロックの分割の例を示し、図５（右）は対応するツリー表現を示す。実線は４分木の分割を表し、点線は２分木の分割を表す。２分木の各分割（即ち、非葉）ノードにおいて、１つのフラグが、どの分割タイプ（即ち、水平または垂直）が使用されるかを示すために信号通知される。ここで、０は、水平分割を表し、１は、垂直分割を表す。４分木の分割の場合、４分木の分割は常にブロックを水平および垂直に分割し、等分したサイズの４つのサブブロックを生成するため、分割タイプを示す必要がない。 Figure 5 (left) shows an example of block partitioning using QTBT, and Figure 5 (right) shows the corresponding tree representation. Solid lines represent quadtree partitioning, and dotted lines represent binary tree partitioning. At each partition (i.e., non-leaf) node of the binary tree, a flag is signaled to indicate which partition type (i.e., horizontal or vertical) is used, where 0 represents a horizontal partition and 1 represents a vertical partition. In the case of quadtree partitioning, there is no need to indicate the partition type, since a quadtree partition always partitions a block horizontally and vertically, generating four sub-blocks of equal size.

さらに、ＱＴＢＴ方式は、輝度および彩度が別個のＱＴＢＴ構造を有する能力をサポートする。現在、ＰおよびＢスライスの場合、１つのＣＴＵにおける輝度および彩度ＣＴＢは、同じＱＴＢＴ構造を共有する。しかしながら、Ｉスライスの場合、輝度ＣＴＢはＱＴＢＴ構造によってＣＵに分割され、彩度ＣＴＢは別のＱＴＢＴ構造によって彩度ＣＵに分割される。これは、１つのＩスライスにおける１つのＣＵが１つの輝度成分の１つの符号化ブロックまたは２つの彩度成分の１つの符号化ブロックからなり、１つのＰまたはＢスライスにおける１つのＣＵが３つの色成分すべての符号化ブロックからなることを意味する。 Furthermore, the QTBT scheme supports the ability for luma and chroma to have separate QTBT structures. Currently, for P and B slices, the luma and chroma CTBs in one CTU share the same QTBT structure. However, for I slices, the luma CTBs are split into CUs by a QTBT structure, and the chroma CTBs are split into chroma CUs by another QTBT structure. This means that one CU in one I slice consists of one coded block of one luma component or one coded block of two chroma components, and one CU in one P or B slice consists of coded blocks of all three color components.

ＨＥＶＣにおいて、小さなブロックのためのインター予測は、動き補償のメモリアクセスを低減するために制限され、その結果、４×８および８×４ブロックのために双予測はサポートされず、４×４ブロックのためにインター予測はサポートされない。ＪＥＭのＱＴＢＴにおいて、これらの制限は取り除かれる。 In HEVC, inter prediction for small blocks is restricted to reduce memory accesses for motion compensation, resulting in no bi-prediction being supported for 4x8 and 8x4 blocks, and no inter prediction being supported for 4x4 blocks. In JEM's QTBT, these restrictions are removed.

２．１．４ＶＶＣの３分木 2.1.4 VVC ternary tree

いくつかの実施形態において、４分木および２分木以外のツリータイプがサポートされる。本実装形態において、図６（ｄ）、（ｅ）に示すように、３分木（ＴＴ）パーティションを２つ以上、すなわち、水平および垂直中心側の３分木を導入する。 In some embodiments, tree types other than quadtrees and binary trees are supported. In this implementation, we introduce two or more ternary tree (TT) partitions, i.e., horizontal and vertical center-side ternary trees, as shown in Figures 6(d) and (e).

図６は、（ａ）４分木分割、（ｂ）垂直２分木分割、（ｃ）水平２分木分割、（ｄ）垂直中心側３分木分割、（ｅ）水平中心側３分木分割を示す。 Figure 6 shows (a) quadtree division, (b) vertical binary tree division, (c) horizontal binary tree division, (d) vertical center side ternary tree division, and (e) horizontal center side ternary tree division.

いくつかの実装形態において、２つのレベルのツリー、すなわち、領域ツリー（４分木）および予測ツリー（２分木または３分木）がある。ＣＴＵは、まず、領域ツリー（ＲＴ）によって分割される。ＲＴ葉は、予測ツリー（ＰＴ）によってさらに分割されてもよい。ＰＴ葉はまた、最大ＰＴ深さに達するまで、ＰＴでさらに分割されてもよい。ＰＴ葉が基本符号化ユニットである。便宜上、ここでもＣＵと呼ぶ。１つのＣＵをさらに分割することはできない。予測および変換は両方ともＪＥＭと同様にＣＵに適用される。パーティション構造全体を「マルチタイプツリー」と呼ぶ。 In some implementations, there are two levels of trees: region tree (quadtree) and prediction tree (binary or ternary). CTUs are first split by region tree (RT). RT leaves may be further split by prediction tree (PT). PT leaves may also be further split by PT until a maximum PT depth is reached. PT leaves are the basic coding units. For convenience, we still refer to them as CUs. A CU cannot be further split. Both prediction and transformation are applied to CUs, similar to JEM. We refer to the whole partition structure as a "multi-type tree".

２．１．５分割構造 2.1.5 Split structure

この応答で使用されるツリー構造は、マルチツリータイプ（Ｍｕｌｔｉ－ＴｒｅｅＴｙｐｅ：ＭＴＴ）と呼ばれ、ＱＴＢＴを一般化したものである。ＱＴＢＴにおいて、図５に示すように、まず、符号化ツリーユニット（ＣＴＵ）を４分木構造で分割する。４分木の葉ノードは、２分木構造によってさらに分割される。 The tree structure used in this response is called the Multi-Tree Type (MTT), which is a generalization of QTBT. In QTBT, as shown in Figure 5, the coding tree unit (CTU) is first divided into a quadtree structure. The leaf nodes of the quadtree are further divided into binary trees.

ＭＴＴの基本構造は、２つのタイプのツリーノードを構成する。図７に示すように、領域ツリー（ＲＴ）および予測ツリー（ＰＴ）は、９つのタイプのパーティションをサポートする。 The basic structure of MTT consists of two types of tree nodes: region tree (RT) and prediction tree (PT), which support nine types of partitions, as shown in Figure 7.

図７は、（ａ）４分木分割、（ｂ）垂直２分木分割、（ｃ）水平２分木分割、（ｄ）垂直３分木分割、（ｅ）水平３分木分割、（ｆ）水平上方非対称２分木分割、（ｇ）水平下方非対称２分木分割、（ｈ）垂直左非対称２分木分割、（ｉ）垂直右非対称２分木分割を示す。 Figure 7 shows (a) quadtree partitioning, (b) vertical binary tree partitioning, (c) horizontal binary tree partitioning, (d) vertical ternary tree partitioning, (e) horizontal ternary tree partitioning, (f) horizontal upper asymmetric binary tree partitioning, (g) horizontal lower asymmetric binary tree partitioning, (h) vertical left asymmetric binary tree partitioning, and (i) vertical right asymmetric binary tree partitioning.

１つの領域ツリーは、１つのＣＴＵを４×４サイズの領域ツリーの葉ノードになるように正方形のブロックに再帰的に分割することができる。領域ツリーにおける各ノードにおいて、予測ツリーは、２分木（ＢＴ）、３分木（ＴＴ）、および非対称２分木（ＡＢＴ）の３つのツリータイプのうちの１つから形成されることができる。ＰＴ分割において、予測ツリーの枝に４分木のパーティションを有することは禁止される。ＪＥＭにおけるように、輝度ツリーおよび彩度ツリーは、Ｉ個のスライスに分けられる。ＲＴおよびＰＴの信号通知方法を図８に示す。 A region tree can recursively split a CTU into square blocks to become leaf nodes of the region tree of size 4x4. At each node in the region tree, a prediction tree can be formed from one of three tree types: binary tree (BT), ternary tree (TT), and asymmetric binary tree (ABT). In PT partitioning, it is forbidden to have quad tree partitions in the branches of the prediction tree. As in JEM, the luma and chroma trees are divided into I slices. The signaling method of RT and PT is shown in Figure 8.

２．２ＨＥＶＣ／Ｈ．２６５におけるインター予測 2.2 Inter prediction in HEVC/H.265

各インター予測されたＰＵは、１つまたは２つの参照ピクチャリストのための動きパラメータを有する。動きパラメータは、動きベクトルおよび参照ピクチャインデックスを含む。２つの参照ピクチャリストのうちの１つの参照ピクチャリストの使用は、ｉｎｔｅｒ＿ｐｒｅｄ＿ｉｄｃを使用して信号通知されてもよい。動きベクトルは、予測因子に関連する差分として明確に符号化されてもよく、このような符号化モードは、ＡＭＶＰモードと呼ばれる。 Each inter-predicted PU has motion parameters for one or two reference picture lists. The motion parameters include a motion vector and a reference picture index. The use of one of the two reference picture lists may be signaled using inter_pred_idc. The motion vector may be explicitly coded as a differential related to the predictor, and such a coding mode is called AMVP mode.

１つのＣＵがスキップモードにて符号化される場合、１つのＰＵがこのＣＵに関連付けられ、有意な残差係数がなく、符号化された動きベクトル差分も参照ピクチャインデックスもない。マージモードを指定し、これにより、現在のＰＵのための動きパラメータを、空間的および時間的候補を含む近傍のＰＵから取得する。マージモードは、スキップモードのためだけでなく、任意のインター予測されたＰＵに適用することができる。マージモードの代替としては、動きパラメータの明確な送信であり、各参照ピクチャリストおよび参照ピクチャリストの使用に対する参照ピクチャインデックスに対応する動きベクトルをＰＵごとに明確に信号通知することである。 When a CU is coded in skip mode, one PU is associated with this CU, there are no significant residual coefficients, no coded motion vector differentials, and no reference picture indexes. A merge mode is specified, whereby motion parameters for the current PU are obtained from neighboring PUs, including spatial and temporal candidates. The merge mode can be applied to any inter-predicted PU, not just for skip mode. An alternative to the merge mode is an explicit transmission of motion parameters, explicitly signaling per PU the motion vectors corresponding to each reference picture list and the reference picture index for the use of the reference picture list.

２つの参照ピクチャリストのうちの１つを使用することを信号通知が示す場合、サンプルのうちの１つのブロックからＰＵを生成する。これを「単一予測」と呼ぶ。ＰスライスおよびＢスライスの両方に対して単一予測が利用可能である。 If the signaling indicates to use one of two reference picture lists, generate the PU from one block of samples. This is called "uni-prediction". Uni-prediction is available for both P and B slices.

両方の参照ピクチャリストを使用することを信号通知が示す場合、サンプルのうちの２つのブロックからＰＵを生成する。これを「双方向予測」と呼ぶ。Ｂスライスのみに双方向予測が利用可能である。 If the signaling indicates to use both reference picture lists, generate a PU from two blocks of samples. This is called "bi-prediction". Bi-prediction is available for B slices only.

以下、ＨＥＶＣに規定されるインター予測モードについて詳細に説明する。まず、マージモードについて説明する。 The inter prediction modes defined in HEVC are explained in detail below. First, merge mode is explained.

２．２．１マージモード 2.2.1 Merge mode

２．２．１．１マージモードの候補の導出 2.2.1.1 Deriving merge mode candidates

マージモードを使用してＰＵを予測する場合、ビットストリームからマージ候補リストにおけるエントリを指すインデックスを構文解析し、これを使用して動き情報を検索する。このリストの構成は、ＨＥＶＣ規格で規定されており、以下のステップのシーケンスに基づいてまとめることができる。
・ステップ１：初期候補の導出
ｏステップ１．１：空間的候補の導出
ｏステップ１．２：空間的候補の冗長性チェック
ｏステップ１．３：時間的候補の導出
・ステップ２：追加候補の挿入
ｏステップ２．１：双方向予測候補の作成
ｏステップ２．２：動きゼロ候補の挿入 When predicting a PU using merge mode, we parse an index from the bitstream that points to an entry in the merge candidate list and use it to look up the motion information. The construction of this list is specified in the HEVC standard and can be summarized based on the following sequence of steps:
Step 1: Derive initial candidates o Step 1.1: Derive spatial candidates o Step 1.2: Redundancy check of spatial candidates o Step 1.3: Derive temporal candidates Step 2: Insert additional candidates o Step 2.1: Create bi-prediction candidates o Step 2.2: Insert zero motion candidates

これらのステップは図９にも概略的に示されている。空間的マージ候補導出のために、５つの異なる位置にある候補の中から最大４つのマージ候補を選択する。時間的マージ候補導出のために、２つの候補の中から最大１つのマージ候補を選択する。デコーダ側ではＰＵごとに一定数の候補を想定しているので、候補数がスライスヘッダで信号通知されるマージ候補の最大数（ＭａｘＮｕｍＭｅｒｇｅＣａｎｄ）に達しない場合、追加候補を生成する。候補の数は一定であるので、最良マージ候補のインデックスは、短縮された単項２値化（ＴＵ：ｔｒｕｎｃａｔｅｄｕｎａｒｙｂｉｎａｒｉｚａｔｉｏｎ）を使用して符号化される。ＣＵのサイズが８に等しい場合、現在のＣＵのすべてのＰＵは、２Ｎ×２Ｎ予測ユニットのマージ候補リストと同じ１つのマージ候補リストを共有する。 These steps are also shown diagrammatically in Fig. 9. For spatial merge candidate derivation, select up to four merge candidates among candidates at five different positions. For temporal merge candidate derivation, select up to one merge candidate among two candidates. Since the decoder side assumes a constant number of candidates per PU, generate additional candidates if the number of candidates does not reach the maximum number of merge candidates (MaxNumMergeCand) signaled in the slice header. Since the number of candidates is constant, the index of the best merge candidate is coded using truncated unary binarization (TU). If the size of the CU is equal to 8, all PUs of the current CU share one merge candidate list, which is the same as the merge candidate list of the 2Nx2N prediction unit.

以下、上述したステップに関連付けられた動作を詳しく説明する。 The operations associated with the steps above are explained in detail below.

２．２．１．２空間的候補の導出 2.2.1.2 Deriving spatial candidates

空間的マージ候補の導出において、図１０に示す位置にある候補の中から、最大４つのマージ候補を選択する。導出の順序はＡ_１、Ｂ_１、Ｂ_０、Ａ_０、Ｂ_２である。位置Ａ_１、Ｂ_１、Ｂ_０、Ａ_０のいずれかのＰＵが利用可能でない場合（例えば、別のスライスまたはタイルに属しているため）、またはイントラ符号化された場合にのみ、位置Ｂ_２が考慮される。位置Ａ_１の候補を加えた後、残りの候補を加えると、冗長性チェックを受け、それにより、同じ動き情報を有する候補を確実にリストから排除でき、符号化効率を向上させることができる。計算の複雑性を低減するために、前述の冗長性チェックにおいて、考えられる候補対のすべてを考慮することはしない。代わりに、図１１において矢印でリンクされた対のみを考慮し、冗長性チェックに使用される対応する候補が同じ動き情報を有していない場合にのみ、その候補をリストに加える。重複した動き情報の別のソースは、２Ｎ×２Ｎとは異なるパーティションに関連付けられた「第２のＰＵ」である。一例として、図１２は、それぞれＮ×２Ｎおよび２Ｎ×Ｎの場合の第２のＰＵを示す。現在のＰＵをＮ×２Ｎに分割する場合、リスト構築に位置Ａ_１の候補は考慮されない。実際、この候補を加えることにより、同じ動き情報を有する２つの予測ユニットが導かれることとなり、１つの符号化ユニットに１つのＰＵのみを有するためには冗長である。同様に、現在のＰＵを２Ｎ×Ｎに分割する場合、位置Ｂ_１は考慮されない。 In the derivation of spatial merging candidates, up to four merging candidates are selected from the candidates at the positions shown in FIG. 10. The order of derivation is _A1 , _B1 , _B0 , _A0 , _B2 . Position B2 is considered only if any PU at positions _A1 , _B1 , _B0 , _A0 is unavailable (e.g., because it belongs to another slice or tile) or is intra _- coded. After adding the candidate at position _A1 , adding the remaining candidates is subjected to a redundancy check, which can ensure that candidates with the same motion information are removed from the list and improve coding efficiency. To reduce computational complexity, the aforementioned redundancy check does not consider all possible candidate pairs. Instead, it considers only the pairs linked by arrows in FIG. 11 and adds a candidate to the list only if the corresponding candidate used for the redundancy check does not have the same motion information. Another source of duplicated motion information is a "second PU" associated with a partition different from 2N×2N. As an example, Fig. 12 shows the second PU for Nx2N and 2NxN, respectively. When splitting the current PU into Nx2N, the candidate at position _A1 is not considered for list construction. In fact, adding this candidate would lead to two prediction units with the same motion information, which is redundant for having only one PU in one coding unit. Similarly, when splitting the current PU into 2NxN, position _B1 is not considered.

２．２．１．３時間的候補の導出 2.2.1.3 Deriving temporal candidates

このステップにおいて、１つの候補のみがリストに追加される。具体的には、この時間的マージ候補の導出において、所与の参照ピクチャリストにおける現在のピクチャとの間に最小のＰＯＣ差を有するピクチャに属する同一位置ＰＵに基づいて、スケーリングされた動きベクトルを導出する。スライスヘッダにおいて、同一位置のＰＵ（ｃｏ－ｌｏｃａｔｅｄＰＵ）の導出に用いられる参照ピクチャリストが明確に信号通知される。図１３に点線で示すように、時間的マージ候補のスケーリングされた動きベクトルが得られる。これは、ＰＯＣ距離ｔｂおよびｔｄを利用して、同一位置のＰＵの動きベクトルからスケーリングしたものである。ｔｂは、現在のピクチャの参照ピクチャと現在のピクチャのＰＯＣ差として規定され、ｔｄは、同一位置のＰＵの参照ピクチャと同一位置のピクチャのＰＯＣ差として規定する。時間的マージ候補の参照ピクチャインデックスをゼロに等しく設定する。このスケーリング処理の実際的な実現については、ＨＥＶＣ仕様に記載されている。Ｂスライスの場合、２つの動きベクトル、即ち、１つは参照ピクチャリスト０のためのもの、もう１つは参照ピクチャリスト１のためのものを取得し、これらを組み合わせることによって、双方向予測マージ候補を形成する。時間的マージ候補のための動きベクトルのスケーリングの説明。 In this step, only one candidate is added to the list. Specifically, in the derivation of this temporal merge candidate, a scaled motion vector is derived based on the co-located PU belonging to the picture with the smallest POC difference with the current picture in a given reference picture list. In the slice header, the reference picture list used for the derivation of the co-located PU is explicitly signaled. As shown by the dotted line in Figure 13, the scaled motion vector of the temporal merge candidate is obtained, which is scaled from the motion vector of the co-located PU using the POC distances tb and td. tb is defined as the POC difference between the reference picture of the current picture and the current picture, and td is defined as the POC difference between the reference picture of the co-located PU and the co-located picture. The reference picture index of the temporal merge candidate is set equal to zero. The practical realization of this scaling process is described in the HEVC specification. For B slices, we take two motion vectors, one for reference picture list 0 and one for reference picture list 1, and combine them to form a bi-predictive merge candidate. Description of motion vector scaling for temporal merge candidates.

参照フレームに属する同一位置のＰＵ（Ｙ）において、図１４に示すように、候補Ｃ_０と候補Ｃ_１との間で時間的候補の位置を選択する。位置Ｃ_０のＰＵが利用可能でない場合、イントラ符号化されている場合、または現在のＣＴＵの外側にある場合、位置Ｃ_１が使用される。そうでない場合、位置Ｃ_０が時間的マージ候補の導出に使用される。 For the co-located PU(Y) belonging to the reference frame, select a location of the temporal candidate between candidates _C0 and _C1 as shown in Fig. 14. If the PU at location _C0 is not available, is intra-coded, or is outside the current CTU, location _C1 is used. Otherwise, location _C0 is used to derive the temporal merging candidate.

２．２．１．４追加候補の挿入 2.2.1.4 Inserting additional candidates

空間的－時間的マージ候補の他に、２つの追加のタイプのマージ候補、すなわち、結合双方向予測マージ候補およびゼロマージ候補がある。空間的－時間的マージ候補を利用して、結合双方向予測マージ候補を生成する。結合双方向予測マージ候補は、Ｂスライスのみに使用される。最初の候補の第１の参照ピクチャリスト動きパラメータと別の候補の第２の参照ピクチャリスト動きパラメータとを組み合わせることで、結合双方向予測候補を生成する。これら２つのタプルが異なる動きの仮説を提供する場合、これらのタプルは、新しい双方向予測候補を形成する。一例として、図１５は、オリジナルリスト（左側）における、ｍｖＬ０およびｒｅｆＩｄｘＬ０、またはｍｖＬ１およびｒｅｆＩｄｘＬ１を有する２つの候補を用いて、最終リスト（右側）に加えられる結合双方向予測マージ候補を生成する場合を示す。ここで規定される、これらの追加のマージ候補を生成するために考慮される組み合わせについては、様々な規則が存在する。 Besides spatial-temporal merge candidates, there are two additional types of merge candidates: joint bi-predictive merge candidates and zero merge candidates. The joint bi-predictive merge candidates are utilized to generate joint bi-predictive merge candidates. The joint bi-predictive merge candidates are used only for B slices. The joint bi-predictive candidate is generated by combining the first reference picture list motion parameters of the first candidate with the second reference picture list motion parameters of another candidate. If these two tuples provide different motion hypotheses, they form a new bi-predictive candidate. As an example, FIG. 15 shows the case where two candidates with mvL0 and refIdxL0 or mvL1 and refIdxL1 in the original list (left) are used to generate a joint bi-predictive merge candidate that is added to the final list (right). There are various rules for the combinations considered to generate these additional merge candidates, as specified here.

ゼロ動き候補を挿入し、マージ候補リストにおける残りのエントリを埋めることにより、ＭａｘＮｕｍＭｅｒｇｅＣａｎｄ容量にヒットする。これらの候補は、空間的変位がゼロであり、新しいゼロ動き候補をリストに加える度にゼロから始まり増加する参照ピクチャインデックスを有する。これらの候補が使用する参照フレームの数は、それぞれ、一方向予測の場合は１つ、双方向予測の場合は２つである。最終的には、これらの候補に対して冗長性チェックは行われない。 By inserting zero motion candidates and filling the remaining entries in the merge candidate list, the MaxNumMergeCand capacity is hit. These candidates have a spatial displacement of zero and a reference picture index that starts from zero and increases each time a new zero motion candidate is added to the list. The number of reference frames used by these candidates is one for unidirectional prediction and two for bidirectional prediction, respectively. Finally, no redundancy check is performed on these candidates.

２．２．１．５並列処理のための動き推定領域 2.2.1.5 Motion estimation regions for parallel processing

符号化処理を高速化するために、動き推定を並列に行うことができ、それによって、所与の領域内のすべての予測ユニットの動きベクトルを同時に導出する。１つの予測ユニットは、その関連する動き推定が完了するまで、隣接するＰＵから動きパラメータを導出することができないので、空間的近傍からのマージ候補の導出は、並列処理に干渉する可能性がある。符号化効率と処理待ち時間との間のトレードオフを緩和するために、ＨＥＶＣは、動き推定領域（ＭＥＲ：ＭｏｔｉｏｎＥｓｔｉｍａｔｉｏｎＲｅｇｉｏｎ）を規定し、そのサイズは、「ｌｏｇ２＿ｐａｒａｌｌｅｌ＿ｍｅｒｇｅ＿ｌｅｖｅｌ＿ｍｉｎｕｓ２」構文要素を使用してピクチャパラメータセットにおいて信号通知される。１つのＭＥＲを規定するとき、同じ領域にあるマージ候補は使用不可としてマークされ、それゆえにリスト構築においては考慮されない。
７．３．２．３ピクチャパラメータセットＲＢＳＰ構文
７．３．２．３．１一般ピクチャパラメータセットＲＢＳＰ構文 To speed up the encoding process, motion estimation can be done in parallel, thereby deriving motion vectors for all prediction units in a given region simultaneously. Since one prediction unit cannot derive motion parameters from neighboring PUs until its associated motion estimation is completed, the derivation of merge candidates from spatial neighborhoods can interfere with parallel processing. To mitigate the tradeoff between coding efficiency and processing latency, HEVC specifies a Motion Estimation Region (MER), whose size is signaled in the picture parameter set using the "log2_parallel_merge_level_minus2" syntax element. When specifying one MER, merge candidates in the same region are marked as unavailable and therefore not considered in the list construction.
7.3.2.3 Picture parameter set RBSP syntax 7.3.2.3.1 General picture parameter set RBSP syntax

ｌｏｇ２＿ｐａｒａｌｌｅｌ＿ｍｅｒｇｅ＿ｌｅｖｅｌ＿ｍｉｎｕｓ２＋２は、８．５．３．２．２．２節で指定されたマージモードの輝度動きベクトルの導出処理と、８．５．３．２．３節で指定された空間的マージ候補の導出処理で使用される変数Ｌｏｇ２ＰａｒＭｒｇＬｅｖｅｌの値を指定する。ｌｏｇ２＿ｐａｒａｌｌｅｌ＿ｍｅｒｇｅ＿ｌｅｖｅｌ＿ｍｉｎｕｓ２の値は、０～ＣｔｂＬｏｇ２ＳｉｚｅＹ－２を含む範囲内とする。
変数Ｌｏｇ２ＰａｒＭｒｇＬｅｖｅｌは、以下のように導出される。
Ｌｏｇ２ＰａｒＭｒｇＬｅｖｅｌ＝ｌｏｇ２＿ｐａｒａｌｌｅｌ＿ｍｅｒｇｅ＿ｌｅｖｅｌ＿ｍｉｎｕｓ２＋２（７－３７）
注３：Ｌｏｇ２ＰａｒＭｒｇＬｅｖｅｌの値は、マージ候補リストを並列に導出する組み込み能力を示す。例えば、Ｌｏｇ２ＰａｒＭｒｇＬｅｖｅｌが６に等しい場合、６４×６４ブロックに含まれたすべての予測ユニット（ＰＵ）および符号化ユニット（ＣＵ）のためのマージ候補リストを並列に導出することができる。 log2_parallel_merge_level_minus2+2 specifies the value of the variable Log2ParMrgLevel used in the derivation process of luminance motion vectors in the merge mode specified in Section 8.5.3.2.2.2 and in the derivation process of spatial merge candidates specified in Section 8.5.3.2.3. The value of log2_parallel_merge_level_minus2 is in the range of 0 to CtbLog2SizeY-2 inclusive.
The variable Log2ParMrgLevel is derived as follows:
Log2ParMrgLevel=log2_parallel_merge_level_minus2+2 (7-37)
NOTE 3: The value of Log2ParMrgLevel indicates the built-in capability of deriving merge candidate lists in parallel. For example, if Log2ParMrgLevel is equal to 6, then merge candidate lists for all prediction units (PUs) and coding units (CUs) contained in a 64x64 block can be derived in parallel.

２．２．２ＡＭＶＰモードにおける動きベクトル予測 2.2.2 Motion vector prediction in AMVP mode

動きベクトル予測は、動きベクトルと近傍のＰＵとの間の空間的－時間的相関を利用し、これを動きパラメータの明確な伝送に用いる。まず、左側、上側の時間的に近傍のＰＵ位置の可用性をチェックし、冗長な候補を取り除き、ゼロベクトルを加えることで、候補リストの長さを一定にすることで、動きベクトル候補リストを構築する。次いで、エンコーダは、候補リストから最良の予測因子を選択し、選択された候補を示す対応するインデックスを送信することができる。マージインデックスの信号通知と同様に、最良の動きベクトル候補のインデックスは、短縮された単項を使用して符号化される。この場合の符号化対象の最大値は２である（例えば、図２～図８）。以下の章では、動きベクトル予測候補の導出処理の詳細を説明する。 Motion vector prediction exploits the spatial-temporal correlation between motion vectors and neighboring PUs, which is used for explicit transmission of motion parameters. First, a motion vector candidate list is constructed by checking the availability of the left, upper temporally neighboring PU positions, removing redundant candidates, and adding zero vectors to keep the length of the candidate list constant. Then, the encoder can select the best predictor from the candidate list and transmit the corresponding index indicating the selected candidate. Similar to the merge index signaling, the index of the best motion vector candidate is coded using a shortened unary term. The maximum value to be coded in this case is 2 (e.g., Figures 2 to 8). The following sections will explain the details of the motion vector prediction candidate derivation process.

２．２．２．１動きベクトル予測候補の導出 2.2.2.1 Deriving motion vector prediction candidates

図１６に、動きベクトル予測候補の導出処理をまとめる。 Figure 16 summarizes the process of deriving motion vector prediction candidates.

動きベクトル予測において、空間的動きベクトル候補と時間的動きベクトル候補という２つのタイプの動きベクトル候補が考慮される。空間的動きベクトル候補を導出するために、図１１に示したように、５つの異なる位置にある各ＰＵの動きベクトルに基づいて、最終的には２つの動きベクトル候補を導出する。 In motion vector prediction, two types of motion vector candidates are considered: spatial motion vector candidates and temporal motion vector candidates. To derive spatial motion vector candidates, two motion vector candidates are ultimately derived based on the motion vectors of each PU at five different positions, as shown in FIG. 11.

時間的動きベクトル候補の導出のために、２つの異なる同一位置の配置に基づいて導出された２つの候補から１つの動きベクトル候補を選択する。空間的－時間的候補の最初のリストを作成した後、リストにおける重複した動きベクトル候補を除去する。可能性のある候補の数が２よりも多い場合、関連づけられた参照ピクチャリストにおける参照ピクチャインデックスが１よりも大きい動きベクトル候補をリストから削除する。空間的－時間的動きベクトル候補の数が２未満である場合は、追加のゼロ動きベクトル候補をリストに加える。 For derivation of a temporal motion vector candidate, select one motion vector candidate from two candidates derived based on two different co-location arrangements. After creating an initial list of spatial-temporal candidates, remove duplicate motion vector candidates in the list. If the number of possible candidates is greater than two, remove from the list the motion vector candidates whose reference picture index in the associated reference picture list is greater than 1. If the number of spatial-temporal motion vector candidates is less than two, add an additional zero motion vector candidate to the list.

２．２．２．２空間的動きベクトル候補 2.2.2.2 Spatial motion vector candidates

空間的動きベクトル候補の導出において、図１１に示したような位置にあるＰＵから導出された５つの可能性のある候補のうち、動きマージと同じ位置にあるものを最大２つの候補を考慮する。現在のＰＵの左側のための導出の順序は、Ａ_０、Ａ_１、スケーリングされたＡ_０、スケーリングされたＡ_１として規定される。現在のＰＵの上側のための導出の順序は、Ｂ_０、Ｂ_１、Ｂ_２、スケーリングされたＢ_０、スケーリングされたＢ_１、スケーリングされたＢ_２として規定される。そのため、辺ごとに、動きベクトル候補として使用できる場合が４つ、すなわち空間的スケーリングを使用する必要がない２つの場合と、空間的スケーリングを使用する２つの場合とがある。４つの異なる場合をまとめると、以下のようになる。
・空間的スケーリングなし
－（１）同じ参照ピクチャリスト、かつ、同じ参照ピクチャインデックス（同じＰＯＣ）
－（２）異なる参照ピクチャリスト、かつ、同じ参照ピクチャ（同じＰＯＣ）
・空間的スケーリング
－（３）同じ参照ピクチャリスト、かつ、異なる参照ピクチャ（異なるＰＯＣ）
－（４）異なる参照ピクチャリスト、かつ、異なる参照ピクチャ（異なるＰＯＣ） In deriving spatial motion vector candidates, we consider up to two candidates out of five possible candidates derived from PUs located as shown in Fig. 11 that are in the same position as the motion merge. The order of derivation for the left side of the current PU is defined as _A0 , _A1 , scaled _A0 , scaled _A1 . The order of derivation for the top side of the current PU is defined as _B0 , _B1 , _B2 , scaled _B0 , scaled _B1 , scaled _B2 . So, for each side, there are four possible cases for motion vector candidates: two cases that do not require spatial scaling and two cases that use spatial scaling. The four different cases can be summarized as follows:
No spatial scaling: (1) Same reference picture list and same reference picture index (same POC)
- (2) Different reference picture lists and the same reference picture (same POC)
Spatial Scaling - (3) Same reference picture list but different reference pictures (different POC)
(4) Different reference picture lists and different reference pictures (different POCs)

最初に非空間的スケーリングの場合をチェックし、次に空間的スケーリングを行う。参照ピクチャリストにかかわらず、ＰＯＣが近傍のＰＵの参照ピクチャと現在のＰＵの参照ピクチャとで異なる場合、空間的スケーリングを考慮する。左側候補のすべてのＰＵが利用可能でないか、またはイントラ符号化されている場合、上側の動きベクトルのスケーリングは、左側および上側ＭＶ候補の並列導出に役立つ。そうでない場合、上側の動きベクトルに対して空間的スケーリングは許可されない。 Check the non-spatial scaling case first, then do spatial scaling. Consider spatial scaling if POC is different between neighboring PU's reference picture and current PU's reference picture, regardless of reference picture list. Scaling top motion vector helps parallel derivation of left and top MV candidates if all PUs of left candidate are not available or are intra-coded. Otherwise, no spatial scaling is allowed for top motion vector.

空間的スケーリング処理において、図１７に示すように、時間的スケーリングと同様にして、近傍のＰＵの動きベクトルをスケーリングする。主な違いは、現在のＰＵの参照ピクチャリストおよびインデックスを入力として与え、実際のスケーリング処理は時間的スケーリングと同じであることである。 In the spatial scaling process, we scale the motion vectors of neighboring PUs in a similar manner to temporal scaling, as shown in Figure 17. The main difference is that the reference picture list and index of the current PU are given as input, and the actual scaling process is the same as temporal scaling.

２．２．２．３時間的動きベクトル候補 2.2.2.3 Temporal motion vector candidates

参照ピクチャインデックスを導出する以外は、時間的マージ候補を導出するための処理は、すべて、空間的動きベクトル候補を導出するための処理と同じである（図６参照）。参照ピクチャインデックスはデコーダに信号通知される。 Other than deriving the reference picture index, all the processes for deriving temporal merge candidates are the same as the processes for deriving spatial motion vector candidates (see Figure 6). The reference picture index is signaled to the decoder.

２．２．２．４ＡＭＶＰ情報の信号通知 2.2.2.4 Signaling of AMVP information

ＡＭＶＰモードの場合、ビットストリームにおいて、４つの部分、すなわち、予測方向、参照インデックス、ＭＶＤ、およびｍｖ予測因子候補インデックスを信号通知することができる。
構文テーブル： For AMVP mode, four parts can be signaled in the bitstream: prediction direction, reference index, MVD, and mv predictor candidate index.
Syntax table:

７．３．８．９動きベクトル差構文 7.3.8.9 Motion Vector Difference Syntax

２．３ＪＥＭ（ＪｏｉｎｔＥｘｐｌｏｒａｔｉｏｎＭｏｄｅｌ）における新しいインター予測方法 2.3 New inter-prediction method in JEM (Joint Exploration Model)

２．３．１サブＣＵに基づく動きベクトル予測 2.3.1 Sub-CU based motion vector prediction

ＱＴＢＴを有するＪＥＭにおいて、各ＣＵは、各予測方向に対して最大１つの動きパラメータのセットを有することができる。エンコーダにおいて、大きなＣＵをサブＣＵに分割し、大きなＣＵのすべてのサブＣＵの動き情報を導出することにより、２つのサブＣＵレベルの動きベクトル予測方法を考慮する。ＡＴＭＶＰ（ＡｌｔｅｒｎａｔｉｖｅＴｅｍｐｏｒａｌＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎ）方法により、各ＣＵが、配列された参照ピクチャにおける現在のＣＵよりも小さい複数のブロックから複数の動き情報のセットをフェッチすることが可能となる。ＳＴＭＶＰ（Ｓｐａｔｉａｌ－ＴｅｍｐｏｒａｌＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎ）法において、時間的動きベクトル予測因子および空間的近傍動きベクトルを使用して、サブＣＵの動きベクトルを再帰的に導出する。 In JEM with QTBT, each CU can have at most one set of motion parameters for each prediction direction. In the encoder, we consider two sub-CU level motion vector prediction methods by splitting a large CU into sub-CUs and deriving motion information for all sub-CUs of the large CU. The Alternative Temporal Motion Vector Prediction (ATMVP) method allows each CU to fetch multiple sets of motion information from multiple blocks smaller than the current CU in the aligned reference picture. In the Spatial-Temporal Motion Vector Prediction (STMVP) method, we use the temporal motion vector predictor and spatial neighborhood motion vectors to recursively derive the motion vectors of the sub-CUs.

サブＣＵ動き予測のためにより正確な動きフィールドを維持するために、参照フレームの動き圧縮は現在無効にされている。 To maintain a more accurate motion field for sub-CU motion estimation, reference frame motion compression is currently disabled.

２．３．１．１代替の時間的動きベクトル予測 2.3.1.1 Alternative temporal motion vector prediction

ＡＴＭＶＰ（ＡｌｔｅｒｎａｔｉｖｅＴｅｍｐｏｒａｌＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎ）において、ＴＭＶＰ（ＴｅｍｐｏｒａｌＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎ）法は、現在のＣＵより小さいブロックから複数セットの動き情報（動きベクトルおよび参照インデックスを含む）をフェッチすることで修正される。図１８に示すように、サブＣＵは、正方形のＮ×Ｎブロックである（デフォルトでは、Ｎは４に設定される）。 In Alternative Temporal Motion Vector Prediction (ATMVP), the Temporal Motion Vector Prediction (TMVP) method is modified by fetching multiple sets of motion information (including motion vectors and reference indexes) from blocks smaller than the current CU. As shown in Figure 18, a sub-CU is a square NxN block (by default, N is set to 4).

ＡＴＭＶＰは、ＣＵ内のサブＣＵの動きベクトルを２つのステップで予測する。第１のステップは、参照ピクチャにおける対応するブロックを、いわゆる時間的ベクトルで特定することである。この参照ピクチャを動きソースピクチャと呼ぶ。第２のステップは、図１８に示すように、現在のＣＵをサブＣＵに分割し、各サブＣＵに対応するブロックから各サブＣＵの動きベクトルならびに参照インデックスを取得する。 ATMVP predicts motion vectors of sub-CUs in a CU in two steps. The first step is to identify the corresponding block in a reference picture with a so-called temporal vector. This reference picture is called the motion source picture. The second step is to split the current CU into sub-CUs and obtain the motion vector and reference index of each sub-CU from the block corresponding to each sub-CU, as shown in Figure 18.

第１のステップにおいて、現在のＣＵの空間的に近傍のブロックの動き情報によって、参照ピクチャおよび対応するブロックを決定する。近傍のブロックの繰り返し走査処理を回避するために、現在のＣＵのマージ候補リストにおける最初のマージ候補を用いる。最初の利用可能な動きベクトルおよびその関連する参照インデックスを、時間的ベクトルおよび動きソースピクチャのインデックスに設定する。このように、ＡＴＭＶＰでは、ＴＭＶＰに比べて、対応するブロックをより正確に特定することができ、対応するブロック（配列されたブロックと呼ばれることがある）は、常に現在のＣＵに対して右下または中心位置にある。１つの例において、最初のマージ候補が左側の近傍のブロック（即ち、図１９のＡ_１）からのものである場合、関連するＭＶおよび参照ピクチャを利用して、ソースブロックおよびソースピクチャを特定する。 In the first step, the reference picture and corresponding block are determined by the motion information of the spatially neighboring blocks of the current CU. To avoid the repeated scanning process of the neighboring blocks, the first merging candidate in the merging candidate list of the current CU is used. The first available motion vector and its associated reference index are set to the temporal vector and the index of the motion source picture. In this way, the corresponding block can be identified more accurately in ATMVP compared with TMVP, and the corresponding block (sometimes called the aligned block) is always in the lower right or center position with respect to the current CU. In one example, if the first merging candidate is from the left neighboring block (i.e., A ₁ in FIG. 19 ), the associated MV and reference picture are utilized to identify the source block and source picture.

図１９は、ソースブロックおよびソースピクチャの特定の例を示す。 Figure 19 shows specific examples of source blocks and source pictures.

第２のステップにおいて、現在のＣＵの座標に時間ベクトルを加えることで、動きソースピクチャにおける時間的ベクトルによって、サブＣＵの対応するブロックを特定する。サブＣＵごとに、その対応するブロックの動き情報（中心サンプルを覆う最小の動きグリッド）を使用して、サブＣＵの動き情報を導出する。対応するＮ×Ｎブロックの動き情報を特定した後、ＨＥＶＣのＴＭＶＰと同様に、現在のサブＣＵの動きベクトルおよび参照インデックスに変換され、動きスケーリングや他の手順が適用される。例えば、デコーダは、低遅延条件（すなわち、現在のピクチャのすべての参照ピクチャのＰＯＣが現在のピクチャのＰＯＣよりも小さい）が満たされているかどうかをチェックし、場合によっては、動きベクトルＭＶｘ（参照ピクチャリストＸに対応する動きベクトル）を使用して、各サブＣＵの動きベクトルＭＶｙ（Ｘが０または１に等しく、Ｙが１－Ｘに等しい）を予測する。 In the second step, the corresponding block of the sub-CU is identified by its temporal vector in the motion source picture by adding the temporal vector to the coordinates of the current CU. For each sub-CU, the motion information of its corresponding block (the smallest motion grid covering the center sample) is used to derive the motion information of the sub-CU. After identifying the motion information of the corresponding N×N block, it is converted into the motion vector and reference index of the current sub-CU, and motion scaling and other procedures are applied, similar to TMVP in HEVC. For example, the decoder checks whether the low-latency condition (i.e., the POC of all reference pictures of the current picture is smaller than the POC of the current picture) is met, and possibly predicts the motion vector MVy (X equals 0 or 1, and Y equals 1-X) of each sub-CU using the motion vector MVx (the motion vector corresponding to the reference picture list X).

２．３．１．２空間的－時間的動きベクトル予測 2.3.1.2 Spatial-temporal motion vector prediction

この方法において、サブＣＵの動きベクトルは、ラスタスキャンの順に沿って再帰的に導出される。図２０にこの概念を示す。４つの４×４サブＣＵであるＡ、Ｂ、Ｃ、およびＤを含む８×８ＣＵを考える。現在のフレームの近傍の４×４ブロックには、ａ、ｂ、ｃ、ｄというラベルが付けられている。 In this method, motion vectors for sub-CUs are derived recursively along the raster scan order. Figure 20 illustrates this concept. Consider an 8x8 CU that contains four 4x4 sub-CUs, A, B, C, and D. The neighboring 4x4 blocks in the current frame are labeled a, b, c, and d.

サブＣＵのＡの動きの導出は、その２つの空間的近傍を特定することで始まる。第１の近傍は、サブＣＵのＡの上のＮ×Ｎブロックである（ブロックｃ）。このブロックｃが利用可能でないか、またはイントラ符号化されている場合、サブＣＵＡより上の他のＮ×Ｎ個のブロックをチェックする（ブロックｃから始まり、左から右へ）。第２の近傍は、サブＣＵのＡの左側のブロックである（ブロックｂ）。ブロックｂが利用可能でないか、またはイントラ符号化されている場合、サブＣＵＡの左側の他のブロックをチェックする（ブロックｂから始まり、上から下へ）。各リストの近傍のブロックから得られた動き情報を、所与のリストの第１の参照フレームにスケーリングする。次に、ＨＥＶＣに規定されているＴＭＶＰ（ＴｅｍｐｏｒａｌＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｏｒ）導出と同様の手順に従って、サブブロックＡのＴＭＶＰを導出する。位置Ｄにおける配列されたブロックの動き情報をフェッチし、それに応じてスケーリングする。最後に、動き情報を検索し、スケーリングした後、参照リストごとにすべての利用可能な動きベクトル（３まで）を別々に平均する。この平均化された動きベクトルを現在のサブＣＵの動きベクトルとする。 The motion derivation of sub-CU A starts by identifying its two spatial neighbors. The first neighbor is the N×N block above sub-CU A (block c). If this block c is not available or is intra-coded, check the other N×N blocks above sub-CU A (starting from block c, going from left to right). The second neighbor is the block to the left of sub-CU A (block b). If block b is not available or is intra-coded, check the other blocks to the left of sub-CU A (starting from block b, going from top to bottom). The motion information obtained from the neighboring blocks in each list is scaled to the first reference frame of a given list. Then, derive the TMVP for sub-block A following a procedure similar to the TMVP derivation specified in HEVC. Fetch the motion information of the aligned blocks at position D and scale accordingly. Finally, after retrieving and scaling the motion information, we average all available motion vectors (up to 3) separately for each reference list. Let this averaged motion vector be the motion vector of the current sub-CU.

図２０は、４つのサブブロック（Ａ－Ｄ）およびその近傍のブロックを有する１つのＣＵの例を示す。 Figure 20 shows an example of one CU with four subblocks (A-D) and their neighboring blocks.

２．３．１．３サブＣＵ動き予測モード信号通知 2.3.1.3 Sub-CU motion prediction mode signal notification

サブＣＵモードは追加のマージ候補として有効とされ、モードを信号通知するために追加の構文要素は必要とされない。ＡＴＭＶＰモードおよびＳＴＭＶＰモードを表すように、各ＣＵのマージ候補リストに２つの追加のマージ候補を加える。シーケンスパラメータセットがＡＴＭＶＰおよびＳＴＭＶＰが有効であることを示す場合、７個までのマージ候補を使用する。追加のマージ候補の符号化ロジックは、ＨＭにおけるマージ候補の場合と同じであり、つまり、ＰまたはＢスライスにおける各ＣＵについて、２つの追加のマージ候補に対して２回以上のＲＤチェックが必要となる。 Sub-CU mode is enabled as an additional merge candidate, and no additional syntax elements are required to signal the mode. Add two additional merge candidates to the merge candidate list of each CU to represent ATMVP and STMVP modes. Use up to seven merge candidates if the sequence parameter set indicates that ATMVP and STMVP are enabled. The encoding logic of the additional merge candidates is the same as that of the merge candidates in HM, i.e., for each CU in a P or B slice, two or more RD checks are required for the two additional merge candidates.

ＪＥＭにおいて、マージインデックスのすべてのビンは、ＣＡＢＡＣによって符号化されたコンテキストである。一方、ＨＥＶＣにおいては、最初のビンのみが符号化されたコンテキストであり、残りのビンはバイパス符号化されたコンテキストである。 In JEM, all bins of the merge index are CABAC coded contexts, whereas in HEVC, only the first bin is a coded context and the remaining bins are bypass coded contexts.

２．３．２適応型動きベクトル差解像度 2.3.2 Adaptive motion vector difference resolution

ＨＥＶＣにおいて、ｕｓｅ＿ｉｎｔｅｇｅｒ＿ｍｖ＿ｆｌａｇがスライスヘッダにおいて０であるとき、１／４輝度サンプルの単位で動きベクトル差分（ＭＶＤ：ＭｏｔｉｏｎＶｅｃｔｏｒＤｉｆｆｅｒｅｎｃｅ）（動きベクトルとＰＵの予測動きベクトルとの差）が信号通知される。ＪＥＭにおいて、ＬＡＭＶＲ（ＬｏｃａｌｌｙＡｄａｐｔｉｖｅＭｏｔｉｏｎＶｅｃｔｏｒＲｅｓｏｌｕｔｉｏｎ）が導入される。ＪＥＭにおいて、ＭＶＤは、１／４輝度サンプル、整数輝度サンプル、または４つの輝度サンプルの単位復号化できる。ＭＶＤ解像度は符号化ユニット（ＣＵ）レベルで制御され、ＭＶＤ解像度フラグは、少なくとも１つの非ゼロＭＶＤの構成要素を有する各ＣＵに対して条件付きで信号通知される。 In HEVC, when use_integer_mv_flag is 0 in the slice header, the Motion Vector Difference (MVD) (the difference between the motion vector and the predicted motion vector of the PU) is signaled in units of 1/4 luma samples. In JEM, Locally Adaptive Motion Vector Resolution (LAMVR) is introduced. In JEM, MVD can be decoded in units of 1/4 luma samples, integer luma samples, or 4 luma samples. MVD resolution is controlled at the coding unit (CU) level, and the MVD resolution flag is conditionally signaled for each CU that has at least one non-zero MVD component.

少なくとも１つの非ゼロＭＶＤの構成要素を有するＣＵの場合、１／４輝度サンプルＭＶ精度がＣＵにおいて使用されるか否かを示すために、第１のフラグが信号通知される。第１のフラグ（１に等しい）が、１／４輝度サンプルＭＶ精度が使用されていないことを示す場合、整数輝度サンプルＭＶ精度が使用されるかまたは４輝度サンプルＭＶ精度が使用されるかを示すために、別のフラグが信号通知される。 For a CU with at least one non-zero MVD component, a first flag is signaled to indicate whether quarter luma sample MV precision is used in the CU. If the first flag (equal to 1) indicates that quarter luma sample MV precision is not used, another flag is signaled to indicate whether integer luma sample MV precision or 4 luma sample MV precision is used.

ＣＵの第１のＭＶＤ解像度フラグがゼロであるか、またはＣＵに対して符号化されていない（つまり、ＣＵにおけるすべてのＭＶＤがゼロである）場合、ＣＵに対して１／４輝度サンプルＭＶ解像度が使用される。ＣＵが整数輝度サンプルＭＶ精度または４輝度サンプルＭＶ精度を使用する場合、ＣＵのＡＭＶＰ候補リストにおけるＭＶＰを対応する精度に丸める。 If the first MVD resolution flag of a CU is zero or is not coded for the CU (i.e., all MVDs in the CU are zero), then 1/4 luma sample MV resolution is used for the CU. If the CU uses integer luma sample MV precision or 4 luma sample MV precision, round the MVPs in the CU's AMVP candidate list to the corresponding precision.

エンコーダにおいて、ＣＵレベルのＲＤチェックは、どのＭＶＤ解像度をＣＵに用いるかを決定するために用いられる。すなわち、１つのＭＶＤ解像度ごとに３回、ＣＵレベルのＲＤチェックを行う。エンコーダの速度を速めるために、ＪＥＭにおいては、以下のエン符号化方式が適用される。 In the encoder, the CU-level RD check is used to determine which MVD resolution to use for the CU. That is, the CU-level RD check is performed three times for each MVD resolution. To increase the encoder speed, the following encoding method is applied in JEM:

通常の１／４輝度サンプルＭＶＤ解像度を有するＣＵのＲＤチェック中、現在のＣＵの動き情報（整数輝度サンプル精度）が記憶される。整数輝度サンプルおよび４輝度サンプルのＭＶＤ解像度を有する同じＣＵのＲＤチェック中に、記憶された動き情報（丸められた後）は、更なる小範囲の動きベクトル改良の開始点として使用されるので、時間がかかる動き推定処理が３回重複しない。 During RD check of a CU with normal 1/4 luma sample MVD resolution, the motion information (integer luma sample precision) of the current CU is stored. During RD check of the same CU with integer luma sample and 4 luma sample MVD resolution, the stored motion information (after rounding) is used as a starting point for further small range motion vector refinement, so that the time consuming motion estimation process is not duplicated three times.

４輝度サンプルＭＶＤ解像度を有するＣＵのＲＤチェックを条件付きで呼び出す。ＣＵの場合、整数輝度サンプルＭＶＤ解像度のＲＤコストが１／４輝度サンプルＭＶＤ解像度のそれよりもはるかに大きい場合、ＣＵのための４輝度サンプルＭＶＤ解像度のＲＤチェックは省略される。 Conditionally invoke RD check for CUs with 4 luma sample MVD resolution. For a CU, if the RD cost of integer luma sample MVD resolution is much greater than that of 1/4 luma sample MVD resolution, then the RD check of 4 luma sample MVD resolution for the CU is omitted.

２．３．３パターンマッチング動きベクトル導出 2.3.3 Pattern matching motion vector derivation

ＰＭＭＶＤ（ＰａｔｔｅｒｎＭａｔｃｈｅｄＭｏｔｉｏｎＶｅｃｔｏｒＤｅｒｉｖａｔｉｏｎ）モードは、ＦＲＵＣ（Ｆｒａｍｅ－ＲａｔｅＵｐＣｏｎｖｅｒｓｉｏｎ）技術に基づく特殊なマージモードである。このモードでは、ブロックの動き情報は信号通知されず、デコーダ側で導出される。 PMMVD (Pattern Matched Motion Vector Derivation) mode is a special merge mode based on FRUC (Frame-Rate Up Conversion) technology. In this mode, the motion information of the blocks is not signaled but derived on the decoder side.

そのマージフラグが真である場合、ＦＲＵＣフラグは、ＣＵに信号通知される。ＦＲＵＣフラグが偽である場合、マージインデックスは信号通知され、通常のマージモードが使用される。ＦＲＵＣフラグが真である場合、追加のＦＲＵＣモードフラグを信号通知して、どの方法（バイラテラルマッチングまたはテンプレートマッチング）を使用してブロックの動き情報を導出するかを示す。 If that merge flag is true, then the FRUC flag is signaled to the CU. If the FRUC flag is false, then the merge index is signaled and normal merge mode is used. If the FRUC flag is true, then an additional FRUC mode flag is signaled to indicate which method (bilateral matching or template matching) is used to derive the motion information for the block.

エンコーダ側では、ＣＵのためにＦＲＵＣマージモードを使用するかどうかの決定は、通常のマージ候補に対して行われるのと同じように、ＲＤコストの選択に基づく。つまり、ＲＤコスト選択を使用して、１つのＣＵに対して２つのマッチングモード（バイラテラルマッチングおよびテンプレートマッチング）を両方チェックする。最小コストに導くものが、更に、他のＣＵモードと比較される。ＦＲＵＣマッチングモードが最も効率的なものである場合、ＣＵに対してＦＲＵＣフラグを真に設定し、関連するマッチングモードを使用する。 On the encoder side, the decision to use FRUC merge mode for a CU is based on the RD cost selection, just as it is done for normal merge candidates. That is, we check both two matching modes (bilateral matching and template matching) for one CU using the RD cost selection. The one that leads to the minimum cost is further compared with other CU modes. If the FRUC matching mode is the most efficient one, we set the FRUC flag to true for the CU and use the associated matching mode.

ＦＲＵＣマージモードにおける動き導出処理は、２つのステップを有する。まず、ＣＵレベルの動き探索を実行し、次に、サブＣＵレベルの動き改良を実行する。ＣＵレベルでは、バイラテラルマッチングまたはテンプレートマッチングに基づいて、ＣＵ全体のための初期の動きベクトルを導出する。まず、ＭＶ候補のリストを生成し、最小マッチングコストに導く候補を、さらなるＣＵレベル改善の開開始点として選択する。そして、開始点付近のバイラテラルマッチングまたはテンプレートマッチングに基づく局所検索を行い、最小マッチングコストとなるＭＶ結果をＣＵ全体のＭＶとする。続いて、導出されたＣＵ動きベクトルを開始点として、サブＣＵレベルでの動き情報をさらに改良する。 The motion derivation process in FRUC merge mode has two steps. First, a CU-level motion search is performed, and then a sub-CU-level motion refinement is performed. At the CU level, an initial motion vector for the entire CU is derived based on bilateral matching or template matching. First, a list of MV candidates is generated, and the candidate that leads to the minimum matching cost is selected as the opening starting point for further CU-level refinement. Then, a local search based on bilateral matching or template matching is performed near the starting point, and the MV result that results in the minimum matching cost is taken as the MV for the entire CU. Next, the derived CU motion vector is used as the starting point to further refine the motion information at the sub-CU level.

例えば、Ｗ×ＨＣＵ動き情報導出のために、以下の導出処理を行う。第１のステージにおいて、Ｗ×ＨＣＵ全体のためのＭＶが導出される。第２のステージにおいて、ＣＵは、Ｍ×Ｍ個のサブＣＵにさらに分割される。Ｍの値は、（１６）のように計算されるが、Ｄは、予め規定義された分割深さであり、ＪＥＭにおいてデフォルトで３に設定される。そして、各サブＣＵのＭＶを導出する。 For example, to derive WxH CU motion information, the following derivation process is performed: In the first stage, MVs for the entire WxH CU are derived. In the second stage, the CU is further divided into MxM sub-CUs. The value of M is calculated as in (16), where D is a predefined division depth, which is set to 3 by default in JEM. Then, MVs for each sub-CU are derived.

図２１に示すように、このバイラテラルマッチングは、２つの異なる参照ピクチャにおける現在のＣＵの動き軌跡に沿った２つのブロック間の最も近いマッチングを見出すことにより、現在のＣＵの動き情報を導出するために用いられる。連続した動き軌跡を仮定すると、２つの参照ブロックを指す動きベクトルＭＶ０およびＭＶ１は、現在のピクチャと２つの参照ピクチャとの間の時間的距離、例えばＴＤ０およびＴＤ１に比例する。特殊なケースとしては、現在のピクチャが時間的に２つの参照ピクチャの間にあり、現在のピクチャから２つの参照ピクチャまでの時間的な距離が同じである場合、バイラテラルマッチングはミラーに基づく双方向ＭＶとなる。 As shown in Figure 21, this bilateral matching is used to derive the motion information of the current CU by finding the closest match between two blocks along the motion trajectory of the current CU in two different reference pictures. Assuming a continuous motion trajectory, the motion vectors MV0 and MV1 pointing to the two reference blocks are proportional to the temporal distance between the current picture and the two reference pictures, e.g., TD0 and TD1. As a special case, when the current picture is temporally between the two reference pictures and the temporal distance from the current picture to the two reference pictures is the same, the bilateral matching becomes a mirror-based bidirectional MV.

図２２に示すように、現在のピクチャにおけるテンプレート（現在のＣＵの上側および／または左側の近傍のブロック）と、参照ピクチャにおけるブロック（テンプレートと同じサイズ）との間の最も近いマッチングを見出すことで、テンプレートマッチングを使用して、現在のＣＵの動き情報を導出する。前述のＦＲＵＣマージモード以外に、テンプレートマッチングは、ＡＭＶＰモードにも適用される。ＪＥＭにおいて、ＨＥＶＣと同様、ＡＭＶＰは２つの候補を有する。テンプレートマッチング法を用いることで、新しい候補を導出する。テンプレートマッチングによって新規に導出された候補が、第１の既存のＡＭＶＰ候補と異なる場合、ＡＭＶＰ候補リストの最初に挿入し、次に、リストサイズを２（第２の既存のＡＭＶＰ候補を取り除くことを意味する）に設定する。ＡＭＶＰモードに適用される場合、ＣＵレベル検索のみが適用される。 As shown in FIG. 22, template matching is used to derive motion information of the current CU by finding the closest match between a template in the current picture (nearby blocks above and/or to the left of the current CU) and a block in the reference picture (same size as the template). Besides the aforementioned FRUC merge mode, template matching is also applied to the AMVP mode. In JEM, similar to HEVC, AMVP has two candidates. A new candidate is derived by using the template matching method. If the newly derived candidate by template matching is different from the first existing AMVP candidate, it is inserted at the beginning of the AMVP candidate list, and then the list size is set to 2 (meaning removing the second existing AMVP candidate). When applied to the AMVP mode, only CU level search is applied.

２．３．３．１ＣＵレベルＭＶ候補セット 2.3.3.1 CU-level MV candidate set

ＣＵレベルのＭＶ候補セットは、以下からなる。
（ｉ）現在のＣＵがＡＭＶＰモードになっている場合の元のＡＭＶＰ候補
（ｉｉ）すべてのマージ候補、
（ｉｉｉ）補間ＭＶフィールド内の複数のＭＶ。
（ｉｖ）上と左の近傍の動きベクトル The CU-level MV candidate set consists of:
(i) the original AMVP candidate if the current CU is in AMVP mode; (ii) all merge candidates;
(iii) Multiple MVs in an interpolated MV field.
(iv) Motion vectors of the top and left neighbors

バイラテラルマッチングを使用する場合、マージ候補の各有効なＭＶを入力として使用して、バイラテラルマッチングを仮定してＭＶ対を生成する。例えば、マージ候補の１つの有効なＭＶは、参照リストＡにおいて（ＭＶａ，ｒｅｆａ）である。そして、その対をなすバイラテラルＭＶの参照ピクチャｒｅｆｂが他の参照リストＢにおいて見出され、ｒｅｆａおよびｒｅｆｂは、時間的に現在のピクチャの異なる側にある。参照リストＢにおいてこのようなｒｅｆｂが利用可能でない場合、ｒｅｆｂをｒｅｆａとは異なる参照として決定し、現在のピクチャとの時間的距離はリストＢにおける最小値である。ｒｅｆｂを決定した後、現在のピクチャとｒｅｆａ，ｒｅｆｂとの時間距離に基づいてＭＶａをスケーリングすることでＭＶｂを導出する。 When bilateral matching is used, each valid MV of the merge candidate is used as input to generate an MV pair assuming bilateral matching. For example, one valid MV of the merge candidate is (MVa, refa) in reference list A. Then, the reference picture refb of the paired bilateral MV is found in another reference list B, where refa and refb are on different sides of the current picture in time. If no such refb is available in reference list B, determine refb as a reference different from refa, whose temporal distance to the current picture is the minimum value in list B. After determining refb, derive MVb by scaling MVa based on the temporal distance between the current picture and refa and refb.

補間されたＭＶフィールドからの４つのＭＶもＣＵレベル候補リストに追加する。具体的には、現在のＣＵの（０，０）、（Ｗ／２，０）、（０，Ｈ／２）、（Ｗ／２，Ｈ／２）の位置の補間ＭＶを加算する。 Add the four MVs from the interpolated MV field to the CU level candidate list as well. Specifically, add the interpolated MVs at positions (0,0), (W/2,0), (0,H/2), and (W/2,H/2) of the current CU.

ＡＭＶＰモードでＦＲＵＣを適用する場合、元のＡＭＶＰ候補をＣＵレベルＭＶ候補セットにも加える。 When applying FRUC in AMVP mode, the original AMVP candidate is also added to the CU-level MV candidate set.

ＣＵレベルにおいて、ＡＭＶＰＣＵのための最大１５個のＭＶおよびマージＣＵのための最大１３個のＭＶを候補リストに加える。 At the CU level, add up to 15 MVs for AMVP CU and up to 13 MVs for merged CU to the candidate list.

２．３．３．２サブＣＵレベルＭＶ候補セット 2.3.3.2 Sub-CU level MV candidate set

サブＣＵレベルのＭＶ候補セットは、以下からなる。
（ｉ）ＣＵレベルの検索から決定されたＭＶ、
（ｉｉ）上、左、左上、右上の近傍のＭＶ、
（ｉｉｉ）参照ピクチャからの並置されたＭＶのスケーリングされたバージョン、
（ｉｖ）最大４つのＡＴＭＶＰ候補、
（ｖ）最大４つのＳＴＭＶＰ候補 The sub-CU level MV candidate set consists of:
(i) MVs determined from CU-level searches;
(ii) MVs in the top, left, top left, and top right neighborhoods;
(iii) a scaled version of the collocated MV from the reference picture;
(iv) up to four ATMVP candidates;
(v) Up to four STMVP candidates

参照ピクチャからのスケーリングされたＭＶは、以下のように導出される。両方のリストにおける参照ピクチャをすべてトラバースする。参照ピクチャにおけるサブＣＵの配列位置にあるＭＶは、開始ＣＵレベルＭＶの参照に対してスケーリングされる。 The scaled MV from the reference picture is derived as follows: traverse all reference pictures in both lists. The MV at the sub-CU's alignment position in the reference picture is scaled with respect to the reference of the starting CU level MV.

ＡＴＭＶＰおよびＳＴＭＶＰの候補は、最初の４つの候補に限定される ATMVP and STMVP candidates are limited to the first four candidates

サブＣＵレベルにおいて、最大１７個のＭＶが候補リストに追加される。 At the sub-CU level, a maximum of 17 MVs are added to the candidate list.

２．３．３．３補間ＭＶフィールドの生成 2.3.3.3 Generation of interpolated MV fields

フレームを符号化する前に、一方のＭＥに基づいてピクチャ全体に対して補間動きフィールドを生成する。そして、この動きフィールドを後にＣＵレベルまたはサブＣＵレベルのＭＶ候補として使用してもよい。 Before encoding a frame, an interpolated motion field is generated for the entire picture based on one ME. This motion field may then be used as a CU-level or sub-CU-level MV candidate.

まず、両方の参照リストにおける各参照ピクチャの動きフィールドは、４×４ブロックレベルでトラバースされる。各４×４ブロックにおいて、現在のピクチャ（図２３に示す）の４×４ブロックを通過するブロックに関連する動きで、補間動きがまだ割り当てられていない場合、時間的距離ＴＤ０およびＴＤ１に基づいて（ＨＥＶＣにおけるＴＭＶＰのＭＶスケーリングと同様に）、参照ブロックの動きを現在のピクチャにスケーリングし、スケーリングされた動きを現在のフレームのブロックに割り当てる。４×４ブロックにスケーリングされたＭＶが割り当てられていない場合、ブロックの動きは、補間された動きフィールドにおいて利用不可能であるとマークされる。 First, the motion field of each reference picture in both reference lists is traversed at the 4x4 block level. For each 4x4 block, if the motion associated with the block passing through the 4x4 block in the current picture (shown in Figure 23) does not already have an interpolated motion assigned to it, scale the motion of the reference block to the current picture based on the temporal distances TD0 and TD1 (similar to MV scaling of TMVP in HEVC) and assign the scaled motion to the block in the current frame. If the 4x4 block does not have a scaled MV assigned to it, the motion of the block is marked as unavailable in the interpolated motion field.

２．３．３．４補間およびマッチングコスト 2.3.3.4 Interpolation and matching costs

１つの動きベクトルが１つの分数のサンプル位置を指す場合、動き補償補間が必要である。複雑性を低減するために、通常の８タップＨＥＶＣ補間の代わりに、バイラテラルマッチングおよびテンプレートマッチングの両方に双線形補間を使用する。 When one motion vector points to one fractional sample location, motion compensated interpolation is needed. To reduce complexity, we use bilinear interpolation for both bilateral and template matching instead of the usual 8-tap HEVC interpolation.

マッチングコストの計算は、異なるステップでは少し異なる。ＣＵレベルの候補セットから候補を選択する場合、マッチングコストは、バイラテラルマッチングまたはテンプレートマッチングの差分の絶対値の和（ＳＡＤ）である。開始ＭＶを決定した後、サブＣＵレベル検索におけるバイラテラルマッチングのマッチングコストＣを以下のように算出する。 The calculation of the matching cost is slightly different in different steps. When selecting a candidate from the CU-level candidate set, the matching cost is the sum of absolute differences (SAD) of bilateral matching or template matching. After determining the starting MV, we calculate the matching cost C of bilateral matching in the sub-CU level search as follows:

ここで、ｗは、経験的に４に設定された重み係数であり、ＭＶおよびＭＶ^ｓは、それぞれ、現在のＭＶおよび開始ＭＶを示す。ＳＡＤは、依然として、サブＣＵレベル検索におけるテンプレートマッチングのマッチングコストとして使用される。 where w is a weighting factor empirically set to 4, and MV and MV ^s denote the current MV and the starting MV, respectively. The SAD is still used as the matching cost for template matching in the sub-CU level search.

ＦＲＵＣモードにおいて、ＭＶは、輝度サンプルのみを使用することによって導出される。導出された動きは、ＭＣインター予測のために、輝度および彩度の両方に使用される。ＭＶを決定した後、輝度用の８タップ補間フィルタおよび彩度用の４タップ補間フィルタを使用して、最終的なＭＣを行う。 In FRUC mode, MV is derived by using only luma samples. The derived motion is used for both luma and chroma for MC inter prediction. After determining MV, the final MC is performed using an 8-tap interpolation filter for luma and a 4-tap interpolation filter for chroma.

２．３．３．５ＭＶの改良 2.3.3.5 MV improvements

ＭＶ改良は、バイラテラルマッチングコストまたはテンプレートマッチングコストの基準を有するパターンに基づくＭＶ検索である。ＪＥＭでは、２つの検索パターン、即ち、ＵＣＢＤＳ（ＵｎｒｅｓｔｒｉｃｔｅｄＣｅｎｔｅｒ－ＢｉａｓｅｄＤｉａｍｏｎｄＳｅａｒｃｈ）およびＣＵレベルおよびサブＣＵレベルでのＭＶ改良のための適応的横断検索をそれぞれサポートする。ＣＵおよびサブＣＵレベルのＭＶ改善の両方のために、ＭＶは、１／４輝度サンプルＭＶの正確度で直接検索され、これに続いて１／８輝度サンプルＭＶの改良が行われる。ＣＵおよびサブＣＵステップのためのＭＶ改良の検索範囲は、８つの輝度サンプルに等しく設定される。 MV refinement is a pattern-based MV search with the criteria of bilateral matching cost or template matching cost. JEM supports two search patterns, namely Unrestricted Center-Biased Diamond Search (UCBDS) and adaptive cross-search for MV refinement at CU level and sub-CU level, respectively. For both CU and sub-CU level MV refinement, MVs are directly searched with the accuracy of 1/4 luma sample MV, followed by refinement of 1/8 luma sample MV. The search range of MV refinement for CU and sub-CU steps is set equal to 8 luma samples.

２．３．３．６テンプレートマッチングＦＲＵＣマージモードにおける予測方向の選択 2.3.3.6 Prediction direction selection in template matching FRUC merge mode

バイラテラルマッチングマージモードにおいては、２つの異なる参照ピクチャにおける現在のＣＵの動き軌跡に沿った２つのブロック間の最も近いマッチングに基づいて、ＣＵの動き情報を導出するため、双方向予測が常に適用される。テンプレートマッチングマージモードについては、そのような限定はない。テンプレートマッチングマージモードにおいて、エンコーダは、ｌｉｓｔ０からの単一予測、ｌｉｓｔ１からの単一予測、またはＣＵのための双方向予測のうちから選択することができる。選択は、テンプレートマッチングコストに基づいて、以下のように行う。
ｃｏｓｔＢｉ≦ｆａｃｔｏｒ＊ｍｉｎ（ｃｏｓｔ０，ｃｏｓｔ１）の場合
双方向予測を用いる。
それ以外の場合において、ｃｏｓｔ０≦ｃｏｓｔ１の場合
ｌｉｓｔ０からの単一予測を用いる。
そうでない場合、
ｌｉｓｔ１からの単一予測を用いる。 In the bilateral matching merge mode, bidirectional prediction is always applied to derive the motion information of a CU based on the closest matching between two blocks along the motion trajectory of the current CU in two different reference pictures. For the template matching merge mode, there is no such limitation. In the template matching merge mode, the encoder can select among uni-prediction from list0, uni-prediction from list1, or bi-prediction for a CU. The selection is made based on the template matching cost as follows:
If costBi≦factor*min(cost0, cost1), then use bidirectional prediction.
Otherwise, if cost0≦cost1, use single prediction from list0.
If not,
Use a single prediction from list1.

ここで、ｃｏｓｔ０はｌｉｓｔ０テンプレートマッチングのＳＡＤであり、ｃｏｓｔ１はｌｉｓｔ１テンプレートマッチングのＳＡＤであり、ｃｏｓｔＢｉは双方向予測テンプレートマッチングのＳＡＤである。ｆａｃｔｏｒの値が１．２５である場合、選択処理が双方向予測に偏っていることを意味する。
このインター予測方向選択は、ＣＵレベルのテンプレートマッチング処理にのみ適用される。 Here, cost0 is the SAD of list0 template matching, cost1 is the SAD of list1 template matching, and costBi is the SAD of bidirectional prediction template matching. When the factor value is 1.25, it means that the selection process is biased towards bidirectional prediction.
This inter prediction direction selection is applied only to the template matching process at the CU level.

２．３．４デコーダ側動きベクトル改良 2.3.4 Decoder-side motion vector improvements

双方向予測演算において、１つのブロック領域を予測するために、ｌｉｓｔ０の動きベクトル（ＭＶ）およびｌｉｓｔ１のＭＶをそれぞれ使用して構成される双予測ブロックを組み合わせ、１つの予測信号を形成する。ＤＭＶＲ（Ｄｅｃｏｄｅｒ－ｓｉｄｅＭｏｔｉｏｎＶｅｃｔｏｒＲｅｆｉｎｅｍｅｎｔ）方法において、バイラテラルテンプレートマッチング処理によって、双方向予測の２つの動きベクトルをさらに改良する。追加の動き情報を送信することなく改良されたＭＶを得るために、デコーダにおいてバイラテラルテンプレートマッチングを適用し、バイラテラルテンプレートと参照ピクチャにおける再構成サンプルとの間の歪みに基づく検索を行う。 In bidirectional prediction operations, bi-predictive blocks constructed using the motion vectors (MVs) in list0 and list1, respectively, are combined to form one prediction signal to predict one block region. In the Decoder-side Motion Vector Refinement (DMVR) method, the two motion vectors of bidirectional prediction are further refined by a bilateral template matching process. To obtain the refined MVs without transmitting additional motion information, bilateral template matching is applied in the decoder to perform a distortion-based search between the bilateral template and the reconstructed samples in the reference picture.

ＤＭＶＲにおいて、図２３に示すように、ｌｉｓｔ０の最初のＭＶ０とｌｉｓｔ１のＭＶ１とから、それぞれ２つの予測ブロックの重み付け結合（すなわち、平均）としてバイラテラルテンプレートを生成する。テンプレートマッチング操作は、生成されたテンプレートと参照ピクチャにおけるサンプル領域（最初の予測ブロックの付近）との間のコスト尺度を計算することからなる。２つの参照ピクチャの各々について、テンプレートコストが最小となるＭＶを、そのリストの更新されたＭＶと見なし、元のＭＶに置き換える。ＪＥＭにおいて、各リストに対して９つのＭＶ候補を検索する。９つのＭＶ候補は、元のＭＶと、水平または垂直方向のいずれかまたは両方向に元のＭＶに対してオフセットしている１つの輝度サンプルを有する８つの周囲のＭＶを含む。最後に、２つの新しいＭＶ、即ち、図２４に示すようなＭＶ０’およびＭＶ１’を使用して、最終的な双方向予測結果を生成する。差分の絶対値の和（ＳＡＤ）をコスト尺度として使用する。 In DMVR, we generate a bilateral template from the first MV0 in list0 and MV1 in list1 as a weighted combination (i.e., average) of two prediction blocks, respectively, as shown in Fig. 23. The template matching operation consists of calculating a cost measure between the generated template and a sample region in the reference picture (near the first prediction block). For each of the two reference pictures, the MV with the smallest template cost is considered as the updated MV of that list and replaces the original MV. In JEM, we search for nine MV candidates for each list. The nine MV candidates include the original MV and eight surrounding MVs with one luma sample offset with respect to the original MV in either horizontal or vertical directions or both directions. Finally, we generate the final bidirectional prediction result using two new MVs, i.e., MV0' and MV1' as shown in Fig. 24. We use the sum of absolute differences (SAD) as the cost measure.

ＤＭＶＲは、追加の構文要素を送信することなく、過去の参照ピクチャからの１つのＭＶと、将来の参照ピクチャからの１つのＭＶとの間の双方向予測のマージモードに適用される。ＪＥＭにおいて、ＣＵに対してＬＩＣ、アフィン動き、ＦＲＵＣ、またはサブＣＵマージ候補が有効である場合、ＤＭＶＲは適用されない。 DMVR applies to bi-prediction merge mode between one MV from a past reference picture and one MV from a future reference picture without transmitting additional syntax elements. In JEM, DMVR is not applied if LIC, affine motion, FRUC, or sub-CU merge candidates are enabled for the CU.

２．３．５バイラテラルマッチングの改良を伴うマージ／スキップモード 2.3.5 Merge/skip mode with improved bilateral matching

まず、利用可能な候補の数が最大候補サイズ１９に達するまで、空間的に近傍のブロックおよび時間的に近傍のブロックの動きベクトルおよび参照インデックスを冗長性チェック付き候補リストに挿入することで、マージ候補リストを構築する。マージ／スキップモードのマージ候補リストは、予め規定された挿入順に基づいて、ＨＥＶＣ（結合候補およびゼロ候補）に用いられる空間的候補（図１１）、時間的候補、アフィン候補、ＡＴＭＶＰ（ＡｄｖａｎｃｅｄＴｅｍｐｏｒａｌＭＶＰ）候補、ＳＴＭＶＰ（ＳｐａｔｉａｌＴｅｍｐｏｒａｌＭＶＰ）候補、および追加候補を挿入することで構築される。 First, a merge candidate list is constructed by inserting motion vectors and reference indices of spatially and temporally neighboring blocks into a candidate list with redundancy check until the number of available candidates reaches the maximum candidate size of 19. The merge candidate list for merge/skip mode is constructed by inserting spatial candidates (FIG. 11) used for HEVC (merge candidates and zero candidates), temporal candidates, affine candidates, Advanced Temporal MVP (ATMVP) candidates, Spatial Temporal MVP (STMVP) candidates, and additional candidates based on a predefined insertion order.

－ブロック１～４の空間的候補 - Spatial candidates for blocks 1 to 4

－ブロック１～４の外挿アフィン候補 - Extrapolation affine candidates for blocks 1 to 4

－ＡＴＭＶＰ -ATMVP

－ＳＴＭＶＰ -STMVP

－仮想アフィン候補 - Virtual affine candidates

－空間的候補（ブロック５）（利用可能な候補の数が６よりも少ない場合にのみ使用される）。 - Spatial Candidates (Block 5) (used only if the number of available candidates is less than 6).

－外挿アフィン候補（ブロック５） - Extrapolation affine candidates (Block 5)

－時間的候補（ＨＥＶＣのように導出） -Temporal candidates (derived like HEVC)

－外挿アフィン候補に続く非隣接空間的候補（図２５に示すブロック６～４９）。 - Non-adjacent spatial candidates following the extrapolated affine candidates (blocks 6 to 49 shown in Figure 25).

－結合候補 - Candidates for joining

－ゼロ候補 -Zero candidates

なお、ＩＣフラグは、ＳＴＭＶＰおよびアフィンを除き、マージ候補から継承される。また、最初の４つの空間的候補について、双方向予測のものを単一予測のものの前に挿入する。 Note that the IC flag is inherited from the merge candidate, except for STMVP and affine. Also, for the first four spatial candidates, the bi-predictive candidates are inserted before the uni-predictive ones.

いくつかの実施形態において、現在のブロックに接続されていないブロックにアクセスすることができる。非隣接ブロックが非イントラモードで符号化されている場合、関連する動き情報を追加のマージ候補として追加してもよい。 In some embodiments, blocks that are not connected to the current block can be accessed. If the non-adjacent blocks are coded in a non-intra mode, the associated motion information may be added as additional merging candidates.

２．３．６共有マージリストＪＶＥＴ－Ｍ０１７０ 2.3.6 Shared merge list JVET-M0170

小さなスキップ／マージ符号化されたＣＵを並列処理することを有効にするために、ＣＵ分割木における１つの祖先ノードのすべての葉の符号化ユニット（ＣＵ）に対して同じマージ候補リストを共有することが提案される。祖先ノードをマージ共有ノードと呼ぶ。マージ共有ノードが葉ＣＵであるように見せかけて、マージ共有ノードにおいて共有マージ候補リストを生成する。 To enable parallel processing of small skip/merge coded CUs, we propose to share the same merge candidate list for all leaf coding units (CUs) of an ancestor node in the CU partition tree. We call the ancestor node a merge-share node. We generate a shared merge candidate list at the merge-share node, pretending that the merge-share node is a leaf CU.

Ｔｙｐｅ－２の定義において、復号化の構文解析段階において、ＣＴＵ内部のＣＵごとにマージ共有ノードを決定する。また、マージ共有ノードは、葉ＣＵの祖先ノードであり、以下の２つの基準を満たさなければならない。 In the definition of Type-2, during the parsing stage of decoding, a merge-shared node is determined for each CU within a CTU. In addition, the merge-shared node is an ancestor node of the leaf CU, and must satisfy the following two criteria:

マージ共有ノードのサイズは、サイズ閾値以上であること。 The size of the merge shared node must be greater than or equal to the size threshold.

マージ共有ノードにおいて、子ＣＵのサイズは、サイズ閾値よりも小さいこと。 In a merge shared node, the size of the child CU is smaller than the size threshold.

さらに、マージ共有ノードのサンプルがピクチャ境界の外側にないことを保証する必要がある。構文解析段階において、祖先ノードが基準（１）および（２）を満たすが、ピクチャ境界の外側にいくつかのサンプルを有する場合、この祖先ノードはマージ共有ノードではないので、先に進んでその子ＣＵのためのマージ共有ノードを見出す。 Furthermore, we need to ensure that the samples of a merge-shared node are not outside the picture boundary. During the parsing phase, if an ancestor node satisfies criteria (1) and (2) but has some samples outside the picture boundary, then this ancestor node is not a merge-shared node, and we go ahead and find a merge-shared node for its child CU.

図３５に、Ｔｙｐｅ－１とＴｙｐｅ－２の定義の違いの一例を示す。本例において、親ノードは、３つの子ＣＵに３分割される。親ノードのサイズは１２８である。Ｔｙｐｅ－１定義の場合、３つの子ＣＵは別々のマージ共有ノードである。しかし、Ｔｙｐｅ－２定義の場合、親ノードはマージ共有ノードである。 Figure 35 shows an example of the difference between Type-1 and Type-2 definitions. In this example, the parent node is split into three child CUs. The size of the parent node is 128. In the Type-1 definition, the three child CUs are separate merge-shared nodes. However, in the Type-2 definition, the parent node is a merge-shared node.

提案した共用マージ候補リストアルゴリズムは、並進マージ（マージモードおよびトライアングルマージモードを含む、履歴に基づく候補もサポートされる）およびサブブロックに基づくマージモードをサポートする。すべての種類のマージモードにおいて、共有マージ候補リストアルゴリズムの挙動は基本的に同じに見え、マージ共有ノードが葉ＣＵであるように見せるだけの候補をマージ共有ノードに生成する。それには２つの大きな利点がある。第１の利点は、マージモードのための並列処理を有効にすることであり、第２の利点は、すべての葉ＣＵのすべての計算をマージ共有ノードに共有することである。そのため、ハードウェアコーデックのためのすべてのマージモードのハードウェアコストを大幅に低減することができる。提案した共有マージ候補リストアルゴリズムにより、エンコーダとデコーダはマージモードの並列符号化に容易に対応でき、マージモードのサイクルバジェット問題を軽減する。 The proposed shared merge candidate list algorithm supports translational merge (including merge mode and triangle merge mode, history-based candidates are also supported) and subblock-based merge modes. In all kinds of merge modes, the behavior of the shared merge candidate list algorithm basically looks the same, generating candidates in the merge share node that only make the merge share node look like a leaf CU. It has two major advantages. The first advantage is that it enables parallel processing for merge modes, and the second advantage is that it shares all the computations of all leaf CUs in the merge share node. Therefore, it can greatly reduce the hardware cost of all merge modes for hardware codecs. The proposed shared merge candidate list algorithm allows the encoder and decoder to easily accommodate parallel encoding of merge modes, and alleviates the cycle budget problem of merge modes.

２．３．７タイル群 2.3.7 Tile groups

ＪＶＥＴ－Ｌ０６８６では、タイルグループに代えるためスライスが削除され、ＨＥＶＣ構文要素ｓｌｉｃｅ＿ａｄｄｒｅｓｓがタイルグループの最初のタイルのアドレスとしてｔｉｌｅ＿ｇｒｏｕｐ＿ｈｅａｄｅｒ内のｔｉｌｅ＿ｇｒｏｕｐ＿ａｄｄｒｅｓｓに置き換えられる（ピクチャ内に複数のタイルがある場合）。 In JVET-L0686, slices are removed in favor of tile groups, and the HEVC syntax element slice_address is replaced with the tile_group_address in the tile_group_header as the address of the first tile in the tile group (if there are multiple tiles in the picture).

３．本明細書に開示される実施形態が解決しようとする課題の例 3. Examples of problems that the embodiments disclosed herein aim to solve

現在のＨＥＶＣ設計は、動き情報をよりよく符号化するために、現在のブロックの近傍のブロック（現在のブロックの隣）の相関をとることができる。しかしながら、近傍のブロックが、異なる動き軌跡を有する異なる対象に対応する可能性がある。この場合、その近傍のブロックからの予測は効率的ではない。 Current HEVC design can correlate the neighboring blocks of the current block (next to the current block) to better code the motion information. However, the neighboring blocks may correspond to different objects with different motion trajectories. In this case, prediction from the neighboring blocks is not efficient.

非隣接ブロックの動き情報からの予測は、全ての動き情報（一般的には４×４レベル）をキャッシュに記憶するコストをかけることになり、付加的な符号化利得をもたらし、ハードウェア実装の複雑性を大幅に増大させる。 Prediction from motion information of non-adjacent blocks incurs the cost of storing all the motion information (typically at 4x4 level) in a cache, which provides no additional coding gain and significantly increases the complexity of the hardware implementation.

４．いくつかの例 4. Some examples

既存の実装形態の欠点を克服するために、様々な実施形態において、ブロックの動き情報を予測するために、少なくとも１つの動き候補が記憶された１つ以上のテーブル（例えばルックアップテーブル）を使用するＬＵＴに基づく動きベクトル予測技術を実装し、より高い符号化効率を有する映像符号化を提供することができる。ルックアップテーブルは、ブロックの動き情報を予測するために動き候補を含める際に使用できるテーブルの一例であり、他の実装形態も可能である。各ＬＵＴは、それぞれが対応する動き情報に関連付けられた１つ以上の動き候補を含んでもよい。動き候補の動き情報は、予測方向、参照インデックス／ピクチャ、動きベクトル、ＬＩＣフラグ、アフィンフラグ、ＭＶＤ（ＭｏｔｉｏｎＶｅｃｔｏｒＤｅｒｉｖａｔｉｏｎ）精度、および／またはＭＶＤ値の一部または全部を含んでもよい。動き情報は、動き情報がどこに由来しているかを示すために、ブロック位置情報をさらに含んでもよい。 To overcome the shortcomings of existing implementations, various embodiments may implement a LUT-based motion vector prediction technique that uses one or more tables (e.g., lookup tables) in which at least one motion candidate is stored to predict motion information of a block, providing video coding with higher coding efficiency. The lookup table is one example of a table that may be used to include motion candidates to predict motion information of a block, and other implementations are possible. Each LUT may include one or more motion candidates, each associated with corresponding motion information. The motion information of a motion candidate may include some or all of a prediction direction, a reference index/picture, a motion vector, a LIC flag, an affine flag, a Motion Vector Derivation (MVD) accuracy, and/or a MVD value. The motion information may further include block position information to indicate where the motion information originates.

開示される技術に基づいたＬＵＴに基づく動きベクトル予測は、既存のおよび将来の映像符号化規格の両方を向上させることができ、様々な実施形態のために以下の例で解明される。ＬＵＴは、履歴データ（例えば、既に処理されたブロック）に基づいて符号化／復号化処理を行うことを可能にするため、ＬＵＴに基づく動きベクトル予測は、ＨＭＶＰＨｉｓｔｏｒｙ－ｂａｓｅｄＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎ）法と呼ぶこともできる。ＬＵＴに基づく動きベクトル予測方法において、以前に符号化されたブロックからの動き情報を有する１つまたは複数のテーブルは、符号化／復号化処理の間、維持される。ＬＵＴに記憶されたこれらの動き候補をＨＭＶＰ候補と称する。１つのブロックの符号化／復号化の間、ＬＵＴにおける関連付けられた動き情報を動き候補リスト（例えば、マージ／ＡＭＶＰ候補リスト）に追加して、１つのブロックを符号化／復号化
した後に、ＬＵＴを使用してもよい。更新されたＬＵＴは、その後、後続のブロックを符号化するために用いられる。このように、ＬＵＴにおける動き候補の更新は、ブロックの符号化／復号化の順に基づく。以下の例は、一般的な概念を説明するための例であると考えられるべきである。これらの例は狭い意味で解釈されるべきではない。さらに、これらの例は、任意の方法で組み合わせることができる。 The LUT-based motion vector prediction based on the disclosed technology can improve both existing and future video coding standards, and is elucidated in the following examples for various embodiments. Since the LUT enables the encoding/decoding process to be performed based on history data (e.g., already processed blocks), the LUT-based motion vector prediction can also be called HMVP History-based Motion Vector Prediction (HMVP) method. In the LUT-based motion vector prediction method, one or more tables with motion information from previously coded blocks are maintained during the encoding/decoding process. These motion candidates stored in the LUT are called HMVP candidates. During the encoding/decoding of a block, the associated motion information in the LUT may be added to a motion candidate list (e.g., merge/AMVP candidate list) to use the LUT after encoding/decoding a block. The updated LUT is then used to code the subsequent block. In this way, the update of the motion candidates in the LUT is based on the encoding/decoding order of the blocks. The following examples should be considered as examples to explain the general concept. These examples should not be interpreted in a narrow sense. Moreover, these examples can be combined in any way.

いくつかの実施形態において、１つのブロックの動き情報を予測するために、少なくとも１つの動き候補が記憶された１つ以上のルックアップテーブルを用いてもよい。実施形態は、動き候補を用いて、ルックアップテーブルに記憶された動き情報のセットを示すことができる。従来のＡＭＶＰまたはマージモードの場合、実施形態では、動き情報を記憶するためにＡＭＶＰまたはマージ候補を使用してもよい。 In some embodiments, one or more lookup tables in which at least one motion candidate is stored may be used to predict motion information for a block. An embodiment may use a motion candidate to indicate a set of motion information stored in the lookup table. In the case of a conventional AMVP or merge mode, an embodiment may use an AMVP or merge candidate to store the motion information.

以下の実施例は、一般的な概念を説明する。 The following examples illustrate the general concept.

ルックアップテーブルの例 Lookup table example

例Ａ１：各ルックアップテーブルは、各候補がその動き情報に関連付けられた１つ以上の動き候補を含んでもよい。
ａ．動き候補の動き情報は、ここでは、予測方向、参照インデックス／ピクチャ、動きベクトル、ＬＩＣフラグ、アフィンフラグ、ＭＶＤ精度、ＭＶＤ値の一部または全部を含んでもよい。
ｂ．動き情報は、動き情報がどこに由来しているかを示すために、ブロック位置情報および／またはブロック形状をさらに含んでもよい。 Example A1: Each lookup table may contain one or more motion candidates, with each candidate associated with its motion information.
The motion information of a motion candidate here may include some or all of the following: prediction direction, reference index/picture, motion vector, LIC flag, affine flag, MVD precision, MVD value.
b. The motion information may further include block position information and/or block shape to indicate where the motion information comes from.

ＬＵＴの選択 Selecting a LUT

例Ｂ１：１つのブロックを符号化する場合、１つのルックアップテーブルからの動き候補の一部または全部を順にチェックすることができる。１つのブロックを符号化する間に１つの動き候補をチェックするとき、この動き候補を動き候補リスト（例えば、ＡＭＶＰ、マージ候補リスト）に加えてもよい。例Ｂ２：ルックアップテーブルの選択は、ブロックの位置に依存してもよい。 Example B1: When encoding a block, some or all of the motion candidates from a lookup table may be checked in sequence. When a motion candidate is checked while encoding a block, this motion candidate may be added to a motion candidate list (e.g., AMVP, merge candidate list). Example B2: The selection of the lookup table may depend on the position of the block.

ルックアップテーブルの使用法 How to use a lookup table

例Ｃ１：チェック対象のルックアップテーブルにおける動き候補の総数は、予め規定されてもよい。 Example C1: The total number of motion candidates in the lookup table to be checked may be predefined.

例Ｃ２：１つのルックアップテーブルに含まれる１つ以上の動き候補は、１つのブロックによって直接継承されてもよい。
ａ．それらをマージモード符号化に使用してもよい。すなわち、マージ候補リスト導出処理において動き候補をチェックしてもよい。
ｂ．これらは、アフィンマージモード符号化に使用してもよい。
ｉ．アフィンフラグが１である場合、ルックアップテーブルにおける動き候補をアフィンマージ候補として加えることができる。
ｃ．それらは、サブブロックマージモード、アフィンマージモード、トライアングルマージモード、インター－イントラマージモード、ＭＭＶＤ（ＭｅｒｇｅｗｉｔｈＭＶＤ）モードのような他の種類のマージモードに使用してもよい。
ｄ．以下の場合、ルックアップテーブルにおける動き候補のチェックを有効にしてもよい。
ｉ．ＴＭＶＰ候補を挿入した後、マージ候補リストが満杯になっていない。
ｉｉ．空間的マージ候補導出のために特定の空間的に近傍のブロックをチェックした後、マージ候補リストが満杯になっていない。
ｉｉｉ．すべての空間的マージ候補の後、マージ候補リストが満杯になっていない。
ｉｖ．結合双方向予測マージ候補の後、マージ候補リストが満杯になっていない。
ｖ．他の符号化方式（例えば、ＨＥＶＣデザイン、またはＪＥＭデザインのマージ導出処理）からマージ候補リストに入れられた空間的または時間的な（例えば、隣接空間および非隣接空間、ＴＭＶＰ、ＳＴＭＶＰ、ＡＴＭＶＰなどを含む）マージ候補の数が、最大許容のマージ候補から、所与の閾値を引いた数よりも少ない場合。
１．一例において、閾値は、１または０に設定される。
２．代替的に、閾値は、ＳＰＳ／ＰＰＳ／シーケンス、ピクチャ、スライスヘッダ／タイルにおいて信号通知されてもよく、または予め規定されてもよい。
３．代替的に、閾値は、ブロックごとに適応的に変更されてもよい。例えば、それは、ブロックサイズ／ブロック形状／スライスタイプのような符号化されたブロック情報に依存してもよく、および／または、利用可能な空間的または時間的マージ候補の数に依存してもよい。
４．他の例において、既にマージ候補リストに含まれていないある種のマージ候補の数が、最大許容マージ候補から、所与の閾値を引いた数未満である場合。「ある種のマージ候補」は、ＨＥＶＣのような空間的候補であってもよいし、隣接しないマージ候補であってもよい。
ｖｉ．マージ候補リストに動き候補を追加する前に、プルーニングを適用してもよい。本特許明細書に開示されたこの例および他の例の様々な実装形態において、プルーニン
グは、ａ）動き情報と既存のエントリとを一意性のために比較すること、または、ｂ）一意である場合、動き情報をリストに追加すること、またはｃ）一意でない場合、ｃ１）動き情報を追加しない、または、ｃ２）動き情報を追加し、一致した既存のエントリを削除することを含んでもよい。いくつかの実装形態において、テーブルから候補リストに動き候補を追加する際に、プルーニング工程は実行されない。
１．一例において、動き候補は、マージ候補リストの他の符号化方法から利用可能な空間的または時間的（例えば、隣接空間および非隣接空間、ＴＭＶＰ、ＳＴＭＶＰ、ＡＴＭＶＰ等を含む）マージ候補の全部または一部にプルーニングされてもよい。
２．動き候補は、サブブロックに基づく動き候補、例えば、ＡＴＭＶＰ、ＳＴＭＶＰにプルーニングされなくてもよい。
３．一例において、現在の動き候補は、マージ候補リストにおける利用可能な動き候補（現在の動き候補の前に挿入された）の全部または一部にプルーニングされてもよい。
４．動き候補に関連するプルーニング工程の数（例えば、動き候補をマージリストにおける他の候補と比較する必要がある回数）は、利用可能な空間的または時間的マージ候補の数に依存してもよい。例えば、新しい動き候補をチェックする際に、マージリストに利用可能な候補がＭ個ある場合、新しい動き候補を最初のＫ個（Ｋ≦Ｍ）の候補とのみ比較することができる。プルーニング関数が偽を返す（例えば、最初のＫ個の候補のいずれとも同一でない）場合、この新しい動き候補は、Ｍ個の候補のすべてと異なると見なされ、マージ候補リストに追加され得る。一例において、Ｋは、ｍｉｎ（Ｋ，２）に設定される。
５．一例において、新しく付加された動き候補とマージ候補リストにおける最初のＮ個の候補とを比較するだけである。例えば、Ｎ＝３、４または５である。Ｎは、エンコーダからデコーダに信号通知されてもよい。
６．一例において、チェック対象の新しい動き候補は、マージ候補リストにおける最後のＮ個の候補と比較されるのみである。例えば、Ｎ＝３、４または５である。Ｎは、エンコーダからデコーダに信号通知されてもよい。
７．一例において、以前リストに追加された候補をテーブルから選択し、新しい動き候補と比較する方法は、前回追加された候補がどこから導出されたかに依存してもよい。
ａ．一例において、ルックアップテーブルにおける動き候補を、所与の時間的および／または空間的に近傍のブロックから導出された候補と比較してもよい。
ｂ．一例において、ルックアップテーブルにおける動き候補の異なるエントリを、以前追加された異なる候補と比較してもよい（すなわち、異なる位置から導出された）
。
ｅ．隣接／非隣接の空間的または時間的ブロックから導出されるような、他のマージ（またはアフィンマージまたは他のインター符号化方法）候補をチェックする前に、ルックアップテーブルにおける動き候補のチェックを有効にしてもよい。
ｆ．ルックアップテーブルに少なくとも１つの動き候補がある場合、ルックアップテーブルにおける動き候補のチェックを有効にしてもよい。 Example C2: One or more motion candidates contained in one lookup table may be directly inherited by one block.
They may be used for merge mode coding, i.e., motion candidates may be checked in the merge candidate list derivation process.
b. They may be used for affine merge mode encoding.
i. If the affine flag is 1, then the motion candidates in the lookup table can be added as affine merge candidates.
c) They may be used for other kinds of merge modes such as sub-block merge mode, affine merge mode, triangle merge mode, inter-intra merge mode, MMVD (Merge with MVD) mode.
d. Checking of motion candidates in the lookup table may be enabled if:
i. After inserting the TMVP candidate, the merge candidate list is not full.
ii. After checking certain spatially neighboring blocks for spatial merge candidate derivation, the merge candidate list is not full.
iii. After all spatial merge candidates, the merge candidate list is not full.
iv. After the combined bi-predictive merge candidates, the merge candidate list is not full.
v. If the number of spatial or temporal (e.g., including spatially adjacent and non-spatially adjacent, TMVP, STMVP, ATMVP, etc.) merge candidates put into the merge candidate list from other coding schemes (e.g., the merge derivation process of the HEVC design, or the JEM design) is less than the maximum allowed merge candidates minus a given threshold.
1. In one example, the threshold is set to 1 or 0.
2. Alternatively, the threshold may be signaled in the SPS/PPS/sequence, picture, slice header/tile or may be pre-defined.
3. Alternatively, the threshold may be adaptively changed per block, e.g., it may depend on coded block information like block size/block shape/slice type and/or it may depend on the number of available spatial or temporal merging candidates.
4. In another example, the number of certain merge candidates that are not already included in the merge candidate list is less than the maximum allowed merge candidates minus a given threshold. The "certain merge candidates" may be spatial candidates like HEVC or non-adjacent merge candidates.
vi. Pruning may be applied before adding motion candidates to the merge candidate list. In various implementations of this and other examples disclosed in this patent specification, pruning may include a) comparing the motion information with existing entries for uniqueness, or b) adding the motion information to the list if unique, or c) if not unique, c1) not adding the motion information, or c2) adding the motion information and removing the matching existing entry. In some implementations, no pruning step is performed when adding motion candidates from the table to the candidate list.
1. In one example, motion candidates may be pruned to all or a portion of spatial or temporal (e.g., including adjacent and non-adjacent spatial, TMVP, STMVP, ATMVP, etc.) merge candidates available from other coding methods in the merge candidate list.
2. Motion candidates may not be pruned to sub-block based motion candidates, e.g., ATMVP, STMVP.
3. In one example, the current motion candidate may be pruned to all or a portion of the available motion candidates in the merge candidate list (inserted before the current motion candidate).
4. The number of pruning steps associated with a motion candidate (e.g., the number of times a motion candidate needs to be compared with other candidates in the merge list) may depend on the number of available spatial or temporal merge candidates. For example, when checking a new motion candidate, if there are M candidates available in the merge list, the new motion candidate can only be compared with the first K candidates (K≦M). If the pruning function returns false (e.g., not identical to any of the first K candidates), the new motion candidate is considered to be different from all M candidates and can be added to the merge candidate list. In one example, K is set to min(K, 2).
5. In one example, only compare the newly added motion candidate with the first N candidates in the merge candidate list, for example N=3, 4 or 5. N may be signaled from the encoder to the decoder.
6. In one example, the new motion candidate being checked is only compared to the last N candidates in the merge candidate list, for example N=3, 4 or 5. N may be signaled from the encoder to the decoder.
7. In one example, the way in which candidates previously added to the list are selected from the table and compared to the new motion candidates may depend on where the previously added candidates were derived from.
In one example, motion candidates in a look-up table may be compared to candidates derived from given temporal and/or spatial neighboring blocks.
b. In one example, different entries of motion candidates in the lookup table may be compared to different candidates that were previously added (i.e., derived from different positions).
.
e. Checking of motion candidates in a lookup table may be enabled before checking other merge (or affine merge or other inter-coding method) candidates, such as those derived from adjacent/non-adjacent spatial or temporal blocks.
f. If there is at least one motion candidate in the lookup table, checking of motion candidates in the lookup table may be enabled.

例Ｃ３：ルックアップテーブルに含まれる動き候補は、ブロックの動き情報を符号化するための予測因子として用いられてもよい使用してもよい。
ａ．それらをＡＭＶＰモード符号化に使用してもよく、すなわち、ＡＭＶＰ候補リスト導出処理において動き候補をチェックしてもよい。
ｂ．それらは、ＭＶＤの一部のみを符号化するＳＭＶＤ（ＳｙｍｍｅｔｒｉｃＭｏｔｉｏｎＶｅｃｔｏｒＤｉｆｆｅｒｅｎｃｅ）符号化に使用してもよい（例えば、１つの参照ピクチャリストに対して信号通知されたＭＶＤのみであり、別の参照ピクチャリストから導出される）。
ｃ．それらは、ＭＶの一部のみを符号化するＳＭＶ（ＳｙｍｍｅｔｒｉｃＭｏｔｉｏｎＶｅｃｔｏｒ）符号化に使用してもよい（例えば、１つの参照ピクチャリストに対して信号通知されたもののみであり、別の参照ピクチャリストから導出される）。
ｄ．以下の場合、ルックアップテーブルにおける動き候補のチェックを有効にしてもよい。
ｉ．ＴＭＶＰ候補をチェックまたは挿入した後、ＡＭＶＰ候補リストが満杯になっていない。
ｉｉ．ＡＭＶＰ候補リストが、空間的近傍から選択し、プルーニングした後で、ＴＭＶＰ候補を挿入する直前には、満杯になっていない。
ｉｉｉ．上側の近傍のブロックからのＡＭＶＰ候補がスケーリング無しで存在しない場合、および／または、左側の近傍のブロックからのＡＭＶＰ候補がスケーリング無しで存在しない場合。
ｉｖ．特定のＡＭＶＰ候補を挿入した後、ＡＭＶＰ候補リストが満杯になっていない。
ｖ．ＡＭＶＰ候補リストに動き候補を追加する前に、プルーニングを適用してもよい。
ｖｉ．実施例Ｃ２のｖｉ．３および４に記載されたものと同様の規則は、ＡＭＶＰモードに適用されてよい。
ｅ．隣接／非隣接の空間的または時間的ブロックから導出されるような、他のＡＭＶＰ（またはＳＭＶＤ／ＳＭＶ／アフィンインターまたは他のインター符号化方法）候補をチェックする前に、動き候補のチェックを有効にしてもよい。
ｆ．ルックアップテーブルに少なくとも１つの動き候補がある場合、動き候補のチェックを有効にしてもよい。
ｇ．現在の参照ピクチャと同一の参照ピクチャを有する（すなわち、ＰＯＣ（Ｐｉｃｔｕｒｅ－Ｏｒｄｅｒ－Ｃｏｕｎｔ）が同一である）動き候補をチェックする。すなわち、動き候補が現在の参照ピクチャと同一の参照ピクチャを含む場合、対応する動きベクトルは、ＡＭＶＰ候補リスト構成処理を考慮してもよい。
ｉ．代替的に、更に、現在の参照ピクチャとは異なる参照ピクチャを有する動き候補も（スケーリングされたＭＶにて）チェックする。すなわち、動き候補が現在の参照ピクチャとは異なる参照ピクチャを有する場合、対応する動きベクトルは、ＡＭＶＰ候補リスト構築処理を考慮してもよい。
ｉｉ．代替的に、まず、現在の参照ピクチャと同一の参照ピクチャを有するすべての動き候補をチェックし、次に、現在の参照ピクチャとは異なる参照ピクチャを有する動き候補をチェックする。すなわち、同一の参照ピクチャを有する動き候補に対して、より高い優先順位を割り当てる。
ｉｉｉ．代替的に、動き候補は、マージにおいて同様にチェックされる。
ｉｖ．１つの動き候補が双方向予測候補である場合、まず参照ピクチャリストＸの参照ピクチャ（例えば、参照ピクチャの参照ピクチャインデックスまたはピクチャ順序カウンタ）をチェックし、次に現在の参照対象ピクチャリストがＸである場合、参照ピクチャリストＹ（Ｙ！＝Ｘ、例えば、Ｙ＝１－Ｘ）の参照ピクチャをチェックしてもよい。
ｖ．代替的に、１つの動き候補が双方向予測候補である場合、まず参照ピクチャリストＹ（Ｙ！＝Ｘ，例えば、Ｙ＝１－Ｘ）の参照ピクチャ（例えば、参照ピクチャの参照ピクチャインデックスまたはピクチャ順序カウンタ）をチェックし、次に現在の参照対象ピクチャリストがＸである場合、参照ピクチャリストＸの参照ピクチャをチェックしてもよい。
ｖｉ．代替的に、参照ピクチャリストの参照ピクチャが、すべてのチェック対象の動き候補に関連付けられたＹ（Ｙ！＝Ｘ、例えば、Ｙ＝１－Ｘ）である前に、参照ピクチャリストの参照ピクチャが、すべてのチェック対象の動き候補に関連付けられたＸであるかどうかをチェックしてもよい。 Example C3: The motion candidates contained in the lookup table may be used as predictors for encoding the motion information of the block.
They may be used for AMVP mode coding, i.e., motion candidates may be checked in the AMVP candidate list derivation process.
b. They may be used for Symmetric Motion Vector Difference (SMVD) coding, which codes only a part of the MVD (e.g. only the MVD signaled for one reference picture list and derived from another reference picture list).
c. They may be used for Symmetric Motion Vector (SMV) coding, which codes only a part of the MVs (e.g. only those signaled for one reference picture list and derived from another reference picture list).
d. Checking of motion candidates in the lookup table may be enabled if:
i. After checking or inserting a TMVP candidate, the AMVP candidate list is not full.
ii. The AMVP candidate list is not full after selecting and pruning from spatial neighborhoods and immediately before inserting the TMVP candidate.
iii. If no AMVP candidates from the upper neighboring block exist without scaling, and/or no AMVP candidates from the left neighboring block exist without scaling.
iv. The AMVP candidate list is not full after inserting a particular AMVP candidate.
v. Before adding motion candidates to the AMVP candidate list, pruning may be applied.
vi. Rules similar to those described in vi. 3 and 4 of embodiment C2 may be applied to the AMVP mode.
e. Checking of motion candidates may be enabled before checking other AMVP (or SMVD/SMV/Affine Inter or other Inter coding methods) candidates, such as those derived from adjacent/non-adjacent spatial or temporal blocks.
f. If there is at least one motion candidate in the lookup table, motion candidate checking may be enabled.
g. Check for motion candidates that have the same reference picture as the current reference picture (i.e., have the same Picture-Order-Count (POC)). That is, if a motion candidate contains the same reference picture as the current reference picture, the corresponding motion vector may be considered for the AMVP candidate list construction process.
Alternatively, also check (in scaled MV) for motion candidates that have a different reference picture than the current reference picture, i.e. if a motion candidate has a different reference picture than the current reference picture, the corresponding motion vector may be considered for the AMVP candidate list building process.
Alternatively, first check all motion candidates that have the same reference picture as the current reference picture, and then check motion candidates that have a different reference picture from the current reference picture, i.e. assign a higher priority to motion candidates that have the same reference picture.
iii. Alternatively, motion candidates are checked in the merge as well.
iv. If one motion candidate is a bi-directional prediction candidate, it may first check the reference picture in reference picture list X (e.g., the reference picture index or picture order counter of the reference picture), and then check the reference picture in reference picture list Y (Y!=X, e.g., Y=1-X) if the current referenced picture list is X.
v. Alternatively, if one motion candidate is a bi-prediction candidate, one may first check the reference pictures (e.g., the reference picture index or picture order counter of the reference picture) in reference picture list Y (Y!=X, e.g., Y=1-X), and then check the reference pictures in reference picture list X if the current referenced picture list is X.
Alternatively, a reference picture in the reference picture list may be checked to see if it is an X associated with all the motion candidates to be checked before the reference picture in the reference picture list is an Y (Y!=X, e.g., Y=1-X) associated with all the motion candidates to be checked.

例Ｃ４：ルックアップテーブルにおける動き候補のチェック順序は、以下のように規定される（Ｋ（Ｋ≧１）個の動き候補をチェックすることができるとする）：
ａ．ルックアップテーブルにおける最後のＫ個の動き候補。
ｂ．最初のＫ％Ｌ候補であって、Ｌは、Ｋ≧Ｌである場合のルックアップテーブルのサイズである。
ｃ．Ｋ≧Ｌである場合、ルックアップテーブルにおけるすべての候補（Ｌ個の候補）。
ｄ．代替的に、さらに、動き候補インデックスの降順に基づいてもよい。
ｅ．代替的に、さらに、動き候補インデックスの昇順に基づいてもよい。
ｆ．代替的に、動き候補に関連付けられた位置と現在のブロックの距離などの候補情報に基づいて、Ｋ個の動き候補を選択する。
ｉ．一例において、Ｋ個の最も近い動き候補を選択する。
ｉｉ．一例において、距離を算出する際に、候補情報は、ブロックの形状をさらに考慮されてもよい。
ｇ．一例において、Ｌ個の候補を含むテーブルからの動き候補のＫのチェック順序は、次のように規定されてもよい：ａ_０、ａ_０＋Ｔ_０、ａ_０＋Ｔ_０＋Ｔ_１、ａ_０＋Ｔ_０＋Ｔ_１＋Ｔ_２、．．．ａ_０＋Ｔ_０＋Ｔ_１＋Ｔ_２＋．．．＋Ｔ_Ｋ－１のインデックスを持つ候補を順に選択し、ａ_０とＴ_ｉ（ｉは０．．．Ｋ－１である）は整数値である。
ｉ．一例において、ａ_０は、０（すなわち、テーブルにおける動き候補の最初のエントリ）に設定される。代替的に、ａ_０は（Ｋ－Ｌ／Ｋ）に設定される。演算「／」は、結果をゼロに切り捨てる整数除算として規定される。代替的に、ａ_０は、０とＬ／Ｋとの間の任意の整数に設定される。
１．代替的に、ａ_０の値は、現在のブロックおよび近傍のブロックの符号化情報に依存してもよい。
ｉｉ．一例において、すべての間隔Ｔ_ｉ（ｉは０…Ｋ－１である）は同じであり、例えばＬ／Ｋである。演算「／」は、結果をゼロに切り捨てる整数除算として規定される。
ｉｉｉ．一例において、（Ｋ，Ｌ，ａ_０，Ｔ_ｉ）は、（４，１６，０，４）、または（４，１２，０，３）、または（４，８，０，１）、または（４，１６，３，４）、または（４，１２，２，３）、または（４，８，１，２）に設定される。Ｔ_ｉはすべてのｉについて同じである。
ｉｖ．このような方法は、ＫがＬよりも小さい場合にのみ適用されてもよい。
ｖ．代替的に、Ｋが閾値以上である場合、例Ｃ４のｃ部を適用してもよい。閾値は、Ｌとして規定されてもよく、またはＫに依存してもよく、またはブロックごとに適応的に変更されてもよい。一例において、閾値は、ルックアップテーブルから新しい動き候補を追加する前のリストにおける利用可能な動き候補の数に依存してもよい。
ｈ．一例において、Ｌ個の候補を含むテーブルからの動き候補のＫのチェック順序は、次のように規定されてもよい：ａ_０、ａ_０－Ｔ_０、ａ_０－Ｔ_０－Ｔ_１、ａ_０－Ｔ_０－Ｔ_１－Ｔ_２、．．．ａ_０－Ｔ_０－Ｔ_１－Ｔ_２－．．．－Ｔ_Ｋ－１のインデックスを持つ候補を順に選択し、ａ_０とＴ_ｉ（ｉは０．．．Ｋ－１である）は整数値である。
ｉ．一例において、ａ_０は、Ｌ－１（すなわち、テーブルにおける動き候補の最後のエントリ）に設定される。代替的に、ａ_０は、Ｌ－１－Ｌ／ＫとＬ－１との間の任意の整数に設定される。
ｉｉ．一例において、すべての間隔Ｔ_ｉ（ｉは０…Ｋ－１である）は同じであり、例えばＬ／Ｋである。
ｉｉｉ．一例において、（Ｋ，Ｌ，ａ_０，Ｔ_ｉ）は、（４，１６，Ｌ－１，４）、または（４，１２，Ｌ－１，３）、または（４，８，Ｌ－１，１）、または（４，１６，Ｌ－４，４）、または（４，１２，Ｌ－３，３）、または（４，８，Ｌ－２，２）に設定される。Ｔ_ｉはすべてのｉについて同じである。
ｉｖ．このような方法は、ＫがＬよりも小さい場合にのみ適用されてもよい。
ｖ．代替的に、更に、Ｋが閾値以上である場合、例Ｃ４のｃ部を適用してもよい。閾値は、Ｌとして規定されてもよく、またはＫに依存してもよく、またはブロックごとに適応的に変更されてもよい。一例において、閾値は、ルックアップテーブルから新しい動き候補を追加する前のリストにおける利用可能な動き候補の数に依存してもよい。
ｉ．ルックテーブルから動き候補を選択する数および／または方法は、符号化情報、例えばブロックサイズ／ブロック形状に依存してもよい。
ｉ．一例において、より小さいブロックサイズの場合、最後のＫ個の動き候補を選択する代わりに、（最後から始まらない）他のＫ個の動き候補を選択してもよい。
ｉｉ．一例において、符号化情報は、ＡＭＶＰモードであってもよいし、マージモードであってもよい。
ｉｉｉ．一例において、符号化情報は、アフィンモードまたは非アフィンＡＭＶＰモードまたは非アフィンマージモードであってもよい。
ｉｖ．一例において、符号化情報は、アフィンＡＭＶＰ（インター）モード、またはアフィンマージモード、または非アフィンＡＭＶＰモード、または非アフィンマージモードであってもよい。
ｖ．一例において、符号化情報は、ＣＰＲ（ＣｕｒｒｅｎｔＰｉｃｔｕｒｅＲｅｆｅｒｅｎｃｅ）モードであってもよいし、ＣＰＲモードでなくてもよい。
ｖｉ．代替的に、ルックアップテーブルから動き候補を選択する方法は、ルックアップテーブルにおける動き候補の数、および／または、ルックアップテーブルから新しい動き候補を追加する前のリストにおける利用可能な動き候補の数にさらに依存してもよい。
ｊ．一例において、チェック対象のルックアップテーブルにおける利用可能な動き候補の最大数（すなわち、マージ／ＡＭＶＰ候補リストに追加されてもよい）は、ルックアップテーブルにおける利用可能な動き候補の数（Ｎ_{ａｖａｉＭＣｉｎＬＵＴ}によって示され
る）、および／または、追加対象の（これは、予め規定されていてもよいし、信号通知されていてもよい）最大許容動き候補の数（ＮＵＭ_{ｍａｘＭＣ}によって示される）、および／または、ルックアップテーブルから候補をチェックする前の候補リストにおける利用可能な動き候補の数（Ｎ_{ａｖａｉＣ}によって示される）に依存していてもよい。
ｉ．一例において、チェック対象のルックアップテーブルにおける動き候補の最大数は、（Ｎ_{ａｖａｉＭＣｉｎＬＵＴ}，ＮＵＭ_{ｍａｘＭＣ}，Ｎ_{ａｖａｉＣ}）の最小値に設定さ
れる。
ｉｉ．代替的に、チェック対象のルックアップテーブルにおける動き候補の最大数は、（Ｎ_{ａｖａｉＭＣｉｎＬＵＴ}，ＮＵＭ_{ｍａｘＭＣ}－Ｎ_{ａｖａｉＣ}）の最小値に設定され
る。
ｉｉｉ．一例において、Ｎ_{ａｖａｉＣ}は、空間的または時間的（隣接および／または非隣接）な近傍のブロックから導出された挿入候補の数を示す。代替的に、サブブロック候補（ＡＭＴＶＰ、ＳＴＭＶＰなど）の数は、Ｎ_{ａｖａｉＣ}にカウントされない。
ｉｖ．ＮＵＭ_{ｍａｘＭＣ}は、符号化モードに依存してもよく、例えば、マージモードおよびＡＭＶＰモードにおいて、ＮＵＭ_{ｍａｘＭＣ}は異なる値に設定されてもよい。一例において、マージモードの場合、ＮＵＭ_{ｍａｘＭＣ}は、４、６、８、１０などに設定してもよく、ＡＭＶＰモードの場合、ＮＵＭ_{ｍａｘＭＣ}は、１、２、４などに設定してもよい。
ｖ．代替的に、ＮＵＭ_{ｍａｘＭＣ}は、ブロックサイズ、ブロック形状、スライスタイプなどのような他の符号化情報に依存してもよい。
ｋ．異なるルックアップテーブルのチェック順序は、次のサブセクションのルックアップテーブルの使用法で規定されている。
ｌ．一旦、マージ／ＡＭＶＰ候補リストが、最大許容候補数に達すると、このチェック処理は終了する。
ｍ．一旦、マージ／ＡＭＶＰ候補リストが、最大許容候補数から閾値（Ｔｈ）を減算した値に達すると、このチェック処理は終了する。一例において、Ｔｈは、例えば、１、２、または３など、正の整数値として予め規定されてもよい。代替的に、Ｔｈは、ブロックごとに適応的に変更されてもよい。代替的に、Ｔｈは、ＳＰＳ／ＰＰＳ／スライスヘッダ等において信号通知されてもよい。代替的に、Ｔｈは、さらに、ブロック形状／ブロックサイズ／符号化モードなどに依存してもよい。代替的に、Ｔｈは、ＬＵＴからの動き候補を追加する前の利用可能な候補の数に依存してもよい。
ｎ．代替的に、追加された動き候補の数が最大許容動き候補数に達すると、それは終了する。最大許容動き候補数は、信号通知されてもよく、または予め規定されてもよい。代替的に、最大許容動き候補数は、ブロック形状／ブロックサイズ／符号化モード等にさらに依存してもよい。
ｏ．テーブルサイズを示す１つのシンタックス要素ならびにチェック可能な動き候補の数（すなわち、Ｋ＝Ｌ）は、ＳＰＳ、ＰＰＳ、スライスヘッダ、タイルヘッダにおいて信号通知されてもよい。 Example C4: The checking order of motion candidates in the lookup table is defined as follows (assuming that K (K≧1) motion candidates can be checked):
The last K motion candidates in the lookup table.
b. The first K % L candidates, where L is the size of the lookup table, where K>L.
c. If K≧L, then all candidates in the lookup table (L candidates).
d. Alternatively, it may also be based on descending order of motion candidate index.
e. Alternatively, it may also be based on the ascending order of the motion candidate index.
f) Alternatively, select the K motion candidates based on candidate information such as the distance of the current block to the position associated with the motion candidate.
i. In one example, select the K closest motion candidates.
ii. In one example, the candidate information may further take into account the shape of the block when calculating the distance.
g. In one example, the checking order of K motion candidates from a table containing L candidates may be defined as follows: _a0 , _a0 + _T0 , _a0 + _T0 + _T1 , a0+ _T0 + _T1 + _T2 , ... _a0 + _T0 + _T1 + _T2 ₊ ...+ _TK-1 , selecting candidates with indices _a0 and T _i (i=0...K-1) in order, where a0 and T i are integer values.
i. In one example, _a0 is set to 0 (i.e., the first entry of the motion candidates in the table). Alternatively, _a0 is set to (K-L/K). The operation "/" is defined as integer division that truncates the result towards zero. Alternatively, _a0 is set to any integer between 0 and L/K.
1. Alternatively, the value of _a0 may depend on the coding information of the current block and neighboring blocks.
In one example, all the intervals T _i (i=0...K-1) are the same, say L/K. The operation "/" is defined as integer division that truncates the result towards zero.
In one example, (K, L, _a0 , _Ti ) is set to (4, 16, 0, 4), or (4, 12, 0, 3), or (4, 8, 0, 1), or (4, 16, 3, 4), or (4, 12, 2, 3), or (4, 8, 1, 2), where _Ti is the same for all i.
iv. Such methods may only be applied if K is less than L.
v. Alternatively, if K is greater than or equal to a threshold, part c of example C4 may be applied. The threshold may be defined as L, or may depend on K, or may be adaptively changed on a block-by-block basis. In one example, the threshold may depend on the number of available motion candidates in the list before adding new motion candidates from the lookup table.
h. In one example, the checking order of K motion candidates from a table containing L candidates may be defined as follows: _a0 , _a0 - _T0 , _a0 - _T0 - _T1 , _a0 - _T0 - _T1 - _T2 , ... _a0 - _T0 - _T1 - _T2 -...- _TK-1 , selecting candidates with indices _a0 and T _i (i is 0...K-1) in order.
In one example, a ₀ is set to L−1 (i.e., the last entry of the motion candidates in the table). Alternatively, a ₀ is set to any integer between L−1-L/K and L−1.
ii. In one example, all the intervals T _i (i=0...K-1) are the same, say L/K.
In one example, (K, L, a ₀ , T _i ) is set to (4, 16, L-1, 4), or (4, 12, L-1, 3), or (4, 8, L-1, 1), or (4, 16, L-4, 4), or (4, 12, L-3, 3), or (4, 8, L-2, 2), where _{T i} is the same for all i.
iv. Such methods may only be applied if K is less than L.
v. Alternatively, part c of example C4 may also be applied if K is greater than or equal to a threshold. The threshold may be defined as L, or may depend on K, or may be adaptively changed on a block-by-block basis. In one example, the threshold may depend on the number of available motion candidates in the list before adding new motion candidates from the lookup table.
i. The number and/or manner of selecting motion candidates from the look-table may depend on the coding information, e.g., block size/block shape.
i. In one example, for smaller block sizes, instead of selecting the last K motion candidates, one may select the other K motion candidates (not starting from the end).
ii. In one example, the coding information may be in AMVP mode or in merge mode.
iii. In one example, the coding information may be in affine mode or non-affine AMVP mode or non-affine merge mode.
iv. In one example, the coding information may be in affine AMVP (inter) mode, or affine merge mode, or non-affine AMVP mode, or non-affine merge mode.
v. In one example, the encoded information may or may not be in Current Picture Reference (CPR) mode.
vi. Alternatively, the method of selecting a motion candidate from the lookup table may further depend on the number of motion candidates in the lookup table and/or the number of available motion candidates in the list before adding a new motion candidate from the lookup table.
j. In one example, the maximum number of available motion candidates in the lookup table to be checked (i.e., that may be added to the merge/AMVP candidate list) may depend on the number of available motion candidates in the lookup table (indicated by N _avaiMCinLUT ) and/or the maximum number of allowed motion candidates to be added (which may be predefined or signaled) (indicated by NUM _maxMC ) and/or the number of available motion candidates in the candidate list before checking candidates from the lookup table (indicated by N _avaiC ).
i. In one example, the maximum number of motion candidates in the lookup table to be checked is set to the minimum of (N _avaiMCinLUT , NUM _maxMC , N _avaiC ).
ii. Alternatively, the maximum number of motion candidates in the lookup table to be checked is set to the minimum of (N _avaiMCinLUT , NUM _maxMC −N _avaiC ).
In one example, N _avaiC indicates the number of insertion candidates derived from spatially or temporally (adjacent and/or non-adjacent) neighboring blocks. Alternatively, the number of sub-block candidates (such as AMTVP, STMVP, etc.) is not counted in N _avaiC .
iv. _{NUM_maxMC} may depend on the coding mode, e.g., in merge mode and AMVP mode, _{NUM_maxMC} may be set to different values. In one example, for merge mode, _{NUM_maxMC} may be set to 4, 6, 8, 10, etc., and for AMVP mode, _{NUM_maxMC} may be set to 1, 2, 4, etc.
v. Alternatively, _{NUM_maxMC} may depend on other coding information such as block size, block shape, slice type, etc.
k. The order in which the different lookup tables are checked is specified in the next subsection, Lookup Table Usage.
l. Once the merge/AMVP candidate list reaches the maximum allowed number of candidates, this checking process ends.
m. Once the merge/AMVP candidate list reaches the maximum allowed number of candidates minus a threshold (Th), the checking process ends. In one example, Th may be predefined as a positive integer value, e.g., 1, 2, or 3. Alternatively, Th may be adaptively changed per block. Alternatively, Th may be signaled in the SPS/PPS/slice header, etc. Alternatively, Th may further depend on the block shape/block size/coding mode, etc. Alternatively, Th may depend on the number of available candidates before adding the motion candidates from the LUT.
Alternatively, it terminates when the number of added motion candidates reaches the maximum allowed number of motion candidates. The maximum allowed number of motion candidates may be signaled or predefined. Alternatively, the maximum allowed number of motion candidates may further depend on block shape/block size/coding mode etc.
o. One syntax element indicating the table size as well as the number of motion candidates that can be checked (i.e., K=L) may be signaled in the SPS, PPS, slice header, tile header.

いくつかの実装形態において、ルックアップテーブルにおける動き候補は他の候補を導出するために用いられてもよく、導出された候補はブロックを符号化するために用いられてもよい。 In some implementations, the motion candidates in the lookup table may be used to derive other candidates, and the derived candidates may be used to encode the block.

いくつかの実装形態において、１つのブロックの動き情報符号化のためのルックアップテーブルの使用の有効／無効は、ＳＰＳ、ＰＰＳ、スライスヘッダ、タイルヘッダ、ＣＴＵ、ＣＴＢ、ＣＵ、またはＰＵ、複数のＣＴＵ／ＣＴＢ／ＣＵ／ＰＵを覆う領域において信号通知されてもよい。 In some implementations, the enable/disable of the use of a lookup table for motion information coding of a block may be signaled in the SPS, PPS, slice header, tile header, CTU, CTB, CU, or PU, an area covering multiple CTUs/CTBs/CUs/PUs.

いくつかの実装形態において、ルックアップテーブルからの予測を適用するかどうかは、さらに符号化情報に依存してもよい。１つのブロックに適用しないと推測される場合、予測の指示の追加の信号通知はスキップされる。代替的に、１つのブロックに適用しないと推測される場合、ルックアップテーブルの動き候補にアクセスする必要はなく、関連する動き候補のチェックは省略される。 In some implementations, whether to apply a prediction from the lookup table may further depend on the coding information. If it is inferred not to apply to a block, the additional signaling of the prediction indication is skipped. Alternatively, if it is inferred not to apply to a block, there is no need to access the motion candidates in the lookup table and the check of the associated motion candidates is omitted.

いくつかの実装形態において、以前符号化されたフレーム／スライス／タイルにおけるルックアップテーブルの動き候補を使用して、異なるフレーム／スライス／タイルにおけるブロックの動き情報を予測してもよい。
ａ．一例において、現在のブロックの参照ピクチャに関連付けられたルックアップテーブルのみを、現在のブロックを符号化するために利用してもよい。
ｂ．一例において、現在のブロックを符号化するために、現在のブロックの同じスライスタイプおよび／または同じ量子化パラメータを有するピクチャに関連付けられたルックアップテーブルのみを利用してもよい。 In some implementations, motion candidates in a look-up table in a previously coded frame/slice/tile may be used to predict motion information of a block in a different frame/slice/tile.
In one example, only the look-up table associated with the reference picture of the current block may be utilized to code the current block.
b. In one example, only look-up tables associated with pictures having the same slice type and/or the same quantization parameter of the current block may be utilized to code the current block.

ルックアップテーブルの更新 Update lookup table

動き情報を有するブロックを符号化した後（すなわち、ＩｎｔｒａＢＣモード、インター符号化モード）に、１つ以上のルックアップテーブルを更新してもよい。 After encoding a block with motion information (i.e., IntraBC mode, inter coding mode), one or more lookup tables may be updated.

上述したすべての例および実装形態において、ルックアップテーブルは、符号化情報、または以前符号化されたブロックからの符号化情報から導出された情報を復号化順に示す。
ａ．ルックアップテーブルは、並進動き情報、またはアフィン動き情報、またはアフィンモデルパラメータ、またはイントラモード情報、または照明補償情報等を含んでもよい。
ｂ．代替的に、ルックアップテーブルは、並進動き情報、またはアフィン動き情報、またはアフィンモデルパラメータ、またはイントラモード情報、または照明補償情報等のような情報を少なくとも２種類含んでもよい。 In all of the above examples and implementations, the lookup table indicates coded information, or information derived from coded information from previously coded blocks, in decoding order.
The lookup table may contain translational motion information, or affine motion information, or affine model parameters, or intra-mode information, or lighting compensation information, etc.
b. Alternatively, the lookup table may contain at least two types of information, such as translational motion information, or affine motion information, or affine model parameters, or intra-mode information, or lighting compensation information, etc.

追加の例示的な実施形態 Additional exemplary embodiments

以前符号化されたブロックの動き情報としてＨＭＶＰ候補を規定する、ＨＭＶＰ（Ｈｉｓｔｏｒｙ－ｂａｓｅｄＭＶＰ）方法が提案される。符号化／復号化処理中、複数のＨＭＶＰ候補を有するテーブルが維持される。新しいスライスに遭遇した場合、テーブルは空になる。インター符号化されたブロックがあるときはいつでも、関連する動き情報を新しいＨＭＶＰ候補としてテーブルの最後のエントリに加える。全体の符号化フローを図３０に示す。 A History-based MVP (HMVP) method is proposed, which defines HMVP candidates as motion information of previously coded blocks. During the coding/decoding process, a table with multiple HMVP candidates is maintained. When a new slice is encountered, the table is emptied. Whenever there is an inter-coded block, the associated motion information is added to the last entry of the table as a new HMVP candidate. The overall coding flow is shown in Figure 30.

一例において、テーブルサイズはＬ（例えば、Ｌ＝１６または６、または４４）に設定され、これは、Ｌ個までのＨＭＶＰ候補をテーブルに追加することができることを示す。 In one example, the table size is set to L (e.g., L=16 or 6 or 44), indicating that up to L HMVP candidates can be added to the table.

１つの実施形態（例１１．ｇ．ｉに対応する）において、以前符号化されたブロックからのＨＭＶＰ候補がＬ個よりも多く存在する場合、テーブルが常に最新の以前符号化されたＬ個の動き候補を含むように、先入れ先出し（ＦＩＦＯ：Ｆｉｒｓｔ－Ｉｎ－Ｆｉｒｓｔ－Ｏｕｔ）規則が適用される。図３１は、ＦＩＦＯ規則を適用してＨＭＶＰ候補を除去し、提案される方法で使用されるテーブルに新しいものを追加する例を示す。 In one embodiment (corresponding to Example 11.g.i), if there are more than L HMVP candidates from previously coded blocks, a First-In-First-Out (FIFO) rule is applied so that the table always contains the most recent L previously coded motion candidates. Figure 31 shows an example of applying the FIFO rule to remove HMVP candidates and add new ones to the table used in the proposed method.

別の実施形態（発明１１．ｇ．ｉｉｉに対応する）において、新しい動き候補を追加するときはいつでも（例えば、現在のブロックがインター符号化され、非アフィンモードであるなど）、まず、冗長性チェック処理を適用し、ＬＵＴに同じまたは類似した動き候補があるかどうかを識別する。 In another embodiment (corresponding to invention 11.g.iii), whenever a new motion candidate is added (e.g., the current block is inter-coded and in non-affine mode), we first apply a redundancy check process to identify whether there is a same or similar motion candidate in the LUT.

いくつかの例を以下に示す。 Some examples are shown below:

図３２Ａは、新しい動き候補を追加する前に、ＬＵＴが満杯であった場合の例を示す。 Figure 32A shows an example where the LUT is full before adding new motion candidates.

図３２Ｂは、新しい動き候補を追加する前には、ＬＵＴが満杯でなかった場合の例を示す。 Figure 32B shows an example where the LUT was not full before adding new motion candidates.

図３２Ａおよび図３２Ａは、ともに、冗長性除去に基づくＬＵＴ更新方法（１つの冗長性動き候補を除去する）の例を示す。 Figures 32A and 32B both show an example of a LUT update method based on redundancy elimination (removing one redundant motion candidate).

図３３Ａおよび図３３Ｂは、冗長性除去に基づくＬＵＴ更新方法（複数の冗長性動き候補を除去する、図では２つの候補を示す）の２つの場合の実装例を示す。 Figures 33A and 33B show two example implementations of the redundancy removal based LUT update method (removing multiple redundant motion candidates, two candidates are shown in the figure).

図３３Ａは、新しい動き候補を追加する前に、ＬＵＴが満杯であった場合の例を示す。 Figure 33A shows an example where the LUT is full before adding new motion candidates.

図３３Ｂは、新しい動き候補を追加する前に、ＬＵＴが満杯でなかった場合の例を示す。 Figure 33B shows an example where the LUT was not full before adding new motion candidates.

ＨＭＶＰ候補は、マージ候補リスト構築処理において使用され得る。ＴＭＶＰ候補の後に、テーブルにおける最後のエントリから最初のエントリ（または最後のＫ０のＨＭＶＰ、例えば、Ｋ０＝１６または６）までのすべてのＨＭＶＰ候補を挿入する。ＨＭＶＰ候補に対してプルーニングを適用する。利用可能なマージ候補の総数が信号通知された最大許容マージ候補に達すると、マージ候補リスト構築処理を終了する。代替的に、加算された動き候補の総数が、所与の値に達すると、ＬＵＴからの動き候補の取り出しを終了する。 HMVP candidates may be used in the merge candidate list construction process. After the TMVP candidate, insert all HMVP candidates from the last entry in the table to the first entry (or the last K0 HMVP, e.g., K0=16 or 6). Apply pruning to the HMVP candidates. End the merge candidate list construction process when the total number of available merge candidates reaches the signaled maximum allowed merge candidates. Alternatively, end the extraction of motion candidates from the LUT when the total number of added motion candidates reaches a given value.

同様に、ＨＭＶＰ候補は、ＡＭＶＰ候補リスト構築処理において使用されてもよい。ＴＭＶＰ候補の後に、テーブルにおける最後のＫ１個のＨＭＶＰ候補の動きベクトルを挿入する。ＡＭＶＰ対象参照ピクチャと同じ参照ピクチャを有するＨＭＶＰ候補のみを用いて、ＡＭＶＰ候補リストを構築する。ＨＭＶＰ候補に対してプルーニングを適用する。一例において、Ｋ１は４に設定される。 Similarly, HMVP candidates may be used in the AMVP candidate list construction process. Insert the motion vectors of the last K1 HMVP candidates in the table after the TMVP candidates. Construct the AMVP candidate list using only HMVP candidates that have the same reference picture as the AMVP target reference picture. Apply pruning to the HMVP candidates. In one example, K1 is set to 4.

図２８は、映像処理装置２８００のブロック図である。装置２８００は、本明細書に記載の方法の１つ以上を実装するために使用してもよい。装置２８００は、スマートフォン、タブレット、コンピュータ、ＩｏＴ（ＩｎｔｅｒｎｅｔｏｆＴｈｉｎｇｓ）受信機等により実装されてもよい。装置２８００は、１つ以上のプロセッサ２８０２と、１つ以上のメモリ２８０４と、映像処理ハードウェア２８０６と、を含んでもよい。１つまたは複数のプロセッサ２８０２は、本明細書に記載される１つ以上の方法を実装するように構成されてもよい。１または複数のメモリ２８０４は、本明細書で説明される方法および技術を実装するために使用されるデータおよびコードを記憶するために使用してもよい。映像処理ハードウェア２８０６は、本明細書に記載される技術をハードウェア回路にて実装するために用いられてよい。 28 is a block diagram of a video processing device 2800. The device 2800 may be used to implement one or more of the methods described herein. The device 2800 may be implemented by a smartphone, a tablet, a computer, an Internet of Things (IoT) receiver, etc. The device 2800 may include one or more processors 2802, one or more memories 2804, and video processing hardware 2806. The one or more processors 2802 may be configured to implement one or more of the methods described herein. The one or more memories 2804 may be used to store data and codes used to implement the methods and techniques described herein. The video processing hardware 2806 may be used to implement the techniques described herein in a hardware circuit.

図２９は、映像復号化方法２９００の例のフローチャートである。方法２９００は、テーブルを維持すること（２９０２）を含み、各テーブルは、動き候補のセットを含み、各動き候補は、対応する動き情報に関連付けられる。方法２９００は、さらに、第１の映像ブロックと、第１の映像ブロックを含む映像のビットストリーム表現との間で変換（２９０４）を行うことを含み、変換を行うことは、動き候補のセットのうちの少なくとも一部を予測因子として使用して第１の映像ブロックの動き情報を処理することを含む。 29 is a flowchart of an example video decoding method 2900. The method 2900 includes maintaining (2902) tables, each table including a set of motion candidates, each motion candidate associated with corresponding motion information. The method 2900 further includes converting (2904) between the first video block and a bitstream representation of the video including the first video block, the converting including processing the motion information of the first video block using at least a portion of the set of motion candidates as predictors.

方法２９００に対して、いくつかの実施形態において、動き情報は、予測方向、参照ピクチャインデックス、動きベクトル値、強度補償フラグ、アフィンフラグ、動きベクトル差精度または動きベクトル差分値のうち少なくとも１つを含む。さらに、動き情報は、動き情報がどこから来ているかを示すために、ブロック位置情報をさらに含んでもよい。いくつかの実施形態において、映像ブロックはＣＵまたはＰＵであり、映像の一部は１つ以上の映像スライスまたは１つ以上の映像ピクチャに対応してよい。 For method 2900, in some embodiments, the motion information includes at least one of a prediction direction, a reference picture index, a motion vector value, an intensity compensation flag, an affine flag, a motion vector difference precision, or a motion vector difference value. Additionally, the motion information may further include block location information to indicate where the motion information comes from. In some embodiments, the video block is a CU or a PU, and the portion of the video may correspond to one or more video slices or one or more video pictures.

いくつかの実施形態において、各ＬＵＴは、関連するカウンタを含み、カウンタは、映像の部分の最初においてゼロ値に初期化され、映像の部分における各符号化された映像領域ごとに増加される。映像領域は、符号化ツリーユニット、符号化ツリーブロック、符号化ユニット、符号化ブロックまたは予測ユニットのうちの１つを含む。いくつかの実施形態において、カウンタは、対応するＬＵＴに対して、対応するＬＵＴから除去された動き候補の数を示す。いくつかの実施形態において、動き候補のセットはすべてのＬＵＴに対して同じサイズを有していてもよい。いくつかの実施形態において、映像の一部は１つの映像スライスに対応し、ＬＵＴの数はＮ＊Ｐに等しく、Ｎは１つの復号化スレッド当たりのＬＵＴを表す整数であり、Ｐは１つの映像スライスにおける最大符号化ユニットの行の数またはタイルの数を表す整数である。方法２９００のさらなる詳細は、第４章に提供される実施例および以下に列挙される実施例に記載される。 In some embodiments, each LUT includes an associated counter, which is initialized to a zero value at the beginning of the portion of the video and is incremented for each coded video region in the portion of the video. The video region includes one of a coding tree unit, a coding tree block, a coding unit, a coding block, or a prediction unit. In some embodiments, the counter indicates, for a corresponding LUT, the number of motion candidates that have been removed from the corresponding LUT. In some embodiments, the set of motion candidates may have the same size for all LUTs. In some embodiments, the portion of the video corresponds to a video slice, and the number of LUTs is equal to N*P, where N is an integer representing the number of LUTs per decoding thread, and P is an integer representing the number of rows or tiles of the largest coding unit in a video slice. Further details of method 2900 are described in the examples provided in Chapter 4 and in the examples listed below.

上述した方法／技術の特徴および実施形態を以下に説明する。 Features and embodiments of the above-mentioned methods/techniques are described below.

１．テーブルを維持することであって、各テーブルは、動き候補のセットを含み、各動き候補は、対応する動き情報に関連付けられる、ことと、第１の映像ブロックと、第１の映像ブロックを含む映像のビットストリーム表現との間で変換を行うことであって、変換を行うことは、動き候補のセットのうちの少なくとも一部を予測因子として使用して第１の映像ブロックの動き情報を処理することを含む、ことと、を有する映像処理方法。 1. A video processing method comprising: maintaining tables, each table including a set of motion candidates, each motion candidate being associated with corresponding motion information; and converting between a first video block and a bitstream representation of a video including the first video block, the converting including processing the motion information of the first video block using at least a portion of the set of motion candidates as predictors.

２．前記テーブルは、前記第１の映像ブロックの前に復号化された、以前復号化された映像ブロックから導出した動き候補を含む、第１項に記載の方法。 2. The method of claim 1, wherein the table includes motion candidates derived from previously decoded video blocks that were decoded before the first video block.

３．前記変換を行うことは、動き候補の前記セットの少なくとも一部を使用して、ＡＭＶＰ（ＡｄｖａｎｃｅｄＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎ）候補リスト導出処理を行うこと含む、第１項に記載の方法。 3. The method of claim 1, wherein performing the transformation includes performing an Advanced Motion Vector Prediction (AMVP) candidate list derivation process using at least a portion of the set of motion candidates.

４．前記ＡＭＶＰ候補リスト導出処理は、１つ以上のテーブルからの動き候補をチェックすることを含む、第３項に記載の方法。 4. The method of claim 3, wherein the AMVP candidate list derivation process includes checking motion candidates from one or more tables.

５．前記変換を行うことは、動き候補をチェックすることを含み、前記チェックされた動き候補に関連付けられた動きベクトルは、前記第１の映像ブロックの前記動きベクトルを符号化するための動きベクトル予測因子として使用する、第１～４項のいずれか１項に記載の方法。 5. The method according to any one of claims 1 to 4, wherein performing the transformation includes checking motion candidates, and a motion vector associated with the checked motion candidate is used as a motion vector predictor for encoding the motion vector of the first video block.

６．チェックされた動き候補に関連付けられた動きベクトルは、前記ＡＭＶＰ動き候補リストに追加される、第４項に記載の方法。 6. The method of claim 4, wherein the motion vector associated with the checked motion candidate is added to the AMVP motion candidate list.

７．前記変換を行うことは、規則に基づいて動き候補の少なくとも一部をチェックすることを含む、第１項に記載の方法。 7. The method of claim 1, wherein performing the transformation includes checking at least a portion of the motion candidates based on a rule.

８．前記規則は、ＴＭＶＰ（ＴｅｍｐｏｒａｌＭｏｔｉｏｎＶｅｃｔｏｒＰｒｅｄｉｃｔｉｏｎ）候補をチェックした後、ＡＭＶＰ候補リストが満杯でない場合、前記チェックを有効にする、第７項に記載の方法。 8. The method of claim 7, wherein the rule enables the check if the AMVP candidate list is not full after checking the TMVP (Temporal Motion Vector Prediction) candidates.

９．前記規則は、空間的近傍から選択し、プルーニングした後、ＴＭＶＰ候補を挿入する前に、ＡＭＶＰ候補リストが満杯でない場合、前記チェックを有効にすることを有効にする、第７項に記載の方法。 9. The method of claim 7, wherein the rule enables the check if the AMVP candidate list is not full after selecting and pruning from spatial neighborhoods and before inserting the TMVP candidate.

１０．前記規則は、ｉ）上側の近傍のブロックからのＡＭＶＰ候補がスケーリング無しで存在しない場合、または、ｉｉ）左側の近傍のブロックからのＡＭＶＰ候補がスケーリング無しで存在しない場合に、前記チェックを有効にする、第７項に記載の方法。 10. The method of claim 7, wherein the rule enables the check if i) no AMVP candidates from the upper neighboring block exist without scaling, or ii) no AMVP candidates from the left neighboring block exist without scaling.

１１．規則は、ＡＭＶＰ候補リストに動き候補を追加する前に、プルーニングを適用する場合、前記チェックを有効にする、第７項に記載の方法。 11. The method of claim 7, wherein the rule enables the check if pruning is applied before adding a motion candidate to the AMVP candidate list.

１２．現在の参照ピクチャと同一の参照ピクチャを有する動き候補をチェックする、第１項に記載の方法。 12. The method of claim 1, checking for motion candidates that have the same reference picture as the current reference picture.

１３．現在の参照ピクチャと異なる参照ピクチャを有する動き候補をさらにチェックする、第１２項に記載の方法。 13. The method of claim 12, further checking motion candidates that have a different reference picture than the current reference picture.

１４．同一の参照ピクチャを有する動き候補をチェックすることは、異なる参照ピクチャを有する動き候補をチェックすることに先立って実行される、第１３項に記載の方法。 14. The method of claim 13, wherein checking motion candidates with the same reference picture is performed prior to checking motion candidates with different reference pictures.

１５．動き候補から動きベクトルをテーブルに追加する前に、プルーニング工程を含むＡＭＶＰ候補リスト構成処理をさらに含む、第１項に記載の方法。 15. The method of claim 1, further comprising an AMVP candidate list construction process including a pruning step prior to adding motion vectors from the motion candidates to the table.

１６．前記プルーニング工程は、動き候補を、ＡＭＶＰ候補リストにおける利用可能な動き候補の少なくとも一部と比較することを含む、第１５項に記載の方法。 16. The method of claim 15, wherein the pruning step includes comparing the motion candidates with at least a portion of the available motion candidates in an AMVP candidate list.

１７．前記プルーニング工程は、複数の工程を含み、その数は、複数の空間的または時間的ＡＭＶＰ候補の関数である、第１５項に記載の方法。 17. The method of claim 15, wherein the pruning step includes multiple steps, the number of which is a function of the number of spatial or temporal AMVP candidates.

１８．前記複数の工程は、ＡＭＶＰ候補リストにおいてＭ個の候補が利用可能である場合に、前記プルーニングが、Ｋ個のＡＭＶＰ候補にのみ適用され、Ｋ≦Ｍであり、ＫおよびＭは整数である、第１７項に記載の方法。 18. The method of claim 17, wherein the steps are such that if M candidates are available in the AMVP candidate list, the pruning is applied to only K AMVP candidates, where K≦M and K and M are integers.

１９．前記変換を行うことは、いくつかの動きベクトルの差分を用いて、ＳＭＶＤ（ＳｙｍｍｅｔｒｉｃＭｏｔｉｏｎＶｅｃｔｏｒＤｉｆｆｅｒｅｎｃｅ）処理を行うことを含む、第１項に記載の方法。 19. The method according to claim 1, wherein the conversion includes performing Symmetric Motion Vector Difference (SMVD) processing using differences between several motion vectors.

２０．前記変換を行うことは、いくつかの動きベクトルを用いて、ＳＭＶ（ＳｙｍｍｅｔｒｉｃＭｏｔｉｏｎＶｅｃｔｏｒ）処理を行うことを含む、第１項に記載の方法。 20. The method according to claim 1, wherein the conversion includes performing SMV (Symmetric Motion Vector) processing using several motion vectors.

２１．前記規則は、あるＡＭＶＰ候補を挿入した後に、ＡＭＶＰ候補リストが満杯でない場合、前記チェックを有効とする、第７項に記載の方法。 21. The method of claim 7, wherein the rule enables the check if the AMVP candidate list is not full after inserting an AMVP candidate.

２２．前記テーブルにおける動き候補のチェックを有効とすることを更に有し、前記チェックは、空間的または時間的ブロックから導出した他の候補をチェックする前に有効にされ、他の候補は、ＡＭＶＰ候補、ＳＭＶＤ候補、ＳＭＶ候補、またはアフィンインター候補を含む、第１項に記載の方法。 22. The method of claim 1, further comprising enabling checking of motion candidates in the table, the checking being enabled before checking other candidates derived from spatial or temporal blocks, the other candidates including AMVP candidates, SMVD candidates, SMV candidates, or affine inter candidates.

２３．テーブルにおける動き候補のチェックを有効とすることを更に有し、前記チェックは、前記テーブルに少なくとも１つの動き候補がある場合に有効になる、第１項に記載の方法。 23. The method of claim 1, further comprising enabling a check of motion candidates in a table, the check being enabled if there is at least one motion candidate in the table.

２４．双方向予測候補である動き候補の場合、第２の参照ピクチャリストの参照ピクチャがチェックされる前に、第１の参照ピクチャリストの参照ピクチャがチェックされ、前記第１の参照ピクチャリストは現在の参照対象ピクチャリストである、第１項に記載の方法。 24. The method of claim 1, in which, for a motion candidate that is a bidirectional prediction candidate, a reference picture in a first reference picture list is checked before a reference picture in a second reference picture list is checked, and the first reference picture list is a current reference target picture list.

２５．双方向予測候補である動き候補の場合、第２の参照ピクチャリストの参照ピクチャがチェックされる前に、第１の参照ピクチャリストの参照ピクチャがチェックされ、前記第２の参照ピクチャリストは現在の参照対象ピクチャリストである、第１または２項に記載の方法。 25. The method according to claim 1 or 2, in which, for a motion candidate that is a bidirectional prediction candidate, a reference picture in a first reference picture list is checked before a reference picture in a second reference picture list is checked, and the second reference picture list is a current reference target picture list.

２６．第１の参照ピクチャリストの参照ピクチャは、第２の参照ピクチャリストの参照ピクチャの前にチェックされる、第１項に記載の方法。 26. The method of claim 1, wherein a reference picture in a first reference picture list is checked before a reference picture in a second reference picture list.

２７．前記変換を行うことは、前記第１の映像ブロックから前記ビットストリーム表現を生成することを含む、第１項に記載の方法。 27. The method of claim 1, wherein performing the conversion includes generating the bitstream representation from the first video block.

２８．前記変換を行うことは、前記第１の映像ブロックをビットストリーム表現から生成することを含む、第１項に記載の方法。 28. The method of claim 1, wherein performing the conversion includes generating the first video block from a bitstream representation.

２９．動き候補は、予測方向、参照ピクチャインデックス、動きベクトル値、強度補償フラグ、アフィンフラグ、動きベクトル差精度、または動きベクトル差分値のうち少なくとも１つを含む動き情報に関連付けられる、第１～２８項のいずれか１項に記載の方法。 29. The method according to any one of claims 1 to 28, wherein the motion candidates are associated with motion information including at least one of a prediction direction, a reference picture index, a motion vector value, an intensity compensation flag, an affine flag, a motion vector difference precision, or a motion vector differential value.

３０．１つの動き候補は、イントラ符号化されたブロックに用いられるイントラ予測モードに関連付けられる、第１～２９項のいずれか１項に記載の方法。 30. The method of any one of claims 1 to 29, wherein one motion candidate is associated with an intra prediction mode used for an intra-coded block.

３１．１つの動き候補は、ＩＣ符号化されたブロックに用いられる複数のＩＣ（ＩｌｌｕｍｉｎａｔｉｏｎＣｏｍｐｅｎｓａｔｉｏｎ）パラメータに関連付けられる、第１～２９項のいずれか１項に記載の方法。 31. A method according to any one of claims 1 to 29, in which one motion candidate is associated with multiple IC (Illumination Compensation) parameters used for an IC-coded block.

３２．１つの動き候補は、フィルタリング処理に用いられるフィルタパラメータに関連付けられる、第１～２９項のいずれか１項に記載の方法。 32. A method according to any one of claims 1 to 29, in which one motion candidate is associated with a filter parameter used in the filtering process.

３３．前記変換に基づいて、１つ以上のテーブルを更新することをさらに含む、第１～２９項のいずれか１項に記載の方法。 33. The method of any one of claims 1 to 29, further comprising updating one or more tables based on the transformation.

３４．１つ以上のテーブルを前記更新することは、前記変換を行った後、前記第１の映像ブロックの動き情報に基づいて１つ以上のテーブルを更新することを含む、第３３項に記載の方法。 34. The method of claim 33, wherein updating one or more tables includes updating one or more tables based on motion information of the first video block after performing the transformation.

３５．前記更新されたテーブルに基づいて、前記映像の後続の映像ブロックと前記映像のビットストリーム表現との間で変換を行うことをさらに含む、第３４項に記載の方法。 35. The method of claim 34, further comprising converting between subsequent video blocks of the video and a bitstream representation of the video based on the updated table.

３６．プロセッサと、命令が記憶された非一時的メモリとを備える装置であって、前記命令は、前記プロセッサにより実行された際に、前記プロセッサに、第１項～３５項のいずれか一項に記載の方法を実施させる装置。 36. An apparatus comprising a processor and a non-transitory memory having instructions stored therein, the instructions, when executed by the processor, causing the processor to perform a method according to any one of paragraphs 1 to 35.

３７．第１～３５項のいずれか１項に記載の方法を実行するためのプログラムコードを含む、非一時的なコンピュータ可読媒体に記憶されたコンピュータプログラム製品。 37. A computer program product stored on a non-transitory computer-readable medium, comprising program code for executing the method according to any one of claims 1 to 35.

以上、説明の目的で本開示の技術の特定の実施形態を説明したが、本発明の範囲から逸脱することなく様々な修正が可能であることは、理解されるであろう。従って、本開示の技術は、添付の特許請求の範囲による場合を除き、限定されない。 While specific embodiments of the disclosed technology have been described above for purposes of illustration, it will be understood that various modifications may be made without departing from the scope of the invention. Accordingly, the disclosed technology is not to be limited except as by the appended claims.

本明細書に記載された開示された、およびその他の実施形態、モジュール、および機能操作の実装形態は、本明細書に開示された構造およびその構造的等価物を含め、デジタル電子回路、またはコンピュータソフトウェア、ファームウェア、若しくはハードウェアで実施されてもよく、またはそれらの１つ以上の組み合わせで実施してもよい。開示された、およびその他の実施形態は、１つ以上のコンピュータプログラム製品、すなわち、データ処理装置によって実装されるため、またはデータ処理装置の操作を制御するために、コンピュータ可読媒体上に符号化されたコンピュータプログラム命令の１つ以上のモジュールとして実施することができる。このコンピュータ可読媒体は、機械可読記憶装置、機械可読記憶基板、記憶装置、機械可読伝播信号をもたらす物質の組成物、またはこれらの１つ以上の組み合わせであってもよい。「データ処理装置」という用語は、例えば、プログラマブルプロセッサ、コンピュータ、または複数のプロセッサ、若しくはコンピュータを含む、データを処理するためのすべての装置、デバイス、および機械を含む。この装置は、ハードウェアの他に、当該コンピュータプログラムの実行環境を作るコード、例えば、プロセッサファームウェア、プロトコルスタック、データベース管理システム、オペレーティングシステム、またはこれらの１つ以上の組み合わせを構成するコードを含んでもよい。伝播信号は、人工的に生成した信号、例えば、機械で生成した電気、光、または電磁信号であり、適切な受信装置に送信するための情報を符号化するために生成される。 Implementations of the disclosed and other embodiments, modules, and functional operations described herein, including the structures disclosed herein and their structural equivalents, may be implemented in digital electronic circuitry, or computer software, firmware, or hardware, or in one or more combinations thereof. The disclosed and other embodiments may be implemented as one or more computer program products, i.e., one or more modules of computer program instructions encoded on a computer-readable medium for implementation by or for controlling the operation of a data processing apparatus. The computer-readable medium may be a machine-readable storage device, a machine-readable storage substrate, a storage device, a composition of matter that provides a machine-readable propagated signal, or one or more combinations thereof. The term "data processing apparatus" includes all apparatus, devices, and machines for processing data, including, for example, a programmable processor, a computer, or multiple processors, or computers. In addition to hardware, the apparatus may include code that creates an environment for the execution of the computer program, such as code that constitutes a processor firmware, a protocol stack, a database management system, an operating system, or one or more combinations thereof. A propagated signal is an artificially generated signal, for example a machine-generated electrical, optical, or electromagnetic signal, that is generated to encode information for transmission to an appropriate receiving device.

コンピュータプログラム（プログラム、ソフトウェア、ソフトウェアアプリケーション、スクリプト、またはコードとも呼ばれる）は、コンパイルされた言語または解釈された言語を含む任意の形式のプログラミング言語で記述することができ、それは、スタンドアロンプログラムとして、またはコンピューティング環境で使用するのに適したモジュール、コンポーネント、サブルーチン、または他のユニットとして含む任意の形式で展開することができる。コンピュータプログラムは、必ずしもファイルシステムにおけるファイルに対応するとは限らない。プログラムは、他のプログラムまたはデータを保持するファイルの一部（例えば、マークアップ言語文書に格納された１つ以上のスクリプト）に記録されていてもよいし、当該プログラム専用の単一のファイルに記憶されていてもよいし、複数の調整ファイル（例えば、１つ以上のモジュール、サブプログラム、またはコードの一部を格納するファイル）に記憶されていてもよい。１つのコンピュータプログラムを、１つのサイトに位置する１つのコンピュータ、または複数のサイトに分散され通信ネットワークによって相互接続される複数のコンピュータで実行させるように展開可能である。 A computer program (also called a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program does not necessarily correspond to a file in a file system. A program may be recorded as part of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), may be stored in a single file dedicated to the program, or may be stored in multiple coordinating files (e.g., files that store one or more modules, subprograms, or portions of code). A computer program can be deployed to run on one computer located at one site, or on multiple computers distributed across multiple sites and interconnected by a communications network.

本明細書に記載されたプロセスおよびロジックフローは、入力データ上で動作し、出力を生成することによって機能を実行するための１つ以上のコンピュータプログラムを実行する１つ以上のプログラマブルプロセッサによって行うことができる。プロセスおよびロジックフローはまた、特別目的のロジック回路、例えば、ＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）またはＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）によって実行することができ、装置はまた、特別目的のロジック回路として実装することができる。 The processes and logic flows described herein may be performed by one or more programmable processors executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows may also be performed by, and devices may also be implemented as, special purpose logic circuits, such as Field Programmable Gate Arrays (FPGAs) or Application Specific Integrated Circuits (ASICs).

コンピュータプログラムの実行に適したプロセッサは、例えば、汎用および専用マイクロプロセッサの両方、並びに任意の種類のデジタルコンピュータの任意の１つ以上のプロセッサを含む。一般的に、プロセッサは、リードオンリーメモリまたはランダムアクセスメモリまたはその両方から命令およびデータを受信する。コンピュータの本質的な要素は、命令を実行するためのプロセッサと、命令およびデータを記憶するための１つ以上の記憶装置とである。一般的に、コンピュータは、データを記憶するための１つ以上の大容量記憶デバイス、例えば、磁気、光磁気ディスク、または光ディスクを含んでもよく、またはこれらの大容量記憶デバイスからデータを受信するか、またはこれらにデータを転送するように動作可能に結合されてもよい。しかしながら、コンピュータは、このようなデバイスを有する必要はない。コンピュータプログラム命令およびデータを記憶するのに適したコンピュータ可読媒体は、あらゆる形式の不揮発性メモリ、媒体、および記憶装置を含み、例えば、ＥＰＲＯＭ、ＥＥＰＲＯＭ、フラッシュ記憶装置、磁気ディスク、例えば内部ハードディスクまたはリムーバブルディスク、光磁気ディスク、およびＣＤ－ＲＯＭおよびＤＶＤ－ＲＯＭディスク等の半導体記憶装置を含む。プロセッサおよびメモリは、専用ロジック回路によって補完されてもよく、または専用ロジック回路に組み込まれてもよい。 Processors suitable for executing computer programs include, for example, both general purpose and special purpose microprocessors, as well as any one or more processors of any kind of digital computer. Typically, a processor receives instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more storage devices for storing instructions and data. Typically, a computer may include one or more mass storage devices, e.g., magnetic, magneto-optical, or optical disks, for storing data, or may be operatively coupled to receive data from or transfer data to these mass storage devices. However, a computer need not have such devices. Computer-readable media suitable for storing computer program instructions and data include all forms of non-volatile memory, media, and storage devices, including, for example, EPROM, EEPROM, flash storage devices, magnetic disks, e.g., internal hard disks or removable disks, magneto-optical disks, and semiconductor storage devices such as CD-ROM and DVD-ROM disks. The processor and memory may be supplemented by, or incorporated in, special purpose logic circuitry.

この特許明細書は多くの詳細を含むが、これらは、任意の発明の範囲または特許請求の範囲を限定するものと解釈されるべきではなく、むしろ、特定の発明の特定の実施形態に特有であり得る特徴の説明と解釈されるべきである。本特許明細書において別の実施形態の文脈で説明されている特定の特徴は、１つの例において組み合わせて実装してもよい。逆に、単一の例の文脈で説明された様々な特徴は、複数の実施形態において別個にまたは任意の適切なサブコンビネーションで実装してもよい。さらに、特徴は、特定の組み合わせで作用するものとして上記に記載され、最初にそのように主張されていてもよいが、主張された組み合わせからの１つ以上の特徴は、場合によっては、組み合わせから抜粋されることができ、主張された組み合わせは、サブ組み合わせまたはサブ組み合わせのバリエーションに向けられてもよい。 While this patent specification contains many details, these should not be construed as limiting the scope of any invention or the scope of the claims, but rather as descriptions of features that may be specific to particular embodiments of a particular invention. Certain features described in this patent specification in the context of another embodiment may be implemented in combination in one example. Conversely, various features described in the context of a single example may be implemented in multiple embodiments separately or in any suitable subcombination. Furthermore, although features may be described above as acting in a particular combination and initially claimed as such, one or more features from a claimed combination may, in some cases, be extracted from the combination, and the claimed combination may be directed to a subcombination or a variation of the subcombination.

同様に、動作は図面において特定の順番で示されているが、これは、所望の結果を達成するために、このような動作が示された特定の順番でまたは連続した順番で実行されること、または示された全ての操作が実行されることを必要とするものと理解されるべきではない。また、本特許明細書に記載されている例における様々なシステムの構成要素の分離は、全ての実施形態においてこのような分離を必要とするものと理解されるべきではない。 Similarly, although operations are shown in a particular order in the figures, this should not be understood as requiring that such operations be performed in the particular order or sequential order shown, or that all of the operations shown be performed, to achieve desired results. Additionally, the separation of various system components in the examples described in this patent specification should not be understood as requiring such separation in all embodiments.

いくつかの実装形態および例のみが記載されており、この特許明細書に記載され図示されている内容に基づいて、他の実施形態、拡張および変形が可能である。 Only some implementations and examples have been described, and other embodiments, extensions and variations are possible based on what is described and illustrated in this patent specification.

Claims

constructing a motion candidate list for a current video block, during said constructing, at least one motion candidate in a table is selectively checked in order after checking at least one block each having a predefined relative position with respect to the current video block;
determining motion information for the current video block using the motion candidate list;
coding the current video block based on the determined motion information; and
having
the table includes one or more motion candidates derived from one or more previously coded video blocks that were coded before the current video block;
A method of video processing, wherein an arrangement of the motion candidates in the table is based on an order of addition of the motion candidates to the table.

The method of claim 1, wherein the at least one block comprises a temporal block in a different picture than the picture that contains the current video block.

The method of claim 1 or 2, wherein the at least one block comprises a given spatial block located in the same picture as the picture containing the current video block.

the at least one block includes a last spatial block of a plurality of spatial blocks checked in sequence;
The method of claim 1 , wherein the spatial blocks are located in the same picture as a picture containing the current video block.

The method of any one of claims 1 to 4, wherein the motion candidate list is a motion vector prediction list.

The method of claim 5, further comprising adding a motion vector of the at least one motion candidate of the table to the motion vector prediction list.

The method of claim 5 or 6, wherein checking the at least one motion candidate in the table includes checking reference picture information of the motion candidate.

The method of any one of claims 1 to 4, wherein the motion candidate list is a merge candidate list.

The method of claim 8, further comprising adding the at least one motion candidate from the table to the merge candidate list.

The method of claim 9, wherein the at least one motion candidate in the table is checked before checking other merge candidates associated with the spatial or temporal block.

The method according to any one of claims 1 to 10, wherein the motion vector of the at least one motion candidate of the table or the at least one motion candidate is added to the motion candidate list after all of the at least one motion candidate derived from the at least one block.

The method of any one of claims 1 to 11, wherein the predefined relative position of the at least one block is different for different objects to be coded.

If at least one condition is met, at least one motion candidate in the table is checked;
13. The method of claim 1, wherein the at least one condition is based on a current number of motion candidates in the motion candidates table after checking the at least one block or a current number of motion candidates in the table.

The method of claim 13, wherein the at least one condition further includes the current number of motion candidates in the motion candidate list not reaching a particular value.

The method of claim 14, wherein the particular value is associated with a maximum allowable number of motion candidates in the motion candidate list.

The method of any one of claims 13 to 15, wherein the at least one condition includes the current number of motion candidates in the table being greater than zero.

The method of any one of claims 1 to 16, wherein the motion candidates of the table are associated with motion information including at least one of a prediction direction, a reference picture index, a motion vector value, an intensity compensation flag, an affine flag, a motion vector differential precision, a motion vector differential value, or a filtering parameter.

The method of any one of claims 1 to 17, further comprising updating the table based on the motion information of the current block.

The method of any one of claims 1 to 18, wherein one or more motion candidates in the table are deleted due to adding a new motion candidate to the table if the table is full before the new motion candidate is added to the table.

The method of any one of claims 1 to 19, wherein the current block is coded in a non-affine inter mode or an affine inter mode.

21. The method of claim 1, further comprising adding a motion vector of the at least one motion candidate of the table to the motion candidate list based on a result of the checking.

The method of any one of claims 1 to 21, wherein the coding includes decoding the current video block from the video block.

The method of any one of claims 1 to 21, wherein the coding comprises encoding the current video block into a bitstream representation.

1. An apparatus for coding video data having a processor and a non-transitory memory having instructions, comprising:
The instructions, when executed by the processor, cause the processor to:
constructing a motion candidate list for a current video block of a video, during said constructing, at least one motion candidate in a table is selectively checked in order after checking at least one block each having a predefined relative position with respect to the current video block;
determining motion information for the current video block using the motion candidate list;
coding the current video block based on the determined motion information; and
Run the command,
the table includes one or more motion candidates derived from one or more previously coded video blocks that were coded before the current video block;
An apparatus, wherein an ordering of the motion candidates in the table is based on an order of addition of the motion candidates to the table.

The processor:
constructing a motion candidate list for a current video block of a video, during said constructing, at least one motion candidate in a table is selectively checked in order after checking at least one block each having a predefined relative position with respect to the current video block;
determining motion information for the current video block using the motion candidate list;
coding the current video block based on the determined motion information; and
Run the command,
the table includes one or more motion candidates derived from one or more previously coded video blocks that were coded before the current video block;
A non-transitory computer-readable storage medium having stored thereon instructions, wherein an ordering of the motion candidates in the table is based on an order of addition of the motion candidates to the table.

1. A method for storing a video bitstream, comprising:
constructing a motion candidate list for a current video block of the image, during said construction at least one motion candidate in a table being checked selectively in sequence after checking at least one block each having a predefined relative position with respect to the current video block;
determining motion information for the current video block using the motion candidate list;
encoding the current video block into the bitstream based on the determined motion information; and
storing the bitstream on a non- transitory computer-readable recording medium;
the table includes one or more motion candidates derived from one or more previously coded video blocks that were coded before the current video block;
A method according to claim 1, wherein an ordering of the motion candidates in the table is based on an order of addition of the motion candidates to the table.