JPWO2015141696A1

JPWO2015141696A1 - Image decoding apparatus, image encoding apparatus, and prediction apparatus

Info

Publication number: JPWO2015141696A1
Application number: JP2016508750A
Authority: JP
Inventors: 知宏猪飼; 健史筑波
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2014-03-18
Filing date: 2015-03-17
Publication date: 2017-04-13
Also published as: WO2015141696A1

Abstract

ＤＢＢＰでは、２つの補間画像を生成する補間処理と、デプス画像から分割モードを導出する分割モード導出処理と、セグメンテーションに応じて２つの補完画像を合成する合成処理の処理量が大きいという課題があった。デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備えるデプスベースブロック予測画像生成装置において、上記画像補間部は、双線形予測により上記２つの動き補償画像を生成することを特徴とするデプスベースブロック予測画像生成装置。また、デプスのブロックの４隅の画素から分割モードを導出することを特徴とするデプスベースブロック予測画像生成装置。また、ブロックの各画素において２つの上記補間画像のいずれかを選択することにより合成することを特徴とするデプスベースブロック予測画像生成装置。また、双予測を行わないことを特徴とするデプスベースブロック予測画像生成装置。DBBP has a problem that the processing amount of interpolation processing for generating two interpolation images, division mode derivation processing for deriving a division mode from a depth image, and synthesis processing for synthesizing two complementary images according to segmentation is large. It was. A depth base comprising a segmentation deriving unit for deriving segmentation information from a depth image, an image interpolating unit for generating two motion compensated images, and an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image In the block prediction image generation device, the image interpolation unit generates the two motion compensated images by bilinear prediction, and the depth base block prediction image generation device. A depth-based block predicted image generation apparatus that derives a division mode from pixels at four corners of a depth block. In addition, a depth-based block predicted image generation apparatus characterized in that synthesis is performed by selecting one of the two interpolation images in each pixel of a block. Also, a depth-based block prediction image generation apparatus characterized by not performing bi-prediction.

Description

本発明は、画像復号装置、画像符号化装置および予測装置に関する。 The present invention relates to an image decoding device, an image encoding device, and a prediction device.

複数視点の画像符号化技術には、複数の視点の画像を符号化する際に画像間の視差を予測することによって情報量を低減する視差予測符号化や、その符号化方法に対応した復号方法が提案されている。視点画像間の視差を表すベクトルを変位ベクトルと呼ぶ。変位ベクトルは、水平方向の要素（ｘ成分）と垂直方向の要素（ｙ成分）を有する２次元のベクトルであり、１つの画像を分割した領域であるブロック毎に算出される。また、複数視点の画像を取得するには、それぞれの視点に配置されたカメラを用いることが一般的である。複数視点の符号化では、各視点画像は、複数のレイヤにおいてそれぞれ異なるレイヤとして符号化される。複数のレイヤから構成される動画像の符号化方法は、一般に、スケーラブル符号化又は階層符号化と呼ばれる。スケーラブル符号化では、レイヤ間で予測を行うことで、高い符号化効率を実現する。レイヤ間で予測を行わずに基準となるレイヤは、ベースレイヤ、それ以外のレイヤは拡張レイヤと呼ばれる。レイヤが視点画像から構成される場合のスケーラブル符号化を、ビュースケーラブル符号化と呼ぶ。このとき、ベースレイヤはベースビュー、拡張レイヤは非ベースビューとも呼ばれる。さらに、ビュースケーラブルに加え、レイヤがテクスチャレイヤ（画像レイヤ）とデプスレイヤ（距離画像レイヤ）から構成される場合のスケーラブル符号化は、３次元スケーラブル符号化と呼ばれる。 The multi-view image encoding technique includes a parallax predictive encoding that reduces the amount of information by predicting a parallax between images when encoding images of a plurality of viewpoints, and a decoding method corresponding to the encoding method. Has been proposed. A vector representing the parallax between viewpoint images is called a displacement vector. The displacement vector is a two-dimensional vector having a horizontal element (x component) and a vertical element (y component), and is calculated for each block which is an area obtained by dividing one image. In order to acquire images from a plurality of viewpoints, it is common to use cameras arranged at the respective viewpoints. In multi-viewpoint encoding, each viewpoint image is encoded as a different layer in each of a plurality of layers. A method for encoding a moving image composed of a plurality of layers is generally referred to as scalable encoding or hierarchical encoding. In scalable coding, high coding efficiency is realized by performing prediction between layers. A reference layer without performing prediction between layers is called a base layer, and other layers are called enhancement layers. Scalable encoding in the case where a layer is composed of viewpoint images is referred to as view scalable encoding. At this time, the base layer is also called a base view, and the enhancement layer is also called a non-base view. Furthermore, in addition to view scalable, scalable coding when a layer is composed of a texture layer (image layer) and a depth layer (distance image layer) is called three-dimensional scalable coding.

また、スケーラブル符号化には、ビュースケーラブル符号化の他、空間的スケーラブル符号化（ベースレイヤとして解像度の低いピクチャ、拡張レイヤが解像度の高いピクチャを処理）、ＳＮＲスケーラブル符号化（ベースレイヤとして画質の低いピクチャ、拡張レイヤとして解像度の高いピクチャを処理）等がある。スケーラブル符号化では、例えばベースレイヤのピクチャを、拡張レイヤのピクチャの符号化において、参照ピクチャとして用いることがある。 For scalable coding, in addition to view scalable coding, spatial scalable coding (pictures with low resolution as the base layer and pictures with high resolution in the enhancement layer), SNR scalable coding (image quality as the base layer) Low picture, high resolution picture as an enhancement layer). In scalable coding, for example, a base layer picture may be used as a reference picture in coding an enhancement layer picture.

非特許文献１では、デプス画像からパーティション情報（セグメンテーション）を導出し、セグメンテーションをマスクとして、２つの補間画像から１つの予測画像を合成するデプスベースブロック分割（Depth-based Block Partitioning、DBBP）と呼ばれる技術が知られている。ＤＢＢＰでは、デプスの画素に基づいて領域分割からセグメンテーションを導出することにより、矩形（２Ｎ×２Ｎ、２Ｎ×Ｎ、２Ｎ×ｎＵ、２Ｎ×ｎＤ、Ｎ×２Ｎ、ｎＬ×２Ｎ、ｎＲ×２Ｎ）に限定されない自由度の高い分割が可能である。 In Non-Patent Document 1, it is called depth-based block partitioning (DBBP), in which partition information (segmentation) is derived from a depth image, and one prediction image is synthesized from two interpolated images using the segmentation as a mask. Technology is known. In DBBP, the segmentation is derived from the region division based on the depth pixels, thereby forming a rectangle (2N × 2N, 2N × N, 2N × nU, 2N × nD, N × 2N, nL × 2N, nR × 2N). Division with a high degree of freedom which is not limited is possible.

非特許文献２は、視点合成予測（ＶＳＰ）とＤＢＢＰでのデプス画素参照に用いる視差ベクトル（ＤＶ）を統一する技術であり、ＶＳＰとＤＢＢＰでどちらも、デプスリファインする前の隣接ベース視差ベクトル（ＮＢＤＶ）を用いる。 Non-Patent Document 2 is a technique for unifying viewpoint synthesis prediction (VSP) and a disparity vector (DV) used for depth pixel reference in DBBP. Both VSP and DBBP are adjacent base disparity vectors before depth refinement ( NBDV).

F. Jager, J. Konieczny, and G. Cordara, “CE3: Results on Depth-based Block Partitioning (DBBP)”, JCT3V-G0106, JCT-3V 7th Meeting: San Jose, USA, 11 Jan. - 17 Jan. 2013（2014年1月3日公開）F. Jager, J. Konieczny, and G. Cordara, “CE3: Results on Depth-based Block Partitioning (DBBP)”, JCT3V-G0106, JCT-3V 7th Meeting: San Jose, USA, 11 Jan.-17 Jan. 2013 (released January 3, 2014) M. W. Park, J. Y. Lee, B. Choi, Y. Cho, C. Kim, “Disparity Vector for DBBP in 3D-HEVC”, JCT3V-H0070, JCT-3V 8th Meeting: Valencia, ES, 29 March - 4 April 2014（2014年3月21日公開）MW Park, JY Lee, B. Choi, Y. Cho, C. Kim, “Disparity Vector for DBBP in 3D-HEVC”, JCT3V-H0070, JCT-3V 8th Meeting: Valencia, ES, 29 March-4 April 2014 （ (March 21, 2014 release)

非特許文献１は、ＶＳＰとＤＢＢＰで異なるパーティション分割方法を用いるため、実装が複雑になるという課題がある。 Since Non-Patent Document 1 uses different partitioning methods for VSP and DBBP, there is a problem that the implementation becomes complicated.

非特許文献２は、ＶＳＰとＤＢＢＰのデプス画素参照に用いる視差ベクトル（ＤＶ）を統一させることができるが、符号化効率が低下するという課題がある。 Non-Patent Document 2 can unify disparity vectors (DV) used for reference to depth pixels of VSP and DBBP, but has a problem that encoding efficiency decreases.

本発明の１つの形態は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備えるデプスベースブロック予測画像生成装置において、上記画像補間部は、双線形予測により上記２つの動き補償画像を生成することを特徴とする。 According to one aspect of the present invention, a segmentation deriving unit that derives segmentation information from a depth image, an image interpolating unit that generates two motion compensation images, and a single motion compensation image are generated by combining the two interpolation images. In the depth-based block predicted image generation apparatus including the image synthesizing unit, the image interpolation unit generates the two motion compensation images by bilinear prediction.

本発明の１つの形態は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備えるデプスベースブロック予測画像生成装置において、さらに、デプス画像から分割モードを導出するデプス分割モード導出部を備え、上記デプス分割モード導出部はデプスのブロックの４隅の画素から分割モードを導出することを特徴とする。 According to one aspect of the present invention, a segmentation deriving unit that derives segmentation information from a depth image, an image interpolating unit that generates two motion compensation images, and a single motion compensation image are generated by combining the two interpolation images. A depth-based block prediction image generation apparatus including an image synthesizing unit that further includes a depth division mode deriving unit for deriving a division mode from the depth image, wherein the depth division mode deriving unit is divided from pixels at four corners of the depth block A mode is derived.

本発明の１つの形態は、デプス分割モード導出部はデプスの左上と右下の比較と、デプスの右上と左下の比較から分割モードを導出することを特徴とする。 One aspect of the present invention is characterized in that the depth division mode deriving unit derives a division mode from a comparison of the upper left and lower right of the depth and a comparison of the upper right and lower left of the depth.

本発明の１つの形態は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部と分割モードを導出するデプス分割モード導出部を備え、上記デプス分割モード導出部は２Ｎ×ＮもしくはＮ×２Ｎの分割モードを導出することを特徴とする。 According to one aspect of the present invention, a segmentation deriving unit that derives segmentation information from a depth image, an image interpolating unit that generates two motion compensation images, and a single motion compensation image are generated by combining the two interpolation images. And a depth division mode deriving unit for deriving a division mode, wherein the depth division mode deriving unit derives a 2N × N or N × 2N division mode.

本発明の１つの形態は、上記デプスベースブロック予測画像生成装置において、上記セグメンテーション導出部は各画素について０もしくは１をとるセグメンテーション情報を導出し、上記画像合成部は、ブロックの各画素において２つの上記補間画像のいずれかを選択することにより合成することを特徴とする。 In one embodiment of the present invention, in the depth-based block prediction image generation device, the segmentation deriving unit derives segmentation information that takes 0 or 1 for each pixel, and the image synthesizing unit includes two pieces of information for each pixel of the block. The composition is performed by selecting one of the interpolation images.

本発明の１つの形態は、上記デプスベースブロック予測画像生成装置とＤＢＢＰフラグ復号部を備える画像復号装置であって、上記デプスベースブロック予測画像生成装置はＤＢＢＰフラグが１の場合に、ＤＢＢＰ予測を行うことを特徴とする。 One embodiment of the present invention is an image decoding device including the depth base block prediction image generation device and the DBBP flag decoding unit, and the depth base block prediction image generation device performs DBBP prediction when the DBBP flag is 1. It is characterized by performing.

本発明の１つの形態は、デプスベースブロック予測画像生成手段と、視点合成予測手段を備える画像復号装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部と、分割モードを導出する分割モード導出部を備え、上記視点合成予測手段は、デプス画像からパーティション分割を行うパーティション分割部と、デプス画像から動きベクトルを導出するデプス動きベクトル導出部を備え、上記分割モード導出部と、上記パーティション分割部は、共通の分割モード導出部を備えることを特徴とする。 One aspect of the present invention is an image decoding apparatus including a depth base block prediction image generation unit and a view synthesis prediction unit, wherein the depth base block prediction image generation unit includes a segmentation derivation unit that derives segmentation information from a depth image, and An image interpolation unit that generates two motion compensation images, an image synthesis unit that generates one motion compensation image by synthesizing the two interpolation images, and a division mode deriving unit that derives a division mode. The composite prediction means includes a partition division unit that performs partition division from the depth image, and a depth motion vector derivation unit that derives a motion vector from the depth image, and the division mode derivation unit and the partition division unit include a common division mode. A derivation unit is provided.

本発明の１つの形態は、デプスベースブロック予測画像生成手段とマージモードパラメータ導出部を備える画像復号装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備え、上記画像復号装置は、ＤＢＢＰフラグ復号部をさらに備え、上記マージモードパラメータ導出部は、上記ＤＢＢＰフラグが１の場合に双予測から単予測に変換することを特徴とする。 One aspect of the present invention is an image decoding apparatus including a depth base block prediction image generation unit and a merge mode parameter derivation unit, wherein the depth base block prediction image generation unit includes a segmentation derivation unit that derives segmentation information from a depth image, and An image interpolation unit that generates two motion compensation images and an image synthesis unit that combines the two interpolation images to generate one motion compensation image. The image decoding apparatus further includes a DBBP flag decoding unit. The merge mode parameter derivation unit converts bi-prediction to uni-prediction when the DBBP flag is 1.

本発明の１つの形態は、デプスベースブロック予測画像生成手段とインター予測パラメータ復号部を備える画像復号装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備え、上記画像復号装置は、ＤＢＢＰフラグ復号部をさらに備え、上記インター予測パラメータ復号部は、上記ＤＢＢＰフラグが１の場合にインター予測識別子として双予測となる値を復号しないことを特徴とする画像復号装置。 One aspect of the present invention is an image decoding apparatus including a depth base block prediction image generation unit and an inter prediction parameter decoding unit, wherein the depth base block prediction image generation unit includes a segmentation deriving unit for deriving segmentation information from the depth image, An image interpolation unit that generates two motion compensation images and an image synthesis unit that combines the two interpolation images to generate one motion compensation image. The image decoding apparatus further includes a DBBP flag decoding unit. The inter prediction parameter decoding unit does not decode a bi-prediction value as an inter prediction identifier when the DBBP flag is 1.

本発明の１つの形態は、上記デプスベースブロック予測画像生成装置とＤＢＢＰフラグ符号化部を備える画像符号化装置であって、上記デプスベースブロック予測画像生成装置はＤＢＢＰフラグが１の場合に、ＤＢＢＰ予測を行うことを特徴とする画像符号化装置。 One embodiment of the present invention is an image encoding device including the depth base block prediction image generation device and the DBBP flag encoding unit, and the depth base block prediction image generation device is configured to perform DBBP when the DBBP flag is 1. An image encoding apparatus that performs prediction.

本発明の１つの形態は、デプスベースブロック予測画像生成手段と、視点合成予測手段を備える画像符号化装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部と、分割モードを導出する分割モード導出部を備え、上記視点合成予測手段は、デプスにおうてパーティション分割を行うパーティション分割部と、デプス画像から動きベクトルを導出するデプス動きベクトル導出部を備え、上記、上記分割モード導出部と、上記パーティション分割部は、共通の分割モード導出部を備えることを特徴とする。 One embodiment of the present invention is an image encoding device including a depth base block prediction image generation unit and a viewpoint synthesis prediction unit, wherein the depth base block prediction image generation unit is a segmentation deriving unit that derives segmentation information from a depth image. An image interpolation unit that generates two motion compensation images, an image synthesis unit that combines the two interpolation images to generate one motion compensation image, and a division mode derivation unit that derives a division mode, The viewpoint synthesis prediction unit includes a partition division unit that performs partition division in the depth, and a depth motion vector derivation unit that derives a motion vector from the depth image, and the division mode derivation unit and the partition division unit include: A common division mode deriving unit is provided.

本発明の１つの形態は、デプスベースブロック予測画像生成手段とマージモードパラメータ導出部を備える画像符号化装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備え、上記画像符号化装置は、ＤＢＢＰフラグを符号化するＤＢＢＰフラグ符号化部をさらに備え、上記マージモードパラメータ導出部は、上記ＤＢＢＰフラグが１の場合に単予測から双予測に変換することを特徴とする。 One aspect of the present invention is an image encoding device including a depth base block prediction image generation unit and a merge mode parameter derivation unit, wherein the depth base block prediction image generation unit derives segmentation information from the depth image. And an image interpolating unit for generating two motion compensated images and an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image. The image encoding device encodes the DBBP flag. And a merge mode parameter deriving unit that performs conversion from uni-prediction to bi-prediction when the DBBP flag is 1.

本発明の１つの形態は、デプスベースブロック予測画像生成手段と、視点合成予測手段を備える画像復号装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部と、分割モードを導出する分割モード導出部を備え、上記視点合成予測手段は、デプス画像からパーティション分割を行うパーティション分割部と、デプス画像から動きベクトルを導出するデプス動きベクトル導出部を備え、上記デプスベースブロック予測画像生成手段の上記セグメンテーション導出部、上記分割モード導出部で参照するデプス画像の位置を導出するのに用いる視差ベクトルと、上記視点合成予測手段の上記パーティション分割部と上記デプス動きベクトル導出部でデプス画像の位置を導出するのに用いる視差ベクトルとを共通の視差ベクトルとすることを特徴とする。 One aspect of the present invention is an image decoding apparatus including a depth base block prediction image generation unit and a view synthesis prediction unit, wherein the depth base block prediction image generation unit includes a segmentation derivation unit that derives segmentation information from a depth image, and An image interpolation unit that generates two motion compensation images, an image synthesis unit that generates one motion compensation image by synthesizing the two interpolation images, and a division mode deriving unit that derives a division mode. The composite prediction unit includes a partition division unit that performs partition division from the depth image, and a depth motion vector derivation unit that derives a motion vector from the depth image. The segmentation derivation unit of the depth base block prediction image generation unit, the division mode The derivation unit derives the position of the depth image to be referenced. A disparity vector used, characterized by a common disparity vector and a disparity vector used to derive the position of the depth image in the partitioning portion of the view synthesized predicting means and the depth motion vector derivation unit.

本発明の１つの形態は、デプスベースブロック予測画像生成手段と、視点合成予測手段を備える画像符号化装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部と、分割モードを導出する分割モード導出部を備え、上記視点合成予測手段は、デプス画像からパーティション分割を行うパーティション分割部と、デプス画像から動きベクトルを導出するデプス動きベクトル導出部を備え、上記デプスベースブロック予測画像生成手段の上記セグメンテーション導出部、上記分割モード導出部で参照するデプス画像の位置を導出するのに用いる視差ベクトルと、上記視点合成予測手段の上記パーティション分割部と上記デプス動きベクトル導出部でデプス画像の位置を導出するのに用いる視差ベクトルとを共通の視差ベクトルとすることを特徴とする。 One embodiment of the present invention is an image encoding device including a depth base block prediction image generation unit and a viewpoint synthesis prediction unit, wherein the depth base block prediction image generation unit is a segmentation deriving unit that derives segmentation information from a depth image. An image interpolation unit that generates two motion compensation images, an image synthesis unit that combines the two interpolation images to generate one motion compensation image, and a division mode derivation unit that derives a division mode, The viewpoint synthesis prediction unit includes a partition division unit that performs partition division from the depth image, and a depth motion vector derivation unit that derives a motion vector from the depth image, and the segmentation derivation unit of the depth base block prediction image generation unit, the division Deriving the position of the depth image referenced by the mode deriving unit Wherein the disparity vector, that a common disparity vector and a disparity vector used to derive the position of the depth image in the partitioning portion of the view synthesized predicting means and the depth motion vector derivation unit for use in.

本発明によれば、ＤＢＢＰとＶＳＰでブロック分割処理を共通化することにより実装を簡略化する効果を奏する。また、本発明によれば、ＤＢＢＰとＶＳＰで用いる視差を共通化することにより実装を簡略化する効果を奏する。 According to the present invention, there is an effect of simplifying the mounting by sharing the block division processing between DBBP and VSP. In addition, according to the present invention, there is an effect of simplifying the mounting by sharing the parallax used in the DBBP and the VSP.

本実施形態に係るＤＢＢＰ予測部３０９５の構成を示すブロック図である。It is a block diagram which shows the structure of the DBBP prediction part 3095 which concerns on this embodiment. 本発明の実施形態に係る画像伝送システムの構成を示す概略図である。1 is a schematic diagram illustrating a configuration of an image transmission system according to an embodiment of the present invention. 本実施形態に係る符号化ストリームのデータの階層構造を示す図である。It is a figure which shows the hierarchical structure of the data of the encoding stream which concerns on this embodiment. 分割モードのパターンを示す図であり、（ａ）〜（ｈ）は、それぞれ、分割モードが、２Ｎ×２Ｎ、２Ｎ×Ｎ、２Ｎ×ｎＵ、２Ｎ×ｎＤ、Ｎ×２Ｎ、ｎＬ×２Ｎ、ｎＲ×２Ｎ、および、Ｎ×Ｎの場合のパーティション形状について示している。It is a figure which shows the pattern of a division | segmentation mode, (a)-(h) is a division | segmentation mode, respectively 2Nx2N, 2NxN, 2NxnU, 2NxnD, Nx2N, nLx2N, nR. The partition shapes in the case of × 2N and N × N are shown. 参照ピクチャリストの一例を示す概念図である。It is a conceptual diagram which shows an example of a reference picture list. 参照ピクチャの例を示す概念図である。It is a conceptual diagram which shows the example of a reference picture. 本実施形態に係る画像復号装置３１の構成を示す概略図である。It is the schematic which shows the structure of the image decoding apparatus 31 which concerns on this embodiment. 本実施形態に係るインター予測パラメータ復号部３０３の構成を示す概略図である。It is the schematic which shows the structure of the inter prediction parameter decoding part 303 which concerns on this embodiment. 本実施形態に係るマージモードパラメータ導出部３０３６の構成を示す概略図である。It is the schematic which shows the structure of the merge mode parameter derivation | leading-out part 3036 which concerns on this embodiment. 本実施形態に係るAMVP予測パラメータ導出部３０３２の構成を示す概略図である。It is the schematic which shows the structure of the AMVP prediction parameter derivation | leading-out part 3032 which concerns on this embodiment. マージ候補リストの一例を示す図である。It is a figure which shows an example of a merge candidate list. 空間マージ候補が参照する隣接ブロックの位置を示す図である。It is a figure which shows the position of the adjacent block which a space merge candidate refers. 本実施形態に係る分割モード、分割フラグ導出においてデプスの参照位置を示す図である。It is a figure which shows the reference position of a depth in the division mode which concerns on this embodiment, and division flag derivation. 本実施形態のVSPマージ候補導出部３０３７４（ＶＳＰ予測部３０３７４）の構成を示す図である。It is a figure which shows the structure of the VSP merge candidate derivation | leading-out part 30374 (VSP prediction part 30374) of this embodiment. 本実施形態に係るインター予測パラメータ復号制御復号部３０３の構成を示す概略図である。It is the schematic which shows the structure of the inter prediction parameter decoding control decoding part 303 which concerns on this embodiment. 本実施形態に係るインター予測画像生成部３０９の構成を示す概略図である。It is the schematic which shows the structure of the inter estimated image generation part 309 which concerns on this embodiment. 本実施形態に係る残差予測部３０９２の構成を示す概略図である。It is the schematic which shows the structure of the residual prediction part 3092 which concerns on this embodiment. 本実施形態に係る残差予測の概念図（動きベクトルの場合）である。It is a conceptual diagram (in the case of a motion vector) of the residual prediction which concerns on this embodiment. 本実施形態に係る残差予測の概念図（視差ベクトルの場合）である。It is a conceptual diagram (in the case of a disparity vector) of residual prediction according to the present embodiment. 本実施形態のＤＢＢＰフラグdbbp_flagに関するシンタックス表である。It is a syntax table regarding DBBP flag dbbp_flag of this embodiment. 本実施形態に係るＤＢＢＰ予測部３０９５Ｃの構成を示すブロック図である。It is a block diagram which shows the structure of DBBP prediction part 3095C which concerns on this embodiment. 本実施形態に係る画像合成部３０９５３を説明する図である。It is a figure explaining the image synthetic | combination part 30953 which concerns on this embodiment. 本実施形態に係るインター予測パラメータ復号制御復号部３０３１Ａの構成を示す概略図である。It is the schematic which shows the structure of the inter prediction parameter decoding control decoding part 3031A which concerns on this embodiment. 本実施形態に係るインター予測パラメータ復号制御部３０３１Ａにおいて、inter_pred_flagの導出を説明する図である。It is a figure explaining derivation | leading-out of inter_pred_flag in the inter prediction parameter decoding control part 3031A which concerns on this embodiment. 本実施形態に係るマージモードパラメータ導出部３０３６Ａの構成を示すブロック図である。It is a block diagram which shows the structure of merge mode parameter derivation | leading-out part 3036A which concerns on this embodiment. 本実施形態に係る画像符号化装置１１の構成を示すブロック図である。It is a block diagram which shows the structure of the image coding apparatus 11 which concerns on this embodiment. 本実施形態に係るインター予測パラメータ符号化部１１２の構成を示す概略図である。It is the schematic which shows the structure of the inter prediction parameter encoding part 112 which concerns on this embodiment. 本実施形態に係るＤＢＢＰ予測部３０９５Ｃの変形例の構成を示すブロック図である。It is a block diagram which shows the structure of the modification of DBBP prediction part 3095C which concerns on this embodiment. ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５で共通の変位ベクトルを用いる構成の画像復号装置３１および画像符号化装置１１の動作を説明するためのフローチャートである。It is a flowchart for demonstrating operation | movement of the image decoding apparatus 31 and the image coding apparatus 11 of a structure which uses a common displacement vector in the VSP prediction part 30374 and the DBBP prediction part 3095. ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５で共通の変位ベクトルを用いる例のデータフローを示す図である。It is a figure which shows the data flow of the example which uses a common displacement vector in VSP prediction part 30374 and DBBP prediction part 3095. ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５Ｃで共通の変位ベクトルを用いる例のデータフローを示す図である。It is a figure which shows the data flow of the example using a common displacement vector in VSP prediction part 30374 and DBBP prediction part 3095C.

以下、図面を参照しながら本発明の実施形態について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図２は、本実施形態に係る画像伝送システム１の構成を示す概略図である。 FIG. 2 is a schematic diagram showing the configuration of the image transmission system 1 according to the present embodiment.

画像伝送システム１は、複数のレイヤ画像を符号化した符号を伝送し、伝送された符号を復号した画像を表示するシステムである。画像伝送システム１は、画像符号化装置１１、ネットワーク２１、画像復号装置３１及び画像表示装置４１を含んで構成される。 The image transmission system 1 is a system that transmits a code obtained by encoding a plurality of layer images and displays an image obtained by decoding the transmitted code. The image transmission system 1 includes an image encoding device 11, a network 21, an image decoding device 31, and an image display device 41.

画像符号化装置１１には、複数のレイヤ画像（テクスチャ画像ともいう）を示す信号Ｔが入力される。レイヤ画像とは、ある解像度及びある視点で視認もしくは撮影される画像である。複数のレイヤ画像を用いて３次元画像を符号化するビュースケーラブル符号化を行う場合、複数のレイヤ画像のそれぞれは、視点画像と呼ばれる。ここで、視点は撮影装置の位置又は観測点に相当する。例えば、複数の視点画像は、被写体に向かって左右の撮影装置のそれぞれが撮影した画像である。画像符号化装置１１は、この信号のそれぞれを符号化して符号化ストリームＴｅ（符号化データ）を生成する。符号化ストリームＴｅの詳細については、後述する。視点画像とは、ある視点において観測される２次元画像（平面画像）である。視点画像は、例えば２次元平面内に配置された画素毎の輝度値、又は色信号値で示される。以下では、１枚の視点画像又は、その視点画像を示す信号をピクチャ（ｐｉｃｔｕｒｅ）と呼ぶ。また、複数のレイヤ画像を用いて空間スケーラブル符号化を行う場合、その複数のレイヤ画像は、解像度の低いベースレイヤ画像と、解像度の高い拡張レイヤ画像からなる。複数のレイヤ画像を用いてＳＮＲスケーラブル符号化を行う場合、その複数のレイヤ画像は、画質の低いベースレイヤ画像と、画質の高い拡張レイヤ画像からなる。なお、ビュースケーラブル符号化、空間スケーラブル符号化、ＳＮＲスケーラブル符号化を任意に組み合わせて行っても良い。本実施形態では、複数のレイヤ画像として、少なくともベースレイヤ画像と、ベースレイヤ画像以外の画像（拡張レイヤ画像）を含む画像の符号化および復号を扱う。複数のレイヤのうち、画像もしくは符号化パラメータにおいて参照関係（依存関係）にある２つのレイヤについて、参照される側の画像を、第１レイヤ画像、参照する側の画像を第２レイヤ画像と呼ぶ。例えば、ベースレイヤを参照して符号化される（ベースレイヤ以外の）エンハンスレイヤ画像がある場合、ベースレイヤ画像を第１レイヤ画像、エンハンスレイヤ画像を第２レイヤ画像として扱う。なお、エンハンスレイヤ画像の例としては、ベースビュー以外の視点の画像やデプス画像などがある。 A signal T indicating a plurality of layer images (also referred to as texture images) is input to the image encoding device 11. A layer image is an image that is viewed or photographed at a certain resolution and a certain viewpoint. When performing view scalable coding in which a three-dimensional image is coded using a plurality of layer images, each of the plurality of layer images is referred to as a viewpoint image. Here, the viewpoint corresponds to the position or observation point of the photographing apparatus. For example, the plurality of viewpoint images are images taken by the left and right photographing devices toward the subject. The image encoding device 11 encodes each of the signals to generate an encoded stream Te (encoded data). Details of the encoded stream Te will be described later. A viewpoint image is a two-dimensional image (planar image) observed at a certain viewpoint. The viewpoint image is indicated by, for example, a luminance value or a color signal value for each pixel arranged in a two-dimensional plane. Hereinafter, one viewpoint image or a signal indicating the viewpoint image is referred to as a picture. In addition, when performing spatial scalable coding using a plurality of layer images, the plurality of layer images include a base layer image having a low resolution and an enhancement layer image having a high resolution. When SNR scalable encoding is performed using a plurality of layer images, the plurality of layer images are composed of a base layer image with low image quality and an extended layer image with high image quality. Note that view scalable coding, spatial scalable coding, and SNR scalable coding may be arbitrarily combined. In the present embodiment, encoding and decoding of an image including at least a base layer image and an image other than the base layer image (enhancement layer image) is handled as the plurality of layer images. Of the multiple layers, for two layers that have a reference relationship (dependency relationship) in the image or encoding parameter, the image on the reference side is referred to as a first layer image, and the image on the reference side is referred to as a second layer image . For example, when there is an enhancement layer image (other than the base layer) that is encoded with reference to the base layer, the base layer image is treated as a first layer image and the enhancement layer image is treated as a second layer image. Note that examples of the enhancement layer image include an image of a viewpoint other than the base view and a depth image.

デプス画像（ｄｅｐｔｈｍａｐ、「深度画像」、「距離画像」とも言う）とは、被写空間に含まれる被写体や背景の、視点（撮影装置等）からの距離に対応する信号値（「デプス値」、「深度値」、「デプス」等と呼ぶ）であって、二次元平面に配置された画素毎の信号値（画素値）からなる画像信号である。デプス画像を構成する画素は、視点画像を構成する画素と対応する。従って、デプスマップは、被写空間を二次元平面に射影した基準となる画像信号である視点画像を用いて、三次元の被写空間を表すための手がかりとなる。 A depth image (also referred to as a depth map, “depth image”, or “distance image”) is a signal value (“depth value”) corresponding to the distance from the viewpoint (such as a photographing device) of the subject or background included in the subject space. ”,“ Depth value ”,“ depth ”, etc.), and is an image signal composed of signal values (pixel values) for each pixel arranged in a two-dimensional plane. The pixels constituting the depth image correspond to the pixels constituting the viewpoint image. Therefore, the depth map is a clue for representing the three-dimensional object space by using the viewpoint image which is a reference image signal obtained by projecting the object space onto the two-dimensional plane.

ネットワーク２１は、画像符号化装置１１が生成した符号化ストリームＴｅを画像復号装置３１に伝送する。ネットワーク２１は、インターネット（internet）、広域ネットワーク（WAN:Wide Area Network）、小規模ネットワーク（LAN:Local Area Network）又はこれらの組み合わせである。ネットワーク２１は、必ずしも双方向の通信網に限らず、地上波ディジタル放送、衛星放送等の放送波を伝送する一方向又は双方向の通信網であっても良い。また、ネットワーク２１は、ＤＶＤ（Digital Versatile Disc）、ＢＤ（Blue-ray Disc）等の符号化ストリームＴｅを記録した記憶媒体で代替されても良い。 The network 21 transmits the encoded stream Te generated by the image encoding device 11 to the image decoding device 31. The network 21 is the Internet, a wide area network (WAN), a small network (LAN), or a combination thereof. The network 21 is not necessarily limited to a bidirectional communication network, and may be a unidirectional or bidirectional communication network that transmits broadcast waves such as terrestrial digital broadcasting and satellite broadcasting. The network 21 may be replaced by a storage medium that records an encoded stream Te such as a DVD (Digital Versatile Disc) or a BD (Blue-ray Disc).

画像復号装置３１は、ネットワーク２１が伝送した符号化ストリームＴｅのそれぞれを復号し、それぞれ復号した複数の復号レイヤ画像Ｔｄ（復号視点画像Ｔｄ）を生成する。 The image decoding device 31 decodes each of the encoded streams Te transmitted by the network 21, and generates a plurality of decoded layer images Td (decoded viewpoint images Td) respectively decoded.

画像表示装置４１は、画像復号装置３１が生成した複数の復号レイヤ画像Ｔｄの全部又は一部を表示する。例えば、ビュースケーラブル符号化においては、全部の場合、３次元画像（立体画像）や自由視点画像が表示され、一部の場合、２次元画像が表示される。画像表示装置４１は、例えば、液晶ディスプレイ、有機ＥＬ（Electro-luminescence）ディスプレイ等の表示デバイスを備える。また、空間スケーラブル符号化、ＳＮＲスケーラブル符号化では、画像復号装置３１、画像表示装置４１が高い処理能力を有する場合には、画質の高い拡張レイヤ画像を表示し、より低い処理能力しか有しない場合には、拡張レイヤほど高い処理能力、表示能力を必要としないベースレイヤ画像を表示する。 The image display device 41 displays all or part of the plurality of decoded layer images Td generated by the image decoding device 31. For example, in view scalable coding, a 3D image (stereoscopic image) and a free viewpoint image are displayed in all cases, and a 2D image is displayed in some cases. The image display device 41 includes, for example, a display device such as a liquid crystal display or an organic EL (Electro-luminescence) display. In addition, in the spatial scalable coding and SNR scalable coding, when the image decoding device 31 and the image display device 41 have a high processing capability, a high-quality enhancement layer image is displayed and only a lower processing capability is provided. Displays a base layer image that does not require higher processing capability and display capability as an extension layer.

＜符号化ストリームＴｅの構造＞
本実施形態に係る画像符号化装置１１および画像復号装置３１の詳細な説明に先立って、画像符号化装置１１によって生成され、画像復号装置３１によって復号される符号化ストリームＴｅのデータ構造について説明する。<Structure of Encoded Stream Te>
Prior to detailed description of the image encoding device 11 and the image decoding device 31 according to the present embodiment, the data structure of the encoded stream Te generated by the image encoding device 11 and decoded by the image decoding device 31 will be described. .

図３は、符号化ストリームＴｅにおけるデータの階層構造を示す図である。符号化ストリームＴｅは、例示的に、シーケンス、およびシーケンスを構成する複数のピクチャを含む。図３の（ａ）〜（ｆ）は、それぞれ、シーケンスＳＥＱを既定するシーケンスレイヤ、ピクチャＰＩＣＴを規定するピクチャレイヤ、スライスＳを規定するスライスレイヤ、スライスデータを規定するスライスデータレイヤ、スライスデータに含まれる符号化ツリーユニットを規定する符号化ツリーレイヤ、符号化ツリーに含まれる符号化単位（ＣｏｄｉｎｇＵｎｉｔ；ＣＵ）を規定する符号化ユニットレイヤを示す図である。 FIG. 3 is a diagram illustrating a hierarchical structure of data in the encoded stream Te. The encoded stream Te illustratively includes a sequence and a plurality of pictures constituting the sequence. (A) to (f) of FIG. 3 respectively show a sequence layer that defines the sequence SEQ, a picture layer that defines the picture PICT, a slice layer that defines the slice S, a slice data layer that defines slice data, and slice data. It is a figure which shows the encoding unit layer which prescribes | regulates the encoding tree layer which prescribes | regulates the encoding tree unit contained, and the coding unit (Coding Unit; CU) contained in a coding tree.

（シーケンスレイヤ）
シーケンスレイヤでは、処理対象のシーケンスＳＥＱ（以下、対象シーケンスとも称する）を復号するために画像復号装置３１が参照するデータの集合が規定されている。シーケンスＳＥＱは、図３の（ａ）に示すように、ビデオパラメータセット（Video Parameter Set）シーケンスパラメータセットＳＰＳ（Sequence Parameter Set）、ピクチャパラメータセットＰＰＳ（Picture Parameter Set）、ピクチャＰＩＣＴ、及び、付加拡張情報ＳＥＩ（Supplemental Enhancement Information）を含んでいる。ここで＃の後に示される値はレイヤＩＤを示す。図３では、＃０と＃１すなわちレイヤ０とレイヤ１の符号化データが存在する例を示すが、レイヤの種類およびレイヤの数はこれによらない。(Sequence layer)
In the sequence layer, a set of data referred to by the image decoding device 31 for decoding a sequence SEQ to be processed (hereinafter also referred to as a target sequence) is defined. As shown in FIG. 3A, the sequence SEQ includes a video parameter set (Sequence Parameter Set), a picture parameter set PPS (Picture Parameter Set), a picture PICT, and an additional extension. Information SEI (Supplemental Enhancement Information) is included. Here, the value indicated after # indicates the layer ID. FIG. 3 shows an example in which encoded data of # 0 and # 1, that is, layer 0 and layer 1, exists, but the type of layer and the number of layers are not dependent on this.

ビデオパラメータセットＶＰＳは、複数のレイヤから構成されている動画像において、複数の動画像に共通する符号化パラメータの集合および動画像に含まれる複数のレイヤおよび個々のレイヤに関連する符号化パラメータの集合が規定されている。 The video parameter set VPS is a set of encoding parameters common to a plurality of moving images, a plurality of layers included in the moving image, and encoding parameters related to individual layers in a moving image composed of a plurality of layers. A set is defined.

シーケンスパラメータセットＳＰＳでは、対象シーケンスを復号するために画像復号装置３１が参照する符号化パラメータの集合が規定されている。例えば、ピクチャの幅や高さが規定される。 In the sequence parameter set SPS, a set of encoding parameters referred to by the image decoding device 31 in order to decode the target sequence is defined. For example, the width and height of the picture are defined.

ピクチャパラメータセットＰＰＳでは、対象シーケンス内の各ピクチャを復号するために画像復号装置３１が参照する符号化パラメータの集合が規定されている。例えば、ピクチャの復号に用いられる量子化幅の基準値（pic_init_qp_minus26）や重み付き予測の適用を示すフラグ（weighted_pred_flag）が含まれる。なお、ＰＰＳは複数存在してもよい。その場合、対象シーケンス内の各ピクチャから複数のＰＰＳの何れかを選択する。 In the picture parameter set PPS, a set of encoding parameters referred to by the image decoding device 31 in order to decode each picture in the target sequence is defined. For example, a quantization width reference value (pic_init_qp_minus26) used for picture decoding and a flag (weighted_pred_flag) indicating application of weighted prediction are included. A plurality of PPS may exist. In that case, one of a plurality of PPSs is selected from each picture in the target sequence.

（ピクチャレイヤ）
ピクチャレイヤでは、処理対象のピクチャＰＩＣＴ（以下、対象ピクチャとも称する）を復号するために画像復号装置３１が参照するデータの集合が規定されている。ピクチャＰＩＣＴは、図３の（ｂ）に示すように、スライスＳ０〜ＳＮＳ−１を含んでいる（ＮＳはピクチャＰＩＣＴに含まれるスライスの総数）。(Picture layer)
In the picture layer, a set of data referred to by the image decoding device 31 for decoding a picture PICT to be processed (hereinafter also referred to as a target picture) is defined. As shown in FIG. 3B, the picture PICT includes slices S0 to SNS-1 (NS is the total number of slices included in the picture PICT).

なお、以下、スライスＳ０〜ＳＮＳ−１のそれぞれを区別する必要が無い場合、符号の添え字を省略して記述することがある。また、以下に説明する符号化ストリームＴｅに含まれるデータであって、添え字を付している他のデータについても同様である。 In addition, hereinafter, when it is not necessary to distinguish each of the slices S0 to SNS-1, the subscripts may be omitted. The same applies to data included in an encoded stream Te described below and to which other subscripts are attached.

（スライスレイヤ）
スライスレイヤでは、処理対象のスライスＳ（対象スライスとも称する）を復号するために画像復号装置３１が参照するデータの集合が規定されている。スライスＳは、図３の（ｃ）に示すように、スライスヘッダＳＨ、および、スライスデータＳＤＡＴＡを含んでいる。(Slice layer)
In the slice layer, a set of data referred to by the image decoding device 31 for decoding the slice S to be processed (also referred to as a target slice) is defined. As shown in FIG. 3C, the slice S includes a slice header SH and slice data SDATA.

スライスヘッダＳＨには、対象スライスの復号方法を決定するために画像復号装置３１が参照する符号化パラメータ群が含まれる。スライスタイプを指定するスライスタイプ指定情報（ｓｌｉｃｅ＿ｔｙｐｅ）は、スライスヘッダＳＨに含まれる符号化パラメータの一例である。 The slice header SH includes a coding parameter group that the image decoding device 31 refers to in order to determine a decoding method of the target slice. The slice type designation information (slice_type) that designates the slice type is an example of an encoding parameter included in the slice header SH.

スライスタイプ指定情報により指定可能なスライスタイプとしては、（１）符号化の際にイントラ予測のみを用いるＩスライス、（２）符号化の際に単方向予測、または、イントラ予測を用いるＰスライス、（３）符号化の際に単方向予測、双方向予測、または、イントラ予測を用いるＢスライスなどが挙げられる。 As slice types that can be specified by the slice type specification information, (1) I slice using only intra prediction at the time of encoding, (2) P slice using unidirectional prediction or intra prediction at the time of encoding, (3) B-slice using unidirectional prediction, bidirectional prediction, or intra prediction at the time of encoding may be used.

なお、スライスヘッダＳＨには、上記シーケンスレイヤに含まれる、ピクチャパラメータセットＰＰＳへの参照（pic_parameter_set_id）を含んでいても良い。 Note that the slice header SH may include a reference (pic_parameter_set_id) to the picture parameter set PPS included in the sequence layer.

（スライスデータレイヤ）
スライスデータレイヤでは、処理対象のスライスデータＳＤＡＴＡを復号するために画像復号装置３１が参照するデータの集合が規定されている。スライスデータＳＤＡＴＡは、図３の（ｄ）に示すように、符号化ツリーブロック（CTB:Coded Tree Block）を含んでいる。ＣＴＢは、スライスを構成する固定サイズ（例えば６４×６４）のブロックであり、最大符号化単位（LCU:Largest Cording Unit）と呼ぶこともある。(Slice data layer)
In the slice data layer, a set of data referred to by the image decoding device 31 in order to decode the slice data SDATA to be processed is defined. As shown in FIG. 3D, the slice data SDATA includes a coded tree block (CTB). The CTB is a fixed-size block (for example, 64 × 64) constituting a slice, and may be referred to as a maximum coding unit (LCU).

（符号化ツリーレイヤ）
符号化ツリーレイヤは、図３の（ｅ）に示すように、処理対象の符号化ツリーブロックを復号するために画像復号装置３１が参照するデータの集合が規定されている。符号化ツリーユニットは、再帰的な４分木分割により分割される。再帰的な４分木分割により得られる木構造のノードのことを符号化ツリー（coding tree）と称する。４分木の中間ノードは、符号化ツリーユニット（CTU:Coded Tree Unit）であり、符号化ツリーブロック自身も最上位のＣＴＵとして規定される。ＣＴＵは、分割フラグ（split_flag）を含み、split_flagが１の場合には、４つの符号化ツリーユニットＣＴＵに分割される。split_flagが０の場合には、符号化ツリーユニットＣＴＵは４つの符号化ユニット（CU:Coded Unit）に分割される。符号化ユニットＣＵは符号化ツリーレイヤの末端ノードであり、このレイヤではこれ以上分割されない。符号化ユニットＣＵは、符号化処理の基本的な単位となる。(Encoding tree layer)
As shown in (e) of FIG. 3, the coding tree layer defines a set of data referred to by the image decoding device 31 in order to decode a coding tree block to be processed. The coding tree unit is divided by recursive quadtree division. A tree-structured node obtained by recursive quadtree partitioning is called a coding tree. An intermediate node of the quadtree is a coded tree unit (CTU), and the coded tree block itself is also defined as the highest CTU. The CTU includes a split flag (split_flag). When the split_flag is 1, the CTU is split into four coding tree units CTU. When split_flag is 0, the coding tree unit CTU is divided into four coding units (CU: Coded Unit). The coding unit CU is a terminal node of the coding tree layer and is not further divided in this layer. The encoding unit CU is a basic unit of the encoding process.

また、符号化ツリーブロックＣＴＢのサイズ６４×６４画素の場合には、符号化ユニットＣＵのサイズは、６４×６４画素、３２×３２画素、１６×１６画素、および、８×８画素の何れかをとり得る。 When the size of the coding tree block CTB is 64 × 64 pixels, the size of the coding unit CU is any of 64 × 64 pixels, 32 × 32 pixels, 16 × 16 pixels, and 8 × 8 pixels. Can take.

（符号化ユニットレイヤ）
符号化ユニットレイヤは、図３の（ｆ）に示すように、処理対象の符号化ユニットを復号するために画像復号装置３１が参照するデータの集合が規定されている。具体的には、符号化ユニットは、ＣＵヘッダＣＵＨ、予測ユニット（予測単位）、変換ツリー、ＣＵヘッダＣＵＦから構成される。ＣＵヘッダＣＵＨでは、符号化ユニットが、イントラ予測を用いるユニットであるか、インター予測を用いるユニットであるかなどが規定される。また、ＣＵヘッダＣＵＨには、符号化ユニットが、残差予測に用いる重み（もしくは残差予測を行うか否か）を示す残差予測インデックスiv_res_pred_weight_idxや、照度補償予測を用いるか否かを示す照度補償フラグic_flagを含む。符号化ユニットは、予測ユニット（prediction unit;PU、予測ユニット）および変換ツリー（transform tree;TT）のルートとなる。ＣＵヘッダＣＵＦは、予測ユニットと変換ツリーの間、もしくは、変換ツリーの後に含まれる。(Encoding unit layer)
As shown in (f) of FIG. 3, the encoding unit layer defines a set of data referred to by the image decoding device 31 in order to decode the processing target encoding unit. Specifically, the encoding unit includes a CU header CUH, a prediction unit (prediction unit), a conversion tree, and a CU header CUF. In the CU header CUH, it is defined whether the coding unit is a unit using intra prediction or a unit using inter prediction. In the CU header CUH, the encoding unit indicates a residual prediction index iv_res_pred_weight_idx indicating a weight used for residual prediction (or whether or not to perform residual prediction), and illuminance indicating whether or not illuminance compensation prediction is used. A compensation flag ic_flag is included. The encoding unit is the root of a prediction unit (PU) and a transform tree (TT). The CU header CUF is included between the prediction unit and the conversion tree or after the conversion tree.

予測ユニットは、符号化ユニットが１または複数の予測ブロックに分割され、各予測ブロックの位置とサイズとが規定される。別の表現でいえば、予測ブロックは、符号化ユニットを構成する１または複数の重複しない領域である。また、予測ユニットは、上述の分割により得られた１または複数の予測ブロックを含む。 In the prediction unit, the encoding unit is divided into one or a plurality of prediction blocks, and the position and size of each prediction block are defined. In other words, the prediction block is one or a plurality of non-overlapping areas constituting the coding unit. Further, the prediction unit includes one or a plurality of prediction blocks obtained by the above-described division.

予測処理は、この予測ブロックごとに行われる。以下、予測の単位である予測ブロックのことを、予測ユニットとも称する。より詳細には予測は色コンポーネント単位で行われるため、以下では、輝度の予測ブロック、色差の予測ブロックなど、色コンポーネント毎のブロックを、予測ブロックと呼び、複数の色コンポーネントのブロック（輝度の予測ブロック、色差の予測ブロック）を合わせて、予測ユニットと呼ぶ。色コンポーネントの種類を示すインデックスcIdx(colour_component Idx)が０のブロックが輝度ブロック（輝度の予測ブロック）を示し(通例、LもしくはYと表示される)、cIdxが１もしくは２のブロックは各々Cb, Crの色差ブロック（色差の予測ブロック）を示す。 Prediction processing is performed for each prediction block. Hereinafter, a prediction block that is a unit of prediction is also referred to as a prediction unit. More specifically, since prediction is performed in units of color components, hereinafter, blocks for each color component, such as a luminance prediction block and a color difference prediction block, are referred to as prediction blocks, and blocks of multiple color components (luminance prediction blocks). A block and a color difference prediction block) are collectively called a prediction unit. A block whose index cIdx (colour_component Idx) indicating a color component type is 0 indicates a luminance block (predicted luminance block) (usually displayed as L or Y), and a block whose cIdx is 1 or 2 is Cb, The Cr color difference block (color difference prediction block) is shown.

予測ユニットにおける分割の種類は、大まかにいえば、イントラ予測の場合と、インター予測の場合との２つがある。イントラ予測とは、同一ピクチャ内の予測であり、インター予測とは、互いに異なるピクチャ間（例えば、表示時刻間、レイヤ画像間）で行われる予測処理を指す。 Roughly speaking, there are two types of division in the prediction unit: intra prediction and inter prediction. Intra prediction is prediction within the same picture, and inter prediction refers to prediction processing performed between different pictures (for example, between display times and between layer images).

イントラ予測の場合、分割方法は、２Ｎ×２Ｎ（符号化ユニットと同一サイズ）と、Ｎ×Ｎとがある。 In the case of intra prediction, there are 2N × 2N (the same size as the encoding unit) and N × N division methods.

また、インター予測の場合、分割方法は、符号化データの分割モードpart_modeにより符号化される。分割モードpart_modeによって指定される分割モードには、対象ＣＵのサイズを２Ｎ×２Ｎ画素とすると、次の合計８種類のパターンがある。すなわち、２Ｎ×２Ｎ画素、２Ｎ×Ｎ画素、Ｎ×２Ｎ画素、およびＮ×Ｎ画素の４つの対称的分割（symmetric splittings）、並びに、２Ｎ×ｎＵ画素、２Ｎ×ｎＤ画素、ｎＬ×２Ｎ画素、およびｎＲ×２Ｎ画素の４つの非対称的分割（ＡＭＰ：asymmetric motion partitions）である。なお、Ｎ＝２^ｍ（ｍは１以上の任意の整数）を意味している。以下、分割モードが非対称的分割である予測ブロックをＡＭＰブロックとも呼称する。分割数は１、２、４のいずれかであるため、ＣＵに含まれるＰＵは１個から４個である。これらのＰＵを順にＰＵ０、ＰＵ１、ＰＵ２、ＰＵ３と表現する。Also, in the case of inter prediction, the division method is encoded by the encoded data division mode part_mode. The division mode specified by the division mode part_mode includes the following eight types of patterns in total, assuming that the size of the target CU is 2N × 2N pixels. That is, 4 symmetric splittings of 2N × 2N pixels, 2N × N pixels, N × 2N pixels, and N × N pixels, and 2N × nU pixels, 2N × nD pixels, nL × 2N pixels, And four asymmetric motion partitions (AMP) of nR × 2N pixels. N = 2 ^m (m is an arbitrary integer of 1 or more). Hereinafter, a prediction block whose division mode is asymmetric division is also referred to as an AMP block. Since the number of divisions is one of 1, 2, and 4, PUs included in the CU are 1 to 4. These PUs are expressed as PU0, PU1, PU2, and PU3 in order.

図４（ａ）〜（ｈ）に、それぞれの分割モードについて、ＣＵにおけるＰＵ分割の境界の位置を具体的に図示している。 4A to 4H specifically illustrate the positions of the PU partition boundaries in the CU for each partition mode.

図４（ａ）は、ＣＵの分割を行わない２Ｎ×２Ｎの分割モードを示している。また、図４（ｂ）および（ｅ）は、それぞれ、分割モードが、２Ｎ×Ｎ、および、Ｎ×２Ｎである場合のパーティションの形状について示している。また、図４（ｈ）は、分割モードが、Ｎ×Ｎである場合のパーティションの形状を示している。 FIG. 4A shows a 2N × 2N division mode in which no CU division is performed. FIGS. 4B and 4E show the shapes of partitions when the division modes are 2N × N and N × 2N, respectively. FIG. 4H shows the shape of the partition when the division mode is N × N.

また、図４（ｃ）、（ｄ）、（ｆ）および（ｇ）は、それぞれ非対称的分割（ＡＭＰ）である、２Ｎ×ｎＵ、２Ｎ×ｎＤ、ｎＬ×２Ｎ、および、ｎＲ×２Ｎである場合のパーティションの形状について示している。 4 (c), (d), (f), and (g) are 2N × nU, 2N × nD, nL × 2N, and nR × 2N, which are asymmetric divisions (AMP), respectively. Shows the shape of the partition.

また、図４（ａ）〜（ｈ）において、各領域に付した番号は、領域の識別番号を示しており、この識別番号順に、領域に対して処理が行われる。すなわち、当該識別番号は、領域のスキャン順を表している。 Also, in FIGS. 4A to 4H, the numbers given to the respective regions indicate the region identification numbers, and the regions are processed in the order of the identification numbers. That is, the identification number represents the scan order of the area.

インター予測の場合の予測ブロックでは、上記８種類の分割モードのうち、Ｎ×Ｎ（図４（ｈ））以外の７種類が定義されている。 In the prediction block in the case of inter prediction, seven types other than N × N (FIG. 4 (h)) are defined among the eight types of division modes.

また、Ｎの具体的な値は、当該ＰＵが属するＣＵのサイズによって規定され、ｎＵ、ｎＤ、ｎＬ、および、ｎＲの具体的な値は、Ｎの値に応じて定められる。例えば、３２×３２画素のＣＵは、３２×３２画素、３２×１６画素、１６×３２画素、３２×１６画素、３２×８画素、３２×２４画素、８×３２画素、および、２４×３２画素のインター予測の予測ブロックへ分割できる。 A specific value of N is defined by the size of the CU to which the PU belongs, and specific values of nU, nD, nL, and nR are determined according to the value of N. For example, 32 × 32 pixel CUs are 32 × 32 pixels, 32 × 16 pixels, 16 × 32 pixels, 32 × 16 pixels, 32 × 8 pixels, 32 × 24 pixels, 8 × 32 pixels, and 24 × 32. It can be divided into prediction blocks for inter prediction of pixels.

また、変換ツリーにおいては、符号化ユニットが１または複数の変換ブロックに分割され、各変換ブロックの位置とサイズとが規定される。別の表現でいえば、変換ブロックは、符号化ユニットを構成する１または複数の重複しない領域のことである。また、変換ツリーは、上述の分割より得られた１または複数の変換ブロックを含む。 In the transform tree, the encoding unit is divided into one or a plurality of transform blocks, and the position and size of each transform block are defined. In other words, the transform block is one or a plurality of non-overlapping areas constituting the encoding unit. The conversion tree includes one or a plurality of conversion blocks obtained by the above division.

変換ツリーにおける分割には、符号化ユニットと同一のサイズの領域を変換ブロックとして割り付けるものと、上述したツリーブロックの分割と同様、再帰的な４分木分割によるものがある。 There are two types of division in the transformation tree: one in which an area having the same size as the encoding unit is allocated as a transformation block, and the other in division by recursive quadtree division, similar to the above-described division in the tree block.

変換処理は、この変換ブロックごとに行われる。以下、変換の単位である変換ブロックのことを、変換単位（transform unit;TU）とも称する。 The conversion process is performed for each conversion block. Hereinafter, a transform block that is a unit of transform is also referred to as a transform unit (TU).

（予測パラメータ）
予測ユニットの予測画像は、予測ユニットに付随する予測パラメータによって導出される。予測パラメータには、イントラ予測の予測パラメータもしくはインター予測の予測パラメータがある。以下、インター予測の予測パラメータ（インター予測パラメータ）について説明する。インター予測パラメータは、予測利用フラグpredFlagL0、predFlagL1と、参照ピクチャインデックスrefIdxL0、refIdxL1と、ベクトルｍｖＬ０、ｍｖＬ１から構成される。予測利用フラグpredFlagL0、predFlagL1は、各々Ｌ０リスト、Ｌ１リストと呼ばれる参照ピクチャリストが用いられるか否かを示すフラグであり、値が１の場合に対応する参照ピクチャリストが用いられる。なお、本明細書中「ＸＸであるか否かを示すフラグ」と記す場合、１をＸＸである場合、０をＸＸではない場合とし、論理否定、論理積などでは１を真、０を偽と扱う（以下同様）。但し、実際の装置や方法では真値、偽値として他の値を用いることもできる。２つの参照ピクチャリストが用いられる場合、つまり、(predFlagL0，predFlagL1) ＝ (1, 1)の場合が、双予測に対応し、１つの参照ピクチャリストを用いる場合、すなわち(predFlagL0, predFlagL1) = (1, 0)もしくは(predFlagL0, predFlagL1) = (0, 1)の場合が単予測に対応する。なお、予測利用フラグの情報は、後述のインター予測識別子inter_pred_idcで表現することもできる。通常、後述の予測画像生成部、予測パラメータメモリでは、予測利用フラグが用いれ、符号化データから、どの参照ピクチャリストが用いられるか否かの情報を復号する場合にはインター予測識別子inter_pred_idcが用いられる。(Prediction parameter)
The prediction image of the prediction unit is derived by a prediction parameter associated with the prediction unit. The prediction parameters include a prediction parameter for intra prediction or a prediction parameter for inter prediction. Hereinafter, prediction parameters for inter prediction (inter prediction parameters) will be described. The inter prediction parameter includes prediction use flags predFlagL0 and predFlagL1, reference picture indexes refIdxL0 and refIdxL1, and vectors mvL0 and mvL1. The prediction use flags predFlagL0 and predFlagL1 are flags indicating whether or not reference picture lists called L0 list and L1 list are used, respectively, and a reference picture list corresponding to a value of 1 is used. In this specification, when “flag indicating whether or not XX” is described, 1 is XX, 0 is not XX, 1 is true and 0 is false in logical negation and logical product. (The same applies hereinafter). However, other values can be used as true values and false values in an actual apparatus or method. When two reference picture lists are used, that is, (predFlagL0, predFlagL1) = (1, 1) corresponds to bi-prediction, and when one reference picture list is used, that is, (predFlagL0, predFlagL1) = ( 1, 0) or (predFlagL0, predFlagL1) = (0, 1) corresponds to single prediction. Note that the prediction use flag information can also be expressed by an inter prediction identifier inter_pred_idc described later. Normally, a prediction use flag is used in a prediction image generation unit and a prediction parameter memory, which will be described later, and an inter prediction identifier inter_pred_idc is used when decoding information about which reference picture list is used from encoded data. .

符号化データに含まれるインター予測パラメータを導出するためのシンタックス要素には、例えば、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLXがある。なお、ＬＸは、Ｌ０予測とＬ１予測を区別しない場合に用いられる記述方法であり、ＬＸをＬ０、Ｌ１に置き換えることでＬ０リストに対するパラメータとＬ１リストに対するパラメータを区別する（以降同様）。例えば、refIdxL0はＬ０予測に用いる参照ピクチャインデックス、refIdxL1はＬ１予測に用いる参照ピクチャインデックス、refIdx（refIdxLX）は、refIdxL0とrefIdxL1を区別しない場合に用いられる表記である。 Syntax elements for deriving inter prediction parameters included in the encoded data include, for example, a partition mode part_mode, a merge flag merge_flag, a merge index merge_idx, an inter prediction identifier inter_pred_idc, a reference picture index refIdxLX, a prediction vector flag mvp_LX_flag, and a difference There is a vector mvdLX. Note that LX is a description method used when L0 prediction and L1 prediction are not distinguished. By replacing LX with L0 and L1, parameters for the L0 list and parameters for the L1 list are distinguished (the same applies hereinafter). For example, refIdxL0 is a reference picture index used for L0 prediction, refIdxL1 is a reference picture index used for L1 prediction, and refIdx (refIdxLX) is a notation used when refIdxL0 and refIdxL1 are not distinguished.

（参照ピクチャリストの一例）
次に、参照ピクチャリストの一例について説明する。参照ピクチャリストとは、参照ピクチャメモリ３０６に記憶された参照ピクチャからなる列である。図５は、参照ピクチャリストRefPicListXの一例を示す概念図である。参照ピクチャリストRefPicListXにおいて、左右に一列に配列された５個の長方形は、それぞれ参照ピクチャを示す。左端から右へ順に示されている符号、Ｐ１、Ｐ２、Ｑ０、Ｐ３、Ｐ４は、それぞれの参照ピクチャを示す符号である。Ｐ１等のＰとは、視点Ｐを示し、そしてＱ０のＱとは、視点Ｐとは異なる視点Ｑを示す。Ｐ及びＱの添字は、ピクチャ順序番号ＰＯＣを示す。refIdxLXの真下の下向きの矢印は、参照ピクチャインデックスrefIdxLXが、参照ピクチャメモリ３０６において参照ピクチャＱ０を参照するインデックスであることを示す。(Example of reference picture list)
Next, an example of the reference picture list will be described. The reference picture list is a column composed of reference pictures stored in the reference picture memory 306. FIG. 5 is a conceptual diagram showing an example of the reference picture list RefPicListX. In the reference picture list RefPicListX, five rectangles arranged in a line on the left and right indicate reference pictures, respectively. The codes P1, P2, Q0, P3, and P4 shown in order from the left end to the right are codes indicating respective reference pictures. P such as P1 indicates the viewpoint P, and Q of Q0 indicates a viewpoint Q different from the viewpoint P. The subscripts P and Q indicate the picture order number POC. A downward arrow directly below refIdxLX indicates that the reference picture index refIdxLX is an index that refers to the reference picture Q0 in the reference picture memory 306.

（参照ピクチャの例）
次に、ベクトルを導出する際に用いる参照ピクチャの例について説明する。図６は、参照ピクチャの例を示す概念図である。図６において、横軸は表示時刻を示し、縦軸は視点を示す。図６に示されている、縦２行、横３列（計６個）の長方形は、それぞれピクチャを示す。６個の長方形のうち、下行の左から２列目の長方形は復号対象のピクチャ（対象ピクチャ）を示し、残りの５個の長方形がそれぞれ参照ピクチャを示す。対象ピクチャから上向きの矢印で示される参照ピクチャＱ０は対象ピクチャと同表示時刻であって視点（ビューＩＤ）が異なるピクチャである。対象ピクチャを基準とする変位予測においては、参照ピクチャＱ０が用いられる。対象ピクチャから左向きの矢印で示される参照ピクチャＰ１は、対象ピクチャと同じ視点であって、過去のピクチャである。対象ピクチャから右向きの矢印で示される参照ピクチャＰ２は、対象ピクチャと同じ視点であって、未来のピクチャである。対象ピクチャを基準とする動き予測においては、参照ピクチャＰ１又はＰ２が用いられる。(Reference picture example)
Next, an example of a reference picture used for deriving a vector will be described. FIG. 6 is a conceptual diagram illustrating an example of a reference picture. In FIG. 6, the horizontal axis indicates the display time, and the vertical axis indicates the viewpoint. The rectangles shown in FIG. 6 with 2 rows and 3 columns (6 in total) indicate pictures. Among the six rectangles, the rectangle in the second column from the left in the lower row indicates a picture to be decoded (target picture), and the remaining five rectangles indicate reference pictures. A reference picture Q0 indicated by an upward arrow from the target picture is a picture that has the same display time as the target picture and a different viewpoint (view ID). In the displacement prediction based on the target picture, the reference picture Q0 is used. A reference picture P1 indicated by a left-pointing arrow from the target picture is a past picture at the same viewpoint as the target picture. A reference picture P2 indicated by a right-pointing arrow from the target picture is a future picture at the same viewpoint as the target picture. In motion prediction based on the target picture, the reference picture P1 or P2 is used.

（インター予測識別子と予測利用フラグ）
インター予測識別子inter_pred_idcと、予測利用フラグpredFlagL0、predFlagL1の関係は
inter_pred_idc ＝（predFlagL1＜＜１）＋ predFlagL0
predFlagL0 ＝inter_pred_idc ＆１
predFlagL1 ＝inter_pred_idc ＞＞１
の式を用いて相互に変換可能である。ここで、＞＞は右シフト、＜＜は左シフトである。そのため、インター予測パラメータとしては、予測利用フラグpredFlagL0、predFlagL1を用いても良いし、インター予測識別子inter_pred_idcを用いてもよい。また、以下、予測利用フラグpredFlagL0、predFlagL1を用いた判定は、インター予測識別子inter_pred_idcに置き替えても可能である。逆に、インター予測識別子inter_pred_idcを用いた判定は、予測利用フラグpredFlagL0、predFlagL1に置き替えても可能である。(Inter prediction identifier and prediction usage flag)
The relationship between the inter prediction identifier inter_pred_idc and the prediction usage flags predFlagL0 and predFlagL1 is
inter_pred_idc = (predFlagL1 << 1) + predFlagL0
predFlagL0 = inter_pred_idc & 1
predFlagL1 = inter_pred_idc >> 1
Can be converted to each other. Here, >> is a right shift, and << is a left shift. Therefore, as the inter prediction parameter, the prediction use flags predFlagL0 and predFlagL1 may be used, or the inter prediction identifier inter_pred_idc may be used. In addition, hereinafter, the determination using the prediction usage flags predFlagL0 and predFlagL1 may be replaced with the inter prediction identifier inter_pred_idc. Conversely, the determination using the inter prediction identifier inter_pred_idc can be replaced with the prediction use flags predFlagL0 and predFlagL1.

（マージモードとAMVP予測）
予測パラメータの復号（符号化）方法には、マージ（merge）モードとAMVP（Adaptive Motion Vector Prediction、適応動きベクトル予測）モードがある、マージフラグmerge_flagは、これらを識別するためのフラグである。マージモードでも、AMVPモードでも、既に処理済みのブロックの予測パラメータを用いて、対象ＰＵの予測パラメータが導出される。マージモードは、予測利用フラグpredFlagLX（インター予測識別子inter_pred_idc）、参照ピクチャインデックスrefIdxLX、ベクトルmvLXを符号化データに含めずに、既に導出した予測パラメータをそのまま用いるモードであり、AMVPモードは、インター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、ベクトルmvLXを符号化データに含めるモードである。なおベクトルmvLXは、予測ベクトルを示す予測ベクトルフラグmvp_LX_flagと差分ベクトル（mvdLX）として符号化される。(Merge mode and AMVP prediction)
The prediction parameter decoding (encoding) method includes a merge mode and an AMVP (Adaptive Motion Vector Prediction) mode. The merge flag merge_flag is a flag for identifying these. In both the merge mode and the AMVP mode, the prediction parameter of the target PU is derived using the prediction parameter of the already processed block. The merge mode is a mode that uses the prediction parameter already derived without including the prediction use flag predFlagLX (inter prediction identifier inter_pred_idc), the reference picture index refIdxLX, and the vector mvLX in the encoded data. The AMVP mode is an inter prediction identifier. In this mode, inter_pred_idc, reference picture index refIdxLX, and vector mvLX are included in the encoded data. The vector mvLX is encoded as a prediction vector flag mvp_LX_flag indicating a prediction vector and a difference vector (mvdLX).

インター予測識別子inter_pred_idcは、参照ピクチャの種類および数を示すデータであり、Pred_L0、Pred_L1、Pred_BIの何れかの値をとる。Pred_L0、Pred_L1は、各々Ｌ０リスト、Ｌ１リストと呼ばれる参照ピクチャリストに記憶された参照ピクチャが用いられることを示し、共に１枚の参照ピクチャを用いること（単予測）を示す。Ｌ０リスト、Ｌ１リストを用いた予測を各々Ｌ０予測、Ｌ１予測と呼ぶ。Pred_BIは２枚の参照ピクチャを用いること（双予測）を示し、Ｌ０リストとＬ１リストに記憶された参照ピクチャの２つを用いることを示す。予測ベクトルフラグmvp_LX_flagは予測ベクトルを示すインデックスであり、参照ピクチャインデックスrefIdxLXは、参照ピクチャリストに記憶された参照ピクチャを示すインデックスである。マージインデックスmerge_idxは、処理が完了したブロックから導出される予測パラメータ候補（マージ候補）のうち、いずれかの予測パラメータを予測ユニット（対象ブロック）の予測パラメータとして用いるかを示すインデックスである。 The inter prediction identifier inter_pred_idc is data indicating the type and number of reference pictures, and takes one of Pred_L0, Pred_L1, and Pred_BI. Pred_L0 and Pred_L1 indicate that reference pictures stored in reference picture lists called an L0 list and an L1 list are used, respectively, and that both use one reference picture (single prediction). Prediction using the L0 list and the L1 list are referred to as L0 prediction and L1 prediction, respectively. Pred_BI indicates that two reference pictures are used (bi-prediction), and indicates that two reference pictures stored in the L0 list and the L1 list are used. The prediction vector flag mvp_LX_flag is an index indicating a prediction vector, and the reference picture index refIdxLX is an index indicating a reference picture stored in the reference picture list. The merge index merge_idx is an index that indicates whether one of the prediction parameter candidates (merge candidates) derived from the processed block is used as a prediction parameter of the prediction unit (target block).

（動きベクトルと変位ベクトル）
ベクトルmvLXには、動きベクトルと変位ベクトル（disparity vector、視差ベクトル）がある。動きベクトルとは、あるレイヤのある表示時刻でのピクチャにおけるブロックの位置と、異なる表示時刻（例えば、隣接する離散時刻）における同一のレイヤのピクチャにおける対応するブロックの位置との間の位置のずれを示すベクトルである。変位ベクトルとは、あるレイヤのある表示時刻でのピクチャにおけるブロックの位置と、同一の表示時刻における異なるレイヤのピクチャにおける対応するブロックの位置との間の位置のずれを示すベクトルである。異なるレイヤのピクチャとしては、異なる視点のピクチャである場合、もしくは、異なる解像度のピクチャである場合などがある。特に、異なる視点のピクチャに対応する変位ベクトルを視差ベクトルと呼ぶ。以下の説明では、動きベクトルと変位ベクトルを区別しない場合には、単にベクトルmvLXと呼ぶ。ベクトルmvLXに関する予測ベクトル、差分ベクトルを、それぞれ予測ベクトルmvpLX、差分ベクトルmvdLXと呼ぶ。ベクトルmvLXおよび差分ベクトルmvdLXが、動きベクトルであるか、変位ベクトルであるかは、ベクトルに付随する参照ピクチャインデックスrefIdxLXを用いて行われる。(Motion vector and displacement vector)
The vector mvLX includes a motion vector and a displacement vector (disparity vector). A motion vector is a positional shift between the position of a block in a picture at a certain display time of a layer and the position of the corresponding block in a picture of the same layer at a different display time (for example, an adjacent discrete time). It is a vector which shows. The displacement vector is a vector indicating a positional shift between the position of a block in a picture at a certain display time of a certain layer and the position of a corresponding block in a picture of a different layer at the same display time. The pictures in different layers may be pictures from different viewpoints or pictures with different resolutions. In particular, a displacement vector corresponding to pictures of different viewpoints is called a disparity vector. In the following description, when a motion vector and a displacement vector are not distinguished, they are simply referred to as a vector mvLX. A prediction vector and a difference vector related to the vector mvLX are referred to as a prediction vector mvpLX and a difference vector mvdLX, respectively. Whether the vector mvLX and the difference vector mvdLX are motion vectors or displacement vectors is determined using a reference picture index refIdxLX associated with the vectors.

（画像復号装置の構成）
次に、本実施形態に係る画像復号装置３１の構成について説明する。図７は、本実施形態に係る画像復号装置３１の構成を示す概略図である。画像復号装置３１は、エントロピー復号部３０１、予測パラメータ復号部３０２、参照ピクチャメモリ（参照画像記憶部、フレームメモリ）３０６、予測パラメータメモリ（予測パラメータ記憶部、フレームメモリ）３０７、予測画像生成部３０８、逆量子化・逆ＤＣＴ部３１１、加算部３１２、及び図示されないデプスＤＶ導出部３５１を含んで構成される。(Configuration of image decoding device)
Next, the configuration of the image decoding device 31 according to the present embodiment will be described. FIG. 7 is a schematic diagram illustrating a configuration of the image decoding device 31 according to the present embodiment. The image decoding device 31 includes an entropy decoding unit 301, a prediction parameter decoding unit 302, a reference picture memory (reference image storage unit, frame memory) 306, a prediction parameter memory (prediction parameter storage unit, frame memory) 307, and a prediction image generation unit 308. , An inverse quantization / inverse DCT unit 311, an addition unit 312, and a depth DV derivation unit 351 (not shown).

また、予測パラメータ復号部３０２は、インター予測パラメータ復号部３０３及びイントラ予測パラメータ復号部３０４を含んで構成される。予測画像生成部３０８は、インター予測画像生成部３０９及びイントラ予測画像生成部３１０を含んで構成される。 The prediction parameter decoding unit 302 includes an inter prediction parameter decoding unit 303 and an intra prediction parameter decoding unit 304. The predicted image generation unit 308 includes an inter predicted image generation unit 309 and an intra predicted image generation unit 310.

エントロピー復号部３０１は、外部から入力された符号化ストリームＴｅに対してエントロピー復号を行って、個々の符号（シンタックス要素）を分離し復号する。分離された符号には、予測画像を生成するための予測情報および、差分画像を生成するための残差情報などがある。 The entropy decoding unit 301 performs entropy decoding on the encoded stream Te input from the outside, and separates and decodes individual codes (syntax elements). The separated codes include prediction information for generating a prediction image and residual information for generating a difference image.

エントロピー復号部３０１は、分離した符号の一部を予測パラメータ復号部３０２に出力する。分離した符号の一部とは、例えば、予測モードPredMode、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLX、残差予測インデックスiv_res_pred_weight_idx、照度補償フラグic_flagである。どの符号を復号するか否かの制御は、予測パラメータ復号部３０２の指示に基づいて行われる。エントロピー復号部３０１は、量子化係数を逆量子化・逆ＤＣＴ部３１１に出力する。この量子化係数は、符号化処理において、残差信号に対してＤＣＴ（DiscreteCosine Transform、離散コサイン変換）を行い量子化して得られる係数である。エントロピー復号部３０１は、デプスＤＶ変換テーブルDepthToDisparityBを、デプスＤＶ導出部３５１に出力する。このデプスＤＶ変換テーブルDepthToDisparityBは、デプス画像の画素値を視点画像間の変位を示す視差に変換するためのテーブルであり、デプスＤＶ変換テーブルDepthToDisparityBの要素DepthToDisparityB[d]は、傾きcp_scaleとオフセットcp_off、傾きの精度cp_precisionを用いて、
log2Div = BitDepth_Y - 1 + cp_precision
offset = ( cp_off << BitDepthY ) + ( ( 1 << log2Div ) >> 1 )
scale = cp_scale
DepthToDisparityB[ d ] = ( scale * d + offset ) >> log2Div
の式によって求めることができる。パラメータcp_scale、cp_off、cp_precisionは参照する視点毎に符号化データ中のパラメータセットから復号する。なお、BitDepthYは輝度信号に対応する画素値のビット深度を示しており、例えば、値として８をとる。The entropy decoding unit 301 outputs a part of the separated code to the prediction parameter decoding unit 302. Some of the separated codes are, for example, prediction mode PredMode, split mode part_mode, merge flag merge_flag, merge index merge_idx, inter prediction identifier inter_pred_idc, reference picture index refIdxLX, prediction vector flag mvp_LX_flag, difference vector mvdLX, residual prediction index iv_res_pred_weight_idx and illuminance compensation flag ic_flag. Control of which code to decode is performed based on an instruction from the prediction parameter decoding unit 302. The entropy decoding unit 301 outputs the quantization coefficient to the inverse quantization / inverse DCT unit 311. This quantization coefficient is a coefficient obtained by performing quantization and performing DCT (Discrete Cosine Transform) on the residual signal in the encoding process. The entropy decoding unit 301 outputs the depth DV conversion table DepthToDisparityB to the depth DV deriving unit 351. The depth DV conversion table DepthToDisparityB is a table for converting the pixel value of the depth image into a parallax indicating the displacement between the viewpoint images, and an element DepthToDisparityB [d] of the depth DV conversion table DepthToDisparityB has an inclination cp_scale and an offset cp_off, Using the slope precision cp_precision,
log2Div = BitDepth _Y -1 + cp_precision
offset = (cp_off << BitDepthY) + ((1 << log2Div) >> 1)
scale = cp_scale
DepthToDisparityB [d] = (scale * d + offset) >> log2Div
It can be calculated by the following formula. The parameters cp_scale, cp_off, and cp_precision are decoded from the parameter set in the encoded data for each viewpoint to be referred to. BitDepthY indicates the bit depth of the pixel value corresponding to the luminance signal, and takes, for example, 8 as the value.

予測パラメータ復号部３０２は、エントロピー復号部３０１から符号の一部を入力として受け取る。予測パラメータ復号部３０２は、符号の一部である予測モードPredModeが示す予測モードに対応する予測パラメータを復号する。予測パラメータ復号部３０２は、予測モードPredModeと復号した予測パラメータを予測パラメータメモリ３０７と予測画像生成部３０８に出力する。 The prediction parameter decoding unit 302 receives a part of the code from the entropy decoding unit 301 as an input. The prediction parameter decoding unit 302 decodes the prediction parameter corresponding to the prediction mode indicated by the prediction mode PredMode that is a part of the code. The prediction parameter decoding unit 302 outputs the prediction mode PredMode and the decoded prediction parameter to the prediction parameter memory 307 and the prediction image generation unit 308.

インター予測パラメータ復号部３０３は、エントロピー復号部３０１から入力された符号に基づいて、予測パラメータメモリ３０７に記憶された予測パラメータを参照してインター予測パラメータを復号する。インター予測パラメータ復号部３０３は、復号したインター予測パラメータを予測画像生成部３０８に出力し、また予測パラメータメモリ３０７に記憶する。インター予測パラメータ復号部３０３の詳細については後述する。 Based on the code input from the entropy decoding unit 301, the inter prediction parameter decoding unit 303 refers to the prediction parameter stored in the prediction parameter memory 307 and decodes the inter prediction parameter. The inter prediction parameter decoding unit 303 outputs the decoded inter prediction parameter to the prediction image generation unit 308 and stores it in the prediction parameter memory 307. Details of the inter prediction parameter decoding unit 303 will be described later.

イントラ予測パラメータ復号部３０４は、エントロピー復号部３０１から入力された符号に基づいて、予測パラメータメモリ３０７に記憶された予測パラメータを参照してイントラ予測パラメータを復号する。イントラ予測パラメータとは、ピクチャブロックを１つのピクチャ内で予測する処理で用いるパラメータ、例えば、イントラ予測モードIntraPredModeである。イントラ予測パラメータ復号部３０４は、復号したイントラ予測パラメータを予測画像生成部３０８に出力し、また予測パラメータメモリ３０７に記憶する。 Based on the code input from the entropy decoding unit 301, the intra prediction parameter decoding unit 304 refers to the prediction parameter stored in the prediction parameter memory 307 and decodes the intra prediction parameter. The intra prediction parameter is a parameter used in a process of predicting a picture block within one picture, for example, an intra prediction mode IntraPredMode. The intra prediction parameter decoding unit 304 outputs the decoded intra prediction parameter to the prediction image generation unit 308 and stores it in the prediction parameter memory 307.

参照ピクチャメモリ３０６は、加算部３１２が生成した復号ピクチャブロックrecSamplesを、復号ピクチャブロックの位置に記憶する。 The reference picture memory 306 stores the decoded picture block recSamples generated by the addition unit 312 at the position of the decoded picture block.

予測パラメータメモリ３０７は、予測パラメータを、復号対象のピクチャ及びブロック毎に予め定めた位置に記憶する。具体的には、予測パラメータメモリ３０７は、インター予測パラメータ復号部３０３が復号したインター予測パラメータ、イントラ予測パラメータ復号部３０４が復号したイントラ予測パラメータ及びエントロピー復号部３０１が分離した予測モードPredModeを記憶する。記憶されるインター予測パラメータには、例えば、予測利用フラグpredFlagLX、参照ピクチャインデックスrefIdxLX、ベクトルmvLXがある。 The prediction parameter memory 307 stores the prediction parameter at a predetermined position for each picture and block to be decoded. Specifically, the prediction parameter memory 307 stores the inter prediction parameter decoded by the inter prediction parameter decoding unit 303, the intra prediction parameter decoded by the intra prediction parameter decoding unit 304, and the prediction mode PredMode separated by the entropy decoding unit 301. . The stored inter prediction parameters include, for example, a prediction use flag predFlagLX, a reference picture index refIdxLX, and a vector mvLX.

予測画像生成部３０８には、予測パラメータ復号部３０２から予測モードPredMode及び予測パラメータが入力される。また、予測画像生成部３０８は、参照ピクチャメモリ３０６から参照ピクチャを読み出す。予測画像生成部３０８は、予測モードPredModeが示す予測モードで、入力された予測パラメータと読み出した参照ピクチャを用いて予測ピクチャブロックpredSamples（予測画像）を生成する。 The prediction image generation unit 308 receives the prediction mode PredMode and the prediction parameter from the prediction parameter decoding unit 302. Further, the predicted image generation unit 308 reads a reference picture from the reference picture memory 306. The prediction image generation unit 308 generates prediction picture blocks predSamples (prediction images) using the input prediction parameter and the read reference picture in the prediction mode indicated by the prediction mode PredMode.

ここで、予測モードPredModeがインター予測モードを示す場合、インター予測画像生成部３０９は、インター予測パラメータ復号部３０３から入力されたインター予測パラメータと読み出した参照ピクチャを用いてインター予測により予測ピクチャブロックpredSamplesを生成する。予測ピクチャブロックpredSamplesは予測ユニットＰＵに対応する。ＰＵは、上述したように予測処理を行う単位となる複数の画素からなるピクチャの一部分、つまり１度に予測処理が行われる対象ブロックに相当する。 Here, when the prediction mode PredMode indicates the inter prediction mode, the inter prediction image generation unit 309 uses the inter prediction parameters input from the inter prediction parameter decoding unit 303 and the read reference pictures to perform prediction picture block predSamples by inter prediction. Is generated. The prediction picture block predSamples corresponds to the prediction unit PU. The PU corresponds to a part of a picture composed of a plurality of pixels as a unit for performing the prediction process as described above, that is, a target block on which the prediction process is performed at a time.

インター予測画像生成部３０９は、予測利用フラグpredFlagLXが１である参照ピクチャリストRefPicListLXに対し、参照ピクチャインデックスrefIdxLXで示される参照ピクチャRefPicListLX[refIdxLX]から、予測ユニットを基準としてベクトルmvLXが示す位置にある参照ピクチャブロックを参照ピクチャメモリ３０６から読み出す。インター予測画像生成部３０９は、読み出した参照ピクチャブロックに動き補償を行って予測ピクチャブロックpredSamplesLXを生成する。インター予測画像生成部３０９は、さらに各参照ピクチャリストの参照ピクチャから導出された予測ピクチャブロックpredSamplesL0、predSamplesL1から重み付予測により予測ピクチャブロックpredSamplesを生成し、加算部３１２に出力する。 For the reference picture list RefPicListLX in which the prediction usage flag predFlagLX is 1, the inter predicted image generation unit 309 is located at the position indicated by the vector mvLX with reference to the prediction unit from the reference picture RefPicListLX [refIdxLX] indicated by the reference picture index refIdxLX. A reference picture block is read from the reference picture memory 306. The inter prediction image generation unit 309 performs motion compensation on the read reference picture block to generate prediction picture blocks predSamplesLX. The inter prediction image generation unit 309 further generates prediction picture blocks predSamples by weighted prediction from the prediction picture blocks predSamplesL0 and predSamplesL1 derived from the reference pictures in each reference picture list, and outputs the prediction picture blocks predSamples to the addition unit 312.

予測モードPredModeがイントラ予測モードを示す場合、イントラ予測画像生成部３１０は、イントラ予測パラメータ復号部３０４から入力されたイントラ予測パラメータと読み出した参照ピクチャを用いてイントラ予測を行う。具体的には、イントラ予測画像生成部３１０は、復号対象のピクチャであって、既に処理されたブロックのうち予測ユニットから予め定めた範囲にある参照ピクチャブロックを参照ピクチャメモリ３０６から読み出す。予め定めた範囲とは、例えば、左、左上、上、右上の隣接ブロックの範囲でありイントラ予測モードによって異なる。 When the prediction mode PredMode indicates the intra prediction mode, the intra predicted image generation unit 310 performs intra prediction using the intra prediction parameter input from the intra prediction parameter decoding unit 304 and the read reference picture. Specifically, the intra predicted image generation unit 310 reads, from the reference picture memory 306, a reference picture block that is a decoding target picture and is in a predetermined range from a prediction unit among blocks that have already been processed. The predetermined range is, for example, the range of adjacent blocks on the left, upper left, upper, and upper right, and differs depending on the intra prediction mode.

イントラ予測画像生成部３１０は、読み出した参照ピクチャブロックについてイントラ予測モードIntraPredModeが示す予測モードで予測を行って予測ピクチャブロックpredSamplesを生成し、加算部３１２に出力する。 The intra predicted image generation unit 310 performs prediction in the prediction mode indicated by the intra prediction mode IntraPredMode for the read reference picture block, generates predicted picture block predSamples, and outputs the prediction picture block predSamples to the adding unit 312.

逆量子化・逆ＤＣＴ部３１１は、エントロピー復号部３０１から入力された量子化係数を逆量子化してＤＣＴ係数を求める。逆量子化・逆ＤＣＴ部３１１は、求めたＤＣＴ係数について逆ＤＣＴ（Inverse Discrete Cosine Transform、逆離散コサイン変換）を行い、復号残差信号を算出する。逆量子化・逆ＤＣＴ部３１１は、算出した復号残差信号を加算部３１２に出力する。 The inverse quantization / inverse DCT unit 311 performs inverse quantization on the quantization coefficient input from the entropy decoding unit 301 to obtain a DCT coefficient. The inverse quantization / inverse DCT unit 311 performs inverse DCT (Inverse Discrete Cosine Transform) on the obtained DCT coefficient to calculate a decoded residual signal. The inverse quantization / inverse DCT unit 311 outputs the calculated decoded residual signal to the adder 312.

加算部３１２は、インター予測画像生成部３０９及びイントラ予測画像生成部３１０から入力された予測ピクチャブロックpredSamplesと逆量子化・逆ＤＣＴ部３１１から入力された復号残差信号の信号値resSamplesを画素毎に加算して、復号ピクチャブロックrecSamplesを生成する。加算部３１２は、生成した復号ピクチャブロックrecSamplesを参照ピクチャメモリ３０６に出力する。復号ピクチャブロックはピクチャ毎に統合される。復号されたピクチャには、デブロックフィルタおよび適応オフセットフィルタが適用などのループフィルタが適用される。復号されたピクチャは復号レイヤ画像Ｔｄとして外部に出力する。 The addition unit 312 performs pixel value processing on the prediction picture block predSamples input from the inter prediction image generation unit 309 and the intra prediction image generation unit 310 and the signal value resSamples of the decoded residual signal input from the inverse quantization / inverse DCT unit 311 for each pixel. To generate a decoded picture block recSamples. The adder 312 outputs the generated decoded picture block recSamples to the reference picture memory 306. The decoded picture block is integrated for each picture. A loop filter such as a deblocking filter and an adaptive offset filter is applied to the decoded picture. The decoded picture is output to the outside as a decoded layer image Td.

（インター予測パラメータ復号部の構成）
次に、インター予測パラメータ復号部３０３の構成について説明する。図８は、本実施形態に係るインター予測パラメータ復号部３０３の構成を示す概略図である。インター予測パラメータ復号部３０３は、インター予測パラメータ復号制御部３０３１、AMVP予測パラメータ導出部３０３２、加算部３０３５及びマージモードパラメータ導出部３０３６、変位導出部３０３６３を含んで構成される。(Configuration of inter prediction parameter decoding unit)
Next, the configuration of the inter prediction parameter decoding unit 303 will be described. FIG. 8 is a schematic diagram illustrating a configuration of the inter prediction parameter decoding unit 303 according to the present embodiment. The inter prediction parameter decoding unit 303 includes an inter prediction parameter decoding control unit 3031, an AMVP prediction parameter deriving unit 3032, an adding unit 3035, a merge mode parameter deriving unit 3036, and a displacement deriving unit 30363.

インター予測パラメータ復号制御部３０３１は、インター予測に関連する符号（シンタックス要素の復号をエントロピー復号部３０１に指示し、符号化データに含まれる符号（シンタックス要素）を例えば、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLX、残差予測インデックスiv_res_pred_weight_idx、照度補償フラグic_flag、ＤＢＢＰフラグdbbp_flagを抽出する。インター予測パラメータ復号制御部３０３１が、あるシンタックス要素を抽出すると表現する場合は、あるシンタックス要素の復号をエントロピー復号部３０１に指示し、該当のシンタックス要素を符号化データから読み出すことを意味する。 The inter prediction parameter decoding control unit 3031 instructs the entropy decoding unit 301 to decode a code related to the inter prediction (the syntax element) includes, for example, a division mode part_mode, a merge included in the encoded data. A flag merge_flag, a merge index merge_idx, an inter prediction identifier inter_pred_idc, a reference picture index refIdxLX, a prediction vector flag mvp_LX_flag, a difference vector mvdLX, a residual prediction index iv_res_pred_weight_idx, an illumination compensation flag ic_flag, and a DBBP flag dbbp_flag are extracted. When 3031 expresses that a certain syntax element is extracted, it means that the entropy decoding unit 301 is instructed to decode a certain syntax element, and the corresponding syntax element is read out from the encoded data.

インター予測パラメータ復号制御部３０３１は、マージフラグmerge_flagが１、すなわち、予測ユニットがマージモードの場合、マージインデックスmerge_idxを符号化データから抽出する。インター予測パラメータ復号制御部３０３１は、抽出した残差予測インデックスiv_res_pred_weight_idx、照度補償フラグic_flag及びマージインデックスmerge_idxをマージモードパラメータ導出部３０３６に出力する。 When the merge flag merge_flag is 1, that is, when the prediction unit is in the merge mode, the inter prediction parameter decoding control unit 3031 extracts the merge index merge_idx from the encoded data. The inter prediction parameter decoding control unit 3031 outputs the extracted residual prediction index iv_res_pred_weight_idx, the illumination compensation flag ic_flag, and the merge index merge_idx to the merge mode parameter deriving unit 3036.

インター予測パラメータ復号制御部３０３１は、マージフラグmerge_flagが０、すなわち、予測ブロックがAMVP予測モードの場合、エントロピー復号部３０１を用いて符号化データからインター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLXを抽出する。インター予測パラメータ復号制御部３０３１は、抽出したインター予測識別子inter_pred_idcから導出した予測利用フラグpredFlagLXと、参照ピクチャインデックスrefIdxLXをAMVP予測パラメータ導出部３０３２及び予測画像生成部３０８に出力し、また予測パラメータメモリ３０７に記憶する。インター予測パラメータ復号制御部３０３１は、抽出した予測ベクトルフラグmvp_LX_flagをAMVP予測パラメータ導出部３０３２に出力し抽出した差分ベクトルmvdLXを加算部３０３５に出力する。 When the merge flag merge_flag is 0, that is, when the prediction block is in the AMVP prediction mode, the inter prediction parameter decoding control unit 3031 uses the entropy decoding unit 301 to calculate the inter prediction identifier inter_pred_idc, the reference picture index refIdxLX, and the prediction vector flag. mvp_LX_flag and difference vector mvdLX are extracted. The inter prediction parameter decoding control unit 3031 outputs the prediction use flag predFlagLX derived from the extracted inter prediction identifier inter_pred_idc and the reference picture index refIdxLX to the AMVP prediction parameter derivation unit 3032 and the prediction image generation unit 308, and also the prediction parameter memory 307 To remember. The inter prediction parameter decoding control unit 3031 outputs the extracted prediction vector flag mvp_LX_flag to the AMVP prediction parameter derivation unit 3032 and outputs the extracted difference vector mvdLX to the addition unit 3035.

インター予測パラメータ復号制御部３０３１は、分割モードPartModeが特定の値の場合に、ＤＢＢＰフラグdbbp_flagを符号化データから復号する。それ以外の場合、dbbp_flagが符号化データに含まれない場合には、dbbp_flagとして０を推定する。図２０は本実施形態のＤＢＢＰフラグdbbp_flagに関するシンタックス表である。インター予測パラメータ復号制御部３０３１は、図のＳＥ１００１〜ＳＥ１００４に示すcu_skip_flag、pred_mode、part_mode、dbbp_flagを復号する。ここで、cu_skip_flagは対象ＣＵがスキップであるか否かを示すフラグである。スキップの場合にはPartModeは２Ｎ×２Ｎに限定され、分割モードpart_modeの復号は省略される。符号化データから復号した分割モードpart_modeは分割モードPredModeにセットされる。インター予測パラメータ復号制御部３０３１は、この例では、分割モードPartMode(=part_mode)が２Ｎ×Ｎの場合に、dbbp_flagを復号するが、分割モードが他の値の場合にdbbp_flagを復号しても良い。また、上記とは異なる方法でdbbpフラグを導出しても良い。 The inter prediction parameter decoding control unit 3031 decodes the DBBP flag dbbp_flag from the encoded data when the division mode PartMode is a specific value. In other cases, when dbbp_flag is not included in the encoded data, 0 is estimated as dbbp_flag. FIG. 20 is a syntax table related to the DBBP flag dbbp_flag of this embodiment. The inter prediction parameter decoding control unit 3031 decodes cu_skip_flag, pred_mode, part_mode, and dbbp_flag shown in SE1001 to SE1004 in the figure. Here, cu_skip_flag is a flag indicating whether or not the target CU is skipped. In the case of skipping, PartMode is limited to 2N × 2N, and decoding of the split mode part_mode is omitted. The division mode part_mode decoded from the encoded data is set to the division mode PredMode. In this example, the inter prediction parameter decoding control unit 3031 decodes dbbp_flag when the partition mode PartMode (= part_mode) is 2N × N, but may decode dbbp_flag when the partition mode is another value. . Further, the dbbp flag may be derived by a method different from the above.

また、インター予測パラメータ復号制御部３０３１は、インター予測パラメータ導出時に導出された変位ベクトル（NBDV）、及び視点合成予測を行うか否かを示すフラグであるVSPモードフラグVspModeFlagを、インター予測画像生成部３０９に出力する。 In addition, the inter prediction parameter decoding control unit 3031 displays the displacement vector (NBDV) derived when the inter prediction parameter is derived and the VSP mode flag VspModeFlag that is a flag indicating whether to perform viewpoint synthesis prediction, as an inter prediction image generation unit. To 309.

図９は、本実施形態に係るマージモードパラメータ導出部３０３６の構成を示す概略図である。マージモードパラメータ導出部３０３６は、マージ候補導出部３０３６１とマージ候補選択部３０３６２、双予測制限部３０３６３を備える。マージ候補導出部３０３６１は、マージ候補格納部３０３６１１と、拡張マージ候補導出部３０３７０と基本マージ候補導出部３０３８０を含んで構成される。 FIG. 9 is a schematic diagram illustrating a configuration of the merge mode parameter deriving unit 3036 according to the present embodiment. The merge mode parameter derivation unit 3036 includes a merge candidate derivation unit 30361, a merge candidate selection unit 30362, and a bi-prediction restriction unit 30363. The merge candidate derivation unit 30361 includes a merge candidate storage unit 303611, an extended merge candidate derivation unit 30370, and a basic merge candidate derivation unit 30380.

マージ候補格納部３０３６１１は、拡張マージ候補導出部３０３７０及び基本マージ候補導出部３０３８０から入力されたマージ候補をマージ候補リストmergeCandListに格納する。なお、マージ候補は、予測利用フラグpredFlagLX、ベクトルmvLX、参照ピクチャインデックスrefIdxLX、VSPモードフラグVspModeFlag、変位ベクトルMvDisp、レイヤＩＤRefViewIdxを含んで構成されている。マージ候補格納部３０３６１１において、マージ候補リストmergeCandListに格納されたマージ候補には、所定の規則に従ってインデックスが割り当てられる。 The merge candidate storage unit 303611 stores the merge candidates input from the extended merge candidate derivation unit 30370 and the basic merge candidate derivation unit 30380 in the merge candidate list mergeCandList. The merge candidate includes a prediction usage flag predFlagLX, a vector mvLX, a reference picture index refIdxLX, a VSP mode flag VspModeFlag, a displacement vector MvDisp, and a layer ID RefViewIdx. In the merge candidate storage unit 303611, an index is assigned to the merge candidates stored in the merge candidate list mergeCandList according to a predetermined rule.

図１１は、マージ候補導出部３０３６１が導出するマージ候補リストmergeCandListの例を示すものである。図１１（ａ）は、ベースレイヤ（レイヤＩＤnal_unit_layer=0のレイヤ）において、マージ候補格納部３０３６１１が導出するマージ候補を示す。２つのマージ候補が同じ予測パラメータである場合に順番を詰める処理（プルーニング処理）を除くと、マージインデックス順に、空間マージ候補（A1）、空間マージ候補（B1）、空間マージ候補（B0）、空間マージ候補（A0）、空間マージ候補（B2）の順になる。なお、括弧内は、マージ候補のニックネームであり、空間マージ候補の場合には、導出に用いる参照ブロックの位置に対応する。また、それ以降に、結合マージ候補、ゼロマージ候補があるが、図１１では省略している。これらのマージ候補は、つまり、空間マージ候補、時間マージ候補、結合マージ候補、ゼロマージ候補は、基本マージ候補導出部３０３８０により導出される。図１１（ｂ）は、ベースレイヤ以外のレイヤであるエンハンスメントレイヤ（レイヤＩＤnal_unit_layer!=0のレイヤ）においてマージ候補格納部３０３６１１が導出するマージ候補を示す。マージインデックス順に、テクスチャマージ候補（T）、インタービュ−マージ候補（IvMC）、空間マージ候補（A1）、空間マージ候補（B1）、空間マージ候補（B0）、変位マージ候補（IvDC）、VSPマージ候補(VSP)、空間マージ候補（A0）、空間マージ候補（B2）、動きシフトマージ候補（IvMCShift）、変位シフトマージ候補（IvDCShift）、時間マージ候補（Col）の順になる。なお、括弧内は、マージ候補のニックネームである。また、それ以降に、結合マージ候補、ゼロマージ候補があるが、図１１では省略している。テクスチャマージ候補（T）、インタービューマージ候補（IvMC）、変位マージ候補（IvDC）、ＶＳＰマージ候補(VSP)、動きシフトマージ候補（IvMCShift）、変位シフトマージ候補（IvDCShift）は、拡張マージ候補導出部３０３７０において導出される。 FIG. 11 shows an example of the merge candidate list mergeCandList derived by the merge candidate deriving unit 30361. FIG. 11A shows merge candidates derived by the merge candidate storage unit 303611 in the base layer (layer IDnal_unit_layer = 0). If two merge candidates have the same prediction parameter, excluding the process of reducing the order (pruning process), the merge index order is spatial merge candidate (A1), spatial merge candidate (B1), spatial merge candidate (B0), and space The merge candidate (A0) and the spatial merge candidate (B2) are in this order. The parentheses are nicknames of merge candidates, and in the case of spatial merge candidates, they correspond to the positions of reference blocks used for derivation. Further, there are a merge merge candidate and a zero merge candidate thereafter, which are omitted in FIG. These merge candidates, that is, the spatial merge candidate, temporal merge candidate, join merge candidate, and zero merge candidate are derived by the basic merge candidate deriving unit 30380. FIG. 11B shows merge candidates derived by the merge candidate storage unit 303611 in the enhancement layer (layer IDnal_unit_layer! = 0) that is a layer other than the base layer. Texture merge candidate (T), interview merge candidate (IvMC), spatial merge candidate (A1), spatial merge candidate (B1), spatial merge candidate (B0), displacement merge candidate (IvDC), VSP merge in order of merge index Candidate (VSP), spatial merge candidate (A0), spatial merge candidate (B2), motion shift merge candidate (IvMCShift), displacement shift merge candidate (IvDCShift), and temporal merge candidate (Col). The parentheses are nicknames of merge candidates. Further, there are a merge merge candidate and a zero merge candidate thereafter, which are omitted in FIG. Texture merge candidate (T), Interview merge candidate (IvMC), Displacement merge candidate (IvDC), VSP merge candidate (VSP), Motion shift merge candidate (IvMCShift), Displacement shift merge candidate (IvDCShift) are derived as extended merge candidates. Derived in part 30370.

図１２は、空間マージ候補が参照する隣接ブロックの位置を示す図である。A0、A1、B0、B1、B2は各々図１２に示す位置に対応し、座標は以下の通りである。予測ユニットの左上座標をxPb、yPb、予測ユニットの幅と高さをnPbW、nPbHとする場合、隣接ブロックの位置は以下の通りとなる。 FIG. 12 is a diagram illustrating the positions of adjacent blocks to which spatial merge candidates refer. A0, A1, B0, B1, and B2 each correspond to the position shown in FIG. 12, and the coordinates are as follows. When the upper left coordinates of the prediction unit are xPb and yPb, and the width and height of the prediction unit are nPbW and nPbH, the positions of adjacent blocks are as follows.

Ａ０：( xPb - 1, yPb + nPbH )
Ａ１：( xPb - 1, yPb + nPbH - 1 )
Ｂ０：( xPb + nPbW, yPb - 1 )
Ｂ１：( xPb + nPbW - 1, yPb - 1 )
Ｂ２：( xPb - 1, yPb - 1 )
拡張マージ候補導出部３０３７０は、レイヤ間マージ候補導出部３０３７１（インタービューマージ候補導出部３０３７１）、変位マージ候補導出部３０３７３、VSPマージ候補導出部３０３７４（ＶＳＰ予測部３０３７４）を含んで構成される。拡張マージ候補は、後述の基本マージ候補とは異なるマージ候補であり、少なくとも、テクスチャマージ候補（T）、インタービュ−マージ候補（IvMC）、変位マージ候補（IvDC）、VSPマージ候補(VSP)、動きシフトマージ候補（IvMCShift）、変位シフトマージ候補（IvDCShift）の何れかを含む。A0: (xPb-1, yPb + nPbH)
A1: (xPb-1, yPb + nPbH-1)
B0: (xPb + nPbW, yPb-1)
B1: (xPb + nPbW-1, 1, yPb-1)
B2: (xPb-1, yPb-1)
The extended merge candidate derivation unit 30370 includes an inter-layer merge candidate derivation unit 30371 (interview merge candidate derivation unit 30371), a displacement merge candidate derivation unit 30373, and a VSP merge candidate derivation unit 30374 (VSP prediction unit 30374). . The extended merge candidate is a merge candidate different from a basic merge candidate described later, and includes at least a texture merge candidate (T), an interview merge candidate (IvMC), a displacement merge candidate (IvDC), a VSP merge candidate (VSP), Either a motion shift merge candidate (IvMCShift) or a displacement shift merge candidate (IvDCShift) is included.

（テクスチャマージ候補）
レイヤ間マージ候補導出部３０３７１は、テクスチャマージ候補（T）とインタービューマージ候補（IvMC）、動きシフトマージ候補(IvMCShift)を導出する。これらマージ候補は、対象ピクチャと同一ＰＯＣを持つ別レイヤ（例えばベースレイヤ、ベースビュー）の参照ピクチャから、予測ユニットに対応するブロックを選択し、該ブロックが有する動きベクトルである予測パラメータを予測パラメータメモリ３０７から読み出すことで導出される。(Texture merge candidate)
The inter-layer merge candidate derivation unit 30371 derives a texture merge candidate (T), an inter-view merge candidate (IvMC), and a motion shift merge candidate (IvMCShift). For these merge candidates, a block corresponding to a prediction unit is selected from reference pictures of different layers (for example, a base layer and a base view) having the same POC as the target picture, and a prediction parameter that is a motion vector included in the block is selected as a prediction parameter. It is derived by reading from the memory 307.

テクスチャマージ候補（Ｔ）は、レイヤ間マージ候補導出部３０３７１において、対象ピクチャがデプスの場合に導出される。テクスチャマージ候補（Ｔ）は、対象ピクチャと同じビューＩＤを有するデプスピクチャから参照ブロックを特定し、参照ブロックの動きベクトルを読み出すことで導出される。
参照ブロックの座標(xRef, yRef)は、予測ユニットの左上座標をxPb、yPb、予測ユニットの幅と高さをnPbW、nPbHとする場合、以下の式から導出される。The texture merge candidate (T) is derived by the inter-layer merge candidate deriving unit 30371 when the target picture is depth. The texture merge candidate (T) is derived by specifying a reference block from a depth picture having the same view ID as the target picture and reading a motion vector of the reference block.
The coordinates (xRef, yRef) of the reference block are derived from the following equations when the upper left coordinates of the prediction unit are xPb and yPb, and the width and height of the prediction unit are nPbW and nPbH.

xRefFull = xPb + ( ( nPbW - 1 ) >> 1 )
yRefFull = yPb + ( ( nPbH - 1 ) >> 1 )
xRef = Clip3( 0, PicWidthInSamplesL - 1, ( xRefFull >> 3 ) << 3 )
yRef = Clip3( 0, PicHeightInSamplesL - 1,( yRefFull >> 3 ) << 3 )
なお、PicWidthInSamples_LとPicHeightInSamples_L は、それぞれ画像の幅と高さを表し、関数Clip3(x,y,z)は、ｚをｘ以上、ｙ以下に制限（クリップ）し、その制限した結果を返す関数である。xRefFull = xPb + ((nPbW-1) >> 1)
yRefFull = yPb + ((nPbH-1) >> 1)
xRef = Clip3 (0, PicWidthInSamplesL-1, (xRefFull >> 3) << 3)
yRef = Clip3 (0, PicHeightInSamplesL-1, (yRefFull >> 3) << 3)
Note that PicWidthInSamples _L and PicHeightInSamples _L represent the width and height of the image, respectively, and the function Clip3 (x, y, z) restricts (clips) z to not less than x and not more than y, and returns the restricted result. It is a function.

参照ブロックの動きベクトルをtextMvLXとすると、テクスチャマージ候補の動きベクトルmvLXTは次の式で導出される。 If the motion vector of the reference block is textMvLX, the motion vector mvLXT of the texture merge candidate is derived by the following equation.

mvLXT[ 0 ] = ( textMvLX[ xRef ][ yRef ][ 0 ] + 2 ) >> 2
mvLXT[ 1 ] = ( textMvLX[ xRef ][ yRef ][ 1 ] + 2 ) >> 2
なお、テクスチャマージ候補では、予測ユニットをさらに分割したサブブロック単位で予測パラメータを割り当てても構わない。mvLXT [0] = (textMvLX [xRef] [yRef] [0] + 2) >> 2
mvLXT [1] = (textMvLX [xRef] [yRef] [1] + 2) >> 2
For texture merge candidates, prediction parameters may be assigned in units of sub-blocks obtained by further dividing the prediction unit.

（インタービューマージ候補）
インタービューマージ候補は、レイヤ間マージ候補導出部３０３７１において、後述の変位ベクトル導出部３５２が特定した対象ピクチャと同一ＰＯＣを有し、異なるビューＩＤ(refViewIdx)を有する参照ピクチャivRefPicの参照ブロックから動きベクトルなどの予測パラメータを読み出すことで導出される。この処理をテンポラルインタービュー動き候補導出処理と呼ぶ。レイヤ間マージ候補導出部３０３７１は、テンポラルインタービュー動き候補導出処理として、まず、ブロックの左上座標を(xPb、yPb)、ブロックの幅と高さをnPbW、nPbH、変位ベクトル導出部３５２から導出される変位ベクトルを(mvDisp[0], mvDisp[1])とする場合に、参照座標(xRef, yRef)を以下の式から導出する。(Interview merge candidate)
The inter-view merge candidate moves from the reference block of the reference picture ivRefPic having the same POC as the target picture specified by the later-described displacement vector deriving unit 352 in the inter-layer merge candidate deriving unit 30371 and having a different view ID (refViewIdx). It is derived by reading prediction parameters such as vectors. This process is called a temporal inter-view motion candidate derivation process. As the temporal inter-view motion candidate derivation process, the inter-layer merge candidate derivation unit 30371 is first derived from the upper left coordinates of the block (xPb, yPb), the block width and height from the nPbW, nPbH, and the displacement vector derivation unit 352. Reference coordinates (xRef, yRef) are derived from the following equations when the displacement vector to be used is (mvDisp [0], mvDisp [1]).

xRefFull = xPb + ( nPbW >> 1 ) + ( ( mvDisp[ 0 ] + 2 ) >> 2 )
yRefFull = yPb + ( nPbH >> 1 ) + ( ( mvDisp[ 1 ] + 2 ) >> 2 )
xRef = Clip3( 0, PicWidthInSamplesL - 1, ( xRefFull >> 3 ) << 3 )
yRef = Clip3( 0, PicHeightInSamplesL - 1, ( yRefFull >> 3 ) << 3 )
次に、レイヤ間マージ候補導出部３０３７１は、図示しないテンポラルインタービュー動き候補導出部３０３７１１において、テンポラルインタービュー動き候補導出処理を行う。xRefFull = xPb + (nPbW >> 1) + ((mvDisp [0] + 2) >> 2)
yRefFull = yPb + (nPbH >> 1) + ((mvDisp [1] + 2) >> 2)
xRef = Clip3 (0, PicWidthInSamplesL-1, (xRefFull >> 3) << 3)
yRef = Clip3 (0, PicHeightInSamplesL-1, (yRefFull >> 3) << 3)
Next, the inter-layer merge candidate derivation unit 30371 performs temporal inter-view motion candidate derivation processing in a temporal inter-view motion candidate derivation unit 303711 (not shown).

テンポラルインタービュー動き候補導出部３０３７１１は、ブロックの座標(xPb, yPb)、ブロックの幅nPbW, nPbH、ブロックの変位ベクトルmvDispから、上記処理により、参照ブロック位置(xRef, yRef)を導出し、さらに参照ブロック位置(xRef, yRef)に位置する参照ピクチャivRefPic上の予測ユニットのベクトルを参照し、テンポラルインタービュー動き候補のベクトルを導出する。まず、参照ブロック位置(xRef, yRef)
で示される座標を含む参照ピクチャivRefPic上の予測ユニット（輝度予測ブロック）の左上座標を(xIvRefPb,yIvRefPb)、参照ピクチャivRefPic上の予測ユニットが備える参照ピクチャリスト、予測リストフラグ、ベクトル、参照ピクチャインデックスを各々refPicListLYIvRef, predFlagLYIvRef[ x ][ y ], mvLYIvRef[ x ][ y ], refIdxLYIvRef[ x ][ y]と置く。The temporal inter-view motion candidate derivation unit 3037111 derives the reference block position (xRef, yRef) from the block coordinates (xPb, yPb), the block widths nPbW, nPbH, and the block displacement vector mvDisp by the above processing, and further A vector of a temporal inter-view motion candidate is derived by referring to the vector of the prediction unit on the reference picture ivRefPic located at the reference block position (xRef, yRef). First, reference block position (xRef, yRef)
(XIvRefPb, yIvRefPb) is the upper left coordinate of the prediction unit (luminance prediction block) on the reference picture ivRefPic including the coordinates indicated by the reference picture list, prediction list flag, vector, and reference picture index included in the prediction unit on the reference picture ivRefPic Are set as refPicListLYIvRef, predFlagLYIvRef [x] [y], mvLYIvRef [x] [y], and refIdxLYIvRef [x] [y], respectively.

テンポラルインタービュー動き候補導出部３０３７１１は、予測利用フラグpredFlagLYIvRef[ xIvRefPb ][ yIvRefPb ]が１の場合には、０から参照ピクチャリスト要素数−１（num_ref_idx_lX_active_minus1）のインデックスiについて、参照ピクチャivRefPic上の予測ユニットのＰＯＣであるPicOrderCnt( refPicListLYIvRef[ refIdxLYIvRef[ xIvRefPb ][ yIvRefPb ] ])と対象予測ユニットの参照ピクチャのＰＯＣであるPicOrderCnt( RefPicListLX[ i ] )が等しいか否かを判定し、等しい場合（すなわちmvLYIvRef[ xIvRefPb ][ yIvRefPb ]が変位ベクトルである場合に）、予測可能フラグavailableFlagLXInterView、ベクトルmvLXInterView、参照ピクチャインデックスrefIdxLXを以下の式により導出する。 When the prediction usage flag predFlagLYIvRef [xIvRefPb] [yIvRefPb] is 1, the temporal inter-view motion candidate derivation unit 3037111 performs prediction on the reference picture ivRefPic for the index i from 0 to the reference picture list element number −1 (num_ref_idx_lX_active_minus1). It is determined whether or not PicOrderCnt (RefPicListLYIvRef [refIdxLYIvRef [xIvRefPb] [yIvRefPb]]) that is the unit POC is equal to PicOrderCnt (RefPicListLX [i]) that is the POC of the reference picture of the target prediction unit (that is, mvLYIRef). When [xIvRefPb] [yIvRefPb] is a displacement vector), the predictable flag availableFlagLXInterView, the vector mvLXInterView, and the reference picture index refIdxLX are derived by the following equations.

availableFlagLXInterView = 1
mvLXInterView = mvLYIvRef[ xIvRefPb ][ yIvRefPb ]
refIdxLX = i
すなわち、テンポラルインタービュー動き候補導出部３０３７１１は、対象予測ユニットが参照する参照ピクチャと、参照ピクチャivRefPic上の予測ユニットが参照する参照ピクチャが同一の場合には、参照ピクチャivRefPic上の予測ユニットの予測パラメータを用いて、ベクトルmvLXInterViewと参照ピクチャインデックスrefIdxLXを導出する。なお、インタービューマージ候補では、予測ユニットをさらに分割したサブブロック単位で予測パラメータを割り当てても構わない。例えば、予測ユニットの幅と高さがnPbW、nPbH、サブブロックの最小サイズがSubPbSizeの場合には、サブブロックの幅nSbWと高さnSbHを以下の式により導出する。availableFlagLXInterView = 1
mvLXInterView = mvLYIvRef [xIvRefPb] [yIvRefPb]
refIdxLX = i
That is, the temporal inter-view motion candidate derivation unit 303711 predicts the prediction unit on the reference picture ivRefPic when the reference picture referred to by the target prediction unit and the reference picture referenced by the prediction unit on the reference picture ivRefPic are the same. A vector mvLXInterView and a reference picture index refIdxLX are derived using parameters. In the inter-view merge candidate, the prediction parameter may be assigned in units of sub-blocks obtained by further dividing the prediction unit. For example, when the width and height of the prediction unit are nPbW and nPbH, and the minimum size of the subblock is SubPbSize, the width nSbW and the height nSbH of the subblock are derived by the following equations.

nSbW = nPbW / SubPbSize <= 1 ? nPbW : SubPbSizen
nSbH = nPbH / SubPbSize <= 1 ? nPbH : SubPbSize
続いて、上述のテンポラルインタービュー動き候補導出部３０３７１１により各サブブロックに対して、ベクトルspMvLX[ xBlk ][ yBlk ]、参照ピクチャインデックスspRefIdxLX[ xBlk ][ yBlk ]、予測利用フラグspPredFlagLX[ xBlk ][ yBlk ]を導出する。nSbW = nPbW / SubPbSize <= 1? nPbW: SubPbSizen
nSbH = nPbH / SubPbSize <= 1? nPbH: SubPbSize
Subsequently, the temporal inter-view motion candidate derivation unit 3037111 described above applies the vector spMvLX [xBlk] [yBlk], the reference picture index spRefIdxLX [xBlk] [yBlk], and the prediction usage flag spPredFlagLX [xBlk] [yBlk] ] Is derived.

ここで(xBlk、yBlk)は、サブブロックの予測ユニット内の相対座標（予測ユニットの左上座標を基準とした座標）であり、各々0から( nPbW / nSbW - 1 )、0から( nPbH / nSbH- 1 )の整数値をとる。予測ユニットの座標を(xPb、yPb)、サブブロックの予測ユニット内の相対座標(xBlk、yBlk)とすると、サブブロックのピクチャ内座標は、(xPb+xBlk*nSbW,yPb+yBlk*nSbH)で表現される。 Where (xBlk, yBlk) is the relative coordinates in the prediction unit of the sub-block (coordinates based on the upper left coordinate of the prediction unit), from 0 (nPbW / nSbW-1), 0 to (nPbH / nSbH, respectively) -Takes an integer value of 1). If the coordinates of the prediction unit are (xPb, yPb) and the relative coordinates (xBlk, yBlk) in the prediction unit of the sub-block, the coordinates in the picture of the sub-block are (xPb + xBlk * nSbW, yPb + yBlk * nSbH) Expressed.

サブブロックのピクチャ内座標(xPb+xBlk*nSbW,yPb+yBlk*nSbH)、サブブロックの幅nSbWと高さnSbHを、テンポラルインタービュー動き候補導出部３０３７１１の入力の(xPb、yPb)、nPbW、nPbHとして、サブブロック単位で、テンポラルインタービュー動き候補導出処理を行う。 The intra-picture coordinates (xPb + xBlk * nSbW, yPb + yBlk * nSbH) of the sub-block, the width nSbW and the height nSbH of the sub-block, (xPb, yPb), nPbW, As nPbH, temporal inter-view motion candidate derivation processing is performed in units of sub-blocks.

テンポラルインタービュー動き候補導出部３０３７１１は、上記処理において、予測可能フラグavailableFlagLXInterViewが０となったサブブロックについては、インタービューマージ候補のベクトルmvLXInterView、参照ピクチャインデックスrefIdxLXInterView、予測利用フラグavailableFlagLXInterViewからサブブロックに対応するベクトルspMvLX、参照ピクチャインデックスspRefIdxLX、予測利用フラグspPredFlagLX、を以下の式により導出する。 In the above processing, the temporal interview motion candidate derivation unit 3037111 supports subblocks from the intermerge candidate vector mvLXInterView, the reference picture index refIdxLXInterView, and the prediction usage flag availableFlagLXInterView for the subblock for which the predictable flag availableFlagLXInterView is 0. A vector spMvLX, a reference picture index spRefIdxLX, and a prediction usage flag spPredFlagLX are derived by the following equations.

spMvLX[ xBlk ][ yBlk ] = mvLXInterView
spRefIdxLX[ xBlk ][ yBlk ] = refIdxLXInterView
spPredFlagLX[ xBlk ][ yBlk ] = availableFlagLXInterView
なお、xBlk、yBlkはサブブロックアドレスであり、各々0から( nPbW / nSbW - 1 )、0から( nPbH / nSbH - 1 )の値をとる。なお、インタービューマージ候補のベクトルmvLXInterView、参照ピクチャインデックスrefIdxLXInterView、予測利用フラグavailableFlagLXInterViewは、( xPb + ( nPbW / nSbW / 2 ) * nSbW, yPb + ( nPbH / nSbH / 2 ) * nSbH )を参照ブロック座標としてテンポラルインタービュー動き候補導出処理を行うことで導出する。spMvLX [xBlk] [yBlk] = mvLXInterView
spRefIdxLX [xBlk] [yBlk] = refIdxLXInterView
spPredFlagLX [xBlk] [yBlk] = availableFlagLXInterView
Note that xBlk and yBlk are sub-block addresses and take values from 0 to (nPbW / nSbW-1) and from 0 to (nPbH / nSbH-1), respectively. Note that the vector mvLXInterView, the reference picture index refIdxLXInterView, and the prediction usage flag availableFlagLXInterView are (xPb + (nPbW / nSbW / 2) * nSbW, yPb + (nPbH / nSbH / 2) * nSbH) with reference block coordinates As a temporal inter-view motion candidate derivation process.

（動きシフトマージ候補）
動きシフトマージ候補も、レイヤ間マージ候補導出部３０３７１において、変位ベクトル導出部３５２が特定した対象ピクチャと同一ＰＯＣを有し、異なるビューＩＤを有するピクチャの参照ブロックから動きベクトルなどの予測パラメータを読み出すことで導出される。参照ブロックの座標(xRef, yRef)、予測ユニットの左上座標をxPb、yPb、予測ユニットの幅と高さをnPbW、nPbH、変位ベクトル導出部３５２から導出される変位ベクトルが、mvDisp[0], mvDisp[1]とする場合、以下の式から導出される。(Motion shift merge candidate)
The motion shift merge candidate also reads a prediction parameter such as a motion vector from a reference block of a picture having the same POC as the target picture identified by the displacement vector deriving unit 352 and having a different view ID in the inter-layer merge candidate deriving unit 30371. It is derived by this. The coordinates (xRef, yRef) of the reference block, the upper left coordinates of the prediction unit are xPb, yPb, the width and height of the prediction unit are nPbW, nPbH, and the displacement vector derived from the displacement vector deriving unit 352 is mvDisp [0], When mvDisp [1], it is derived from the following equation.

xRefFull = xPb + ( nPbW >> 1 ) + ( ( mvDisp[ 0 ] + nPbW * 2 + 4 + 2 ) >> 2
)
yRefFull = yPb + ( nPbH >> 1 ) + ( ( mvDisp[ 1 ] + nPbH * 2 + 4 + 2 ) >> 2
)
xRef = Clip3( 0, PicWidthInSamplesL - 1, ( xRefFull >> 3 ) << 3 )
yRef = Clip3( 0, PicHeightInSamplesL - 1, ( yRefFull >> 3 ) << 3 )
（変位マージ候補）
変位マージ候補導出部３０３７３は、変位ベクトル導出部３５２から入力される変位ベクトルから、変位マージ候補（IvDC）、シフト変位マージ候補（IvDcShift）を導出する。変位マージ候補導出部３０３７３は、変位マージ候補（IvDC）として水平成分が入力された変位ベクトル（mvDisp[0], mvDisp[1]）の水平成分mvDisp[0]であり、垂直成分が０であるベクトルを以下の式により生成する。xRefFull = xPb + (nPbW >> 1) + ((mvDisp [0] + nPbW * 2 + 4 + 2) >> 2
)
yRefFull = yPb + (nPbH >> 1) + ((mvDisp [1] + nPbH * 2 + 4 + 2) >> 2
)
xRef = Clip3 (0, PicWidthInSamplesL-1, (xRefFull >> 3) << 3)
yRef = Clip3 (0, PicHeightInSamplesL-1, (yRefFull >> 3) << 3)
(Displacement merge candidate)
The displacement merge candidate derivation unit 30373 derives a displacement merge candidate (IvDC) and a shift displacement merge candidate (IvDcShift) from the displacement vector input from the displacement vector derivation unit 352. The displacement merge candidate derivation unit 30373 is the horizontal component mvDisp [0] of the displacement vector (mvDisp [0], mvDisp [1]) to which the horizontal component is input as the displacement merge candidate (IvDC), and the vertical component is 0. A vector is generated by the following equation.

mvL0IvDC[ 0 ] = DepthFlag ? ( mvDisp[ 0 ] + 2 ) >> 2 : mvDisp[ 0 ]
mvL0IvDC[ 1 ] = 0
ここで、DepthFlagは、デプスの場合に１となる変数である。mvL0IvDC [0] = DepthFlag? (mvDisp [0] + 2) >> 2: mvDisp [0]
mvL0IvDC [1] = 0
Here, DepthFlag is a variable that becomes 1 in the case of depth.

変位マージ候補導出部３０３７３は、生成したベクトルと、変位ベクトルが指す先のレイヤ画像の参照ピクチャインデックスrefIdxLX（例えば、復号対象ピクチャと同一ＰＯＣを持つベースレイヤ画像のインデックス）をマージ候補としてマージ候補格納部３０３６１１に出力する。 The displacement merge candidate derivation unit 30373 stores the generated vector and the reference picture index refIdxLX of the previous layer image pointed to by the displacement vector (for example, the index of the base layer image having the same POC as the decoding target picture) as a merge candidate. Output to the unit 303611.

変位マージ候補導出部３０３７３は、シフト変位マージ候補（IvDC）として、変位マージ候補を水平方向にずらしたベクトルを有するマージ候補を以下の式により導出する。 The displacement merge candidate derivation unit 30373 derives, as a shift displacement merge candidate (IvDC), a merge candidate having a vector obtained by shifting the displacement merge candidate in the horizontal direction by the following expression.

mvLXIvDCShift[ 0 ] = mvL0IvDC[ 0 ] + 4
mvLXIvDCShift[ 1 ] = mvL0IvDC[ 1 ]
（ＶＳＰマージ候補）
VSPマージ候補導出部３０３７４（以下、ＶＳＰ予測部３０３７４）は、VSP（視点合成予測：View Synthesis Prediction）マージ候補を導出する。ＶＳＰ予測部３０３７４は、予測ユニットを複数のサブブロック（サブ予測ユニット）に分割し、分割したサブブロック単位で、ベクトルmvLXと参照ピクチャインデックスrefIdxLX、ビューＩＤRefViewIdxを設定する。ＶＳＰ予測部３０３７４は、導出したVSPマージ候補をマージ候補格納部３０３６１１に出力する。mvLXIvDCShift [0] = mvL0IvDC [0] + 4
mvLXIvDCShift [1] = mvL0IvDC [1]
(VSP merge candidate)
The VSP merge candidate derivation unit 30374 (hereinafter, VSP prediction unit 30374) derives a VSP (View Synthesis Prediction) merge candidate. The VSP prediction unit 30374 divides the prediction unit into a plurality of sub-blocks (sub-prediction units), and sets the vector mvLX, the reference picture index refIdxLX, and the view ID RefViewIdx for each divided sub-block. The VSP prediction unit 30374 outputs the derived VSP merge candidate to the merge candidate storage unit 303611.

図１４はＶＳＰ予測部３０３７４と他の手段の関係を示すブロック図である。VSP予測部３０３７４は、分割フラグ導出部３５３の導出する分割フラグhorSplitFlagと、デプスＤＶ導出部３５１の導出する変位ベクトルdisparitySamplesを用いて動作する。 FIG. 14 is a block diagram showing the relationship between the VSP prediction unit 30374 and other means. The VSP prediction unit 30374 operates using the split flag horSplitFlag derived by the split flag deriving unit 353 and the displacement vector disparitySamples derived by the depth DV deriving unit 351.

ＶＳＰ予測部３０３７４の図示しないパーティション分割部は、分割フラグ導出部３５３の導出した分割フラグhorSplitFlagに応じて、横長長方形（ここでは８×４）と縦長長方形（ここでは４×８）のいずれかを選択することでサブブロックサイズを決定する。具体的には、以下の式を用いてサブブロックの幅nSubBlkWと高さnSubBlkHを設定する。 The partition division unit (not shown) of the VSP prediction unit 30374 selects either a horizontally long rectangle (here 8 × 4) or a vertically long rectangle (here 4 × 8) according to the partition flag horSplitFlag derived by the partition flag deriving unit 353. The sub-block size is determined by selection. Specifically, the sub-block width nSubBlkW and height nSubBlkH are set using the following equations.

nSubBlkW = horSplitFlag ? 8 : 4
nSubBlkH = horSplitFlag ? 4 : 8
VSP予測部３０３７４の図示しないデプスベクトル導出部は、導出されたサブブロックサイズの各々のサブブロックに対して、デプスＤＶ導出部３５１から導出された動きベクトルdisparitySamples[]を水平成分の動きベクトルmvLX[0]、０を垂直成分の動きベクトルmvLX[1]としてベクトルmvLX[]を導出し、ＶＳＰマージ候補の予測パラメータを導出する。nSubBlkW = horSplitFlag? 8: 4
nSubBlkH = horSplitFlag? 4: 8
The depth vector derivation unit (not shown) of the VSP prediction unit 30374 uses the motion vector disparitySamples [] derived from the depth DV derivation unit 351 for each subblock of the derived subblock size, as a horizontal component motion vector mvLX [ A vector mvLX [] is derived using 0], 0 as a vertical component motion vector mvLX [1], and a prediction parameter of a VSP merge candidate is derived.

また、VSP予測部３０３７４は、インター予測パラメータ復号制御部３０３１から入力された残差予測インデックスiv_res_pred_weight_idx及び照度補償フラグic_flagに応じてVSPマージ候補をマージ候補リストmergeCandListに追加するか否かを制御してもよい。具体的には、VSP予測部３０３７４は、残差予測インデックスiv_res_pred_weight_idxが０、かつ、照度補償フラグic_flagが０の場合のみ、マージ候補リストmergeCandListの要素にVSPマージ候補を追加しても良い。 Also, the VSP prediction unit 30374 controls whether or not to add the VSP merge candidate to the merge candidate list mergeCandList according to the residual prediction index iv_res_pred_weight_idx and the illumination compensation flag ic_flag input from the inter prediction parameter decoding control unit 3031. Also good. Specifically, the VSP prediction unit 30374 may add the VSP merge candidate to the elements of the merge candidate list mergeCandList only when the residual prediction index iv_res_pred_weight_idx is 0 and the illumination compensation flag ic_flag is 0.

基本マージ候補導出部３０３８０は、空間マージ候補導出部３０３８１と時間マージ候補導出部３０３８２と結合マージ候補導出部３０３８３とゼロマージ候補導出部３０３８４を含んで構成される。基本マージ候補は、ベースレイヤで用いられるマージ候補であり、すなわち、スケーラブルではなくＨＥＶＣ（例えばＨＥＶＣメインプロファイル）で用いられるマージ候補であり、少なくとも空間マージ候補、時間マージ候補の何れかを含む。 The basic merge candidate derivation unit 30380 includes a spatial merge candidate derivation unit 30382, a temporal merge candidate derivation unit 30382, a combined merge candidate derivation unit 30383, and a zero merge candidate derivation unit 30384. The basic merge candidate is a merge candidate used in the base layer, that is, a merge candidate used in HEVC (for example, HEVC main profile) instead of scalable, and includes at least one of a spatial merge candidate and a temporal merge candidate.

空間マージ候補導出部３０３８１は、所定の規則に従って、予測パラメータメモリ３０７が記憶している予測パラメータ（予測利用フラグpredFlagLX、ベクトルmvLX、参照ピクチャインデックスrefIdxLX）を読み出し、読み出した予測パラメータを空間マージ候補として導出する。読み出される予測パラメータは、予測ユニットから予め定めた範囲内にあるブロック（例えば、予測ユニットの左下端、左上端、右上端にそれぞれ接するブロックの全部又は一部）である隣接ブロックのそれぞれに係る予測パラメータである。導出された空間マージ候補はマージ候補格納部３０３６１１に格納される。 The spatial merge candidate derivation unit 30381 reads the prediction parameters (prediction usage flag predFlagLX, vector mvLX, reference picture index refIdxLX) stored in the prediction parameter memory 307 according to a predetermined rule, and uses the read prediction parameters as spatial merge candidates. To derive. Prediction parameters to be read are predictions related to each of adjacent blocks that are blocks within a predetermined range from the prediction unit (for example, all or a part of blocks that touch the lower left end, the upper left end, and the upper right end of the prediction unit, respectively). It is a parameter. The derived spatial merge candidate is stored in the merge candidate storage unit 303611.

空間マージ候補導出部３０３８１では、隣接ブロックのVSPモードフラグVspModeFlagを継承して導出するマージ候補のVSPモードフラグmergeCandIsVspFlagを設定する。すなわち、隣接ブロックのVSPモードフラグVspModeFlagが１の場合、対応する空間マージ候補のVSPモードフラグmergeCandIsVspFlagを１、それ以外の場合はVSPモードフラグmergeCandIsVspFlagを０とする。 The spatial merge candidate derivation unit 30381 sets a merge candidate VSP mode flag mergeCandIsVspFlag that is derived by inheriting the VSP mode flag VspModeFlag of the adjacent block. That is, when the VSP mode flag VspModeFlag of the adjacent block is 1, the VSP mode flag mergeCandIsVspFlag of the corresponding spatial merge candidate is 1, and otherwise, the VSP mode flag mergeCandIsVspFlag is 0.

以下、時間マージ候補導出部３０３８２、結合マージ候補導出部３０３８３、ゼロマージ候補導出部３０３８４が導出するマージ候補では、VSPモードフラグVspModeFlagを０に設定する。 Hereinafter, the VSP mode flag VspModeFlag is set to 0 in the merge candidates derived by the time merge candidate derivation unit 30382, the merge merge candidate derivation unit 30383, and the zero merge candidate derivation unit 30384.

時間マージ候補導出部３０３８２は、予測ユニットの右下の座標を含む参照画像中のブロックの予測パラメータを予測パラメータメモリ３０７から読みだしマージ候補とする。参照画像の指定方法は、例えば、スライスヘッダで指定されるコロケートピクチャcol_ref_idxと、参照ピクチャリストRefPicListXから指定されるRefPicListX[col_ref_idx]で指定される参照ピクチャインデックスrefIdxLXを用いればよい導出されたマージ候補はマージ候補格納部３０３６１１に格納される。 The temporal merge candidate derivation unit 30382 reads the prediction parameter of the block in the reference image including the lower right coordinate of the prediction unit from the prediction parameter memory 307 and sets it as a merge candidate. The reference image can be specified by using, for example, the collocated picture col_ref_idx specified by the slice header and the reference picture index refIdxLX specified by RefPicListX [col_ref_idx] specified by the reference picture list RefPicListX. It is stored in the merge candidate storage unit 303611.

結合マージ候補導出部３０３８３は、既に導出されマージ候補格納部３０３６１１に格納された２つの異なる導出済マージ候補のベクトルと参照ピクチャインデックスを、それぞれＬ０、Ｌ１のベクトルとして組み合わせることで結合マージ候補を導出する。導出されたマージ候補はマージ候補格納部３０３６１１に格納される。 The merge merge candidate derivation unit 30383 derives a merge merge candidate by combining two different derived merge candidate vectors and reference picture indexes that have already been derived and stored in the merge candidate storage unit 303611 as L0 and L1 vectors, respectively. To do. The derived merge candidates are stored in the merge candidate storage unit 303611.

ゼロマージ候補導出部３０３８４は、参照ピクチャインデックスrefIdxLXがｉであり、ベクトルmvLXのＸ成分、Ｙ成分が共に０であるマージ候補を、導出したマージ候補数が最大値に達するまで導出する。参照ピクチャインデックスrefIdxLXを示すｉの値は、０から順に割り振られる。導出されたマージ候補はマージ候補格納部３０３６１１に格納される。 The zero merge candidate derivation unit 30384 derives merge candidates whose reference picture index refIdxLX is i and whose X component and Y component of the vector mvLX are both 0 until the number of derived merge candidates reaches the maximum value. The value of i indicating the reference picture index refIdxLX is assigned in order from 0. The derived merge candidates are stored in the merge candidate storage unit 303611.

マージ候補選択部３０３６２は、マージ候補格納部３０３６１１に格納されているマージ候補のうち、インター予測パラメータ復号制御部３０３１から入力されたマージインデックスmerge_idxに対応するインデックスが割り当てられたマージ候補を、対象ＰＵのインター予測パラメータとして選択する。つまり、マージ候補リストをmergeCandListとするとmergeCandList[merge_idx]で示される予測パラメータを選択し、双予測制限部３０３６３に出力する。 The merge candidate selection unit 30362 selects, from the merge candidates stored in the merge candidate storage unit 303611, a merge candidate to which an index corresponding to the merge index merge_idx input from the inter prediction parameter decoding control unit 3031 is assigned. As an inter prediction parameter. That is, when the merge candidate list is mergeCandList, the prediction parameter indicated by mergeCandList [merge_idx] is selected and output to the bi-prediction restriction unit 30363.

マージ候補選択部３０３６２は、マージ候補としてインタービューマージ候補が選択された場合には、サブブロック動き補償フラグsubPbMotionFlagを１に設定する。また、マージ候補選択部３０３６２は、マージ候補のＶＳＰモードフラグvspModeFlagが１の場合にもサブブロック動き補償フラグsubPbMotionFlagを１に設定しても良い。それ以外の場合には、サブブロック動き補償フラグsubPbMotionFlagを０に設定する。 Merge candidate selection section 30362 sets subblock motion compensation flag subPbMotionFlag to 1 when an inter-view merge candidate is selected as a merge candidate. The merge candidate selection unit 30362 may set the sub-block motion compensation flag subPbMotionFlag to 1 even when the merge candidate VSP mode flag vspModeFlag is 1. In other cases, the sub-block motion compensation flag subPbMotionFlag is set to 0.

双予測制限部３０３６３は、以下に示す双予測制限条件１の場合に、Ｌ１の参照ピクチャインデックスrefIdxL1とＬ１の予測利用フラグpredFlagL1に、
refIdxL1＝-1、predFlagL1＝0
を設定することにより、双予測（predFlagL0＝1かつpredFlagL1=1）を単予測に変換する。The bi-prediction restriction unit 30363 sets the reference picture index refIdxL1 of L1 and the prediction use flag predFlagL1 of L1 in the case of the following bi-prediction restriction condition 1 to:
refIdxL1 = -1, predFlagL1 = 0
To convert bi-prediction (predFlagL0 = 1 and predFlagL1 = 1) to single prediction.

双予測制限条件１：選択された予測パラメータが双予測（predFlagL0＝1かつpredFlagL1=1）かつ予測ユニットのサイズが所定のサイズより小さい（予測ユニットの幅nOrigPbWと高さnOrigPbHの和が１２に等しい）
双予測制限部３０３６３は、選択したマージ候補を予測パラメータメモリ３０７に記憶するとともに、予測画像生成部３０８に出力する。Bi-prediction restriction condition 1: The selected prediction parameter is bi-prediction (predFlagL0 = 1 and predFlagL1 = 1) and the size of the prediction unit is smaller than a predetermined size (the sum of the width nOrigPbW and height nOrigPbH of the prediction unit is equal to 12) )
The bi-prediction restriction unit 30363 stores the selected merge candidate in the prediction parameter memory 307 and outputs it to the prediction image generation unit 308.

predSamplesLX´[x][y] = predSamplesLX[x][y]
図１０は、本実施形態に係るAMVP予測パラメータ導出部３０３２の構成を示す概略図である。AMVP予測パラメータ導出部３０３２は、ベクトル候補導出部３０３３と予測ベクトル選択部３０３４、インター予測識別子導出部３０３５を備える。ベクトル候補導出部３０３３は、参照ピクチャインデックスrefIdxに基づいて予測パラメータメモリ３０７が記憶するベクトルを読み出し、ベクトル候補リストmvpListLXを生成する。参照ブロックは、予測ユニットの位置を基準として予め定めた位置にあるブロック（例えば、予測ユニットの左下端、右上端、時間的に隣接するブロック）である。predSamplesLX´ [x] [y] = predSamplesLX [x] [y]
FIG. 10 is a schematic diagram illustrating a configuration of the AMVP prediction parameter derivation unit 3032 according to the present embodiment. The AMVP prediction parameter derivation unit 3032 includes a vector candidate derivation unit 3033, a prediction vector selection unit 3034, and an inter prediction identifier derivation unit 3035. The vector candidate derivation unit 3033 reads a vector stored in the prediction parameter memory 307 based on the reference picture index refIdx, and generates a vector candidate list mvpListLX. The reference block is a block (for example, a block at the lower left end, an upper right end, or a temporally adjacent block of the prediction unit) at a predetermined position based on the position of the prediction unit.

予測ベクトル選択部３０３４は、ベクトル候補導出部３０３３が導出したベクトル候補mvpListLXのうち、インター予測パラメータ復号制御部３０３１から入力された予測ベクトルフラグmvp_LX_flagが示すベクトルmvpListLX[ mvp_lX_flag]を予測ベクトルmvpLXとして選択する。予測ベクトル選択部３０３４は、選択した予測ベクトルmvpLXを加算部３０３５に出力する。 The prediction vector selection unit 3034 selects the vector mvpListLX [mvp_lX_flag] indicated by the prediction vector flag mvp_LX_flag input from the inter prediction parameter decoding control unit 3031 as the prediction vector mvpLX from the vector candidates mvpListLX derived by the vector candidate derivation unit 3033. . The prediction vector selection unit 3034 outputs the selected prediction vector mvpLX to the addition unit 3035.

加算部３０３５は、予測ベクトル選択部３０３４から入力された予測ベクトルmvpLXとインター予測パラメータ復号制御部から入力された差分ベクトルmvdLXを加算してベクトルmvLXを算出する。加算部３０３５は、算出したベクトルmvLXを予測画像生成部３０８に出力する。 The addition unit 3035 adds the prediction vector mvpLX input from the prediction vector selection unit 3034 and the difference vector mvdLX input from the inter prediction parameter decoding control unit to calculate a vector mvLX. The adding unit 3035 outputs the calculated vector mvLX to the predicted image generation unit 308.

図１５は、本発明の実施形態のインター予測パラメータ復号制御部３０３１の構成を示すブロック図である。図１５に示すように、インター予測パラメータ復号制御部３０３１は、分割モード復号部３０３１１、インター予測識別子復号部３０３１２、ＤＢＢＰフラグ復号部３０３１３及び図示しない、マージフラグ復号部、マージインデックス復号部、インター予測識別子復号部、参照ピクチャインデックス復号部、ベクトル候補インデックス復号部、ベクトル差分復号部、残差予測インデックス復号部、照度補償フラグ復号部を含んで構成される。分割モード復号部、マージフラグ復号部、マージインデックス復号部、参照ピクチャインデックス復号部、ベクトル候補インデックス復号部、ベクトル差分復号部は各々、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLXを復号する。 FIG. 15 is a block diagram illustrating a configuration of the inter prediction parameter decoding control unit 3031 according to the embodiment of this invention. As illustrated in FIG. 15, the inter prediction parameter decoding control unit 3031 includes a split mode decoding unit 30311, an inter prediction identifier decoding unit 30312, a DBBP flag decoding unit 30313, and a merge flag decoding unit, a merge index decoding unit, and inter prediction that are not illustrated. An identifier decoding unit, a reference picture index decoding unit, a vector candidate index decoding unit, a vector difference decoding unit, a residual prediction index decoding unit, and an illuminance compensation flag decoding unit are configured. The partition mode decoding unit, the merge flag decoding unit, the merge index decoding unit, the reference picture index decoding unit, the vector candidate index decoding unit, and the vector difference decoding unit are respectively divided mode part_mode, merge flag merge_flag, merge index merge_idx, inter prediction identifier inter_pred_idc The reference picture index refIdxLX, the prediction vector flag mvp_LX_flag, and the difference vector mvdLX are decoded.

インター予測識別子復号部３０３１２は、予測ユニットが、L0予測(PRED_L0)、L0予測(PRED_L1)、双予測(PRED_BI)を示すインター予測識別子inter_pred_flagを復号する。 The inter prediction identifier decoding unit 30312 decodes an inter prediction identifier inter_pred_flag indicating that the prediction unit indicates L0 prediction (PRED_L0), L0 prediction (PRED_L1), and bi-prediction (PRED_BI).

残差予測インデックス復号部は、エントロピー復号部３０１を用いて、符号化ユニットＣＵの分割モードPartMode(part_mode)が2Nx2Nの場合に符号化データから残差予測インデックスiv_res_pred_weight_idxを復号する。それ以外の場合には、残差予測インデックス復号部は、iv_res_pred_weight_idxに０を設定(infer)する。残差予測インデックス復号部は、復号された残差予測インデックスiv_res_pred_weight_idxをマージモードパラメータ導出部３０３６とインター予測画像生成部３０９に出力する。残差予測インデックスは、残差予測の動作を変更するためのパラメータである。本実施形態では、残差予測の重みを示すインデックスであり、０、１、２の値をとる。iv_res_pred_weight_idxが０の場合には、残差予測は行わない。なお、インデックスに応じて残差予測の重みを変化させるのではなく、残差予測に用いるベクトルを変化させても良い。なお、残差予測インデックスではなく、残差予測を行うか否かを示すフラグ（残差予測フラグ）としても良い。 The residual prediction index decoding unit uses the entropy decoding unit 301 to decode the residual prediction index iv_res_pred_weight_idx from the encoded data when the division mode PartMode (part_mode) of the encoding unit CU is 2Nx2N. In other cases, the residual prediction index decoding unit sets (infers) 0 to iv_res_pred_weight_idx. The residual prediction index decoding unit outputs the decoded residual prediction index iv_res_pred_weight_idx to the merge mode parameter derivation unit 3036 and the inter prediction image generation unit 309. The residual prediction index is a parameter for changing the operation of residual prediction. In this embodiment, it is an index indicating the weight of residual prediction, and takes values of 0, 1, and 2. When iv_res_pred_weight_idx is 0, residual prediction is not performed. Note that the vector used for residual prediction may be changed instead of changing the weight of residual prediction according to the index. Instead of the residual prediction index, a flag (residual prediction flag) indicating whether to perform residual prediction may be used.

照度補償フラグ復号部は、エントロピー復号部３０１を用いて、分割モードPartModeが2Nx2Nの場合に符号化データから照度補償フラグic_flagを復号する。それ以外の場合には、照度補償フラグ復号部は、ic_flagに０を設定(infer)する。照度補償フラグ復号部は、復号された照度補償フラグic_flagをマージモードパラメータ導出部３０３６とインター予測画像生成部３０９に出力する。
以下、予測パラメータ導出に用いる手段である変位ベクトル導出部３５２、分割フラグ導出部３５３、デプスＤＶ導出部３５１を順に説明する。The illuminance compensation flag decoding unit uses the entropy decoding unit 301 to decode the illuminance compensation flag ic_flag from the encoded data when the division mode PartMode is 2Nx2N. In other cases, the illuminance compensation flag decoding unit sets (infers) 0 to ic_flag. The illuminance compensation flag decoding unit outputs the decoded illuminance compensation flag ic_flag to the merge mode parameter derivation unit 3036 and the inter predicted image generation unit 309.
Hereinafter, the displacement vector deriving unit 352, the division flag deriving unit 353, and the depth DV deriving unit 351, which are means used for deriving the prediction parameters, will be described in order.

（変位ベクトル導出部３５２）
変位ベクトル導出部３５２は、対象ＰＵが属する符号化ユニット（対象ＣＵ）の変位ベクトル（以下、MvDisp[x][y]、もしくはmvDisp[x][y]と示す）を、符号化ユニットに空間的もしくは時間的に隣接するブロックから抽出する。具体的には、対象ＣＵに時間的に隣接するブロックCol、時間的に隣接する第２のブロックAltCol、空間的に左に隣接するブロックA1、上に隣接するブロックB1を参照ブロックとして、その参照ブロックの予測フラグpredFlagLX、参照ピクチャインデックスrefIdxLXとベクトルmvLXを順に抽出する。抽出したベクトルmvLXが変位ベクトルである場合には、その隣接ブロックの変位ベクトルを出力する。隣接ブロックの予測パラメータに変位ベクトルが無い場合には、次の隣接ブロックの予測パラメータを読み出し同様に変位ベクトルを導出する。全ての隣接ブロックにおいて変位ベクトルが導出できない場合には、ゼロベクトルを変位ベクトルとして出力する。変位ベクトル導出部３５２は、また、変位ベクトルを導出したブロックの参照ピクチャインデックス及びビューＩＤ（RefViewIdx[x][y]、ここで（xP、yP）は座標）を出力する。(Displacement vector deriving unit 352)
The displacement vector deriving unit 352 stores the displacement vector (hereinafter referred to as MvDisp [x] [y] or mvDisp [x] [y]) of the encoding unit (target CU) to which the target PU belongs in the space of the encoding unit. From adjacent blocks temporally or temporally. Specifically, a block Col that is temporally adjacent to the target CU, a second block AltCol that is temporally adjacent, a block A1 that is spatially adjacent to the left, and a block B1 that is adjacent to the top are used as reference blocks. A block prediction flag predFlagLX, a reference picture index refIdxLX, and a vector mvLX are extracted in order. When the extracted vector mvLX is a displacement vector, the displacement vector of the adjacent block is output. If there is no displacement vector in the prediction parameter of the adjacent block, the prediction parameter of the next adjacent block is read and the displacement vector is derived in the same manner. If displacement vectors cannot be derived in all adjacent blocks, a zero vector is output as a displacement vector. The displacement vector deriving unit 352 also outputs a reference picture index and a view ID (RefViewIdx [x] [y], where (xP, yP) are coordinates) of the block from which the displacement vector is derived.

上記により得られた変位ベクトルはNBDV(Neighbour Base Disparity Vector)と呼ばれる。変位ベクトル導出部３５２は、さらに得られた変位ベクトルNBDVをデプスＤＶ導出部３５１に出力する。デプスＤＶ導出部３５１は、デプス由来の変位ベクトル（変位配列disparitySamples）を導出する。デプスＤＶ導出部３５１は、デプスから得られた変位ベクトルdisparitySamplesを動きベクトルの水平成分mvLX[0]とすることにより変位ベクトルを更新（リファイン）する。更新された変位ベクトルは、DoNBDV(Depth Orientated Neighbour Base Disparity Vector)と呼ばれる。変位ベクトル導出部３５２は、変位ベクトル(DoNBDV)をレイヤ間マージ候補導出部３０３７１、変位マージ候補導出部及び視点合成予測マージ候補導出部に出力する。さらに、得られた変位ベクトル(NBDV)を、インター予測画像生成部３０９に出力する。 The displacement vector obtained by the above is called NBDV (Neighbour Base Disparity Vector). The displacement vector deriving unit 352 further outputs the obtained displacement vector NBDV to the depth DV deriving unit 351. The depth DV deriving unit 351 derives depth-derived displacement vectors (displacement array disparitySamples). The depth DV deriving unit 351 updates (refines) the displacement vector by using the displacement vector disparitySamples obtained from the depth as the horizontal component mvLX [0] of the motion vector. The updated displacement vector is called DoNBDV (Depth Orientated Neighbor Base Disparity Vector). The displacement vector deriving unit 352 outputs the displacement vector (DoNBDV) to the inter-layer merge candidate deriving unit 30371, the displacement merge candidate deriving unit, and the viewpoint synthesis prediction merge candidate deriving unit. Further, the obtained displacement vector (NBDV) is output to the inter predicted image generation unit 309.

（ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５の変位ベクトル）
以下、ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５において、共通の変位ベクトル（視差ベクトル）を用いる構成の画像復号装置を説明する。図１４に示すように、ＶＳＰ予測部３０３７４は、分割フラグ導出部３５３のパーティション分割およびデプスＤＶ導出部３５１の変位ベクトル配列disparitySamples導出において、対象ブロックを変位ベクトルMvDispだけずらした座標(xTL, yTL)を導出し、座標(xTL, yTL)のデプスブロック上の点を参照する。ＤＢＢＰ予測部３０９５も同様にセグメンテーション部３０９５２、ＤＢＢＰ分割モード導出部３０９５４において、対象ブロックを変位ベクトルMvDispだけずらした座標(xTL, yTL)を導出し、座標(xTL, yTL)のデプスブロック上の点を参照する。(Displacement vector of VSP prediction unit 30374 and DBBP prediction unit 3095)
Hereinafter, an image decoding apparatus configured to use a common displacement vector (disparity vector) in the VSP prediction unit 30374 and the DBBP prediction unit 3095 will be described. As illustrated in FIG. 14, the VSP prediction unit 30374 performs coordinates (xTL, yTL) obtained by shifting the target block by the displacement vector MvDisp in the partition division of the division flag derivation unit 353 and the displacement vector array disparitySamples derivation of the depth DV derivation unit 351. And refer to the point on the depth block of coordinates (xTL, yTL). Similarly, the DBBP prediction unit 3095 also derives the coordinates (xTL, yTL) obtained by shifting the target block by the displacement vector MvDisp in the segmentation unit 30952 and the DBBP division mode derivation unit 30954, and points on the depth block of the coordinates (xTL, yTL) Refer to

これまでＶＳＰ予測部３０３７４では、デプス参照による更新（リファイン）がされていない変位ベクトルNBDVを変位ベクトルmvDispに用いてデプスブロックの座標(xTL, yTL)を導出し、デプスDV導出部３５１により、サブブロックの変位ベクトルを導出する。この場合、変位ベクトルmvDispの導出にデプス参照は不要で、サブブロックの変位ベクトルを導出する際にデプス参照を行うため、デプス参照は一回で良い。それに対し、ＤＢＢＰ予測部３０９５では、デプス参照による更新がなされた変位ベクトルDoNBDVを変位ベクトルmvDispに用いて、デプスブロックの座標(xTL, yTL)を導出し、セグメンテーション部３０９５２、ＤＢＢＰ分割モード導出部３０９５４において、デプスを参照している。この場合、変位ベクトルmvDispの導出に変位ベクトルNBDVによるデプス参照を行い、さらに、セグメンテーション部３０９５２、ＤＢＢＰ分割モード導出部３０９５４のために別の変位ベクトルDoNBDVによるデプス参照（デプス転送）を行う。この場合、２回のデプス転送が必要であるためデプス画像の転送や処理量が大きい。また、ＶＳＰ予測部３０３７４とＤＢＢＰ予測部３０９５が異なるデプス転送を行うと処理の共通化ができないため、一方の処理が複雑で、他方の処理は容易であるという設計上のアンバランスが生じる。全体の最悪ケースの複雑度は、２つの処理の最大値で定まるため、一方の処理が容易にしても、全体の複雑度は低減しない。よって、設計上は、最悪ケースの複雑度が共用できるものであるかぎり、同じ程度の複雑度を許容することが好ましい。従って、視点合成予測とＤＢＢＰでどの程度、更新された変位ベクトルを用いるのかという判定基準を共通化し、同じ変位ベクトルを用いることが好適である。 The VSP prediction unit 30374 derives the coordinates (xTL, yTL) of the depth block using the displacement vector NBDV that has not been updated (refined) by the depth reference as the displacement vector mvDisp, and the depth DV derivation unit 351 A block displacement vector is derived. In this case, the depth reference is not necessary for the derivation of the displacement vector mvDisp, and the depth reference is performed when the displacement vector of the sub-block is derived. Therefore, the depth reference only needs to be performed once. On the other hand, the DBBP prediction unit 3095 derives the coordinates (xTL, yTL) of the depth block using the displacement vector DoNBDV updated by the depth reference as the displacement vector mvDisp, and a segmentation unit 30952 and a DBBP division mode derivation unit 30954. In reference to depth. In this case, the depth reference by the displacement vector NBDV is performed for the derivation of the displacement vector mvDisp, and the depth reference (depth transfer) by another displacement vector DoNBDV is performed for the segmentation unit 30952 and the DBBP division mode derivation unit 30954. In this case, since depth transfer is required twice, the transfer and processing amount of depth images are large. In addition, if the VSP prediction unit 30374 and the DBBP prediction unit 3095 perform different depth transfers, the process cannot be shared, so that one process is complicated, and the other process is easy, resulting in a design imbalance. Since the overall worst case complexity is determined by the maximum value of the two processes, even if one of the processes is easy, the overall complexity is not reduced. Therefore, in design, it is preferable to allow the same degree of complexity as long as the worst case complexity can be shared. Therefore, it is preferable to use the same displacement vector by standardizing the criteria for how much the updated displacement vector is used in the viewpoint synthesis prediction and DBBP.

本実施形態では、ＶＳＰ予測部３０３７４とＤＢＢＰ予測部３０９５の処理を共通化しながら、デプス参照の最悪ケースの処理量を削減する。最悪ケースの処理量とは小さいブロックで何度もデプス参照を行うことである。よって、本実施形態では、対象ブロックのサイズが所定のサイズより大きい場合には、デプスを参照して更新された変位ベクトル（DoNBDV）を用い、それ以外の場合には、デプスを参照して更新されていない変位ベクトル（NBDV）を用いる。 In this embodiment, the processing amount of the worst case of depth reference is reduced while making the processing of the VSP prediction unit 30374 and the DBBP prediction unit 3095 common. The worst case processing amount is to perform depth reference many times in a small block. Therefore, in this embodiment, when the size of the target block is larger than the predetermined size, the displacement vector (DoNBDV) updated with reference to the depth is used, and in other cases, the update is performed with reference to the depth. A displacement vector (NBDV) that is not used is used.

より具体的には、対象ブロックのサイズが所定のサイズより大きい場合には、ＶＳＰ予測部３０３７４では、デプス参照による更新がされた変位ベクトルDoNBDVを変位ベクトルmvDispに用いてデプスブロックの座標(xTL, yTL)を導出し、デプスDV導出部３５１により、サブブロックの変位ベクトルを導出する。逆に、対象ブロックのサイズが所定のサイズ以下の場合には、ＶＳＰ予測部３０３７４では、デプス参照による更新されていない変位ベクトルNBDVを変位ベクトルmvDispに用いてデプスブロックの座標(xTL, yTL)を導出し、デプスDV導出部３５１により、サブブロックの変位ベクトルを導出する。同様に、対象ブロックのサイズが所定のサイズより大きい場合には、ＤＢＢＰ予測部３０９５では、デプス参照による更新された変位ベクトルDoNBDVを変位ベクトルmvDispに用いて、デプスブロックの座標(xTL, yTL)を導出し、セグメンテーション部３０９５２、ＤＢＢＰ分割モード導出部３０９５４において、デプスを参照する。逆に、対象ブロックのサイズが所定のサイズ以下の場合には、ＤＢＢＰ予測部３０９５では、デプス参照による更新されていない変位ベクトルNBDVを変位ベクトルmvDispに用いて、デプスブロックの座標(xTL, yTL)を導出し、セグメンテーション部３０９５２、ＤＢＢＰ分割モード導出部３０９５４において、デプスを参照する。 More specifically, when the size of the target block is larger than a predetermined size, the VSP prediction unit 30374 uses the displacement vector DoNBDV updated by depth reference as the displacement vector mvDisp, and coordinates of the depth block (xTL, yTL) is derived, and the depth DV deriving unit 351 derives the displacement vector of the sub-block. On the other hand, when the size of the target block is equal to or smaller than the predetermined size, the VSP prediction unit 30374 uses the displacement vector NBDV that has not been updated by the depth reference as the displacement vector mvDisp, and calculates the coordinates (xTL, yTL) of the depth block. Then, the depth DV deriving unit 351 derives the displacement vector of the sub-block. Similarly, when the size of the target block is larger than the predetermined size, the DBBP prediction unit 3095 uses the displacement vector DoNBDV updated by the depth reference as the displacement vector mvDisp, and uses the depth block coordinates (xTL, yTL). The segmentation unit 30952 and the DBBP split mode deriving unit 30954 refer to the depth. On the other hand, when the size of the target block is equal to or smaller than the predetermined size, the DBBP prediction unit 3095 uses the displacement vector NBDV that has not been updated by the depth reference as the displacement vector mvDisp and uses the coordinates (xTL, yTL) of the depth block. The segmentation unit 30952 and the DBBP division mode deriving unit 30954 refer to the depth.

図２９は、ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５として共通の変位ベクトルを用いる構成の画像復号装置３１および画像符号化装置１１の動作を説明するためのフローチャートである。 FIG. 29 is a flowchart for explaining operations of the image decoding device 31 and the image encoding device 11 configured to use a common displacement vector as the VSP prediction unit 30374 and the DBBP prediction unit 3095.

（Ｓ３００１）対象ブロックである予測ユニットが所定のサイズより大きいな否かを判定する。ＹＥＳの場合にはＳ３００２に遷移する。ＮＯの場合にはＳ３００３に遷移する。 (S3001) It is determined whether or not the prediction unit that is the target block is larger than a predetermined size. If YES, the processing proceeds to S3002. If NO, the processing proceeds to S3003.

（Ｓ３００２）予測ユニットが所定のサイズより大きい場合には、視差ベクトルとしてデプス画像によりリファインして得られる視差ベクトルDoNBDVを用いる。 (S3002) When the prediction unit is larger than the predetermined size, the disparity vector DoNBDV obtained by refining the depth image is used as the disparity vector.

（Ｓ３００３）予測ユニットが所定のサイズより大きい場合には、視差ベクトルとしてデプス画像によりリファインすることなく得られる視差ベクトルNBDVを用いる。 (S3003) When the prediction unit is larger than the predetermined size, a disparity vector NBDV obtained without refining with a depth image is used as a disparity vector.

さらに具体的には、画像復号装置３１および画像符号化装置１１は以下の処理を行う。ＶＳＰ予測部３０３７４の備える分割フラグ導出部３５３のパーティション分割およびデプスＤＶ導出部３５１のデプスＤＶ導出において、以下の式Ａ１のように、予測ブロックの幅nPbWと高さnPbHの和が所定の値（ここでは１６）を超えた場合には、デプス画像を用いて更新された変位ベクトルMvRefinedDisp[ xPb ][ yPb ]、それ以外の場合には、デプス画像を用いて更新されていない変位ベクトルMvDisp[ xPb ][ yPb ]を用いて、デプスＤＶ部３５１によるサブブロックの変位ベクトルmvLXを導出するための視差配列DisparitySamples導出、分割フラグ導出部３５３で導出される分割フラグを用いたhorSplitFlagによるパーティション分割部によるパーティション分割が行われる。 More specifically, the image decoding device 31 and the image encoding device 11 perform the following processing. In the partitioning of the partition flag deriving unit 353 and the depth DV deriving of the depth DV deriving unit 351 included in the VSP prediction unit 30374, the sum of the predicted block width nPbW and height nPbH is a predetermined value ( Here, the displacement vector MvRefinedDisp [xPb] [yPb] updated using the depth image if 16) is exceeded, and the displacement vector MvDisp [xPb not updated using the depth image otherwise. ] [yPb], partition by disparity array DisparitySamples for deriving sub-block displacement vector mvLX by depth DV unit 351, partition by partition partition unit by horSplitFlag using partition flag derived by partition flag deriving unit 353 Splitting is performed.

mvLXVSP = nPbW + nPbH > 16 ? MvRefinedDisp[ xPb ][ yPb ] : MvDisp[ xPb ][ yPb ] 式Ａ１
ＤＢＢＰ予測部３０９５も同様にセグメンテーション部３０９５２、ＤＢＢＰ分割モード導出部３０９５４において、予測ブロックの幅（ここではnTbS）と予測ブロックの高さ（ここではnTbS）の和が所定の値（ここでは１６）を超えた場合には、デプス画像を用いてリファインされた変位ベクトルMvRefinedDisp[ xTb ][ yTb ]、それ以外の場合には、デプス画像を用いてリファインされていない変位ベクトルMvDisp[ xTb ][ yTb ]を用いて、セグメンテーションブ３０９５２におけるsegMask導出、ＤＢＢＰ分割モード導出部３０９５４における分割モードPartMode導出が行われる。mvLXVSP = nPbW + nPbH> 16? MvRefinedDisp [xPb] [yPb]: MvDisp [xPb] [yPb] Formula A1
Similarly, in the DBBP prediction unit 3095, the sum of the prediction block width (here, nTbS) and the prediction block height (here, nTbS) is a predetermined value (here, 16). The displacement vector MvRefinedDisp [xTb] [yTb] refined using the depth image is exceeded if not, and the displacement vector MvDisp [xTb] [yTb] not refined using the depth image otherwise. , SegMask derivation in the segmentation block 30952 and division mode PartMode derivation in the DBBP division mode derivation unit 30954 are performed.

nTbS + nTbS > 16 ? MvRefinedDisp[ xTb ][ yTb ] : MvDisp[ xTb ][ yTb ] 式Ａ２
なお、所定のサイズを上記よりも大きな構成であっても構わない。例えば、式Ａ１、式Ａ２の代わりに以下の式Ａ１´、式Ａ２´のように所定のサイズを２４とするのは好適である。nTbS + nTbS> 16? MvRefinedDisp [xTb] [yTb]: MvDisp [xTb] [yTb] Formula A2
The predetermined size may be larger than that described above. For example, instead of the expressions A1 and A2, it is preferable to set the predetermined size to 24 as in the following expressions A1 ′ and A2 ′.

mvLXVSP = nPbW + nPbH > 24 ? MvRefinedDisp[ xPb ][ yPb ] : MvDisp[ xPb ][ yPb ] 式Ａ１´
nTbS + nTbS > 24 ? MvRefinedDisp[ xTb ][ yTb ] : MvDisp[ xTb ][ yTb ] 式Ａ２´
図３０は、ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５で共通の変位ベクトルを用いる例のデータフローを示す図である。既に説明したようにデプスＤＶ導出部３５１は、参照ピクチャメモリ３０６から、ブロックの左上座標とデプス参照により更新される前の変位ベクトルMvDisp[][]より定まるデプスブロックであるデプス画像＃１により、デプス参照により更新された変位ベクトルMvRefinedDisp[][]を導出する。スイッチ３５４は、ブロックサイズが所定のサイズよりも大きい場合にMvRefinedDisp[][]を選択し、それ以外の場合にMvDisp[][]を選択されるスイッチであり変位ベクトルmvDispに設定する。視点合成予測では、デプスＤＶ導出部３５１が変位ベクトルmvDispを用いてサブブロック単位で導出する視差配列DispariytSamplesを導出し、ＶＳＰ予測部３０３７４により、視差配列DispariytSamplesで示される値を水平ベクトルとする動き変位補償により予測画像を生成する。このとき、デプスＤＶ導出部３５１は変位ベクトルmvDispより定まるデプスブロックであるデプス画像＃２を参照する。また、視点合成予測では、分割フラグ導出部３５３により変位ベクトルmvDispを用いて参照されるデプス画像＃２を用いて、８×４もしくは４×８のサブブロックサイズを選択する。ここで変位ベクトルmvDispが、MvDisp[][]と等しい場合には、MvRefinedDisp[]を導出するために参照するデプスブロックと、視差配列DispariytSamplesおよび分割フラグhorSplitFlagを導出するためのデプスブロックは等しく、デプス画像＃２はデプス画像＃１と等しい。よって、１回のデプス画像の転送で、変位ベクトルMvRefinedDispと、視差配列DispariytSamplesおよび分割フラグhorSplitFlag導出という一連の処理を行うことができ、デプス画像を得るために２回のデプス転送は不要である。逆に、変位ベクトルmvDispが、MvRefinedDisp[][]の場合には、デプス画像＃２はデプス画像＃１と異なるブロックであるため、２回のデプス転送が必要である。同様に、ＤＢＢＰ予測においても、セグメンテーション部３０９５２およびDBBP分割モード導出部３０９５４は変位ベクトルmvDispを用いてセグメンテーション情報segMaskおよび分割モードPartModeを導出する。このとき、セグメンテーション部３０９５２およびDBBP分割モード導出部３０９５４は変位ベクトルmvDispより定まるデプスブロックであるデプス画像＃２を参照する。ここでも変位ベクトルmvDispが、MvDisp[][]と等しい場合には、デプス画像＃２はデプス画像＃１と等しく２回のデプス転送は不要である。逆に、変位ベクトルmvDispが、MvRefinedDisp[][]の場合には、デプス画像＃２はデプス画像＃１と異なるため、２回のデプス転送が必要である。mvLXVSP = nPbW + nPbH> 24? MvRefinedDisp [xPb] [yPb]: MvDisp [xPb] [yPb] Formula A1 ′
nTbS + nTbS> 24? MvRefinedDisp [xTb] [yTb]: MvDisp [xTb] [yTb] Formula A2 ′
FIG. 30 is a diagram illustrating a data flow of an example in which a common displacement vector is used in the VSP prediction unit 30374 and the DBBP prediction unit 3095. As described above, the depth DV deriving unit 351 uses the depth image # 1 which is a depth block determined by the displacement vector MvDisp [] [] before being updated by the upper left coordinates of the block and the depth reference from the reference picture memory 306, The displacement vector MvRefinedDisp [] [] updated by the depth reference is derived. The switch 354 selects MvRefinedDisp [] [] when the block size is larger than a predetermined size, and sets MvDisp [] [] to the displacement vector mvDisp in other cases. In view synthesis prediction, the depth DV deriving unit 351 derives disparity arrays DispariytSamples derived in subblock units using the displacement vector mvDisp, and the VSP predicting unit 30374 uses the values indicated by the disparity array DispariytSamples as horizontal vectors. A predicted image is generated by compensation. At this time, the depth DV deriving unit 351 refers to the depth image # 2 that is a depth block determined by the displacement vector mvDisp. In view synthesis prediction, an 8 × 4 or 4 × 8 sub-block size is selected using the depth image # 2 that is referred to by the division flag deriving unit 353 using the displacement vector mvDisp. Here, when the displacement vector mvDisp is equal to MvDisp [] [], the depth block to be referenced for deriving MvRefinedDisp [] is equal to the depth block for deriving the disparity array DispariytSamples and the split flag horSplitFlag. Image # 2 is equal to depth image # 1. Therefore, a series of processes of derivation of the displacement vector MvRefinedDisp, the disparity array DispariytSamples, and the division flag horSplitFlag can be performed by transferring the depth image once, and two depth transfers are not necessary to obtain the depth image. On the other hand, when the displacement vector mvDisp is MvRefinedDisp [] [], the depth image # 2 is a different block from the depth image # 1, and two depth transfers are necessary. Similarly, also in DBBP prediction, the segmentation unit 30952 and the DBBP partition mode deriving unit 30954 derive the segmentation information segMask and the partition mode PartMode using the displacement vector mvDisp. At this time, the segmentation unit 30952 and the DBBP division mode deriving unit 30954 refer to the depth image # 2 that is a depth block determined by the displacement vector mvDisp. Again, if the displacement vector mvDisp is equal to MvDisp [] [], the depth image # 2 is equal to the depth image # 1 and two depth transfers are not required. On the other hand, when the displacement vector mvDisp is MvRefinedDisp [] [], the depth image # 2 is different from the depth image # 1, and two depth transfers are necessary.

なお、図３０では、分割フラグ導出部３５３、セグメンテーション部３０９５２、DBBP分割モード導出部３０９５４などを用いる例を説明したが、各手段はこれに限定されず本明細書に記載の変形例であっても良い。例えば、ＤＢＢＰ分割モード導出部３０９５４Ｃを用いる例は、視点合成予測と、ＤＢＢＰで分割処理が共通化されるため好ましい。図３１は、ＤＢＢＰ予測部３０９５（ＤＢＢＰ分割モード導出部３０９５４）の代わりにＤＢＢＰ予測部３０９５Ｃ（ＤＢＢＰ分割モード導出部３０９５４Ｃ）を用いる場合の例である。この場合も、上記のように対象ブロックのサイズに応じてMvDispとMvRefinedDispを切り替えることにより、視点合成予測処理のデプスDV導出部３５１の視差配列DisparitySamples導出処理、分割フラグ導出部の分割フラグhorSplitFlag導出処理、セグメンテーション部３０９５２のsegMask導出処理を共通の変位ベクトルを用いて導出する。以上の構成の画像復号装置３１および画像符号化装置１１は、デプスベースブロック予測画像生成手段（ＤＢＢＰ予測部３０９５）と、視点合成予測手段を備える画像復号装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部３０９５２と、２つの動き補償画像を生成するＤＢＢＰ画像補間部３０９５１と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部３０９５３と、分割モードPartModeを導出するＤＢＢＰ分割モード導出部３０９５４を備え、上記視点合成予測手段は、デプス画像から分割フラグhorSplitFlagを導出しサブブロックサイズを得るパーティション分割を行うパーティション分割部と、デプス画像から視差配列DisparitySamplesを導出し動きベクトルmvLXを得るデプス動きＤＶ導出部３５１を備え、上記デプスベースブロック予測画像生成手段の上記セグメンテーション導出部３０９５２および上記ＤＢＢＰ分割モード導出部３０９５４で参照するデプス画像の位置を導出するのに用いる視差ベクトルと、上記視点合成予測手段の上記パーティション分割部と上記デプス動きＤＶ導出部３５１でデプス画像の位置を導出するのに用いる視差ベクトルとを共通の視差ベクトルとする。以上の構成の変形例では、画像復号装置３１および画像符号化装置１１は、デプスベースブロック予測画像生成手段（ＤＢＢＰ予測部３０９５Ｃ）と、視点合成予測手段を備える画像復号装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部３０９５２と、２つの動き補償画像を生成するＤＢＢＰ画像補間部３０９５１と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部３０９５３と、分割モードPartModeを導出するＤＢＢＰ分割モード導出部３０９５４Ｃを備え、上記視点合成予測手段は、デプス画像から分割フラグhorSplitFlagを導出しサブブロックサイズを得るパーティション分割を行うパーティション分割部と、デプス画像から視差配列DisparitySamplesを導出し動きベクトルmvLXを得るデプス動きＤＶ導出部３５１を備え、上記デプスベースブロック予測画像生成手段の上記セグメンテーション導出部３０９５２で参照するデプス画像の位置を導出するのに用いる視差ベクトルと、上記視点合成予測手段の上記パーティション分割部と上記デプス動きＤＶ導出部３５１でデプス画像の位置を導出するのに用いる視差ベクトルとを共通の視差ベクトルとする。また、および上記ＤＢＢＰ分割モード導出部３０９５４Ｃおよびパーティション分割部は共通の分割フラグ導出部３５３の出力を用いて各々分割モードPartMode導出とサブブロックサイズ導出を行う。なお、分割フラグ導出部３５３の代わりに分割フラグ導出部３５３Ａなどを用いても良い。 In addition, in FIG. 30, although the example using the division | segmentation flag derivation | leading-out part 353, the segmentation part 30952, DBBP division | segmentation mode derivation | leading-out part 30954 etc. was demonstrated, each means is not limited to this, It is a modification as described in this specification. Also good. For example, an example in which the DBBP split mode deriving unit 30954C is used is preferable because the split processing is shared between the viewpoint synthesis prediction and DBBP. FIG. 31 shows an example in which a DBBP prediction unit 3095C (DBBP division mode deriving unit 30954C) is used instead of the DBBP prediction unit 3095 (DBBP division mode deriving unit 30954). Also in this case, by switching between MvDisp and MvRefinedDisp according to the size of the target block as described above, the disparity array DisparitySamples derivation process of the depth DV derivation unit 351 of the viewpoint synthesis prediction process and the division flag horSplitFlag derivation process of the division flag derivation unit The segMask derivation process of the segmentation unit 30952 is derived using a common displacement vector. The image decoding apparatus 31 and the image encoding apparatus 11 configured as described above are the above-described depth base block prediction image generation means in an image decoding apparatus including a depth base block prediction image generation unit (DBBP prediction unit 3095) and a viewpoint synthesis prediction unit. Includes a segmentation deriving unit 30952 for deriving segmentation information from a depth image, a DBBP image interpolating unit 30951 for generating two motion compensated images, and image synthesis for synthesizing the two interpolated images to generate one motion compensated image. A partition division unit for deriving a partition flag horSplitFlag from the depth image and obtaining a sub-block size; and a depth division unit 30953, and a DBBP partition mode deriving unit 30954 for deriving a partition mode PartMode. Disparity array from image A depth motion DV deriving unit 351 for deriving ySamples and obtaining a motion vector mvLX is provided, and a position of a depth image referred to by the segmentation deriving unit 30952 and the DBBP division mode deriving unit 30954 of the depth base block prediction image generating unit is derived. And the disparity vector used for deriving the position of the depth image by the partition dividing unit and the depth motion DV deriving unit 351 of the viewpoint synthesis prediction unit are used as a common disparity vector. In the modification of the above configuration, the image decoding device 31 and the image encoding device 11 are the above-described depth base block in an image decoding device including a depth base block predicted image generation unit (DBBP prediction unit 3095C) and a viewpoint synthesis prediction unit. The predicted image generation means includes a segmentation deriving unit 30951 for deriving segmentation information from the depth image, a DBBP image interpolating unit 30951 for generating two motion compensated images, and combining the two interpolated images into one motion compensated image. A partitioning unit that generates an image synthesis unit 30953 and a DBBP partitioning mode deriving unit 30954C for deriving a partitioning mode PartMode, wherein the viewpoint synthesis prediction unit performs partitioning by deriving a partitioning flag horSplitFlag from a depth image and obtaining a subblock size Part and depth image A depth motion DV deriving unit 351 for deriving a disparity array DisparitySamples to obtain a motion vector mvLX, and a disparity vector used for deriving a position of a depth image referred to by the segmentation deriving unit 30952 of the depth-based block prediction image generating unit And the disparity vector used for deriving the position of the depth image by the partition dividing unit and the depth motion DV deriving unit 351 of the viewpoint synthesis prediction unit are set as a common disparity vector. Also, the DBBP partition mode deriving unit 30954C and the partition partitioning unit respectively perform partition mode PartMode derivation and subblock size derivation using the output of the common partition flag deriving unit 353. Note that a division flag deriving unit 353A or the like may be used instead of the division flag deriving unit 353.

また、好適には、上記共通の視差ベクトルは、ブロックサイズが所定のサイズより大きい場合には、デプスによりリファインされる視差ベクトルであり、ブロックサイズが所定のサイズ以下の場合には、デプスによりリファインされる前の視差ベクトルである。 Preferably, the common disparity vector is a disparity vector refined by depth when the block size is larger than a predetermined size, and refined by depth when the block size is equal to or smaller than the predetermined size. This is the disparity vector before being processed.

また、さらに好適には、上記共通の視差ベクトルは、予測ブロックの幅と高さの和が１６より大きい場合には、デプスによりリファインされる視差ベクトルであり、それ以外の場合にはデプスによりリファインされる前の視差ベクトルである。 More preferably, the common disparity vector is a disparity vector refined by depth when the sum of the width and height of the prediction block is larger than 16, and is refined by depth otherwise. This is the disparity vector before being processed.

また、上記共通の視差ベクトルは、予測ブロックの幅と高さの和が２４より大きい場合には、デプスによりリファインされる視差ベクトルであり、それ以外の場合にはデプスによりリファインされる前の視差ベクトルであっても構わない。 The common disparity vector is a disparity vector refined by depth when the sum of the width and height of the prediction block is greater than 24. In other cases, the disparity before being refined by depth It may be a vector.

以上の構成の画像復号装置３１および画像符号化装置１１によれば、ＶＳＰ予測部３０３７４およびＤＢＢＰ予測部３０９５として共通の変位ベクトルを用いるため、デプス転送の処理量および実装が簡略化されるという効果を奏する。 According to the image decoding device 31 and the image encoding device 11 configured as described above, since a common displacement vector is used as the VSP prediction unit 30374 and the DBBP prediction unit 3095, an effect of simplifying the processing amount and implementation of depth transfer. Play.

以上の構成の画像復号装置３１および画像符号化装置１１によれば、所定のサイズ以下の小さいブロックでは、デプス画像によるリファインが必要な視差ベクトルを用いないため、デプス画像のアクセスが減少し、デプス画像を転送するためのメモリバンド幅やデプス画像を参照する処理量が低下するという効果を奏する。 According to the image decoding device 31 and the image encoding device 11 configured as described above, a small block having a predetermined size or less does not use a disparity vector that needs to be refined by a depth image. The memory bandwidth for transferring the image and the processing amount referring to the depth image are reduced.

（分割フラグ導出部３５３）
分割フラグ導出部３５３は、対象ブロックに対応するデプス画像を参照し、分割フラグhorSplitFlagを導出する。分割フラグ導出部３５３の入力として設定される対象ブロックの座標を(xP, yP)、幅と高さをnPSW、nPSH、変位ベクトルをmvDispであるとして以下、説明する。分割フラグ導出部３５３は、対象ブロックの幅と高さが等しい場合にはデプス画像を参照するが、対象ブロックの幅と高さが等しくない場合には、デプス画像を参照せずに、分割フラグhorSplitFlagを導出するを導出しても良い。以下、分割フラグ導出部３５３の詳細を説明する。(Division flag deriving unit 353)
The division flag deriving unit 353 refers to the depth image corresponding to the target block and derives the division flag horSplitFlag. The following description will be made assuming that the coordinates of the target block set as the input of the division flag deriving unit 353 are (xP, yP), the width and height are nPSW, nPSH, and the displacement vector is mvDisp. The division flag deriving unit 353 refers to the depth image when the width and height of the target block are equal, but when the width and height of the target block are not equal, the division flag deriving unit 353 refers to the division flag without referring to the depth image. Deriving horSplitFlag may be derived. Details of the division flag deriving unit 353 will be described below.

分割フラグ導出部３５３は、参照ピクチャメモリ３０６から、復号対象ピクチャと同一ＰＯＣを持ち、なお且つ、変位ベクトルmvDispが示す参照ピクチャのビューＩＤ（RefViewIdx）と同じビューＩＤであるデプス画像refDepPelsを読み出す。 The division flag deriving unit 353 reads, from the reference picture memory 306, the depth image refDepPels that has the same POC as the decoding target picture and has the same view ID as the view ID (RefViewIdx) of the reference picture indicated by the displacement vector mvDisp.

次に、分割フラグ導出部３５３は、対象ブロックの左上の座標（ｘＰ、ｙＰ）を変位ベクトルMvDispだけずらした座標(xTL, yTL)を、以下の式により導出する。 Next, the division flag deriving unit 353 derives coordinates (xTL, yTL) obtained by shifting the upper left coordinates (xP, yP) of the target block by the displacement vector MvDisp by the following formula.

xTL = xP + ( ( mvDisp[ 0 ] + 2 ) >> 2 )
yTL = yP + ( ( mvDisp[ 1 ] + 2 ) >> 2 )
ここで、mvDisp[ 0 ]、mvDisp[ 1 ]は、それぞれ変位ベクトルMvDispのＸ成分とＹ成分である。導出する座標(xTL, yTL)は、デプス画像refDepPels上の対象ブロックに対応するブロックの座標を示すものである。xTL = xP + ((mvDisp [0] + 2) >> 2)
yTL = yP + ((mvDisp [1] + 2) >> 2)
Here, mvDisp [0] and mvDisp [1] are the X component and the Y component of the displacement vector MvDisp, respectively. The derived coordinates (xTL, yTL) indicate the coordinates of the block corresponding to the target block on the depth image refDepPels.

分割フラグ導出部３５３は、対象ブロックの幅nPSWもしくは高さnPSHが８の倍数以外の場合に、以下の式によりフラグminSubBlkSizeFlagを１に設定する。 The division flag deriving unit 353 sets the flag minSubBlkSizeFlag to 1 using the following expression when the width nPSW or the height nPSH of the target block is other than a multiple of 8.

minSubBlkSizeFlag = ( nPSW % 8 != 0) | | ( nPSH % 8 != 0 )
分割フラグ導出部３５３は、フラグminSubBlkSizeFlagが１の場合、以下の式により、対象ブロックの高さが８の倍数以外の場合（nPSH % 8が真の場合）には、horSplitFlagに１、それ以外の場合には、０を設定する。minSubBlkSizeFlag = (nPSW% 8! = 0) | | (nPSH% 8! = 0)
When the flag minSubBlkSizeFlag is 1, the split flag deriving unit 353 determines that the horSplitFlag is 1 when the height of the target block is not a multiple of 8 (when nPSH% 8 is true), and In this case, 0 is set.

horSplitFlag = ( nPSH % 8 ! = 0 )
すなわち、対象ブロックの高さが８の倍数以外の場合（nPSH % 8が真の場合）には、horSplitFlagに１、対象ブロックの幅が８の倍数以外の場合（nPSW % 8が真の場合）には、horSplitFlagに０が設定される。horSplitFlag = (nPSH% 8! = 0)
In other words, when the target block height is not a multiple of 8 (when nPSH% 8 is true), horSplitFlag is 1, and when the target block width is other than a multiple of 8 (when nPSW% 8 is true) Is set to 0 in horSplitFlag.

分割フラグ導出部３５３は、デプス値からサブブロックサイズを導出する。予測ブロックのコーナーの４点（ＴＬ、ＴＲ、ＢＬ、ＢＲ）の比較から、サブブロックサイズを導出する。フラグminSubBlkSizeFlagが０の場合、対象ブロックの左上端（ＴＬ）の座標のデプス画像の画素値をrefDepPelsP0、右上端（ＴＲ）の画素値をrefDepPelsP1、左下端（ＢＬ）の画素値をrefDepPelsP2、右下端（ＢＲ）の画素値をrefDepPelsP3とした場合、
horSplitFlag＝( refDepPelsP0 > refDepPelsP3 ) == ( refDepPelsP1 > refDepPelsP2 )
の条件式（horSplitFlag）が成立するかを判定する。
なお、horSplitFlagの導出には、符号を変更した以下の式を用いても良い。The division flag deriving unit 353 derives the sub block size from the depth value. The sub-block size is derived from the comparison of the four points (TL, TR, BL, BR) at the corners of the prediction block. When the flag minSubBlkSizeFlag is 0, the pixel value of the depth image of the upper left (TL) coordinates of the target block is refDepPelsP0, the pixel value of the upper right end (TR) is refDepPelsP1, the pixel value of the lower left end (BL) is refDepPelsP2, and the lower right end When the pixel value of (BR) is refDepPelsP3,
horSplitFlag = (refDepPelsP0> refDepPelsP3) == (refDepPelsP1> refDepPelsP2)
It is determined whether the conditional expression (horSplitFlag) is satisfied.
In order to derive horSplitFlag, the following expression with a changed sign may be used.

horSplitFlag＝( refDepPelsP0 < refDepPelsP3 ) == ( refDepPelsP1 < refDepPelsP2 )
分割フラグ導出部３５３は、horSplitFlagを、分割モード導出部３０９５４ＣおよびVSP予測部３０３７４に出力する。horSplitFlag = (refDepPelsP0 <refDepPelsP3) == (refDepPelsP1 <refDepPelsP2)
The split flag derivation unit 353 outputs horSplitFlag to the split mode derivation unit 30954C and the VSP prediction unit 30374.

なお、分割フラグ導出部３５３は、以下のように導出しても良い。対象ブロックの幅nPSWと高さnPSHが異なる場合には、対象ブロックの幅と高さに応じて以下の式により導出する。 The division flag deriving unit 353 may derive as follows. When the width nPSW and the height nPSH of the target block are different, it is derived by the following formula according to the width and height of the target block.

nPSW > nPSHであれば、horSplitFlag＝１
それ以外でnPSH > nPSWであれば、horSplitFlag＝０
それ以外、対象ブロックの幅と高さが等しい場合にはデプスを参照して以下の式に応じて導出する。If nPSW> nPSH, horSplitFlag = 1
Otherwise, if nPSH> nPSW, horSplitFlag = 0
Otherwise, when the width and height of the target block are equal, the depth is referred to and derived according to the following formula.

horSplitFlag＝( refDepPelsP0 > refDepPelsP3 ) == ( refDepPelsP1 > refDepPelsP2 )
なお、分割フラグ導出部３５３の対象ブロックは、視点合成予測の場合には予測ユニット、ＤＢＢＰの場合には幅と高さが等しいブロックである。ＤＢＢＰの場合には幅と高さが等しいため、上記の導出方法では、デプス画像の４隅を参照して分割フラグhorSplitFlagが導出される。horSplitFlag = (refDepPelsP0> refDepPelsP3) == (refDepPelsP1> refDepPelsP2)
The target block of the division flag deriving unit 353 is a prediction unit in the case of viewpoint synthesis prediction, and a block having the same width and height in the case of DBBP. In the case of DBBP, since the width and the height are equal, in the above derivation method, the division flag horSplitFlag is derived with reference to the four corners of the depth image.

（分割フラグ導出部３５３Ａ）
以下、分割フラグ導出部３５３の変形例である分割フラグ導出部３５３Ａを説明する。分割フラグ導出部３５３Ａは、対象ブロックに対応するデプス画像を参照し、分割フラグhorSplitFlagを導出する。分割フラグ導出部３５３Ａの入力として設定される対象ブロックの座標を(xP, yP)、幅と高さをnPSW、nPSH、変位ベクトルをmvDispであるとして以下、説明する。(Division flag deriving unit 353A)
Hereinafter, a split flag deriving unit 353A that is a modification of the split flag deriving unit 353 will be described. The division flag deriving unit 353A refers to the depth image corresponding to the target block and derives the division flag horSplitFlag. In the following description, it is assumed that the coordinates of the target block set as the input of the division flag deriving unit 353A are (xP, yP), the width and height are nPSW, nPSH, and the displacement vector is mvDisp.

分割フラグ導出部３５３Ａは、参照ピクチャメモリ３０６から、復号対象ピクチャと同一ＰＯＣを持ち、なお且つ、変位ベクトルmvDispが示す参照ピクチャのビューＩＤ（RefViewIdx）と同じビューＩＤであるデプス画像refDepPelsを読み出す。 The division flag deriving unit 353A reads out the depth image refDepPels having the same POC as the decoding target picture from the reference picture memory 306 and having the same view ID as the reference ID (RefViewIdx) of the reference picture indicated by the displacement vector mvDisp.

次に、分割フラグ導出部３５３Ａは、対象ブロックの左上の座標（ｘＰ、ｙＰ）を変位ベクトルMvDispだけずらした座標(xTL, yTL)を、以下の式により導出する。 Next, the division flag deriving unit 353A derives coordinates (xTL, yTL) obtained by shifting the upper left coordinates (xP, yP) of the target block by the displacement vector MvDisp by the following formula.

分割フラグ導出部３５３Ａは、対象ブロックの幅nPSWもしくは高さnPSHが８の倍数以外の場合に、以下の式によりフラグminSubBlkSizeFlagを１に設定する。 The division flag deriving unit 353A sets the flag minSubBlkSizeFlag to 1 using the following expression when the width nPSW or the height nPSH of the target block is other than a multiple of 8.

minSubBlkSizeFlag = ( nPSW % 8 != 0) | | ( nPSH % 8 != 0 )
分割フラグ導出部３５３Ａは、フラグminSubBlkSizeFlagが１の場合、以下の式により、対象ブロックの高さが８の倍数以外の場合（nPSH % 8が真の場合）には、horSplitFlagに１、それ以外の場合には、０を設定する。minSubBlkSizeFlag = (nPSW% 8! = 0) | | (nPSH% 8! = 0)
When the flag minSubBlkSizeFlag is 1, when the flag minSubBlkSizeFlag is 1, the split flag deriving unit 353A sets 1 to horSplitFlag when the height of the target block is not a multiple of 8 (when nPSH% 8 is true), otherwise In this case, 0 is set.

分割フラグ導出部３５３Ａは、対象ブロックの幅と対象ブロックの高さがいずれも８の倍数の場合、デプス値からサブブロックサイズを導出する。具体的には、予測ブロックのコーナーの３点（ＴＬ、ＴＲ、ＢＬ）の比較から、サブブロックサイズを導出する。フラグminSubBlkSizeFlagが０の場合、対象ブロックの左上端（ＴＬ）の座標のデプス画像の画素値をrefDepPelsP0、右上端（ＴＲ）の画素値をrefDepPelsP1、左下端（ＢＬ）の画素値をrefDepPelsP2とした場合、水平方向の絶対値差分abs (a-b)が、垂直方向の絶対値差分abs (a - c)より大きい場合（abs (a-b)>abs(a-c)）に、縦長サブブロック（４×８）となるhorSplitFlag＝０を導出し、それ以外の場合、つまり、水平方向の絶対値差分abs (a-b)が、垂直方向の絶対値差分abs (a - c)以下の場合（abs (a-b)<=abs(a-c)）に、横長サブブロック（８×４）となるhorSplitFlag＝１を導出する。具体的には、分割フラグ導出部３５３Ａは、以下の式によりhorSplitFlagを導出する。 The division flag deriving unit 353A derives the sub-block size from the depth value when both the width of the target block and the height of the target block are multiples of 8. Specifically, the sub-block size is derived from a comparison of three points (TL, TR, BL) at the corners of the prediction block. When the flag minSubBlkSizeFlag is 0, the pixel value of the depth image at the upper left (TL) coordinates of the target block is refDepPelsP0, the upper right (TR) pixel value is refDepPelsP1, and the lower left (BL) pixel value is refDepPelsP2. When the horizontal absolute value difference abs (ab) is larger than the vertical absolute value difference abs (ac) (abs (ab)> abs (ac)), the vertical sub-block (4 × 8) is HorSplitFlag = 0 is derived, and in other cases, that is, when the absolute value difference abs (ab) in the horizontal direction is less than or equal to the absolute value difference abs (a-c) in the vertical direction (abs (ab) <= abs (ac)), horSplitFlag = 1 which is a horizontally long sub-block (8 × 4) is derived. Specifically, the split flag deriving unit 353A derives horSplitFlag by the following equation.

a = refDepPelsP0
b = refDepPelsP1
c = refDepPelsP2
horSplitFlag＝ abs( a - b) > abs( a - c) ? 0 : 1
なお、horSplitFlagの導出には、判定における等号の扱いを変更した以下の式を用いても良い。a = refDepPelsP0
b = refDepPelsP1
c = refDepPelsP2
horSplitFlag = abs (a-b)> abs (a-c)? 0: 1
In order to derive horSplitFlag, the following expression in which the handling of the equal sign in the determination is changed may be used.

horSplitFlag＝ abs( a - b) >= abs( a - c) ? 0 : 1
分割フラグ導出部３５３Ａは、horSplitFlagを、分割モード導出部３０９５４ＣおよびVSP予測部３０３７４に出力する。horSplitFlag = abs (a-b)> = abs (a-c)? 0: 1
The split flag derivation unit 353A outputs the horSplitFlag to the split mode derivation unit 30954C and the VSP prediction unit 30374.

（デプスＤＶ導出部３５１）
デプスＤＶ導出部３５１は、指定されたブロック単位（サブブロック）で、デプス由来の変位ベクトルの水平成分である視差配列disparitySamples（水平ベクトル）を導出する。デプスＤＶ導出部３５１の入力は、デプスＤＶ変換テーブルDepthToDisparityB、ブロックの幅nBlkWと高さnBlkHと、分割フラグsplitFlagと、デプス画像refDepPelsと、デプス画像refDepPels上の対応ブロックの座標(xTL, yTL)と、ビューＩＤrefViewIdx、出力は視差配列disparitySamples（水平ベクトル）である。(Depth DV deriving unit 351)
The depth DV deriving unit 351 derives disparity arrays disparitySamples (horizontal vectors), which are horizontal components of depth-derived displacement vectors, in designated block units (sub-blocks). The input of the depth DV derivation unit 351 includes the depth DV conversion table DepthToDisparityB, the block width nBlkW and height nBlkH, the split flag splitFlag, the depth image refDepPels, and the coordinates (xTL, yTL) of the corresponding block on the depth image refDepPels. , View IDrefViewIdx, output is disparity array disparitySamples (horizontal vector).

なお、デプス画像refDepPels上の対応ブロックの座標(xTL, yTL)は、変位ベクトルがmvDispである場合、以下の式により導出される。 Note that the coordinates (xTL, yTL) of the corresponding block on the depth image refDepPels are derived by the following equations when the displacement vector is mvDisp.

xTL = xP + ( ( mvDisp[ 0 ] + 2 ) >> 2 )
yTL = yP + ( ( mvDisp[ 1 ] + 2 ) >> 2 )
デプスＤＶ導出部３５１は、対象ブロックごとにデプス代表値maxDep導出に用いる画素を設定する。具体的には、図１３に示すように、対象ブロックの左上の予測ブロック(xTL, yTL)からの相対座標を（xSubB、ySubB）とした場合、サブブロックの左端のＸ座標xP0と、右端のＸ座標xP1と、上端のＹ座標yP0と、下端のＹ座標yP1を、以下の式より求める。xTL = xP + ((mvDisp [0] + 2) >> 2)
yTL = yP + ((mvDisp [1] + 2) >> 2)
The depth DV deriving unit 351 sets a pixel used for deriving the depth representative value maxDep for each target block. Specifically, as shown in FIG. 13, when the relative coordinates from the upper left prediction block (xTL, yTL) of the target block are (xSubB, ySubB), the X coordinate xP0 of the left end of the sub-block and the right end The X coordinate xP1, the upper end Y coordinate yP0, and the lower end Y coordinate yP1 are obtained from the following equations.

xP0 = Clip3( 0, pic_width_in_luma_samples - 1, xTL + xSubB )
yP0 = Clip3( 0, pic_height_in_luma_samples - 1, yTL + ySubB )
xP1 = Clip3( 0, pic_width_in_luma_samples - 1, xTL + xSubB + nBlkW - 1 )
yP1 = Clip3( 0, pic_height_in_luma_samples - 1, yTL + ySubB + nBlkH - 1 )
ここで、pic_width_in_luma_samplesとpic_height_in_luma_samplesは、それぞれ画像の幅と高さを表す。xP0 = Clip3 (0, pic_width_in_luma_samples-1, xTL + xSubB)
yP0 = Clip3 (0, pic_height_in_luma_samples-1, yTL + ySubB)
xP1 = Clip3 (0, pic_width_in_luma_samples-1, xTL + xSubB + nBlkW-1)
yP1 = Clip3 (0, pic_height_in_luma_samples-1, yTL + ySubB + nBlkH-1)
Here, pic_width_in_luma_samples and pic_height_in_luma_samples represent the width and height of the image, respectively.

次に、デプスＤＶ導出部３５１は、対象ブロックのデプスの代表値maxDepを導出する。具体的には、サブブロックのコーナーおよびその付近４点のデプス画像の画素値refDepPels[ xP0 ][ yP0 ]、refDepPels[ xP0 ][ yP1 ]、refDepPels[ xP1 ][ yP0 ]、refDepPels[ xP1 ][ yP1 ]の最大値である代表デプス値maxDepを、以下の式より導出する。 Next, the depth DV deriving unit 351 derives the depth representative value maxDep of the target block. Specifically, the pixel values refDepPels [xP0] [yP0], refDepPels [xP0], refDepPels [xP1] [yP0], refDepPels [xP1] [yP1] of the depth image at the corner of the sub-block and in the vicinity thereof ], The representative depth value maxDep is derived from the following equation.

maxDep = 0
maxDep = Max( maxDep, refDepPels[ xP0 ][ yP0 ] )
maxDep = Max( maxDep, refDepPels[ xP0 ][ yP1 ] )
maxDep = Max( maxDep, refDepPels[ xP1 ][ yP0 ] )
maxDep = Max( maxDep, refDepPels[ xP1 ][ yP1 ] )
ここで、また、関数Max(x,y)は、第１引数ｘが第２引数ｙ以上であればｘを、そうでなければｙを返す関数である。maxDep = 0
maxDep = Max (maxDep, refDepPels [xP0] [yP0])
maxDep = Max (maxDep, refDepPels [xP0] [yP1])
maxDep = Max (maxDep, refDepPels [xP1] [yP0])
maxDep = Max (maxDep, refDepPels [xP1] [yP1])
Here, the function Max (x, y) is a function that returns x if the first argument x is greater than or equal to the second argument y, and returns y otherwise.

デプスＤＶ導出部３５１は、代表デプス値maxDepとデプスＤＶ変換テーブルDepthToDisparityBと、変位ベクトル(NBDV)が示すレイヤのビューＩＤrefViewIdxを用いて、デプス由来の変位ベクトルの水平成分である視差配列disparitySamplesを、対象ブロック内の画素（ｘ、ｙ）（ｘは0からnBlkW-1、ｙは0からnBlkH-1の値を取る）ごとに、以下の式により導出する。 The depth DV deriving unit 351 uses the representative depth value maxDep, the depth DV conversion table DepthToDisparityB, and the view ID refViewIdx of the layer indicated by the displacement vector (NBDV) to target the disparity array disparitySamples that is the horizontal component of the displacement vector derived from the depth. For each pixel (x, y) in the block (where x is a value from 0 to nBlkW-1, and y is a value from 0 to nBlkH-1), it is derived by the following equation.

disparitySamples[x][y] = DepthToDisparityB[refViewIdx][maxDep]・・（式Ａ）
デプスＤＶ導出部３５１は、導出した視差配列disparitySamples[]を変位ベクトルDoNBDV（の水平成分）として、変位ベクトル導出部３５２に出力する。デプスＤＶ導出部３５１は、また、VSP予測部３０３７４に変位ベクトル（の水平成分）として出力する。disparitySamples [x] [y] = DepthToDisparityB [refViewIdx] [maxDep] ... (Formula A)
The depth DV deriving unit 351 outputs the derived parallax array disparitySamples [] to the displacement vector deriving unit 352 as the displacement vector DoNBDV (a horizontal component thereof). The depth DV deriving unit 351 also outputs the displacement vector (the horizontal component thereof) to the VSP prediction unit 30374.

（インター予測画像生成部３０９）
図１６は、本実施形態に係るインター予測画像生成部３０９の構成を示す概略図である。インター予測画像生成部３０９は、動き変位補償部３０９１、残差予測部３０９２、照度補償部３０９３、ＤＢＢＰ予測部３０９５（デプスベースブロック予測画像生成装置３０９５）、重み付け予測部３０９６を含んで構成される。(Inter prediction image generation unit 309)
FIG. 16 is a schematic diagram illustrating a configuration of the inter predicted image generation unit 309 according to the present embodiment. The inter prediction image generation unit 309 includes a motion displacement compensation unit 3091, a residual prediction unit 3092, an illuminance compensation unit 3093, a DBBP prediction unit 3095 (depth base block prediction image generation device 3095), and a weighted prediction unit 3096. .

インター予測画像生成部３０９は、インター予測パラメータ復号部３０３から入力されるサブブロック動き補償フラグsubPbMotionFlagが１の場合にはサブブロック単位で処理し、サブブロック動き補償フラグsubPbMotionFlagが０の場合には予測ユニット単位で以下の処理を行う。なお、サブブロック動き補償フラグsubPbMotionFlagはマージモードとして、インタービューマージ候補が選択された場合、もしくは、ＶＳＰマージ候補が選択された場合に１となる。インター予測画像生成部３０９は、予測パラメータを動き変位補償部３０９１により、予測画像predSamplesを導出する。また、インター予測画像生成部３０９は、残差予測インデックスiv_res_pred_weight_idxが０ではない場合に、残差予測実施フラグresPredFlagに残差予測を実行することを示す１を設定し、動き変位補償部３０９１と残差予測部３０９２に出力する。一方、残差予測インデックスiv_res_pred_weight_idxが０である場合に、残差予測実施フラグresPredFlagに０を設定し、動き変位補償部３０９１と残差予測部３０９２に出力する。 The inter prediction image generation unit 309 performs processing in units of subblocks when the subblock motion compensation flag subPbMotionFlag input from the inter prediction parameter decoding unit 303 is 1, and performs prediction when the subblock motion compensation flag subPbMotionFlag is 0. The following processing is performed for each unit. The sub-block motion compensation flag subPbMotionFlag is set to 1 when the inter-view merge candidate is selected as the merge mode or when the VSP merge candidate is selected. The inter prediction image generation unit 309 derives prediction images predSamples using the motion displacement compensation unit 3091 based on the prediction parameters. In addition, when the residual prediction index iv_res_pred_weight_idx is not 0, the inter predicted image generation unit 309 sets the residual prediction execution flag resPredFlag to 1 indicating that residual prediction is to be performed, and the motion displacement compensation unit 3091 The result is output to the difference prediction unit 3092. On the other hand, when the residual prediction index iv_res_pred_weight_idx is 0, the residual prediction execution flag resPredFlag is set to 0 and output to the motion displacement compensation unit 3091 and the residual prediction unit 3092.

動き変位補償部３０９１、残差予測部３０９２、照度予測部３０９３、ＤＢＢＰ予測部３０９５は各々、単予測（predFlagL0=1もしくはpredFlagL1=1）の場合にはL0の動き補償画像predSamplesL0もしくはL1の動き補償画像predSamplesL1を導出し、双予測（predFlagL0=1かつpredFlagL1=1）の場合には、L0の動き補償画像predSamplesL0とL1の動き補償画像predSamplesL1を導出し、重み付け予測部３０９６に出力する。重み付け予測部３０９６は単予測の場合には、１つの動き補償画像predSamplesL0もしくはpredSamplesL1から予測画像predSamplesを導出し、双予測の場合には、２つの２つの動き補償画像predSamplesL0とpredSamplesL1から予測画像predSamplesを導出する。 The motion displacement compensation unit 3091, the residual prediction unit 3092, the illuminance prediction unit 3093, and the DBBP prediction unit 3095 each perform motion compensation of the motion compensation image predSamplesL0 or L1 of L0 in the case of simple prediction (predFlagL0 = 1 or predFlagL1 = 1). The image predSamplesL1 is derived, and in the case of bi-prediction (predFlagL0 = 1 and predFlagL1 = 1), the L0 motion compensated image predSamplesL0 and the L1 motion compensated image predSamplesL1 are derived and output to the weighted prediction unit 3096. The weighted prediction unit 3096 derives a predicted image predSamples from one motion compensated image predSamplesL0 or predSamplesL1 in the case of uni-prediction, and calculates a predicted image predSamples from two two motion compensated images predSamplesL0 and predSamplesL1 in the case of bi-prediction. To derive.

（動き変位補償）
動き変位補償部３０９１は、予測利用フラグpredFlagLX、参照ピクチャインデックスrefIdxLX、ベクトルmvLX（動きベクトル、又は変位ベクトル）に基づいて、動き予測画像predSampleLXを生成する。動き変位補償部３０９１は、参照ピクチャメモリ３０６から、参照ピクチャインデックスrefIdxLXで指定された参照ピクチャの予測ユニットの位置を起点として、ベクトルmvLXだけずれた位置にあるブロックを読み出し補間することによって予測画像を生成する。ここで、ベクトルmvLXが整数ベクトルでない場合には、動き補償フィルタ（もしくは変位補償フィルタ）と呼ばれる小数位置の画素を生成するためのフィルタを施して、予測画像を生成する。一般に、ベクトルmvLXが動きベクトルの場合、上記処理を動き補償と呼び、変位ベクトルの場合は変位補償と呼ぶ。ここでは総称して動き変位補償と表現する。以下、Ｌ０予測の予測画像をpredSamplesL0、Ｌ１予測の予測画像をpredSamplesL1と呼ぶ。両者を区別しない場合predSamplesLXと呼ぶ。以下、動き変位補償部３０９１で得られた予測画像predSamplesLXに、さらに残差予測および照度補償が行われる例を説明するが、これらの出力画像もまた、予測画像predSamplesLXと呼ぶ。なお、以下の残差予測および照度補償において、入力画像と出力画像を区別する場合には、入力画像をpredSamplesLX、出力画像をpredSamplesLX´と表現する。(Motion displacement compensation)
The motion displacement compensation unit 3091 generates a motion prediction image predSampleLX based on the prediction use flag predFlagLX, the reference picture index refIdxLX, and the vector mvLX (motion vector or displacement vector). The motion displacement compensation unit 3091 reads out a block at a position shifted by the vector mvLX from the reference picture memory 306, starting from the position of the prediction unit of the reference picture specified by the reference picture index refIdxLX, and interpolates the predicted image. Generate. Here, when the vector mvLX is not an integer vector, a prediction image is generated by applying a filter called a motion compensation filter (or displacement compensation filter) for generating a pixel at a decimal position. In general, when the vector mvLX is a motion vector, the above processing is called motion compensation, and when the vector mvLX is a displacement vector, it is called displacement compensation. Here, it is collectively referred to as motion displacement compensation. Hereinafter, the prediction image of L0 prediction is referred to as predSamplesL0, and the prediction image of L1 prediction is referred to as predSamplesL1. If the two are not distinguished, they are called predSamplesLX. Hereinafter, an example in which residual prediction and illuminance compensation are further performed on the prediction image predSamplesLX obtained by the motion displacement compensation unit 3091 will be described. These output images are also referred to as prediction images predSamplesLX. In the following residual prediction and illuminance compensation, when an input image and an output image are distinguished, the input image is expressed as predSamplesLX and the output image is expressed as predSamplesLX ′.

動き変位補償部３０９１は、残差予測実施フラグresPredFlagが０の場合には、輝度成分は８タップ、色差成分は４タップの動き補償フィルタにより、動き補償画像predSamplesLXを生成する。残差予測実施フラグresPredFlagが１の場合には、輝度成分、色差成分ともに２タップの動き補償フィルタにより、動き補償画像predSamplesLXを生成する。 When the residual prediction execution flag resPredFlag is 0, the motion displacement compensation unit 3091 generates a motion compensated image predSamplesLX using a motion compensation filter having 8 taps for the luminance component and 4 taps for the chrominance component. When the residual prediction execution flag resPredFlag is 1, a motion compensation image predSamplesLX is generated by a 2-tap motion compensation filter for both the luminance component and the chrominance component.

なお、サブブロック動き補償フラグsubPbMotionFlagが１の場合に、動き変位補償部３０９１は、サブブロック単位で動き補償を行う。具体的には、座標（xCb, yCb）のサブブロックのベクトル、参照ピクチャインデックス、参照リスト利用フラグを、以下の式から導出する。 When the sub block motion compensation flag subPbMotionFlag is 1, the motion displacement compensation unit 3091 performs motion compensation in units of sub blocks. Specifically, a sub-block vector of coordinates (xCb, yCb), a reference picture index, and a reference list use flag are derived from the following equations.

MvL0[xCb+x][ yCb+y] = subPbMotionFlag ? SubPbMvL0[xCb+x][ yCb+y] : mvL0
MvL1[xCb+x][ yCb+y] = subPbMotionFlag ? SubPbMvL1[xCb+x][ yCb+y] : mvL1
RefIdxL0[xCb+x][ yCb+y] = subPbMotionFlag ? SubPbRefIdxL0[xCb+x][ yCb+y] : refIdxL0
RefIdxL1[xCb+x][ yCb+y] = subPbMotionFlag ? SubPbRefIdxL1[xCb+x][ yCb+y] : refIdxL1
PredFlagL0[xCb+x][ yCb+y] = subPbMotionFlag ? SubPbPredFlagL0[xCb+x][ yCb+y] : predFlagL0
PredFlagL1[xCb+x][ yCb+y] = subPbMotionFlag ? SubPbPredFlagL1[xCb+x][ yCb+y] : predFlagL1
ここで、SubPbMvLX、SubPbRefIdxLX、SubPbPredFlagLX（Xは0, 1）は、レイヤ間マージ候補導出部３０３７１で説明したsubPbMvLX、subPbRefIdxLX、subPbPredFlagLXに対応する。MvL0 [xCb + x] [yCb + y] = subPbMotionFlag? SubPbMvL0 [xCb + x] [yCb + y]: mvL0
MvL1 [xCb + x] [yCb + y] = subPbMotionFlag? SubPbMvL1 [xCb + x] [yCb + y]: mvL1
RefIdxL0 [xCb + x] [yCb + y] = subPbMotionFlag? SubPbRefIdxL0 [xCb + x] [yCb + y]: refIdxL0
RefIdxL1 [xCb + x] [yCb + y] = subPbMotionFlag? SubPbRefIdxL1 [xCb + x] [yCb + y]: refIdxL1
PredFlagL0 [xCb + x] [yCb + y] = subPbMotionFlag? SubPbPredFlagL0 [xCb + x] [yCb + y]: predFlagL0
PredFlagL1 [xCb + x] [yCb + y] = subPbMotionFlag? SubPbPredFlagL1 [xCb + x] [yCb + y]: predFlagL1
Here, SubPbMvLX, SubPbRefIdxLX, and SubPbPredFlagLX (X is 0, 1) correspond to subPbMvLX, subPbRefIdxLX, and subPbPredFlagLX described in the inter-layer merge candidate derivation unit 30371.

（残差予測）
残差予測部３０９２は、残差予測実施フラグresPredFlagが１の場合に、残差予測を行う。残差予測部３０９２は、残差予測実施フラグresPredFlagが０の場合には、入力された予測画像predSamplesLXをそのまま出力する。refResSamples残差予測は、動き予測もしくは変位予測により生成される動き補償画像predSamplesLXの残差を推定し、対象レイヤの予測画像predSamplesLXに加えることにより行われる。具体的には、予測ユニットが動き予測の場合には、参照レイヤと同様の残差が対象レイヤにも生じると仮定して、既に導出された参照レイヤの残差を対象レイヤの残差の推定値として用いる。予測ユニットが変位予測の場合には、対象ピクチャとは異なる時刻（ＰＯＣ）の参照レイヤのピクチャと対象レイヤのピクチャの残差を、残差の推定値として用いる。(Residual prediction)
The residual prediction unit 3092 performs residual prediction when the residual prediction execution flag resPredFlag is 1. When the residual prediction execution flag resPredFlag is 0, the residual prediction unit 3092 outputs the input predicted image predSamplesLX as it is. The refResSamples residual prediction is performed by estimating the residual of the motion compensated image predSamplesLX generated by motion prediction or displacement prediction and adding it to the predicted image predSamplesLX of the target layer. Specifically, when the prediction unit is motion prediction, it is assumed that a residual similar to the reference layer also occurs in the target layer, and the residual of the reference layer already derived is estimated as the residual of the target layer. Use as a value. When the prediction unit is displacement prediction, a residual between a reference layer picture and a target layer picture at a time (POC) different from that of the target picture is used as an estimated value of the residual.

残差予測部３０９２も、動き変位補償部３０９１と同様、サブブロック動き補償フラグsubPbMotionFlagが１の場合に、サブブロック単位で残差予測を行う。 Similar to the motion displacement compensation unit 3091, the residual prediction unit 3092 also performs residual prediction in units of subblocks when the subblock motion compensation flag subPbMotionFlag is 1.

図１７は残差予測部３０９２の構成を示すブロック図である。残差予測部３０９２は、参照画像補間部３０９２２と、残差合成部３０９２３から構成される。 FIG. 17 is a block diagram showing a configuration of the residual prediction unit 3092. The residual prediction unit 3092 includes a reference image interpolation unit 30922 and a residual synthesis unit 30923.

参照画像補間部３０９２２は、残差予測実施フラグresPredFlagが１の場合には、インター予測パラメータ復号部３０３から入力されたベクトルmvLXと残差予測変位ベクトルmvDisp、参照ピクチャメモリ３０６に格納された参照ピクチャを用いて、２つの残差予測動き補償画像（対応ブロックrpSamplesLX、参照ブロックrpRefSamplesLX）を生成する。 When the residual prediction execution flag resPredFlag is 1, the reference image interpolation unit 30922 receives the vector mvLX and the residual prediction displacement vector mvDisp input from the inter prediction parameter decoding unit 303, and the reference picture stored in the reference picture memory 306. Are used to generate two residual prediction motion compensated images (corresponding block rpSamplesLX, reference block rpRefSamplesLX).

残差予測部３０９２は、対象ブロックが動き予測であるか変位予測であるかを示すフラグであるインタービュ予測フラグivRefFlagを( DiffPicOrderCnt( currPic, RefPicListX[ refIdxLX ] ) = = 0 )により導出する。ここでDiffPicOrderCnt(X, Y)はピクチャXとピクチャYのPOCの差分を示す（以下同様）。従って、対象ピクチャcurrPicのＰＯＣと、参照ピクチャインデックスrefIdxLXと参照ピクチャリストRefPicListXで示される参照ピクチャRefPicListX[ refIdxLX ]のＰＯＣが０である場合には、対象ブロックは変位予測が適用されるとしてivRefFlagを１に設定され、それ以外の場合には対象ブロックには動き予測が適用されるとしてivRefFlagを０に設定される。 The residual prediction unit 3092 derives an inter-view prediction flag ivRefFlag, which is a flag indicating whether the target block is motion prediction or displacement prediction, by (DiffPicOrderCnt (currPic, RefPicListX [refIdxLX]) == 0). Here, DiffPicOrderCnt (X, Y) indicates the difference between the POC of picture X and picture Y (the same applies hereinafter). Therefore, when the POC of the target picture currPic and the POC of the reference picture RefPicListX [refIdxLX] indicated by the reference picture index refIdxLX and the reference picture list RefPicListX are 0, the target block is assumed to be subject to displacement prediction and ivRefFlag is set to 1. Otherwise, ivRefFlag is set to 0, assuming that motion prediction is applied to the target block.

図１８は、ベクトルmvLXが動きベクトルである場合（インタービュ予測フラグivRefFlagが０の場合）の対応ブロックrpSamplesLXと参照ブロックrpRefSamplesLXを説明するための図である。図１８に示すように、対象レイヤ上の予測ユニットに対応する対応ブロックは、参照レイヤ上の画像の予測ユニットの位置を起点として、参照レイヤと対象レイヤの位置関係を示すベクトルである変位ベクトルmvDispだけずれた位置になるブロックに位置する。 FIG. 18 is a diagram for describing the corresponding block rpSamplesLX and the reference block rpRefSamplesLX when the vector mvLX is a motion vector (when the interview prediction flag ivRefFlag is 0). As shown in FIG. 18, the corresponding block corresponding to the prediction unit on the target layer is a displacement vector mvDisp that is a vector indicating the positional relationship between the reference layer and the target layer, starting from the position of the prediction unit of the image on the reference layer. It is located in a block that is displaced by a certain amount.

図１９は、ベクトルmvLXが変位ベクトルである場合（インタービュ予測フラグivRefFlagが１の場合）の対応ブロックrpSamplesLXと参照ブロックrpRefSamplesLXを説明するための図である。図１９に示すように、対応ブロックrpSamplesLXは、対象ピクチャとは異なる時刻かつ対象ピクチャと同じビューＩＤである参照ピクチャrpPic上のブロックである。残差予測部３０９２は、対象ブロックのベクトルmvLX（=変位ベクトルmvDisp）が指し示す先のピクチャmvPicT上の予測ユニットのベクトルであるmvTを導出する。対応ブロックrpSamplesLXは、予測ユニット（対象ブロック）の位置を起点として、ベクトルmvTだけずれた位置になるブロックに位置する。 FIG. 19 is a diagram for explaining the corresponding block rpSamplesLX and the reference block rpRefSamplesLX when the vector mvLX is a displacement vector (when the interview prediction flag ivRefFlag is 1). As shown in FIG. 19, the corresponding block rpSamplesLX is a block on the reference picture rpPic that has a different time from the target picture and the same view ID as the target picture. The residual prediction unit 3092 derives mvT that is a vector of a prediction unit on the previous picture mvPicT indicated by the target block vector mvLX (= displacement vector mvDisp). The corresponding block rpSamplesLX is located in the block that is shifted by the vector mvT starting from the position of the prediction unit (target block).

（残差予測用参照ピクチャの導出）
残差予測部３０９２は、残差予測動き補償画像（rpSamplesLX、rpRefSamplesLX）の導出において参照する参照ピクチャである参照ピクチャrpPic、rpPicRefと、参照ブロックの位置（対象ブロックの座標を基準とした参照ブロックの相対座標）を示すベクトルmvRp、mvRpRefを導出する。(Derivation of reference picture for residual prediction)
The residual prediction unit 3092 includes reference pictures rpPic and rpPicRef, which are reference pictures to be referred to in derivation of residual prediction motion compensated images (rpSamplesLX and rpRefSamplesLX), and the position of the reference block (the reference block based on the coordinates of the target block). Relative coordinates) vectors mvRp and mvRpRef are derived.

残差予測部３０９２は、対象ブロックの属する対象ピクチャと同じ表示時刻（ＰＯＣ）もしくは同じビューＩＤであるピクチャをrpPicとして設定する。 The residual prediction unit 3092 sets a picture having the same display time (POC) or the same view ID as the target picture to which the target block belongs as rpPic.

具体的には、残差予測部３０９２は、対象ブロックが動き予測の場合（インタービュ予測フラグivRefFlagが０の場合）、参照ピクチャrpPicのＰＯＣと対象ピクチャのＰＯＣであるPicOrderCntValが等しく、かつ、参照ピクチャrpPicのビューＩＤと予測ユニットの参照ビューＩＤRefViewIdx[ xP ][ yP ]（これは対象ピクチャのビューＩＤは異なる）が等しい。という条件から参照ピクチャrpPicを導出する。さらに、残差予測部３０９２は、上記rpPicのベクトルmvRpに、変位ベクトルMvDispを設定する。 Specifically, when the target block is motion prediction (when the interview prediction flag ivRefFlag is 0), the residual prediction unit 3092 has the same POC of the reference picture rpPic and PicOrderCntVal that is the POC of the target picture, and the reference The view ID of the picture rpPic and the reference view ID RefViewIdx [xP] [yP] of the prediction unit (this is different from the view ID of the target picture). The reference picture rpPic is derived from the above condition. Further, the residual prediction unit 3092 sets the displacement vector MvDisp to the rpPic vector mvRp.

残差予測部３０９２は、対象ブロックが変位予測の場合（インタービュ予測フラグivRefFlagが１の場合）、対象ブロックの予測画像生成に用いる参照ピクチャをrpPicに設定する。すなわち、対象ブロックの参照インデックスがRpRefIdxLY、参照ピクチャリストがRefPicListYの場合、参照ピクチャrpPicはRefPicListY[ RpRefIdxLY ]から導出される。さらに、残差予測部３０９２に含まれる図示しない残差予測用ベクトル導出部３０９２４を含む。残差予測用ベクトル導出部３０９２４は、上記rpPicのベクトルmvRpに、対象ブロックのベクトルmvLX（これは変位ベクトルMvDispと等しい）が指し示す先の、対象ピクチャと同一ＰＯＣで、ビューＩＤの異なるピクチャ上の予測ユニットのベクトルであるmvTを導出し、その動きベクトルmvTをmvRpに設定する。 The residual prediction unit 3092 sets the reference picture used for generating the predicted image of the target block to rpPic when the target block is displacement prediction (when the interview prediction flag ivRefFlag is 1). That is, when the reference index of the target block is RpRefIdxLY and the reference picture list is RefPicListY, the reference picture rpPic is derived from RefPicListY [RpRefIdxLY]. Furthermore, a residual prediction vector deriving unit 30924 (not shown) included in the residual prediction unit 3092 is included. The residual prediction vector deriving unit 30924 is the same POC as the target picture to which the target block vector mvLX (which is equal to the displacement vector MvDisp) points to the rpPic vector mvRp. MvT which is a vector of the prediction unit is derived, and the motion vector mvT is set to mvRp.

次に残差予測部３０９２は、対象ピクチャと異なる表示時刻（ＰＯＣ）かつ異なるビューＩＤを備える参照ピクチャをrpPicRefとして設定する。 Next, the residual prediction unit 3092 sets, as rpPicRef, a reference picture having a different display time (POC) and a different view ID from the current picture.

具体的には、残差予測部３０９２は、対象ブロックが動き予測の場合（インタービュ予測フラグivRefFlagが０の場合）、参照ピクチャrpPicRefのＰＯＣと対象ブロックの参照ピクチャRefPicListY[ RpRefIdxLY ]のＰＯＣが等しく、かつ、参照ピクチャrpPicRefのビューＩＤと変位ベクトルMvDispの参照ピクチャのビューＩＤRefViewIdx[ xP ][ yP ]が等しいという条件から参照ピクチャrpPicRefを導出する。さらに、残差予測部３０９２は、上記rpPicRefのベクトルmvRpRefに予測ブロックの動きベクトルをスケーリングしたベクトルmvLXと、ベクトルmvRpの和（mvRp+mvLX）を設定する。 Specifically, when the target block is motion prediction (when the interview prediction flag ivRefFlag is 0), the residual prediction unit 3092 has the same POC of the reference picture rpPicRef and the POC of the reference picture RefPicListY [RpRefIdxLY] of the target block. The reference picture rpPicRef is derived from the condition that the view ID of the reference picture rpPicRef is equal to the view ID RefViewIdx [xP] [yP] of the reference picture of the displacement vector MvDisp. Further, the residual prediction unit 3092 sets the vector mvLX obtained by scaling the motion vector of the prediction block to the vector mvRpRef of rpPicRef and the sum (mvRp + mvLX) of the vector mvRp.

残差予測部３０９２は、対象予測ユニットが変位予測の場合（インタービュ予測フラグivRefFlagが１の場合）、参照ピクチャrpPicRefのＰＯＣが参照ピクチャrpPicのＰＯＣと等しく、かつ、参照ピクチャrpPicRefのビューＩＤと予測ユニットのビューＩＤRefViewIdx[ xP ][ yP ]が等しいという条件から参照ピクチャrpPicRefを導出する。さらに、残差予測部３０９２は、上記rpPicRefのベクトルmvRpRefに予測ブロックの動きベクトルmvLXと、ベクトルmvRpの和（mvRp+mvLX）を設定する。 When the target prediction unit is displacement prediction (when the inter prediction prediction flag ivRefFlag is 1), the residual prediction unit 3092 has the POC of the reference picture rpPicRef equal to the POC of the reference picture rpPic and the view ID of the reference picture rpPicRef. A reference picture rpPicRef is derived from the condition that the view IDs RefViewIdx [xP] [yP] of the prediction units are equal. Further, the residual prediction unit 3092 sets the motion vector mvLX of the prediction block and the sum (mvRp + mvLX) of the prediction block to the vector mvRpRef of the rpPicRef.

すなわち、残差予測部３０９２では、mvRpとmvRpRefは、以下のように導出される。 That is, in the residual prediction unit 3092, mvRp and mvRpRef are derived as follows.

インタービュ予測フラグivRefFlagが０の場合
mvRp = MvDisp 式（Ｂ−１）
mvRpRef = mvRp + mvLX (=mvLX + MvDisp) 式（Ｂ−２）
インタービュ予測フラグivRefFlagが１の場合
mvRp = mvT 式（Ｂ−３）
mvRpRef = mvRp + mvLX (=mvLX + mvT) 式（Ｂ−４）
（残差予測用ベクトル導出部３０９２４）
残差予測用ベクトル導出部３０９２４は、対象ピクチャと異なるピクチャ上の予測ユニットのベクトルmvTを導出する。残差予測用ベクトル導出部３０９２４は、参照ピクチャ、対象ブロック座標(xP, yP)、対象ブロックサイズnPSW, nPSH、ベクトルmvLXを入力とし、参照ピクチャ上の予測ユニットの動き補償パラメータ（ベクトル、参照ピクチャインデックス、ビューＩＤ）からベクトルmvT及びビューＩＤを導出する。残差予測用ベクトル導出部３０９２４は、入力として指示された参照ピクチャ上の、対象ブロックからベクトルmvLXだけずれた位置にあるブロックの中心座標として、参照座標(xRef, yRef)を以下の式により導出する。When the interview prediction flag ivRefFlag is 0
mvRp = MvDisp formula (B-1)
mvRpRef = mvRp + mvLX (= mvLX + MvDisp) Formula (B-2)
When interview prediction flag ivRefFlag is 1
mvRp = mvT formula (B-3)
mvRpRef = mvRp + mvLX (= mvLX + mvT) Formula (B-4)
(Residue prediction vector deriving unit 30924)
The residual prediction vector deriving unit 30924 derives a vector mvT of a prediction unit on a picture different from the current picture. The residual prediction vector deriving unit 30924 receives the reference picture, the target block coordinates (xP, yP), the target block size nPSW, nPSH, and the vector mvLX, and receives motion compensation parameters (vector, reference picture) of the prediction unit on the reference picture. The vector mvT and view ID are derived from the index and view ID). The residual prediction vector deriving unit 30924 derives the reference coordinates (xRef, yRef) as the center coordinates of the block at the position shifted by the vector mvLX from the target block on the reference picture instructed as an input by the following expression: To do.

xRef = Clip3( 0, PicWidthInSamplesL - 1, xP + ( nPSW >> 1 ) + ( ( mvDisp[ 0 ] + 2 ) >> 2 ) )
yRef = Clip3( 0, PicHeightInSamplesL - 1, yP + ( nPSH >> 1 ) + ( ( mvDisp[ 1 ] + 2 ) >> 2 ) )
残差予測用ベクトル導出部３０９２４は、参照ブロック座標(xRef, yRef)を含む予測ユニットであるrefPUのベクトルmvLXと参照ピクチャインデックスrefPicLXを導出する。対象予測ユニットが変位予測（DiffPicOrderCnt (currPic, refPic)が０）かつ参照予測ユニットrefPUが動き予測の場合（DiffPicOrderCnt(refPic, refPicListRefX[ refIdxLX ]）が０以外)の場合には、refPUのベクトルをmvTとし、参照可能フラグavailFlagTを１とする。上記処理により、対象ピクチャと同一ＰＯＣで、ビューＩＤの異なるピクチャを参照ピクチャとするブロックのベクトルをmvTとして導出できる。xRef = Clip3 (0, PicWidthInSamplesL-1, xP + (nPSW >> 1) + ((mvDisp [0] + 2) >> 2))
yRef = Clip3 (0, PicHeightInSamplesL-1, yP + (nPSH >> 1) + ((mvDisp [1] + 2) >> 2))
The residual prediction vector deriving unit 30924 derives a refPU vector mvLX and a reference picture index refPicLX that are prediction units including reference block coordinates (xRef, yRef). If the target prediction unit is displacement prediction (DiffPicOrderCnt (currPic, refPic) is 0) and the reference prediction unit refPU is motion prediction (DiffPicOrderCnt (refPic, refPicListRefX [refIdxLX]) is non-zero), the refPU vector is mvT And the referable flag availFlagT is set to 1. With the above processing, a block vector having the same POC as the target picture and a picture with a different view ID as a reference picture can be derived as mvT.

残差予測用ベクトル導出部３０９２４は、対象ピクチャと異なるピクチャ上の予測ユニットのベクトルを導出する。残差予測用ベクトル導出部３０９２４は、対象ブロック座標(xP, yP)、対象ブロックサイズnPbW, nPbH、変位ベクトルmvDispを入力として、以下の参照ブロック座標(xRef, yRef)を導出する。 The residual prediction vector deriving unit 30924 derives a vector of prediction units on a picture different from the current picture. The residual prediction vector deriving unit 30924 derives the following reference block coordinates (xRef, yRef) using the target block coordinates (xP, yP), the target block sizes nPbW, nPbH, and the displacement vector mvDisp as inputs.

xRef = Clip3( 0, PicWidthInSamplesL - 1, xP + ( nPSW >> 1 ) + ( ( mvDisp[ 0 ] + 2 ) >> 2 ) )
yRef = Clip3( 0, PicHeightInSamplesL - 1, yP + ( nPSH >> 1 ) + ( ( mvDisp[ 1 ] + 2 ) >> 2 ) )
残差予測用ベクトル導出部３０９２４は、参照ブロック座標(xRef, yRef)を含む予測ユニットであるrefPUのベクトルmvLXと参照ピクチャインデックスrefPicLXを導出する。対象予測ユニットが動き予測（DiffPicOrderCnt( currPic, refPic)が０以外）、参照予測ユニットrefPUが変位予測の場合（DiffPicOrderCnt( refPic, refPicListRefX[ refIdxLX ]）が０)には、参照可能フラグavailFlagTを１とする。これにより、対象ピクチャと同一ＰＯＣで、ビューＩＤの異なるピクチャを参照ピクチャとするブロックのベクトルをmvTとして導出できる。xRef = Clip3 (0, PicWidthInSamplesL-1, xP + (nPSW >> 1) + ((mvDisp [0] + 2) >> 2))
yRef = Clip3 (0, PicHeightInSamplesL-1, yP + (nPSH >> 1) + ((mvDisp [1] + 2) >> 2))
The residual prediction vector deriving unit 30924 derives a refPU vector mvLX and a reference picture index refPicLX that are prediction units including reference block coordinates (xRef, yRef). When the target prediction unit is motion prediction (DiffPicOrderCnt (currPic, refPic) is other than 0) and the reference prediction unit refPU is displacement prediction (DiffPicOrderCnt (refPic, refPicListRefX [refIdxLX]) is 0), the reference flag availFlagT is set to 1. To do. As a result, a block vector having the same POC as the target picture and a picture with a different view ID as a reference picture can be derived as mvT.

（参照画像補間部３０９２２）
参照画像補間部３０９２２は、参照ブロックrpSamplesLXの補間画像をベクトルmvLXに上記ベクトルmvCを設定して生成する。補間画像の画素の座標（ｘ，ｙ）を、予測ユニットのベクトルmvLXだけずらした位置の画素を線形補間（双線形補間）により導出する。変位ベクトルLXが１／４ペルの小数精度であることを考慮し、参照画像補間部３０９２２は、予測ユニットの画素の座標が（ｘＰ、ｙＰ）である場合に対応する整数精度の画素Ｒ０のＸ座標xIntとＹ座標yInt、及び変位ベクトルmvDispのＸ成分の小数部分xFracとＹ成分の小数部分yFracを、以下の（式Ｃ−１）
xInt = xPb + ( mvLX[ 0 ] >> 2 )
yInt = yPb + ( mvLX[ 1 ] >> 2 )
xFrac = mvLX[ 0 ] & 3
yFrac = mvLX[ 1 ] & 3
の式により導出する。ここで、Ｘ & 3は、Ｘの下位２ビットのみを取り出す数式である。(Reference image interpolation unit 30922)
The reference image interpolation unit 30922 generates an interpolation image of the reference block rpSamplesLX by setting the vector mvC to the vector mvLX. A pixel at a position where the coordinates (x, y) of the pixel of the interpolation image is shifted by the vector mvLX of the prediction unit is derived by linear interpolation (bilinear interpolation). Considering that the displacement vector LX has a 1/4 pel decimal precision, the reference image interpolating unit 30922 uses the X of the pixel R0 with integer precision corresponding to the case where the pixel coordinates of the prediction unit are (xP, yP). The coordinate xInt, the Y coordinate yInt, and the fractional part xFrac of the X component of the displacement vector mvDisp and the fractional part yFrac of the Y component are expressed as follows (formula C-1)
xInt = xPb + (mvLX [0] >> 2)
yInt = yPb + (mvLX [1] >> 2)
xFrac = mvLX [0] & 3
yFrac = mvLX [1] & 3
It is derived by the following formula. Here, X & 3 is a mathematical expression for extracting only the lower 2 bits of X.

次に、参照画像補間部３０９２２は、ベクトルmvLXが１／４ペルの小数精度であることを考慮し、補間画素predPartLX[ x ][ y ]を生成する。まず、整数画素Ａ(xA,yB)、Ｂ(xB,yB)、Ｃ(xC,yC)及びＤ(xD,yD)の座標を、以下の（式Ｃ−２）
xA = Clip3( 0, picWidthInSamples - 1, xInt )
xB = Clip3( 0, picWidthInSamples - 1, xInt + 1 )
xC = Clip3( 0, picWidthInSamples - 1, xInt )
xD = Clip3( 0, picWidthInSamples - 1, xInt + 1 )
yA = Clip3( 0, picHeightInSamples - 1, yInt )
yB = Clip3( 0, picHeightInSamples - 1, yInt )
yC = Clip3( 0, picHeightInSamples - 1, yInt + 1 )
yD = Clip3( 0, picHeightInSamples - 1, yInt + 1 )
の式により導出する。ここで、整数画素Ａは画素Ｒ０に対応した画素であり、整数画素Ｂ，Ｃ，Ｄはそれぞれ整数画素Ａの右、下、右下に隣接する整数精度の画素である。参照画像補間部３０９２２は、各整数画素Ａ、Ｂ、Ｃ、及びＤに対応する参照画素refPicLX[ xA][ yA ]、refPicLX[ xB ][ yB ]、refPicLX[ xC ][ yC ]、及びrefPicLX[ xD ][ yD ]を参照ピクチャメモリ３０６から読み出す。Next, the reference image interpolation unit 30922 generates an interpolation pixel predPartLX [x] [y] in consideration of the fact that the vector mvLX has a 1/4 pel decimal precision. First, the coordinates of integer pixels A (xA, yB), B (xB, yB), C (xC, yC), and D (xD, yD) are expressed by the following (formula C-2)
xA = Clip3 (0, picWidthInSamples-1, xInt)
xB = Clip3 (0, picWidthInSamples-1, xInt + 1)
xC = Clip3 (0, picWidthInSamples-1, xInt)
xD = Clip3 (0, picWidthInSamples-1, xInt + 1)
yA = Clip3 (0, picHeightInSamples-1, yInt)
yB = Clip3 (0, picHeightInSamples-1, yInt)
yC = Clip3 (0, picHeightInSamples-1, yInt + 1)
yD = Clip3 (0, picHeightInSamples-1, yInt + 1)
It is derived by the following formula. Here, the integer pixel A is a pixel corresponding to the pixel R0, and the integer pixels B, C, and D are integer precision pixels adjacent to the right, bottom, and bottom right of the integer pixel A, respectively. The reference image interpolation unit 30922 includes reference pixels refPicLX [xA] [yA], refPicLX [xB] [yB], refPicLX [xC] [yC], and refPicLX [corresponding to the integer pixels A, B, C, and D, respectively. xD] [yD] is read from the reference picture memory 306.

そして、参照画像補間部３０９２２は、参照画素refPicLX[ xA ][ yA ]、refPicLX[ xB][ yB ]、refPicLX[ xC ][ yC ]、refPicLX[ xD ][ yD ]とベクトルmvLXのＸ成分の小数部分xFracとＹ成分の小数部分yFracを用いて、画素Ｒ０からベクトルmvLXの小数部分だけずらした位置の画素である補間画素predPartLX[ x ][ y ]を線形補間（双線形補間）により導出する。具体的には、以下の式（Ｃ−３）
predPartLX[ x ][ y ] = (refPicLX[ xA ][ yA ] * ( 8 - xFrac ) * ( 8 - yFrac )
+ refPicLX[ xB ][ yB ] * ( 8 - yFrac ) * xFrac
+ refPicLX[ xC ][ yC ] * ( 8 - xFrac ) * yFrac
+ refPicLX[ xD ][ yD ] * xFrac * yFrac ) >> 6
の式により導出する。The reference image interpolation unit 30922 then subtracts the X component of the reference pixel refPicLX [xA] [yA], refPicLX [xB] [yB], refPicLX [xC] [yC], refPicLX [xD] [yD] and the vector mvLX. An interpolated pixel predPartLX [x] [y], which is a pixel shifted by a decimal part of the vector mvLX from the pixel R0, is derived by linear interpolation (bilinear interpolation) using the part xFrac and the fractional part yFrac of the Y component. Specifically, the following formula (C-3)
predPartLX [x] [y] = (refPicLX [xA] [yA] * (8-xFrac) * (8-yFrac)
+ refPicLX [xB] [yB] * (8-yFrac) * xFrac
+ refPicLX [xC] [yC] * (8-xFrac) * yFrac
+ refPicLX [xD] [yD] * xFrac * yFrac) >> 6
It is derived by the following formula.

なお、上記では対象画素の周囲の４点の画素を用いて１ステップの双線形補間により導出しているが、水平方向の線形補間と垂直方向の線形補間を分離し２ステップの線形補間により残差予測補間画像を生成しても良い。 In the above description, the four-point pixels around the target pixel are used for deriving by one-step bilinear interpolation. However, horizontal linear interpolation and vertical linear interpolation are separated, and the remaining pixels are separated by two-step linear interpolation. A difference prediction interpolation image may be generated.

参照画像補間部３０９２２は、上記の補間画素導出処理を、予測ユニット内の各画素に対して行い、補間画素の集合を補間ブロックpredPartLXとする。参照画像補間部３０９２２は、導出した補間ブロックpredPartLXを、対応ブロックrpSamplesLXとして、残差合成部３０９２３に出力する。 The reference image interpolation unit 30922 performs the above interpolation pixel derivation process on each pixel in the prediction unit, and sets a set of interpolation pixels as an interpolation block predPartLX. The reference image interpolation unit 30922 outputs the derived interpolation block predPartLX to the residual synthesis unit 30923 as the corresponding block rpSamplesLX.

参照画像補間部３０９２２は、対応ブロックrpSamplesLXを導出した処理と、変位ベクトルmvLXをベクトルmvRに置き換えている点を除いて、同様の処理を行うことで、参照ブロックrpRefSamplesLXを導出する。参照画像補間部３０９２２は、参照ブロックrpRefSamplesLXを残差合成部３０９２３に出力する。 The reference image interpolation unit 30922 derives the reference block rpRefSamplesLX by performing the same processing except that the corresponding block rpSamplesLX is derived and the displacement vector mvLX is replaced with the vector mvR. The reference image interpolation unit 30922 outputs the reference block rpRefSamplesLX to the residual synthesis unit 30923.

（残差合成部３０９２３）
残差合成部３０９２３は、残差予測実施フラグresPredFlagが１の場合には、２つの残差予測動き補償画像（rpSamplesLX、rpRefSamplesLX）の差分から残差を導出し、動き補償画像にこの残差を加算することにより予測画像を導出する。具体的には、残差合成部３０９２３は、予測画像predSamplesLX、対応ブロックrpSamplesLX、参照ブロックrpRefSamplesLX及び残差予測インデックスiv_res_pred_weight_idxから、補正予測画像predSamplesLX´を導出する。補正予測画像predSamplesLX´は、
predSamplesLX´[x][y] = predSamplesLX[x][y] + ((rpSamplesLX[x][y] - rpRefSamplesLX[x][y]) >> (iv_res_pred_weight_idx - 1))
の式を用いて求める。xは０から予測ブロックの幅-1、yは0から予測ブロックの高さ-1である。残差合成部３０９２３は、残差予測実施フラグresPredFlagが０の場合には、以下の式のように予測画像predSamplesLXをそのまま出力する。(Residual synthesis unit 30923)
When the residual prediction execution flag resPredFlag is 1, the residual synthesis unit 30923 derives a residual from the difference between the two residual prediction motion compensated images (rpSamplesLX, rpRefSamplesLX), and uses this residual in the motion compensated image. A predicted image is derived by adding the predicted images. Specifically, the residual synthesis unit 30923 derives a corrected predicted image predSamplesLX ′ from the predicted image predSamplesLX, the corresponding block rpSamplesLX, the reference block rpRefSamplesLX, and the residual prediction index iv_res_pred_weight_idx. The corrected predicted image predSamplesLX´
predSamplesLX´ [x] [y] = predSamplesLX [x] [y] + ((rpSamplesLX [x] [y]-rpRefSamplesLX [x] [y]) >> (iv_res_pred_weight_idx-1))
It is calculated using the following formula. x is 0 to the width -1 of the prediction block, and y is 0 to the height of the prediction block -1. When the residual prediction execution flag resPredFlag is 0, the residual synthesis unit 30923 outputs the predicted image predSamplesLX as it is as in the following equation.

predSamplesLX´[x][y] = predSamplesLX[x][y]
（照度補償）
照度補償部３０９３は、照度補償フラグic_flagが１の場合に、入力された予測画像predSamplesLXに対して、照度補償を行う。照度補償フラグic_flagが０の場合には、入力された予測画像predSamplesLXをそのまま出力する。predSamplesLX´ [x] [y] = predSamplesLX [x] [y]
(Illuminance compensation)
When the illumination compensation flag ic_flag is 1, the illumination compensation unit 3093 performs illumination compensation on the input predicted image predSamplesLX. When the illumination compensation flag ic_flag is 0, the input predicted image predSamplesLX is output as it is.

（重み付け予測）
重み付け予測部３０９６は、単予測の場合(predFlagL0=1/ predFlagL1=0もしくはpredFlagL0=0/ predFlagL1=1)には、L0の動き補償画像predSampleL0もしくはL1の動き補償画像predSampleL1から予測画像predSamplesを導出する。具体的には、L0からの予測、L1からの予測に対して各々、下記式を用いて導出する。(Weighted prediction)
In the case of single prediction (predFlagL0 = 1 / predFlagL1 = 0 or predFlagL0 = 0 / predFlagL1 = 1), the weighted prediction unit 3096 derives the predicted image predSamples from the L0 motion compensated image predSampleL0 or the L1 motion compensated image predSampleL1. . Specifically, the prediction from L0 and the prediction from L1 are respectively derived using the following equations.

predSamples[ x ][ y ] = Clip3( 0, ( 1 << bitDepth ) - 1, predSamplesL0[ x ][ y ] * w0 + o0 )
predSamples[ x ][ y ] = Clip3( 0, ( 1 << bitDepth ) - 1, predSamplesL1[ x ][ y ] * w1 + o1 )
)
ここで、w0, w1, o0, o1は、各々、パラメータセットで符号化されるウェイトおよびオフセットである。bitDepthはビットデプスを示す値である。重み付け予測部３０９６は、双予測の場合(predFlagL0=1/ predFlagL1=1)には、L0の動き補償画像predSampleL0とL1の動き補償画像predSampleL1の重み付き予測から予測画像を生成する。predSamples [x] [y] = Clip3 (0, (1 << bitDepth)-1, predSamplesL0 [x] [y] * w0 + o0)
predSamples [x] [y] = Clip3 (0, (1 << bitDepth)-1, predSamplesL1 [x] [y] * w1 + o1)
)
Here, w0, w1, o0, and o1 are weights and offsets encoded in the parameter set, respectively. bitDepth is a value indicating the bit depth. In the case of bi-prediction (predFlagL0 = 1 / predFlagL1 = 1), the weighted prediction unit 3096 generates a prediction image from the weighted prediction of the L0 motion compensated image predSampleL0 and the L1 motion compensated image predSampleL1.

( predSamplesL0 [ x ][ y ] * w0 + predSamplesL1[ x ][ y ] * w1 +
( ( o0 + o1 + 1 ) << log2Wd ) ) >> ( log2Wd + 1 ) )
ここで、w0, w1, o0, o1、log2Wdは、各々、パラメータセットで符号化されるウェイトおよびオフセットおよびシフト値。bitDepthはビットデプスを示す値である。(predSamplesL0 [x] [y] * w0 + predSamplesL1 [x] [y] * w1 +
((o0 + o1 + 1) << log2Wd)) >> (log2Wd + 1))
Here, w0, w1, o0, o1, and log2Wd are a weight, an offset, and a shift value encoded by the parameter set, respectively. bitDepth is a value indicating the bit depth.

（ＤＢＢＰ予測）
ＤＢＢＰ予測部３０９５は、DBBPモードフラグdbbp_flagが１の場合に、デプスベースブロック分割（Depth-based Block Partitioning、DBBP）により、予測画像predSamplesを生成する。デプスベースブロック分割は、対象ブロックに対応するデプス画像のセグメンテーションに基づいて、対象ブロックを２つの領域（領域１と領域２）に分割する。領域１の補間画像（以下predSamplesA）と、領域２の補間画像（以下predSamplesB）と領域分割を示すセグメンテーションを導出し、さらに２つの補間画像を、セグメンテーションに応じて合成することで、１つの補間画像（予測画像）を生成する。(DBBP prediction)
When the DBBP mode flag dbbp_flag is 1, the DBBP prediction unit 3095 generates predicted images predSamples by depth-based block partitioning (DBBP). The depth-based block division divides the target block into two regions (region 1 and region 2) based on the segmentation of the depth image corresponding to the target block. An interpolation image of region 1 (hereinafter referred to as predSamplesA), an interpolation image of region 2 (hereinafter referred to as predSamplesB) and a segmentation indicating region division are derived, and further, two interpolation images are synthesized according to the segmentation, thereby obtaining one interpolation image (Predicted image) is generated.

ＤＢＢＰでは、デプス画像に基づいて好適な分割を行うため、符号化データから復号された分割モード（PartMode）と実際に適用される分割（セグメンテーション）は異なる。符号化データから復号される動きベクトルなどの予測パラメータは、後続の予測ユニットや、別ピクチャの予測ユニットの予測パラメータの導出に用いられるため、できるだけ実際の分割（セグメンテーション）に近い分割モードを用いて、ＤＢＢＰに適用される予測パラメータを格納することが適当である。従って、ＤＢＢＰ予測部３０９５は後述のＤＢＢＰ分割モード導出部３０９５４において、デプス画像から分割モードPartModeを導出する。ＤＢＢＰ予測部３０９５は、導出された分割モードPartModeで、復号して得られた分割モードを置き替える。また、インター予測パラメータ復号制御部３０３１で復号した予測パラメータは、置き換えらた分割モードPartModeに応じて、予測パラメータメモリ３０７に格納される。なお、セグメンテーションは画素単位の分割を示し、分割モードは予め皿められた矩形単位の分割を示す。 In DBBP, in order to perform suitable division based on the depth image, the division mode (PartMode) decoded from the encoded data is different from the division (segmentation) actually applied. Prediction parameters such as motion vectors decoded from the encoded data are used to derive prediction parameters of subsequent prediction units and prediction units of different pictures, so use a division mode that is as close to actual division (segmentation) as possible. It is appropriate to store the prediction parameters applied to the DBBP. Therefore, the DBBP prediction unit 3095 derives a division mode PartMode from the depth image in a DBBP division mode derivation unit 30954 described later. The DBBP prediction unit 3095 replaces the partition mode obtained by decoding with the derived partition mode PartMode. Further, the prediction parameter decoded by the inter prediction parameter decoding control unit 3031 is stored in the prediction parameter memory 307 according to the replaced partition mode PartMode. Note that segmentation indicates division in units of pixels, and the division mode indicates division in units of rectangles that are preliminarily dished.

図１は、本発明の実施形態のＤＢＢＰ予測部３０９５の構成を示すブロック図である。ＤＢＢＰ予測部３０９５は、ＤＢＢＰ画像補間部３０９５１、セグメンテーション部３０９５２、画像合成部３０９５３、ＤＢＢＰ分割モード導出部３０９５４から構成される。なお、セグメンテーション部３０９５２、ＤＢＢＰ分割モード導出部３０９５４は、予測画像生成部１０１ではなく、予測パラメータ復号部３０２で行っても良い。 FIG. 1 is a block diagram illustrating a configuration of the DBBP prediction unit 3095 according to the embodiment of this invention. The DBBP prediction unit 3095 includes a DBBP image interpolation unit 30951, a segmentation unit 30952, an image synthesis unit 30953, and a DBBP split mode derivation unit 30954. Note that the segmentation unit 30952 and the DBBP division mode derivation unit 30954 may be performed by the prediction parameter decoding unit 302 instead of the prediction image generation unit 101.

（ＤＢＢＰ画像補間部３０９５１）
ＤＢＢＰ画像補間部３０９５１は、各参照ピクチャリストL0もしくはL1に対して、ＤＢＢＰ予測部３０９５に入力される２つのベクトルに基づいて双線形補間により、２つの補間画像（predSamplesA、predSampleB）を生成する。なお、双線形補間に関するＤＢＢＰ画像補間部３０９５１の動作は、ＡＲＰ参照画像補間部３０９２２の動作と等しく、まず、（式Ｃ−１）により動きベクトルmvLXから、整数位置xInt, yIntと位相xFrac、yFracを導出し、次に（式Ｃ−２）により４点の画素refPicLx[xA][yA]、refPicLx[xB][yB]、refPicLx[xC][yC]、refPicLx[xD][yD]の位置を導出する。最後に、位相xFrac、yFracに応じて（式Ｃ−３）により４点から画素を補間するなお、ＡＲＰ参照画像補間部３０９２２とＤＢＢＰ画像補間部３０９５１で、共通の双線形補間を行う画像補間部を用いても良い。(DBBP image interpolation unit 30951)
The DBBP image interpolation unit 30951 generates two interpolated images (predSamplesA, predSampleB) for each reference picture list L0 or L1 by bilinear interpolation based on the two vectors input to the DBBP prediction unit 3095. Note that the operation of the DBBP image interpolation unit 30951 related to bilinear interpolation is the same as the operation of the ARP reference image interpolation unit 30922. First, the integer position xInt, yInt and the phases xFrac, yFrac are calculated from the motion vector mvLX by (Equation C-1). Next, the positions of the four pixels refPicLx [xA] [yA], refPicLx [xB] [yB], refPicLx [xC] [yC], refPicLx [xD] [yD] are obtained by (Expression C-2). Is derived. Finally, the pixels are interpolated from four points according to (Formula C-3) according to the phases xFrac and yFrac. Note that the ARP reference image interpolation unit 30922 and the DBBP image interpolation unit 30951 perform common bilinear interpolation. May be used.

なお、２つの補間画像は０もしくは１をとるpartIdxにより区別される。すなわちＤＢＢＰ画像補間部３０９５１では、partIdx＝０の補間画像predSampleもしくはpartIdx＝１の補間画像predSampleが導出される。 Note that the two interpolated images are distinguished by partIdx taking 0 or 1. That is, the DBBP image interpolation unit 30951 derives an interpolated image predSample with partIdx = 0 or an interpolated image predSample with partIdx = 1.

セグメンテーション部３０９５２は、参照ピクチャメモリ３０６から入力された対象ブロックに対応するデプスブロックからセグメンテーション情報segMaskを導出する。なお、対象ブロックに対応するデプスブロックは、復号対象ピクチャと同一ＰＯＣを持ち、なお且つ、変位ベクトルMvDispが示す参照ピクチャのビューＩＤ（RefViewIdx）と同じビューＩＤであるデプスピクチャの上の画像上の、座標 (xP+mvDisp[0], xP+mvDisp[1])を左上座標とするブロックである。ここで、(xP, yP)は対象ブロックの座標、MvDispは対象ブロックの変位ベクトルがMvDispを示す。 The segmentation unit 30952 derives the segmentation information segMask from the depth block corresponding to the target block input from the reference picture memory 306. Note that the depth block corresponding to the target block has the same POC as the decoding target picture, and is on the image above the depth picture having the same view ID as the view ID (RefViewIdx) of the reference picture indicated by the displacement vector MvDisp. This is a block whose coordinates are (xP + mvDisp [0], xP + mvDisp [1]). Here, (xP, yP) represents the coordinates of the target block, and MvDisp represents the displacement vector of the target block as MvDisp.

具体的に、セグメンテーション部３０９５２は、デプスブロックの画素値の代表値thresValを導出し、デプスブロックの各画素値が代表値thresValより大きい場合には１、代表値thresVal以下の場合には０としてsegMask[][]を導出する。 Specifically, the segmentation unit 30952 derives the representative value thresVal of the pixel value of the depth block, and segMask is set to 1 when each pixel value of the depth block is larger than the representative value thresVal, and 0 when it is less than or equal to the representative value thresVal. [] [] Is derived.

具体的には、代表値thresValは、デプスブロックを構成する各画素（x=0.. nCbSL-1,y=0.. nCbSL-1の各x, y）について、デプス画素refSamples[ x ][ y ]の和sumRefValsを下記式のようにとして導出し、さらに、デプスブロックサイズの２の対数に相当する値で右シフトすることによりセグメンテーション部３０９５２で導出される。 Specifically, the representative value thresVal is the depth pixel refSamples [x] [for each pixel (x = 0 .. nCbSL-1, y = 0 .. nCbSL-1 x, y) constituting the depth block. The sum SumVals of y] is derived as shown in the following equation, and is further derived by the segmentation unit 30952 by right shifting by a value corresponding to the logarithm of 2 of the depth block size.

sumRefVals += refSamples[ x ][ y ]
threshVal = ( sumRefVals >> ( 2 * log2( nTbS ) ) )
segMask[][]は、デプスブロックを構成する各画素（x=0.. nCbSL-1,y=0.. nCbSL-1の各x, y）について、デプス画素refSamples[ x ][ y ]が代表値thresValを超えるか否かによって下記式によりセグメンテーション部３０９５２で導出される。sumRefVals + = refSamples [x] [y]
threshVal = (sumRefVals >> (2 * log2 (nTbS)))
segMask [] [] is the depth pixel refSamples [x] [y] for each pixel (x = 0 .. nCbSL-1, y = 0 .. nCbSL-1 x, y) making up the depth block. Depending on whether or not the representative value thresVal is exceeded, it is derived by the segmentation unit 30952 according to the following equation.

segMask[ x ][ y ] = ( refSamples[ x ][ y ] > threshVal )
segMask[ x ][ y ]は、対象ブロックと同じサイズを有し、各画素値が０もしくは１を有するブロックである。segMask [x] [y] = (refSamples [x] [y]> threshVal)
segMask [x] [y] is a block having the same size as the target block and each pixel value having 0 or 1.

画像合成部３０９５３は、各参照ピクチャリストL0もしくはL1に対して、セグメンテーション部３０９５２で導出されたsegMaskと、ＤＢＢＰ画像補間部３０９５１で導出された２つの補間画像（predSamplesA、predSampleB）から、補間画像predSamplesLXを導出する。導出された補間画像predSamplesL0と補間画像predSamplesL1は、重み付け予測部３０９６に出力される。 For each reference picture list L0 or L1, the image composition unit 30953 uses the segMask derived by the segmentation unit 30952 and the two interpolated images (predSamplesA and predSampleB) derived by the DBBP image interpolation unit 30951 to interpolate the image predSamplesLX. Is derived. The derived interpolation image predSamplesL0 and interpolation image predSamplesL1 are output to the weighted prediction unit 3096.

図２２は、画像合成部３０９５３を説明する図である。画像合成部３０９５３は、セグメンテーション情報segMaskに基づいて、画素単位で２つの補間画像のどちらか一方を選択し、さらにフィルタ処理を行うことにより予測画像predSamplesLX(ここではPredSamplesDbbp)を導出する。 FIG. 22 is a diagram illustrating the image composition unit 30953. Based on the segmentation information segMask, the image composition unit 30953 selects one of the two interpolated images for each pixel, and further performs a filtering process to derive a predicted image predSamplesLX (here, PredSamplesDbbp).

具体的には、２つの補間画像を区別するpartIdxとセグメンテーション情報segMask[x][y]の値に応じて、predSamplesDbbpに補間画像predSamplesを設定する。 Specifically, the interpolated image predSamples is set in predSamplesDbbp according to the values of partIdx and segmentation information segMask [x] [y] that distinguish the two interpolated images.

画像合成部３０９５３は、segMaskの左上画素segMask[0][0]に基づいて２つの補間画像から画素を選択しても良い。この場合、画像合成部３０９５３は、以下のように、segMask[0][0]と異なるpartIdxに対応する補間画像から画素を選択する。画像合成部３０９５３は、partIdxとsegMask[0][0]の値が等しいか否かを示すフラグcurSegmentFlagを以下の式により導出する。 The image composition unit 30953 may select a pixel from two interpolation images based on the upper left pixel segMask [0] [0] of segMask. In this case, the image composition unit 30953 selects pixels from the interpolated image corresponding to partIdx different from segMask [0] [0] as follows. The image composition unit 30953 derives a flag curSegmentFlag indicating whether or not the values of partIdx and segMask [0] [0] are equal to each other by the following expression.

curSegmentFlag = ( partIdx ! = segMask[ 0 ][ 0 ] ).
curSegmentFlagが１の場合には、segMask[ x ][ y ]がcurSegmentFlagと等しい画素について、predIdx=curSegmentFlagに対応する補間画像predSamplesを、補間画像predSamplesLXに割り当てる。例えば、partIdxが０でブロックの左上画像のセグメンテーション情報segMask[0][0]が０の場合、各画素のセグメンテーション情報segMask[ x ][ y ]が１（ ( partIdx ! = segMask[ 0 ][ 0 ] )= (0!=0) = １）であれば、そのpartIdx=0の補間画像predSamplesをPredSamplesDbbpに設定する。partIdxが０でブロックの左上画像のセグメンテーション情報segMask[0][0]が１の場合、各画素のセグメンテーション情報segMask[ x ][ y ]が０（ ( partIdx ! = segMask[ 0 ][ 0 ] ) = (0!=1) = ０）であれば、そのpartIdx=0の補間画像predSamplesをPredSamplesDbbpに設定する。curSegmentFlag = (partIdx! = segMask [0] [0]).
When curSegmentFlag is 1, an interpolated image predSamples corresponding to predIdx = curSegmentFlag is assigned to the interpolated image predSamplesLX for pixels whose segMask [x] [y] is equal to curSegmentFlag. For example, if partIdx is 0 and the segmentation information segMask [0] [0] of the upper left image of the block is 0, the segmentation information segMask [x] [y] of each pixel is 1 ((partIdx! = SegMask [0] [0 ]) = (0! = 0) = 1), the interpolated image predSamples of partIdx = 0 is set to PredSamplesDbbp. When partIdx is 0 and the segmentation information segMask [0] [0] of the upper left image of the block is 1, the segmentation information segMask [x] [y] of each pixel is 0 ((partIdx! = segMask [0] [0]) If (0! = 1) = 0), the interpolated image predSamples of partIdx = 0 is set to PredSamplesDbbp.

まとめると、画像合成部３０９５３は、以下の式により、各画素のセグメンテーション情報segMask[x][y]に応じて、補間画像predSamplesを予測画像PredSamplesDbbpに割り当てる。 In summary, the image synthesis unit 30953 assigns the interpolated image predSamples to the predicted image PredSamplesDbbp according to the segmentation information segMask [x] [y] of each pixel according to the following equation.

for ( y = 0; y < nCbS_L; y++ ) {
for( x = 0; x < nCbS_L; x++ ) {
if( segMask[ x ][ y ] = = ( partIdx ! = segMask[ 0 ][ 0 ] ) )
PredSamplesDbbp_L[ x ][ y ] = predSamples_L[ x ][ y ]
if( ( x % 2 = = 0 ) && ( y % 2 = = 0 ) ) {
PredSamplesDbbp_Cb[ x / 2 ][ y / 2 ] = predSamples_Cb[ x / 2 ][ y / 2 ]
PredSamplesDbbp_Cr[ x / 2 ][ y / 2 ] = predSamples_Cr[ x / 2 ][ y / 2 ]
}
}
}
画像合成部３０９５３は、さらに各画素について、セグメンテーション情報segMask[x][y]に応じてフィルタをかけても良い。画像合成部３０９５３は、具体的には、対象画素のセグメンテーション情報cFlag(=segMask[x][y])と、左に隣接する画素のセグメンテーション情報lFlag(=segMask[x-1][y])、右に隣接する画素のセグメンテーション情報rFlag(segMask[x+1][y])を参照し、それらが異なる場合(cFlag!=lFlag || cFlag!=rFlag)には、左画素p[x-1][y]、対象画素p[x][y]、右画素p[x+1][y]に対して1:2:1の重みを用いたフィルタをかける。さらに続いて、画像合成部３０９５３は、対象画素のセグメンテーション情報cFlag(=segMask[x][y])と、上に隣接する画素のセグメンテーション情報tFlag(=segMask[x][y-1])、下に隣接する画素のセグメンテーション情報bFlag(segMask[x][y+1])を参照し、それらが異なる場合(cFlag!=tFlag || cFlag!=bFlag)には、上画素p[x][y-1]、対象画素p[x][y]、下画素p[x][y+1]に対して1:2:1の重みを用いたフィルタをかける。この処理の擬似コードは以下の通りである。for (y = 0; y <nCbS _L ; y ++) {
for (x = 0; x <nCbS _L ; x ++) {
if (segMask [x] [y] = = (partIdx! = segMask [0] [0]))
PredSamplesDbbp _L [x] [y] = predSamples _L [x] [y]
if ((x% 2 = = 0) && (y% 2 = = 0)) {
PredSamplesDbbp _Cb [x / 2] [y / 2] = predSamples _Cb [x / 2] [y / 2]
PredSamplesDbbp _Cr [x / 2] [y / 2] = predSamples _Cr [x / 2] [y / 2]
}
}
}
The image composition unit 30953 may further filter each pixel according to the segmentation information segMask [x] [y]. Specifically, the image composition unit 30953 includes segmentation information cFlag (= segMask [x] [y]) of the target pixel and segmentation information lFlag (= segMask [x-1] [y]) of the pixel adjacent to the left. , Refer to the segmentation information rFlag (segMask [x + 1] [y]) of the pixel adjacent to the right, and if they are different (cFlag! = LFlag || cFlag! = RFlag), the left pixel p [x− 1] [y], the target pixel p [x] [y], and the right pixel p [x + 1] [y] are filtered using a weight of 1: 2: 1. Subsequently, the image composition unit 30953 includes the segmentation information cFlag (= segMask [x] [y]) of the target pixel and the segmentation information tFlag (= segMask [x] [y-1]) of the adjacent pixel on the upper side. Refer to the segmentation information bFlag (segMask [x] [y + 1]) of the adjacent pixel below, and if they are different (cFlag! = TFlag || cFlag! = BFlag), the upper pixel p [x] [ A filter using a weight of 1: 2: 1 is applied to y−1], target pixel p [x] [y], and lower pixel p [x] [y + 1]. The pseudo code for this process is as follows.

for ( y = 0; y < nCbS_X; y++ )
for( x = 0; x < nCbS_X; x++ ) {
tFlag = segMask[ n * x ][ Max( 0, n * ( y - 1 ) ) ]
lFlag = segMask[ Max( 0, (n * ( x - 1 ) ) ][ n * y ]
bFlag = segMask[ n * x ][ Min( n * ( y + 1 ), nCbS_L - 1 ) ]
rFlag = segMask[ Min( n * ( x + 1 ), nCbS_L - 1 ) ][ n*y ]
cFlag = segMask[ n * x ][ n * y ]
filt = p[ x ][ y ]
if( ( lFlag | | cFlag | | rFlag ) && ( !lFlag | | !cFlag | | !rFlag ) )
filt = ( p[ Max( 0, x - 1 ) ][ y ] + ( filt << 1 ) + p[ Min( x + 1, nCbS_X - 1 ) ][ y ] ) >> 2
if( ( tFlag | | cFlag | | bFlag ) && ( !tFlag | | !cFlag | | !bFlag ) )
filt = ( p[ x ][ Max( 0, y - 1 ) ] + ( filt << 1 ) + p[ x ][ Min( y + 1, nCbS_X - 1 ) ] ) >> 2
predSamples[ x ][ y ] = filt
}
ＤＢＢＰ分割モード導出部３０９５４は、対象ブロックに対応するデプスブロックrefSamplesから分割モードpartModeを導出する。具体的には、デプスブロックrefSamplesから導出したセグメンテーション情報segMaskに基づいて、対象ブロックの各x, y（x=0..nCbS-1, y=0..nCbS-1）について、下記式のように各分割モードPartModeとして、２Ｎ×２Ｎを２つに分割する分割モードであるＮ×２Ｎ（SIZE_Nx2N）、２Ｎ×Ｎ（SIZE_2NxN）、２Ｎ×ｎＵ（SIZE_2NxnU）、２Ｎ×ｎＤ（SIZE_2NxnD）、ｎＬ×２Ｎ（SIZE_nLx2N）、ｎＲ×２Ｎ（SIZE_nRx2N）に対応する和partSum[0], partSum[1], partSum[2], partSum[3], partSum[4], partSum[5]を以下の擬似コードに基づいて導出する。for (y = 0; y <nCbS _X ; y ++)
for (x = 0; x <nCbS _X ; x ++) {
tFlag = segMask [n * x] [Max (0, n * (y-1))]
lFlag = segMask [Max (0, (n * (x-1))] [n * y]
bFlag = segMask [n * x] [Min (n * (y + 1), nCbS _L -1)]
rFlag = segMask [Min (n * (x + 1), nCbS _L -1)] [n * y]
cFlag = segMask [n * x] [n * y]
filt = p [x] [y]
if ((lFlag | | cFlag | | rFlag) && (! lFlag | |! cFlag | |! rFlag))
filt = (p [Max (0, x-1)] [y] + (filt << 1) + p [Min (x + 1, nCbS _X -1)] [y]) >> 2
if ((tFlag | | cFlag | | bFlag) && (! tFlag | |! cFlag | |! bFlag))
filt = (p [x] [Max (0, y-1)] + (filt << 1) + p [x] [Min (y + 1, nCbS _X -1)]) >> 2
predSamples [x] [y] = filt
}
The DBBP division mode deriving unit 30954 derives the division mode partMode from the depth block refSamples corresponding to the target block. Specifically, based on the segmentation information segMask derived from the depth block refSamples, for each x and y (x = 0..nCbS-1, y = 0..nCbS-1) of the target block, 2N × 2N (SIZE_Nx2N), 2N × N (SIZE_2NxN), 2N × nU (SIZE_2NxnU), 2N × nD (SIZE_2NxnD), nL × The following part of the pseudo code is the sum partSum [0], partSum [1], partSum [2], partSum [3], partSum [4], partSum [5] corresponding to 2N (SIZE_nLx2N) and nR × 2N (SIZE_nRx2N) Derived based on.

for( y = 0; y < nCbS ; y ++ )
for( x = 0; x < nCbS ; x ++ ) {
segFlag = segMask[ x ][ y ]
partSum[ 0 ][ ( x < ( nCbS >> 1 ) ) ? segFlag : !segFlag ]++
partSum[ 1 ][ ( y < ( nCbS >> 1 ) ) ? segFlag : !segFlag ]++
if( nCbS > 8 ) {
partSum[ 2 ][ ( y < ( nCbS >> 2 ) ) ? segFlag : !segFlag ]++
partSum[ 3 ][ ( y < ( nCbS >> 2 + nCbS >> 1 ) ) ? segFlag : !segFlag ]++
partSum[ 4 ][ ( x < ( nCbS >> 2 ) ) ? segFlag : !segFlag ]++
partSum[ 5 ][ ( x < ( nCbS >> 2 + nCbS >> 1 ) ) ? segFlag : !segFlag ]++
}
}
さらに、ＤＢＢＰ分割モード導出部３０９５４は、p=0..5についてpartSum[p]が最大となるpを導出し、そのpに対応する分割モードを、p=0..5がSIZE_Nx2N、SIZE_2NxN、SIZE_2NxnU、SIZE_2NxnD、SIZE_nLx2N、SIZE_nRx2Nに対応するとして導出する。例えば、p=0でpartSum[p]が最大となる場合（partIdc＝０の場合）には分割モードはSIZE_Nx2Nが導出される。for (y = 0; y <nCbS; y ++)
for (x = 0; x <nCbS; x ++) {
segFlag = segMask [x] [y]
partSum [0] [(x <(nCbS >> 1))? segFlag:! segFlag] ++
partSum [1] [(y <(nCbS >> 1))? segFlag:! segFlag] ++
if (nCbS> 8) {
partSum [2] [(y <(nCbS >> 2))? segFlag:! segFlag] ++
partSum [3] [(y <(nCbS >> 2 + nCbS >> 1))? segFlag:! segFlag] ++
partSum [4] [(x <(nCbS >> 2))? segFlag:! segFlag] ++
partSum [5] [(x <(nCbS >> 2 + nCbS >> 1))? segFlag:! segFlag] ++
}
}
Further, the DBBP partitioning mode deriving unit 30954 derives p that maximizes partSum [p] for p = 0..5, and the partitioning mode corresponding to p is SIZE_Nx2N, SIZE_2NxN, Derived as corresponding to SIZE_2NxnU, SIZE_2NxnD, SIZE_nLx2N, SIZE_nRx2N. For example, when p = 0 and partSum [p] is maximum (partIdc = 0), SIZE_Nx2N is derived as the division mode.

partIdc = 0
maxPartSum = 0
for( p = 0; p < 6; p++ )
for( i = 0; i < 2; i++ ) {
if( partSum[ p ][ i ] > maxPartSum ) {
maxPartSum = partSum[ p ][ i ]
partIdc = p
}
}
以上の構成のＤＢＢＰ予測部３０９５によれば、ＤＢＢＰ画像補間部３０９５１において、ＤＢＢＰ予測の合成に用いられる２つの補間画像を双線形予測により補間画像を生成するため、通常の動き変位補償部３０９１で用いられる８タップもしくは４タップで、ＤＢＢＰの予測ユニットの補間画像を生成する場合と比較して、処理量および転送量を大幅に削減する効果を奏する。発明者の試算によれば、輝度で８タップ、色差で４タップを用いる動き変位補償部３０９１で動き補償を行う場合には、８×８ブロックの双予測における動き補償の場合と比較して、ＤＢＢＰの動き補償は、乗算で最大１７０％、加算で最大１７０％、メモリバンド幅で最大１１９％の演算量および転送量であり、非常に複雑度が大きい。それに対して、ＤＢＢＰ予測ユニットに対して双線形予測を用いることにより、ＤＢＢＰの最大複雑度は、乗算で最大４４％、加算で最大２６％、メモリバンド幅７０％まで減らすことが可能である。また、発明者の実験によれば、この双線形補間を用いることによる符号化効率の低下は８シーケンス平均で0.00%であり、符号化効率を低下させることなく、処理量の低下が可能である。partIdc = 0
maxPartSum = 0
for (p = 0; p <6; p ++)
for (i = 0; i <2; i ++) {
if (partSum [p] [i]> maxPartSum) {
maxPartSum = partSum [p] [i]
partIdc = p
}
}
According to the DBBP prediction unit 3095 having the above configuration, the DBBP image interpolation unit 30951 generates an interpolation image by bilinear prediction from two interpolation images used for DBBP prediction synthesis. Compared with the case where the interpolated image of the DBBP prediction unit is generated with 8 taps or 4 taps used, the processing amount and the transfer amount are greatly reduced. According to the inventor's calculation, when motion compensation is performed by the motion displacement compensation unit 3091 that uses 8 taps for luminance and 4 taps for color difference, compared to motion compensation in bi-prediction of 8 × 8 blocks, DBBP motion compensation has a very high complexity, with a maximum of 170% for multiplication, a maximum of 170% for addition, and a maximum of 119% for memory bandwidth. In contrast, by using bilinear prediction for the DBBP prediction unit, the maximum complexity of DBBP can be reduced to a maximum of 44% by multiplication, a maximum of 26% by addition, and a memory bandwidth of 70%. Further, according to the experiment by the inventor, the decrease in encoding efficiency due to the use of this bilinear interpolation is 0.00% on an average of 8 sequences, and the processing amount can be decreased without decreasing the encoding efficiency. .

以上の構成のＤＢＢＰ予測部３０９５によれば、セグメンテーション部３０９５２において、各画素について０もしくは１をとるようなセグメンテーション情報segMaskを導出し、画像合成部３０９５３は、セグメンテーション情報segMaskに基づいて対象ブロックの各画素において２つの動き補償画像のいずれかを選択することにより合成する。これにより、２つの動き補償画像を例えば重み1/2で重み付けして合成する画素がある場合に比べ、画像合成部３０９５３の処理を低減する効果を奏する。また、各画素(x, y)毎に対応するセグメンテーション情報segMask[x][y]だけではなく、上下左右のセグメンテーション情報segMask[x][y-1]、segMask[x][y+1]、segMask[x-1][y]、segMask[x+1][y]も参照して合成する場合に比べ、画像合成部３０９５３の処理を大幅に低減する効果を奏する。なお、ＤＢＢＰ予測部３０９５の備えるＤＢＢＰ分割モード導出部３０９５４は別の処理を用いることもできる。例えば、後述する、ＤＢＢＰ分割モード導出部３０９５４Ａ、ＤＢＢＰ分割モード導出部３０９５４Ｂを用いても構わない。 According to the DBBP prediction unit 3095 having the above configuration, the segmentation unit 30952 derives segmentation information segMask that takes 0 or 1 for each pixel, and the image synthesis unit 30953 determines each of the target blocks based on the segmentation information segMask. The pixel is synthesized by selecting one of the two motion compensation images. Accordingly, an effect of reducing the processing of the image synthesizing unit 30953 is achieved as compared with a case where there are pixels that are synthesized by weighting, for example, two motion compensated images with a weight of 1/2. Also, not only segmentation information segMask [x] [y] corresponding to each pixel (x, y), but also segmentation information segMask [x] [y-1], segMask [x] [y + 1] , SegMask [x-1] [y] and segMask [x + 1] [y] are also referred to, and the effect of significantly reducing the processing of the image composition unit 30953 is achieved compared to the case of composition. Note that the DBBP split mode deriving unit 30954 included in the DBBP prediction unit 3095 can use another process. For example, a DBBP split mode deriving unit 30954A and a DBBP split mode deriving unit 30954B, which will be described later, may be used.

（ＤＢＢＰ予測部３０９５Ａ）
以下、ＤＢＢＰ予測部３０９５の別の構成であるＤＢＢＰ予測部３０９５Ａを説明する。ＤＢＢＰ予測部３０９５Ａは、ＤＢＢＰ画像補間部３０９５１、セグメンテーション部３０９５２、画像合成部３０９５３、ＤＢＢＰ分割モード導出部３０９５４Ａから構成される。ＤＢＢＰ予測部３０９５Ａは基本的にはＤＢＢＰ予測部３０９５と構成は同じであるが、ＤＢＢＰ分割モード導出部３０９５４の代わりにＤＢＢＰ分割モード導出部３０９５４Ａを備える。ＤＢＢＰ画像補間部３０９５１、セグメンテーション部３０９５２については既に説明済みであるので説明を省略する。(DBBP prediction unit 3095A)
Hereinafter, a DBBP prediction unit 3095A, which is another configuration of the DBBP prediction unit 3095, will be described. The DBBP prediction unit 3095A includes a DBBP image interpolation unit 30951, a segmentation unit 30952, an image synthesis unit 30953, and a DBBP division mode derivation unit 30954A. The DBBP prediction unit 3095A basically has the same configuration as the DBBP prediction unit 3095, but includes a DBBP split mode deriving unit 30954A instead of the DBBP split mode deriving unit 30954. Since the DBBP image interpolation unit 30951 and the segmentation unit 30952 have already been described, description thereof will be omitted.

ＤＢＢＰ分割モード導出部３０９５４Ａは、対象ブロックに対応するデプスブロックrefSamplesから分割モードpartModeとして、SIZE_Nx2N、SIZE_2NxNの何れかを導出する。具体的には、デプスブロックrefSamplesから導出したセグメンテーション情報segMaskに基づいて、対象ブロックの各x, y（x=0..nCbS-1, y=0..nCbS-1）について、下記式のように各分割モード、SIZE_Nx2N、SIZE_2NxNに対応する和partSum[0], partSum[1]を以下の擬似コードに基づいて導出する。 The DBBP division mode deriving unit 30954A derives one of SIZE_Nx2N and SIZE_2NxN as the division mode partMode from the depth block refSamples corresponding to the target block. Specifically, based on the segmentation information segMask derived from the depth block refSamples, for each x and y (x = 0..nCbS-1, y = 0..nCbS-1) of the target block, Then, the sums partSum [0] and partSum [1] corresponding to the respective division modes, SIZE_Nx2N, and SIZE_2NxN are derived based on the following pseudo code.

for( y = 0; y < nCbS ; y ++ )
for( x = 0; x < nCbS ; x ++ ) {
segFlag = segMask[ x ][ y ]
partSum[ 0 ][ ( x < ( nCbS >> 1 ) ) ? segFlag : !segFlag ]++
partSum[ 1 ][ ( y < ( nCbS >> 1 ) ) ? segFlag : !segFlag ]++
}
さらに、ＤＢＢＰ分割モード導出部３０９５４は、p=0..１についてpartSum[p]が最大となるpを導出し、そのpに対応する分割モードを、p=0..１がSIZE_Nx2N、SIZE_2NxNに対応するとして導出する。for (y = 0; y <nCbS; y ++)
for (x = 0; x <nCbS; x ++) {
segFlag = segMask [x] [y]
partSum [0] [(x <(nCbS >> 1))? segFlag:! segFlag] ++
partSum [1] [(y <(nCbS >> 1))? segFlag:! segFlag] ++
}
Further, the DBBP partition mode deriving unit 30954 derives p that maximizes partSum [p] for p = 0.0.1, and sets the partition mode corresponding to p to SIZE_Nx2N and SIZE_2NxN. Derived as corresponding.

partIdc = 0
maxPartSum = 0
for( p = 0; p < 2; p++ )
for( i = 0; i < 2; i++ ) {
if( partSum[ p ][ i ] > maxPartSum ) {
maxPartSum = partSum[ p ][ i ]
partIdc = p
}
}
ＤＢＢＰ予測部３０９５Ａは、ＤＢＢＰ予測部３０９５と異なり、分割モードとして非対称分割（ＡＭＰ分割、SIZE_2NxnU、SIZE_2NxnD、SIZE_nLx2N、SIZE_nRx2N）を導出せず、対称分割の２つの分割モードＮ×２Ｎ、２Ｎ×Ｎのみを対象とする。partIdc = 0
maxPartSum = 0
for (p = 0; p <2; p ++)
for (i = 0; i <2; i ++) {
if (partSum [p] [i]> maxPartSum) {
maxPartSum = partSum [p] [i]
partIdc = p
}
}
Unlike the DBBP prediction unit 3095, the DBBP prediction unit 3095A does not derive an asymmetric partition (AMP partition, SIZE_2NxnU, SIZE_2NxnD, SIZE_nLx2N, SIZE_nRx2N) as a partition mode, and only two partition modes N × 2N and 2N × N of symmetric partition Is targeted.

以上の構成のＤＢＢＰ予測部３０９５Ａによれば、ＤＢＢＰ分割モード導出部３０９５４Ａにおいて、分割モードとしてＮ×２Ｎもしくは２Ｎ×Ｎの何れかのみを用いるため、ＡＭＰ分割を対象とする場合に比べ処理量を削減する効果を奏する。すなわち、合計値として、２つの分割モードに対する合計値partSum[0]、partSum[1]を導出するだけのであるので、６つの分割モードに対する合計値partSum[0]、partSum[1]、partSum[2]、partSum[3]、partSum[4]、partSum[5]を導出する場合に比べて処理量を３分の１に低下させることができる。 According to the DBBP predicting unit 3095A having the above configuration, the DBBP split mode deriving unit 30954A uses only N × 2N or 2N × N as the split mode. There is an effect to reduce. That is, since only the total values partSum [0] and partSum [1] for the two partition modes are derived as the total values, the total values partSum [0], partSum [1], and partSum [2] for the six partition modes are derived. ], PartSum [3], partSum [4], and partSum [5] can be reduced to a third of the processing amount.

（ＤＢＢＰ予測部３０９５Ｂ）
以下、ＤＢＢＰ予測部３０９５の別の構成であるＤＢＢＰ予測部３０９５Ｂを説明する。ＤＢＢＰ予測部３０９５Ｂは基本的にはＤＢＢＰ予測部３０９５と構成は同じであるが、ＤＢＢＰ分割モード導出部３０９５４の代わりにＤＢＢＰ分割モード導出部３０９５４Ｂを備える。(DBBP prediction unit 3095B)
Hereinafter, a DBBP prediction unit 3095B, which is another configuration of the DBBP prediction unit 3095, will be described. The DBBP prediction unit 3095B basically has the same configuration as the DBBP prediction unit 3095, but includes a DBBP split mode deriving unit 30954B instead of the DBBP split mode deriving unit 30954.

ＤＢＢＰ分割モード導出部３０９５４Ｂは、対象ブロックに対応するrefSamples（以下は、refDepPelsと記す）の図１３に示す対象ブロックの４つのコーナー画素のみを参照して、分割モードPartModeを決定する。 The DBBP split mode deriving unit 30954B determines the split mode PartMode with reference to only the four corner pixels of the target block shown in FIG. 13 of refSamples (hereinafter referred to as refDepPels) corresponding to the target block.

具体的には、左上座標(xP0, yP0)、右上座標(xP1, yP0)、左下座標(xP0, xP1)、右下座標(xP1, yP1)に対応する座標xP0, xP0, yP0, yP1を下記式で導出する。 Specifically, the coordinates xP0, xP0, yP0, yP1 corresponding to the upper left coordinates (xP0, yP0), upper right coordinates (xP1, yP0), lower left coordinates (xP0, xP1), and lower right coordinates (xP1, yP1) are as follows: Derived by the formula.

xP0 = Clip3( 0, pic_width_in_luma_samples - 1, x_TL)
yP0 = Clip3( 0, pic_height_in_luma_samples - 1, y_TL )
xP1 = Clip3( 0, pic_width_in_luma_samples - 1, _xTL + nPSW - 1 )
yP1 = Clip3( 0, pic_height_in_luma_samples - 1, y_TL + nPSH - 1 )
さらに、左上画素TLrefDepPels[ xP0 ][ yP0 ]と右下画素BRの比較（refDepPels[ xP0 ][ yP0 ] < refDepPels[ xP1 ][ yP1 ]）と左下画素BLrefDepPels[ xP0 ][ yP0 ]と右上TR画素の比較（ refDepPels[ xP1 ][ yP0 ] < refDepPels[ xP0 ][ yP1] ）から、下記の式により、分割フラグhorSplitFlagを導出する。xP0 = Clip3 (0, pic_width_in_luma_samples-1, x _TL )
yP0 = Clip3 (0, pic_height_in_luma_samples-1, y _TL )
xP1 = Clip3 (0, pic_width_in_luma_samples-1, _xTL + nPSW-1)
yP1 = Clip3 (0, pic_height_in_luma_samples-1, y _TL + nPSH-1)
Furthermore, the upper left pixel TLrefDepPels [xP0] [yP0] and the lower right pixel BR are compared (refDepPels [xP0] [yP0] <refDepPels [xP1] [yP1]) and the lower left pixel BLrefDepPels [xP0] [yP0] From the comparison (refDepPels [xP1] [yP0] <refDepPels [xP0] [yP1]), a split flag horSplitFlag is derived by the following equation.

horSplitFlag = ( refDepPels[ xP0 ][ yP0 ] < refDepPels[ xP1 ][ yP1 ] )
= = ( refDepPels[ xP1 ][ yP0 ] < refDepPels[ xP0 ][ yP1] ) )
ＤＢＢＰ分割モード導出部３０９５４Ｂは、分割フラグhorSplitFlagに応じて２Ｎ×ＮもしくはＮ×２Ｎを割り当てる。具体的には、horSplitFlagが１の場合に２Ｎ×Ｎ、horSplitFlagが０の場合にＮ×２Ｎを割り当てることにより分割モードPartModeを導出する。horSplitFlag = (refDepPels [xP0] [yP0] <refDepPels [xP1] [yP1])
= = (refDepPels [xP1] [yP0] <refDepPels [xP0] [yP1]))
The DBBP split mode deriving unit 30954B assigns 2N × N or N × 2N according to the split flag horSplitFlag. Specifically, the split mode PartMode is derived by assigning 2N × N when horSplitFlag is 1 and N × 2N when horSplitFlag is 0.

以上の構成のＤＢＢＰ予測部３０９５Ｂによれば、デプスブロックの限定された画素のみ（ここではブロックの内の４画素、ブロックの４隅の画素）のみを参照するため、全ての画素を参照する場合に比べて処理量を大幅に削減する効果を奏する。また、デプス代表ア値の算出や、分割モード毎の合計値partSum[]の導出が不要であるため、処理量をさらに削減できる。 According to the DBBP prediction unit 3095B having the above configuration, only the limited pixels of the depth block (here, the four pixels in the block and the pixels at the four corners of the block) are referred to. Compared to the above, the amount of processing is greatly reduced. Further, since it is not necessary to calculate the depth representative value or to derive the total value partSum [] for each division mode, the processing amount can be further reduced.

以上の構成のＤＢＢＰ予測部３０９５Ｂによれば、デプスの左上画素と右下画素の比較と、デプスの右上画素と左下画素の比較という単純な処理により分割モードを導出するため、全ての画素について比較を行う場合に比べて処理量を大幅に削減する効果を奏する。 According to the DBBP prediction unit 3095B having the above configuration, the division mode is derived by a simple process of comparing the upper left pixel and the lower right pixel of the depth and comparing the upper right pixel and the lower left pixel of the depth. As compared with the case where the process is performed, the processing amount is greatly reduced.

また、以上の構成のＤＢＢＰ予測部３０９５Ｂによれば、ＤＢＢＰ予測部３０９５Ａと同様、分割モードとしてＮ×２Ｎもしくは２Ｎ×Ｎの何れかのみを用いるため、ＡＭＰ分割を対象とする場合に比べ処理量を削減する効果を奏する。 Further, according to the DBBP prediction unit 3095B having the above configuration, as in the DBBP prediction unit 3095A, only N × 2N or 2N × N is used as the division mode. It has the effect of reducing

（ＤＢＢＰ予測部３０９５Ｃ）
以下、ＤＢＢＰ予測部３０９５の別の構成であるＤＢＢＰ予測部３０９５Ｃを説明する。ＤＢＢＰ予測部３０９５Ｃは基本的にはＤＢＢＰ予測部３０９５Ｂと構成は同じであるが、ＤＢＢＰ分割モード導出部３０９５４Ｂの代わりにＤＢＢＰ分割モード導出部３０９５４Ｃを備える。(DBBP prediction unit 3095C)
Hereinafter, a DBBP prediction unit 3095C, which is another configuration of the DBBP prediction unit 3095, will be described. The DBBP prediction unit 3095C has basically the same configuration as the DBBP prediction unit 3095B, but includes a DBBP split mode deriving unit 30954C instead of the DBBP split mode deriving unit 30954B.

図２１は、本発明の実施形態のＤＢＢＰ予測部３０９５Ｃの構成を示すブロック図である。ＤＢＢＰ予測部３０９５Ｃは、ＤＢＢＰ画像補間部３０９５１、セグメンテーション部３０９５２、画像合成部３０９５３、ＤＢＢＰ分割モード導出部３０９５４Ｃから構成される。 FIG. 21 is a block diagram illustrating a configuration of the DBBP prediction unit 3095C according to the embodiment of this invention. The DBBP prediction unit 3095C includes a DBBP image interpolation unit 30951, a segmentation unit 30952, an image synthesis unit 30953, and a DBBP division mode derivation unit 30954C.

ＤＢＢＰ分割モード導出部３０９５４Ｃは、分割フラグ導出部３５３を用いて、０もしくは１の値をとる分割フラグhorSplitFlagを導出する。ＤＢＢＰ分割モード導出部３０９５４Ｃは、horSplitFlagに基づいて分割モードを導出する。例えば、ＤＢＢＰ分割モード導出部３０９５４Ｃは、horSplitFlagが１の場合に２Ｎ×Ｎ、horSplitFlagが０の場合にＮ×２Ｎを割り当てることにより分割モードを導出する。 The DBBP split mode deriving unit 30954C uses the split flag deriving unit 353 to derive a split flag horSplitFlag having a value of 0 or 1. The DBBP split mode deriving unit 30954C derives a split mode based on horSplitFlag. For example, the DBBP split mode deriving unit 30954C derives a split mode by assigning 2N × N when horSplitFlag is 1 and N × 2N when horSplitFlag is 0.

ＤＢＢＰ予測部３０９５ＣとＶＳＰ予測部３０３７４を備える画像復号装置は、ＤＢＢＰ予測部３０９５Ｃとして、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部３０９５２と、２つの動き補償画像を生成するＤＢＢＰ画像補間部３０９５１と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部３０９５３と、分割モードを導出するＤＢＢＰ分割モード導出部３０９５４Ｃを備える。ＶＳＰ予測部３０３７４は、デプスに応じてパーティション分割を行うパーティション分割部と、デプス画像から動きベクトルを導出するデプス動きＤＶ導出部３５１を備える。また、分割モード導出部３０９５４Ｃと、ＶＳＰ予測部３０３７４のパーティション分割部は、共通の分割モード導出部３５３を備える。 An image decoding apparatus including a DBBP prediction unit 3095C and a VSP prediction unit 30374 includes, as the DBBP prediction unit 3095C, a segmentation derivation unit 30952 that derives segmentation information from a depth image, and a DBBP image interpolation unit 30951 that generates two motion compensation images. The image synthesizing unit 30953 for synthesizing the two interpolation images to generate one motion compensation image, and the DBBP division mode deriving unit 30954C for deriving the division mode are provided. The VSP prediction unit 30374 includes a partition division unit that performs partition division according to depth, and a depth motion DV derivation unit 351 that derives a motion vector from the depth image. Further, the partition mode deriving unit 30954C and the partition division unit of the VSP prediction unit 30374 include a common partition mode deriving unit 353.

以上の構成のＤＢＢＰ予測部３０９５Ｃによれば、デプスブロックの限定された画素のみ（ここでは４隅の画素）のみを参照するため、全ての画素を参照する場合に比べて処理量を大幅に削減する効果を奏する。 According to the DBBP prediction unit 3095C having the above configuration, only the limited pixels of the depth block (here, the pixels at the four corners) are referred to, so that the processing amount is significantly reduced as compared with the case of referring to all the pixels. The effect to do.

以上の構成のＤＢＢＰ予測部３０９５Ｃの備える画像復号装置によれば、ＤＢＢＰ予測部３０９５ＣとＶＳＰ予測部３０３７４で共通の分割フラグ導出部３５３を用いるため、ＤＢＢＰ予測部とＶＳＰ予測部で異なる方法を用いて分割方法を導出する場合に比べて、実装を簡略化できる効果を奏する。 According to the image decoding apparatus included in the DBBP prediction unit 3095C having the above-described configuration, the DBBP prediction unit 3095C and the VSP prediction unit 30374 use the common partition flag derivation unit 353, and thus different methods are used in the DBBP prediction unit and the VSP prediction unit. As compared with the case where the division method is derived, the effect of simplifying the mounting is obtained.

以上の構成のＤＢＢＰ予測部３０９５Ｃによれば、デプスブロックの左上画素と右下画素の比較と、デプスブロックの右上画素と左下画素の比較という単純な処理により分割モードPartModeを導出するため、全ての画素について比較を行う場合に比べて処理量を大幅に削減する効果を奏する。 According to the DBBP prediction unit 3095C having the above-described configuration, since the division mode PartMode is derived by a simple process of comparing the upper left pixel and the lower right pixel of the depth block and comparing the upper right pixel and the lower left pixel of the depth block, There is an effect that the amount of processing is significantly reduced as compared with the case of performing comparison with respect to pixels.

また、以上の構成のＤＢＢＰ予測部３０９５Ｃによれば、ＤＢＢＰ予測部３０９５Ａと同様、分割モードとしてＮ×２Ｎもしくは２Ｎ×Ｎの何れかのみを用いるため、ＡＭＰ分割を出力対象とする場合に比べ処理量を削減する効果を奏する。 Further, according to the DBBP prediction unit 3095C having the above configuration, as in the case of the DBBP prediction unit 3095A, only N × 2N or 2N × N is used as the division mode. There is an effect of reducing the amount.

なお、ＤＢＢＰ予測部３０９５Ｃ及びＶＳＰ予測部３０３７４は、共通の分割フラグ導出部として分割フラグ導出部３５３の代わりに、分割フラグ導出部３５３Ａを用いても構わない。この場合も、ＤＢＢＰ予測部３０９５Ｃの備える画像復号装置によれば、ＤＢＢＰ予測部３０９５ＣとＶＳＰ予測部３０３７４で共通の分割フラグ導出部３５３Ａを用いるため、ＤＢＢＰ予測部とＶＳＰ予測部で異なる方法を用いて分割方法を導出する場合に比べて、実装を簡略化できる効果を奏する。 Note that the DBBP prediction unit 3095C and the VSP prediction unit 30374 may use the division flag derivation unit 353A instead of the division flag derivation unit 353 as a common division flag derivation unit. Also in this case, according to the image decoding apparatus included in the DBBP prediction unit 3095C, the DBBP prediction unit 3095C and the VSP prediction unit 30374 use the same division flag derivation unit 353A, and therefore different methods are used in the DBBP prediction unit and the VSP prediction unit. As compared with the case where the division method is derived, the effect of simplifying the mounting is obtained.

（ＤＢＢＰの双予測制限）
本実施形態の変形例の画像復号装置では、ＤＢＢＰの場合に双予測を適用しないように構成する。変形例の画像復号装置では、インター予測パラメータ復号制御部３０３１の代わりにインター予測パラメータ復号制御部３０３１Ａ、マージモードパラメータ導出部３０３６の代わりにマージモードパラメータ導出部３０３６Ａを備える。インター予測パラメータ復号制御部３０３１Ａとマージモードパラメータ導出部３０３６Ａを除く動作は、既に説明したとおりであるので説明を省略する。(DBBP bi-prediction restriction)
The image decoding apparatus according to the modification of the present embodiment is configured not to apply bi-prediction in the case of DBBP. The image decoding apparatus according to the modified example includes an inter prediction parameter decoding control unit 3031A instead of the inter prediction parameter decoding control unit 3031 and a merge mode parameter deriving unit 3036A instead of the merge mode parameter deriving unit 3036. Since operations other than the inter prediction parameter decoding control unit 3031A and the merge mode parameter deriving unit 3036A are as described above, description thereof will be omitted.

図２３は、本実施形態に係るインター予測パラメータ復号制御復号部３０３１Ａの構成を示す概略図である。インター予測パラメータ復号制御復号部３０３１Ａは、インター予測パラメータ復号制御復号部３０３１と同様の構成であるが、インター予測識別子復号部３０３１２の代わりにインター予測識別子復号部３０３１２Ａを備える。 FIG. 23 is a schematic diagram showing the configuration of the inter prediction parameter decoding control decoding unit 3031A according to the present embodiment. The inter prediction parameter decoding control decoding unit 3031A has the same configuration as the inter prediction parameter decoding control decoding unit 3031, but includes an inter prediction identifier decoding unit 30312A instead of the inter prediction identifier decoding unit 30312.

図２４は、インター予測パラメータ復号制御部３０３１Ａにおいて、inter_pred_flagの導出を説明する図である。inter_pred_flagは、スライスタイプがＢ（双予測が使用可能）の場合に復号される。図２４（ａ）は、inter_pred_flagが取り得る値を示す図であり、図２４（ｂ）は、inter_pred_flagのＣＡＢＡＣ復号後のビット列（バイナリゼーション）を示す。図に示すように、本実施形態のインター予測パラメータ復号制御部３０３１Ａ（インター予測識別子復号部３０３１２Ａ）は、ＤＢＢＰフラグが１の場合には、inter_pred_flag=2(PRED_BI)を復号しない。すなわち、本実施形態のインター予測パラメータ復号制御部３０３１Ａ（インター予測識別子復号部３０３１２Ａ）は、予測ユニットが所定のサイズではなく(( nPbW + nPbH ) != 12）、かつ、ＤＢＢＰフラグが０の場合（dbbp_flag == 0）に、inter_pred_flagとして０(PRED_L0)、１(PRED_L1)、２(PRED_BI)を復号し、それ以外、予測ユニットが所定のサイズ(( nPbW + nPbH ) != 12）、もしくは、ＤＢＢＰフラグが１の場合（dbbp_flag == 1）、inter_pred_flagとして０(PRED_L0)、１(PRED_L1)を復号する。なお、予測ユニットが所定のサイズ（予測ユニットの幅と高さの和が１２）の条件は、上述の双予測制限条件１に相当し、ＤＢＢＰフラグが１の条件は、以下の双予測制限条件２に相当する。 FIG. 24 is a diagram illustrating derivation of inter_pred_flag in the inter prediction parameter decoding control unit 3031A. inter_pred_flag is decoded when the slice type is B (bi-prediction is available). FIG. 24A is a diagram showing values that inter_pred_flag can take, and FIG. 24B shows a bit string (binarization) after CABAC decoding of inter_pred_flag. As shown in the figure, when the DBBP flag is 1, the inter prediction parameter decoding control unit 3031A (inter prediction identifier decoding unit 30312A) of the present embodiment does not decode inter_pred_flag = 2 (PRED_BI). That is, the inter-prediction parameter decoding control unit 3031A (inter-prediction identifier decoding unit 30312A) of the present embodiment has a case where the prediction unit is not a predetermined size ((nPbW + nPbH)! = 12) and the DBBP flag is 0. (Dbbp_flag == 0), 0 (PRED_L0), 1 (PRED_L1), 2 (PRED_BI) are decoded as inter_pred_flag, and the prediction unit has a predetermined size ((nPbW + nPbH)! = 12), or When the DBBP flag is 1 (dbbp_flag == 1), 0 (PRED_L0) and 1 (PRED_L1) are decoded as inter_pred_flag. The condition that the prediction unit is a predetermined size (the sum of the width and height of the prediction unit is 12) corresponds to the above-described bi-prediction restriction condition 1, and the condition that the DBBP flag is 1 is the following bi-prediction restriction condition. It corresponds to 2.

双予測制限条件２：ＤＢＢＰフラグdbbp_flagが１である。 Bi-prediction restriction condition 2: DBBP flag dbbp_flag is 1.

また、図２４（ｂ）に示すように、inter_pred_flagは、予測ユニットが所定のサイズではなく(( nPbW + nPbH ) != 12）、かつ、ＤＢＢＰフラグが０の場合（dbbp_flag == 0）に、inter_pred_flagのビット列は００、０１、１であり、各々、０(PRED_L0)、１(PRED_L1)、２(PRED _BI)が対応する。インター予測パラメータ復号制御部３０３１Ａ（インター予測識別子復号部３０３１２Ａ）は、上記場合に００、０１、１のビット列を復号し、inter_pred_flagに０(PRED_L0)、１(PRED _L1)、２(PRED _BI)を割り当てることで復号する。それ以外、予測ユニットが所定のサイズ(( nPbW + nPbH ) != 12）、もしくは、ＤＢＢＰフラグが１の場合（dbbp_flag == 1）、inter_pred_flagのビット列は０、１であり、各々、０(PRED_L0)、１(PRED_L1)が対応する。インター予測パラメータ復号制御部３０３１Ａは、上記場合に０、１のビット列を復号し、inter_pred_flagに０(PRED_L0)、１(PRED_L1)を割り当てることで復号する。 Also, as shown in FIG. 24 (b), inter_pred_flag is calculated when the prediction unit is not a predetermined size ((nPbW + nPbH)! = 12) and the DBBP flag is 0 (dbbp_flag == 0). The bit string of inter_pred_flag is 00, 01, 1 and 0 (PRED_L0), 1 (PRED_L1), and 2 (PRED_BI) correspond to each. In the above case, the inter prediction parameter decoding control unit 3031A (inter prediction identifier decoding unit 30312A) decodes 00, 01, and 1 bit strings, and inter_pred_flag is set to 0 (PRED_L0), 1 (PRED_L1), and 2 (PRED_BI). Decrypt by assigning. Otherwise, when the prediction unit has a predetermined size ((nPbW + nPbH)! = 12) or the DBBP flag is 1 (dbbp_flag == 1), the bit string of inter_pred_flag is 0, 1 and 0 (PRED_L0 ), 1 (PRED_L1) corresponds. In the above case, the inter prediction parameter decoding control unit 3031A decodes 0 and 1 bit strings, and assigns 0 (PRED_L0) and 1 (PRED_L1) to inter_pred_flag.

図２５は、マージモードパラメータ導出部３０３６Ａの構成を示すブロック図である。マージモードパラメータ導出部３０３６は、マージ候補導出部３０３６１とマージ候補選択部３０３６２、双予測制限部３０３６３Ａを備える。なお、マージモードパラメータ導出部３０３６Ａの一部動作と、双予測制限部３０３６３Ａを除く手段は既に説明した通りであるので説明を省略する。 FIG. 25 is a block diagram illustrating a configuration of the merge mode parameter deriving unit 3036A. The merge mode parameter derivation unit 3036 includes a merge candidate derivation unit 30361, a merge candidate selection unit 30362, and a bi-prediction restriction unit 30363A. The partial operation of the merge mode parameter deriving unit 3036A and the means other than the bi-prediction limiting unit 30363A are as described above, and thus the description thereof is omitted.

マージモードパラメータ導出部３０３６Ａは、双予測制限部３０３６３に、マージ候補選択部３０３６２で導出された予測パラメータと、予測ユニットの幅nOrigPbWと高さnOrigPbHに加え、ＤＢＢＰフラグdbbp_flagを双予測制限部３０３６３Ａに出力する。 The merge mode parameter deriving unit 3036A adds the DBBP flag dbbp_flag to the bi-prediction restriction unit 30363A in addition to the prediction parameter derived by the merge candidate selection unit 30362, the width nOrigPbW and the height nOrigPbH of the prediction unit. Output.

双予測制限部３０３６３Ａは、上述の双予測制限条件１もしくは双予測制限条件２の場合に、Ｌ１の参照ピクチャインデックスrefIdxL1とＬ１の予測利用フラグpredFlagL1に、refIdxL1＝-1、predFlagL1＝0を設定することにより、双予測（predFlagL0=1かつpredFlagL1=1）を単予測に変換する。 The bi-prediction restriction unit 30363A sets refIdxL1 = -1 and predFlagL1 = 0 to the reference picture index refIdxL1 of L1 and the prediction use flag predFlagL1 of L1 in the case of the above-described bi-prediction restriction condition 1 or bi-prediction restriction condition 2. Thus, bi-prediction (predFlagL0 = 1 and predFlagL1 = 1) is converted to single prediction.

以上の構成の双予測制限部３０３６３Ａによれば、ＤＢＢＰ予測部３０９５で導出される補間画像は、単予測（Ｌ０もしくはＬ１の参照ピクチャの場合）の場合に限定されるため、双予測において各々ＤＢＢＰ予測を用いて補間画像を生成する場合に比べて、処理量および転送量を大幅に削減する効果を奏する。 According to the bi-prediction restriction unit 30363A having the above-described configuration, the interpolation image derived by the DBBP prediction unit 3095 is limited to the case of uni-prediction (in the case of a reference picture of L0 or L1). Compared with the case of generating an interpolated image using prediction, the processing amount and the transfer amount are greatly reduced.

また、以上の構成のインター予測パラメータ復号制御部３０３１Ａおよびマージモードパラメータ導出部３０３６Ａをともにそなえる場合には、dbbp_flagが１の場合に行われるＤＢＢＰ予測において、双予測が適用される場合を完全に禁止することができる。 Further, when both the inter prediction parameter decoding control unit 3031A and the merge mode parameter deriving unit 3036A having the above-described configuration are provided, the case where bi-prediction is applied is completely prohibited in DBBP prediction performed when dbbp_flag is 1. can do.

（画像符号化装置の構成）
次に、本実施形態に係る画像符号化装置１１の構成について説明する。図２６は、本実施形態に係る画像符号化装置１１の構成を示すブロック図である。画像符号化装置１１は、予測画像生成部１０１、減算部１０２、ＤＣＴ・量子化部１０３、エントロピー符号化部１０４、逆量子化・逆ＤＣＴ部１０５、加算部１０６、予測パラメータメモリ（予測パラメータ記憶部、フレームメモリ）１０８、参照ピクチャメモリ（参照画像記憶部、フレームメモリ）１０９、符号化パラメータ決定部１１０、予測パラメータ符号化部１１１、を含んで構成される。予測パラメータ符号化部１１１は、インター予測パラメータ符号化部１１２及びイントラ予測パラメータ符号化部１１３を含んで構成される。(Configuration of image encoding device)
Next, the configuration of the image encoding device 11 according to the present embodiment will be described. FIG. 26 is a block diagram illustrating a configuration of the image encoding device 11 according to the present embodiment. The image encoding device 11 includes a prediction image generation unit 101, a subtraction unit 102, a DCT / quantization unit 103, an entropy encoding unit 104, an inverse quantization / inverse DCT unit 105, an addition unit 106, a prediction parameter memory (prediction parameter storage). Section, frame memory) 108, reference picture memory (reference image storage unit, frame memory) 109, coding parameter determination unit 110, and prediction parameter coding unit 111. The prediction parameter encoding unit 111 includes an inter prediction parameter encoding unit 112 and an intra prediction parameter encoding unit 113.

予測画像生成部１０１は、外部から入力されたレイヤ画像Ｔの視点毎の各ピクチャについて、そのピクチャを分割した領域であるブロック毎に予測ピクチャブロックpredSamplesを生成する。ここで、予測画像生成部１０１は、予測パラメータ符号化部１１１から入力された予測パラメータに基づいて参照ピクチャメモリ１０９から参照ピクチャブロックを読み出す。予測パラメータ符号化部１１１から入力された予測パラメータとは、例えば、動きベクトル又は変位ベクトルである。予測画像生成部１０１は、符号化予測ユニットを起点として予測された動きベクトル又は変位ベクトルが示す位置にあるブロックの参照ピクチャブロックを読み出す。予測画像生成部１０１は、読み出した参照ピクチャブロックについて複数の予測方式のうちの１つの予測方式を用いて予測ピクチャブロックpredSamplesを生成する。予測画像生成部１０１は、生成した予測ピクチャブロックpredSamplesを減算部１０２と加算部１０６に出力する。なお、予測画像生成部１０１は、既に説明した予測画像生成部３０８と同じ動作であるため予測ピクチャブロックpredSamplesの生成の詳細は省略する。 The predicted image generation unit 101 generates predicted picture blocks predSamples for each block which is an area obtained by dividing the picture for each viewpoint of the layer image T input from the outside. Here, the predicted image generation unit 101 reads the reference picture block from the reference picture memory 109 based on the prediction parameter input from the prediction parameter encoding unit 111. The prediction parameter input from the prediction parameter encoding unit 111 is, for example, a motion vector or a displacement vector. The predicted image generation unit 101 reads the reference picture block of the block at the position indicated by the motion vector or the displacement vector predicted from the encoded prediction unit. The predicted image generation unit 101 generates predicted picture blocks predSamples using one prediction method among a plurality of prediction methods for the read reference picture block. The predicted image generation unit 101 outputs the generated predicted picture block predSamples to the subtraction unit 102 and the addition unit 106. Note that since the predicted image generation unit 101 performs the same operation as the predicted image generation unit 308 already described, details of generation of the predicted picture block predSamples are omitted.

予測画像生成部１０１は、予測方式を選択するために、例えば、レイヤ画像に含まれるブロックの画素毎の信号値と予測ピクチャブロックpredSamplesの対応する画素毎の信号値との差分に基づく誤差値を最小にする予測方式を選択する。なお、予測方式を選択する方法は、これには限られない。 In order to select a prediction method, the predicted image generation unit 101, for example, calculates an error value based on a difference between a signal value for each pixel of a block included in the layer image and a signal value for each corresponding pixel of the predicted picture block predSamples. Select the prediction method to minimize. Note that the method of selecting the prediction method is not limited to this.

符号化対象のピクチャがベースビューのピクチャである場合には、複数の予測方式とは、イントラ予測、動き予測及びマージモードである。動き予測とは、上述のインター予測のうち、表示時刻間の予測である。マージモードとは、既に符号化されたブロックであって、予測ユニットから予め定めた範囲内にあるブロックと同一の参照ピクチャブロック及び予測パラメータを用いる予測である。符号化対象のピクチャがベースビュー以外のピクチャである場合には、複数の予測方式とは、イントラ予測、動き予測、マージモード（視点合成予測を含む）、及び変位予測である。変位予測（視差予測）とは、上述のインター予測のうち、別レイヤ画像（別視点画像）間の予測である。変位予測（視差予測）に対して、追加予測（残差予測および照度補償）を行う場合と行わない場合の予測がある。 When the encoding target picture is a base view picture, the plurality of prediction methods are intra prediction, motion prediction, and merge mode. Motion prediction is prediction between display times among the above-mentioned inter predictions. The merge mode is a prediction that uses the same reference picture block and prediction parameter as a block that has already been encoded and is within a predetermined range from the prediction unit. When the encoding target picture is a picture other than the base view, the plurality of prediction methods are intra prediction, motion prediction, merge mode (including viewpoint synthesis prediction), and displacement prediction. The displacement prediction (disparity prediction) is prediction between different layer images (different viewpoint images) in the above-described inter prediction. For displacement prediction (disparity prediction), there are predictions with and without additional prediction (residual prediction and illuminance compensation).

予測画像生成部１０１は、イントラ予測を選択した場合、予測ピクチャブロックpredSamplesを生成する際に用いたイントラ予測モードを示す予測モードPredModeを予測パラメータ符号化部１１１に出力する。 When the intra prediction is selected, the predicted image generation unit 101 outputs a prediction mode PredMode indicating the intra prediction mode used when generating the predicted picture block predSamples to the prediction parameter encoding unit 111.

予測画像生成部１０１は、動き予測を選択した場合、予測ピクチャブロックpredSamplesを生成する際に用いた動きベクトルmvLXを予測パラメータメモリ１０８に記憶し、インター予測パラメータ符号化部１１２に出力する。動きベクトルmvLXは、符号化予測ユニットの位置から予測ピクチャブロックpredSamplesを生成する際の参照ピクチャブロックの位置までのベクトルを示す。動きベクトルmvLXを示す情報には、参照ピクチャを示す情報（例えば、参照ピクチャインデックスrefIdxLX、ピクチャ順序番号ＰＯＣ）を含み、予測パラメータを表すものであっても良い。また、予測画像生成部１０１は、インター予測モードを示す予測モードPredModeを予測パラメータ符号化部１１１に出力する。 When motion prediction is selected, the predicted image generation unit 101 stores the motion vector mvLX used when generating the predicted picture block predSamples in the prediction parameter memory 108 and outputs the motion vector mvLX to the inter prediction parameter encoding unit 112. The motion vector mvLX indicates a vector from the position of the encoded prediction unit to the position of the reference picture block when the predicted picture block predSamples is generated. The information indicating the motion vector mvLX may include information indicating a reference picture (for example, a reference picture index refIdxLX, a picture order number POC), and may represent a prediction parameter. Further, the predicted image generation unit 101 outputs a prediction mode PredMode indicating the inter prediction mode to the prediction parameter encoding unit 111.

予測画像生成部１０１は、変位予測を選択した場合、予測ピクチャブロックpredSamplesを生成する際に用いた変位ベクトルを予測パラメータメモリ１０８に記憶し、インター予測パラメータ符号化部１１２に出力する。変位ベクトルdvLXは、符号化予測ユニットの位置から予測ピクチャブロックpredSamplesを生成する際の参照ピクチャブロックの位置までのベクトルを示す。変位ベクトルdvLXを示す情報には、参照ピクチャを示す情報（例えば、参照ピクチャインデックスrefIdxLX、ビューＩＤview_id）を含み、予測パラメータを表すものであっても良い。また、予測画像生成部１０１は、インター予測モードを示す予測モードPredModeを予測パラメータ符号化部１１１に出力する。 When the prediction image generation unit 101 selects displacement prediction, the prediction image generation unit 101 stores the displacement vector used when generating the prediction picture block predSamples in the prediction parameter memory 108 and outputs the prediction vector to the inter prediction parameter encoding unit 112. The displacement vector dvLX indicates a vector from the position of the encoded prediction unit to the position of the reference picture block when the predicted picture block predSamples is generated. The information indicating the displacement vector dvLX may include information indicating a reference picture (for example, reference picture index refIdxLX, view IDview_id) and may represent a prediction parameter. Further, the predicted image generation unit 101 outputs a prediction mode PredMode indicating the inter prediction mode to the prediction parameter encoding unit 111.

予測画像生成部１０１は、マージモードを選択した場合、選択した参照ピクチャブロックを示すマージインデックスmerge_idxをインター予測パラメータ符号化部１１２に出力する。また、予測画像生成部１０１は、マージモードを示す予測モードPredModeを予測パラメータ符号化部１１１に出力する。 When the merge mode is selected, the predicted image generation unit 101 outputs a merge index merge_idx indicating the selected reference picture block to the inter prediction parameter encoding unit 112. Further, the predicted image generation unit 101 outputs a prediction mode PredMode indicating the merge mode to the prediction parameter encoding unit 111.

上記の、マージモードにおいて、予測画像生成部１０１は、VSPモードフラグVspModeFlagが視点合成予測を行うことを示す場合には、既に説明したように予測画像生成部１０１に含まれるVSP予測部３０３７４において視点合成予測を行う。また、動き予測、変位予測、マージモードにおいて、予測画像生成部１０１は、残差予測実施フラグresPredFlagが残差予測を行うことを示す場合には、既に説明したように予測画像生成部１０１に含まれる残差予測部３０９２において残差予測を行う。 In the merge mode described above, when the VSP mode flag VspModeFlag indicates that the viewpoint synthesis prediction is performed, the prediction image generation unit 101 performs the viewpoint in the VSP prediction unit 30374 included in the prediction image generation unit 101 as described above. Perform synthetic prediction. Further, in the motion prediction, displacement prediction, and merge mode, the prediction image generation unit 101 includes the prediction image generation unit 101 as described above when the residual prediction execution flag resPredFlag indicates that the residual prediction is performed. The residual prediction unit 3092 performs residual prediction.

減算部１０２は、予測画像生成部１０１から入力された予測ピクチャブロックpredSamplesの信号値を、外部から入力されたレイヤ画像Ｔの対応するブロックの信号値から画素毎に減算して、残差信号を生成する。減算部１０２は、生成した残差信号をＤＣＴ・量子化部１０３と符号化パラメータ決定部１１０に出力する。 The subtraction unit 102 subtracts the signal value of the prediction picture block predSamples input from the prediction image generation unit 101 for each pixel from the signal value of the corresponding block of the layer image T input from the outside, and generates a residual signal. Generate. The subtraction unit 102 outputs the generated residual signal to the DCT / quantization unit 103 and the encoding parameter determination unit 110.

ＤＣＴ・量子化部１０３は、減算部１０２から入力された残差信号についてＤＣＴを行い、ＤＣＴ係数を算出する。ＤＣＴ・量子化部１０３は、算出したＤＣＴ係数を量子化して量子化係数を求める。ＤＣＴ・量子化部１０３は、求めた量子化係数をエントロピー符号化部１０４及び逆量子化・逆ＤＣＴ部１０５に出力する。 The DCT / quantization unit 103 performs DCT on the residual signal input from the subtraction unit 102 and calculates a DCT coefficient. The DCT / quantization unit 103 quantizes the calculated DCT coefficient to obtain a quantization coefficient. The DCT / quantization unit 103 outputs the obtained quantization coefficient to the entropy encoding unit 104 and the inverse quantization / inverse DCT unit 105.

エントロピー符号化部１０４には、ＤＣＴ・量子化部１０３から量子化係数が入力され、符号化パラメータ決定部１１０から符号化パラメータが入力される。入力される符号化パラメータには、例えば、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLX、予測モードPredMode、マージインデックスmerge_idx、残差予測インデックスiv_res_pred_weight_idx、及び照度補償フラグic_flag等の符号がある。 The entropy coding unit 104 receives the quantization coefficient from the DCT / quantization unit 103 and the coding parameter from the coding parameter determination unit 110. The input encoding parameters include codes such as a reference picture index refIdxLX, a prediction vector flag mvp_LX_flag, a difference vector mvdLX, a prediction mode PredMode, a merge index merge_idx, a residual prediction index iv_res_pred_weight_idx, and an illumination compensation flag ic_flag.

エントロピー符号化部１０４は、入力された量子化係数と符号化パラメータをエントロピー符号化して符号化ストリームＴｅを生成し、生成した符号化ストリームＴｅを外部に出力する。 The entropy encoding unit 104 generates an encoded stream Te by entropy encoding the input quantization coefficient and encoding parameter, and outputs the generated encoded stream Te to the outside.

逆量子化・逆ＤＣＴ部１０５は、ＤＣＴ・量子化部１０３から入力された量子化係数を逆量子化してＤＣＴ係数を求める。逆量子化・逆ＤＣＴ部１０５は、求めたＤＣＴ係数について逆ＤＣＴを行い、復号残差信号を算出する。逆量子化・逆ＤＣＴ部１０５は、算出した復号残差信号を加算部１０６、及び符号化パラメータ決定部１１０に出力する。 The inverse quantization / inverse DCT unit 105 inversely quantizes the quantization coefficient input from the DCT / quantization unit 103 to obtain a DCT coefficient. The inverse quantization / inverse DCT unit 105 performs inverse DCT on the obtained DCT coefficient to calculate a decoded residual signal. The inverse quantization / inverse DCT unit 105 outputs the calculated decoded residual signal to the addition unit 106 and the encoding parameter determination unit 110.

加算部１０６は、予測画像生成部１０１から入力された予測ピクチャブロックpredSamplesの信号値と逆量子化・逆ＤＣＴ部１０５から入力された復号残差信号の信号値を画素毎に加算して、参照ピクチャブロックを生成する。加算部１０６は、生成した参照ピクチャブロックを参照ピクチャメモリ１０９に記憶する。 The addition unit 106 adds the signal value of the prediction picture block predSamples input from the prediction image generation unit 101 and the signal value of the decoded residual signal input from the inverse quantization / inverse DCT unit 105 for each pixel, and refers to them. Generate a picture block. The adding unit 106 stores the generated reference picture block in the reference picture memory 109.

予測パラメータメモリ１０８は、予測パラメータ符号化部１１１が生成した予測パラメータを、符号化対象のピクチャ及びブロック毎に予め定めた位置に記憶する。 The prediction parameter memory 108 stores the prediction parameter generated by the prediction parameter encoding unit 111 at a predetermined position for each picture and block to be encoded.

参照ピクチャメモリ１０９は、加算部１０６が生成した参照ピクチャブロックを、符号化対象のピクチャ及びブロック毎に予め定めた位置に記憶する。 The reference picture memory 109 stores the reference picture block generated by the addition unit 106 at a predetermined position for each picture and block to be encoded.

符号化パラメータ決定部１１０は、符号化パラメータの複数のセットのうち、１つのセットを選択する。符号化パラメータとは、上述した予測パラメータやこの予測パラメータに関連して生成される符号化の対象となるパラメータである。予測画像生成部１０１は、これらの符号化パラメータのセットの各々を用いて予測ピクチャブロックpredSamplesを生成する。 The encoding parameter determination unit 110 selects one set from among a plurality of sets of encoding parameters. The encoding parameter is a parameter to be encoded that is generated in association with the above-described prediction parameter or the prediction parameter. The predicted image generation unit 101 generates predicted picture blocks predSamples using each of these sets of encoding parameters.

符号化パラメータ決定部１１０は、複数のセットの各々について情報量の大きさと符号化誤差を示すコスト値を算出する。コスト値は、例えば、符号量と二乗誤差に係数λを乗じた値との和である。符号量は、量子化誤差と符号化パラメータをエントロピー符号化して得られる符号化ストリームＴｅの情報量である。二乗誤差は、減算部１０２において算出された残差信号の残差値の二乗値についての画素間の総和である。係数λは、予め設定されたゼロよりも大きい実数である。符号化パラメータ決定部１１０は、算出したコスト値が最小となる符号化パラメータのセットを選択する。これにより、エントロピー符号化部１０４は、選択した符号化パラメータのセットを符号化ストリームＴｅとして外部に出力し、選択されなかった符号化パラメータのセットを出力しない。 The encoding parameter determination unit 110 calculates a cost value indicating the amount of information and the encoding error for each of the plurality of sets. The cost value is, for example, the sum of a code amount and a square error multiplied by a coefficient λ. The code amount is the information amount of the encoded stream Te obtained by entropy encoding the quantization error and the encoding parameter. The square error is the sum between pixels regarding the square value of the residual value of the residual signal calculated by the subtracting unit 102. The coefficient λ is a real number larger than a preset zero. The encoding parameter determination unit 110 selects a set of encoding parameters that minimizes the calculated cost value. As a result, the entropy encoding unit 104 outputs the selected set of encoding parameters to the outside as the encoded stream Te, and does not output the set of unselected encoding parameters.

予測パラメータ符号化部１１１は、予測画像生成部１０１から入力されたパラメータに基づいて予測ピクチャを生成する際に用いる予測パラメータを導出し、導出した予測パラメータを符号化して符号化パラメータのセットを生成する。予測パラメータ符号化部１１１は、生成した符号化パラメータのセットをエントロピー符号化部１０４に出力する。 The prediction parameter encoding unit 111 derives a prediction parameter used when generating a prediction picture based on the parameter input from the prediction image generation unit 101, and encodes the derived prediction parameter to generate a set of encoding parameters. To do. The prediction parameter encoding unit 111 outputs the generated set of encoding parameters to the entropy encoding unit 104.

予測パラメータ符号化部１１１は、生成した符号化パラメータのセットのうち符号化パラメータ決定部１１０が選択したものに対応する予測パラメータを予測パラメータメモリ１０８に記憶する。 The prediction parameter encoding unit 111 stores, in the prediction parameter memory 108, a prediction parameter corresponding to the one selected by the encoding parameter determination unit 110 from the generated set of encoding parameters.

予測パラメータ符号化部１１１は、予測画像生成部１０１から入力された予測モードPredModeがインター予測モードを示す場合、インター予測パラメータ符号化部１１２を動作させる。予測パラメータ符号化部１１１は、予測モードPredModeがイントラ予測モードを示す場合、イントラ予測パラメータ符号化部１１３を動作させる。 The prediction parameter encoding unit 111 operates the inter prediction parameter encoding unit 112 when the prediction mode PredMode input from the prediction image generation unit 101 indicates the inter prediction mode. The prediction parameter encoding unit 111 operates the intra prediction parameter encoding unit 113 when the prediction mode PredMode indicates the intra prediction mode.

インター予測パラメータ符号化部１１２は、符号化パラメータ決定部１１０から入力された予測パラメータに基づいてインター予測パラメータを導出する。インター予測パラメータ符号化部１１２は、インター予測パラメータを導出する構成として、インター予測パラメータ復号部３０３がインター予測パラメータを導出する構成と同一の構成を含む。インター予測パラメータ符号化部１１２の構成については、後述する。 The inter prediction parameter encoding unit 112 derives an inter prediction parameter based on the prediction parameter input from the encoding parameter determination unit 110. The inter prediction parameter encoding unit 112 includes the same configuration as the configuration in which the inter prediction parameter decoding unit 303 derives the inter prediction parameter as a configuration for deriving the inter prediction parameter. The configuration of the inter prediction parameter encoding unit 112 will be described later.

イントラ予測パラメータ符号化部１１３は、符号化パラメータ決定部１１０から入力された予測モードPredModeが示すイントラ予測モードIntraPredModeをインター予測パラメータのセットとして定める。 The intra prediction parameter encoding unit 113 determines the intra prediction mode IntraPredMode indicated by the prediction mode PredMode input from the encoding parameter determination unit 110 as a set of inter prediction parameters.

（インター予測パラメータ符号化部の構成）
次に、インター予測パラメータ符号化部１１２の構成について説明する。インター予測パラメータ符号化部１１２は、インター予測パラメータ復号部３０３に対応する手段である。図２７は、本実施形態に係るインター予測パラメータ符号化部１１２の構成を示す概略図である。インター予測パラメータ符号化部１１２は、マージモードパラメータ導出部１１２１、AMVP予測パラメータ導出部１１２２、減算部１１２３、及びインター予測パラメータ符号化制御部１１２６を含んで構成される。(Configuration of inter prediction parameter encoding unit)
Next, the configuration of the inter prediction parameter encoding unit 112 will be described. The inter prediction parameter encoding unit 112 is means corresponding to the inter prediction parameter decoding unit 303. FIG. 27 is a schematic diagram illustrating the configuration of the inter prediction parameter encoding unit 112 according to the present embodiment. The inter prediction parameter encoding unit 112 includes a merge mode parameter deriving unit 1121, an AMVP prediction parameter deriving unit 1122, a subtracting unit 1123, and an inter prediction parameter encoding control unit 1126.

マージモードパラメータ導出部１１２１は、上述のマージモードパラメータ導出部３０３６（図９参照）と同様な構成を有する。 The merge mode parameter deriving unit 1121 has the same configuration as the merge mode parameter deriving unit 3036 (see FIG. 9).

AMVP予測パラメータ導出部１１２２は、上述のAMVP予測パラメータ導出部３０３２（図１０参照）と同様な構成を有する。 The AMVP prediction parameter derivation unit 1122 has the same configuration as the AMVP prediction parameter derivation unit 3032 (see FIG. 10).

減算部１１２３は、符号化パラメータ決定部１１０から入力されたベクトルmvLXからAMVP予測パラメータ導出部１１２２から入力された予測ベクトルmvpLXを減算して差分ベクトルmvdLXを生成する。差分ベクトルmvdLXはインター予測パラメータ符号化制御部１１２６に出力する。 The subtraction unit 1123 subtracts the prediction vector mvpLX input from the AMVP prediction parameter derivation unit 1122 from the vector mvLX input from the coding parameter determination unit 110 to generate a difference vector mvdLX. The difference vector mvdLX is output to the inter prediction parameter encoding control unit 1126.

インター予測パラメータ符号化制御部１１２６は、インター予測に関連する符号（シンタックス要素の復号をエントロピー符号化部１０４に指示し、符号化データに含まれる符号（シンタックス要素）を例えば、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLXを符号化する。 The inter prediction parameter coding control unit 1126 instructs the entropy coding unit 104 to decode a code related to inter prediction (the syntax element) includes, for example, a code (syntax element) included in the coded data. , Merge flag merge_flag, merge index merge_idx, inter prediction identifier inter_pred_idc, reference picture index refIdxLX, prediction vector flag mvp_LX_flag, and difference vector mvdLX are encoded.

インター予測パラメータ符号化制御部１１２６は、残差予測インデックス符号化部１０３１１、照度補償フラグ符号化部１０３１２、マージインデックス符号化部、ベクトル候補インデックス符号化部、分割モード符号化部、マージフラグ符号化部、インター予測識別子符号化部、参照ピクチャインデックス符号化部、ベクトル差分符号化部を含んで構成される。分割モード符号化部、マージフラグ符号化部、マージインデックス符号化部、インター予測識別子符号化部、参照ピクチャインデックス符号化部、ベクトル候補インデックス符号化部、ベクトル差分符号化部は各々、分割モードpart_mode、マージフラグmerge_flag、マージインデックスmerge_idx、インター予測識別子inter_pred_idc、参照ピクチャインデックスrefIdxLX、予測ベクトルフラグmvp_LX_flag、差分ベクトルmvdLXを符号化する。 The inter prediction parameter encoding control unit 1126 includes a residual prediction index encoding unit 10311, an illumination compensation flag encoding unit 10312, a merge index encoding unit, a vector candidate index encoding unit, a split mode encoding unit, and a merge flag encoding. , An inter prediction identifier encoding unit, a reference picture index encoding unit, and a vector difference encoding unit. The division mode encoding unit, the merge flag encoding unit, the merge index encoding unit, the inter prediction identifier encoding unit, the reference picture index encoding unit, the vector candidate index encoding unit, and the vector difference encoding unit are respectively divided mode part_mode , Merge flag merge_flag, merge index merge_idx, inter prediction identifier inter_pred_idc, reference picture index refIdxLX, prediction vector flag mvp_LX_flag, and difference vector mvdLX are encoded.

残差予測インデックス符号化部１０３１１は、残差予測が行われるか否かを示すために、残差予測インデックスiv_res_pred_weight_idxを符号化する。 The residual prediction index encoding unit 10311 encodes the residual prediction index iv_res_pred_weight_idx to indicate whether or not residual prediction is performed.

照度補償フラグ符号化部１０３１２は、照度補償が行われるか否かを示すために、照度補償フラグic_flagを符号化する。 The illuminance compensation flag encoding unit 10312 encodes the illuminance compensation flag ic_flag to indicate whether or not illuminance compensation is performed.

インター予測パラメータ符号化制御部１１２６は、予測画像生成部１０１から入力された予測モードPredModeがマージモードを示す場合には、符号化パラメータ決定部１１０から入力されたマージインデックスmerge_idxをエントロピー符号化部１０４に出力し、符号化させる。 When the prediction mode PredMode input from the prediction image generation unit 101 indicates the merge mode, the inter prediction parameter encoding control unit 1126 uses the merge index merge_idx input from the encoding parameter determination unit 110 as the entropy encoding unit 104. To be encoded.

また、インター予測パラメータ符号化制御部１１２６は、予測画像生成部１０１から入力された予測モードPredModeがインター予測モードを示す場合には、次の処理を行う。 Also, the inter prediction parameter encoding control unit 1126 performs the following process when the prediction mode PredMode input from the prediction image generation unit 101 indicates the inter prediction mode.

インター予測パラメータ符号化制御部１１２６は、符号化パラメータ決定部１１０から入力された参照ピクチャインデックスrefIdxLX及び予測ベクトルフラグmvp_LX_flag、減算部１１２３から入力された差分ベクトルmvdLXを統合する。インター予測パラメータ符号化制御部１１２６は、統合した符号をエントロピー符号化部１０４に出力し、符号化させる。インター予測パラメータ符号化制御部１１２６は図示しないＤＢＢＰフラグ符号化部dbbp_flagを備える。 The inter prediction parameter encoding control unit 1126 integrates the reference picture index refIdxLX and the prediction vector flag mvp_LX_flag input from the encoding parameter determination unit 110 and the difference vector mvdLX input from the subtraction unit 1123. The inter prediction parameter encoding control unit 1126 outputs the integrated code to the entropy encoding unit 104 to be encoded. The inter prediction parameter encoding control unit 1126 includes a DBBP flag encoding unit dbbp_flag (not shown).

予測画像生成部１０１は、上述の予測画像生成部３０８に対応する手段であり、予測パラメータから、予測画像を生成する処理は同一である。 The predicted image generation unit 101 is means corresponding to the predicted image generation unit 308 described above, and the process for generating a predicted image from the prediction parameters is the same.

本実施形態では、予測画像生成部１０１も、予測画像生成部３０８と同様、本実施形態も上述の残差合成部３０９２３を備える。すなわち、対象ブロック（予測ブロック）のサイズが所定のサイズ以下の場合には、残差予測を行わない。また、本実施形態の予測画像生成部１０１は、符号化ユニットＣＵの分割モードpart_modeが２Ｎ×２Ｎの場合にのみ、残差予測を行う。すなわち、残差予測インデックスiv_res_pred_weight_idxを０として処理する。また、本実施形態の残差予測インデックス符号化部１０３１１では、符号化ユニットＣＵの分割モードpart_modeが２Ｎ×２Ｎの場合にのみ、残差予測インデックスiv_res_pred_weight_idxを符号化する。 In the present embodiment, the predicted image generation unit 101 also includes the above-described residual synthesis unit 30923 as with the predicted image generation unit 308. That is, residual prediction is not performed when the size of the target block (predicted block) is equal to or smaller than a predetermined size. Also, the predicted image generation unit 101 of the present embodiment performs residual prediction only when the division mode part_mode of the coding unit CU is 2N × 2N. That is, the residual prediction index iv_res_pred_weight_idx is set to 0. Also, the residual prediction index encoding unit 10311 of the present embodiment encodes the residual prediction index iv_res_pred_weight_idx only when the division mode part_mode of the encoding unit CU is 2N × 2N.

残差予測部３０９２を備える画像符号化装置は、残差予測インデックスを符号化する残差予測インデックス符号化部を備える画像符号化装置において、対象対象ブロックを含む符号化ユニットの分割モードが２Ｎ×２Ｎの場合に、残差予測インデックスを符号化し、それ以外の場合には、残差予測インデックスを符号化せず、残差予測インデックスが０以外の場合に、残差予測を行う。 The image encoding device including the residual prediction unit 3092 is the image encoding device including the residual prediction index encoding unit that encodes the residual prediction index, and the division mode of the encoding unit including the target block is 2N ×. In the case of 2N, the residual prediction index is encoded. In other cases, the residual prediction index is not encoded, and when the residual prediction index is other than 0, residual prediction is performed.

（ＤＢＢＰ予測）
また、本実施形態の画像符号化装置１１の備える予測画像生成部１０１は、ＤＢＢＰ予測部３０９５を備える。ＤＢＢＰ予測部３０９５の動作の詳細は既に説明済みであるので省略する。ＤＢＢＰ予測部３０９５は上述のＤＢＢＰフラグ符号化部dbbp_flagとして１を符号化する場合に、デプスベースブロック予測を行う。(DBBP prediction)
Further, the predicted image generation unit 101 included in the image encoding device 11 of the present embodiment includes a DBBP prediction unit 3095. Details of the operation of the DBBP prediction unit 3095 have already been described, and are omitted here. The DBBP prediction unit 3095 performs depth-based block prediction when 1 is encoded as the DBBP flag encoding unit dbbp_flag described above.

以上の構成のＤＢＢＰ予測部３０９５を備える画像符号化装置によれば、ＤＢＢＰ画像補間部３０９５１において、２つの補間画像を双線形予測により補間画像を生成するため、処理量および転送量を大幅に削減する効果を奏する。 According to the image coding apparatus including the DBBP prediction unit 3095 having the above configuration, the DBBP image interpolation unit 30951 generates an interpolation image by bilinear prediction of two interpolation images, so that the processing amount and the transfer amount are greatly reduced. The effect to do.

以上の構成のＤＢＢＰ予測部３０９５を備える画像符号化装置によれば、セグメンテーション部３０９５２において、各画素について０もしくは１をとるようなセグメンテーション情報segMaskを導出し、画像合成部３０９５３は、セグメンテーション情報segMaskに基づいて対象ブロックの各画素において２つの動き補償画像のいずれかを選択することにより合成する。これにより、２つの動き補償画像を例えば重み1/2で重み付けして合成する画素がある場合に比べ、画像合成部３０９５３の処理を低減する効果を奏する。 According to the image coding apparatus including the DBBP prediction unit 3095 having the above configuration, the segmentation unit 30952 derives segmentation information segMask that takes 0 or 1 for each pixel, and the image synthesis unit 30953 outputs the segmentation information segMask to the segmentation information segMask. Based on this, synthesis is performed by selecting one of the two motion compensation images in each pixel of the target block. Accordingly, an effect of reducing the processing of the image synthesizing unit 30953 is achieved as compared with a case where there are pixels that are synthesized by weighting, for example, two motion compensated images with a weight of 1/2.

また、以上の構成のＤＢＢＰ予測部３０９５を備える画像符号化装置によれば、各画素(x, y)毎に対応するセグメンテーション情報segMask[x][y]だけではなく、上下左右のセグメンテーション情報segMask[x][y-1]、segMask[x][y+1]、segMask[x-1][y]、segMask[x+1][y]も参照して合成する場合に比べ、画像合成部３０９５３の処理を大幅に低減する効果を奏する。 In addition, according to the image coding apparatus including the DBBP prediction unit 3095 having the above configuration, not only segmentation information segMask [x] [y] corresponding to each pixel (x, y), but also up / down / left / right segmentation information segMask Image compositing compared to compositing with reference to [x] [y-1], segMask [x] [y + 1], segMask [x-1] [y], segMask [x + 1] [y] The effect of greatly reducing the processing of the unit 30953 is achieved.

（ＤＢＢＰの変形例）
また、画像符号化装置１１の備える予測画像生成部１０１は、ＤＢＢＰ予測部３０９５の代わりに、ＤＢＢＰ予測部３０９５Ａ、ＤＢＢＰ予測部３０９５Ｂ、ＤＢＢＰ予測部３０９５Ｃの何れかを備えても良い。(Modification of DBBP)
The predicted image generation unit 101 included in the image encoding device 11 may include any of the DBBP prediction unit 3095A, the DBBP prediction unit 3095B, and the DBBP prediction unit 3095C instead of the DBBP prediction unit 3095.

以上の構成のＤＢＢＰ予測部３０９５Ａ〜ＤＢＢＰ予測部３０９５Ｃを備える画像符号化装置によれば、デプスブロックの限定された画素のみ（ここでは４隅の画素）のみを参照するため、全ての画素を参照する場合に比べて処理量を大幅に削減する効果を奏する。 According to the image coding apparatus including the DBBP prediction units 3095A to 3095C having the above-described configuration, only the limited pixels of the depth block (here, the four corner pixels) are referred to, and thus all the pixels are referred to. Compared with the case where it does, there exists an effect which reduces a processing amount significantly.

以上の構成のＤＢＢＰ予測部３０９５Ａ〜ＤＢＢＰ予測部３０９５Ｃを備える画像符号化装置によれば、デプスの左上画素と右下画素の比較と、デプスの右上画素と左下画素の比較という単純な処理により分割モードを導出するため、全ての画素について比較を行う場合に比べて処理量を大幅に削減する効果を奏する。 According to the image coding apparatus including the DBBP prediction units 3095A to 3095C having the above configuration, the division is performed by a simple process of comparing the upper left pixel and the lower right pixel of the depth and comparing the upper right pixel and the lower left pixel of the depth. Since the mode is derived, the processing amount is greatly reduced as compared with the case where all the pixels are compared.

また、以上の構成のＤＢＢＰ予測部３０９５Ｂ、ＤＢＢＰ予測部３０９５Ｃを備える画像符号化装置によれば、ＤＢＢＰ予測部３０９５Ａと同様、分割モードとしてＮ×２Ｎもしくは２Ｎ×Ｎの何れかのみを用いるため、ＡＭＰ分割を対象とする場合に比べ処理量を削減する効果を奏する。 In addition, according to the image encoding device including the DBBP prediction unit 3095B and the DBBP prediction unit 3095C having the above configuration, as in the DBBP prediction unit 3095A, only N × 2N or 2N × N is used as the division mode. There is an effect of reducing the processing amount as compared with the case where AMP division is targeted.

以上の構成のＤＢＢＰ予測部３０９５Ｃの備える画像符号化装置によれば、ＤＢＢＰ予測部３０９５ＣとＶＳＰ予測部３０３７４で共通の分割モード導出部３０９６を用いるため、ＤＢＢＰ予測部とＶＳＰ予測部で異なる方法を用いて分割方法を導出する場合に比べて、実装を簡略化できる効果を奏する。 According to the image coding apparatus included in the DBBP prediction unit 3095C having the above configuration, the DBBP prediction unit 3095C and the VSP prediction unit 30374 use the common split mode deriving unit 3096. Therefore, different methods are used in the DBBP prediction unit and the VSP prediction unit. As compared with the case where the division method is derived by using the method, the effect of simplifying the mounting is obtained.

（ＤＢＢＰの双予測制限）
本実施形態の変形例の画像符号化装置１１では、ＤＢＢＰの場合に双予測を適用しないように構成する。具体的には、インター予測パラメータ符号化部１０３の代わりに図示しないインター予測パラメータ符号化部１０３Ａ、マージモードパラメータ導出部３０３６の代わりに図示しないマージモードパラメータ符号化部１０３６Ａおよびマージモードパラメータ導出部３０３６Ａを備える。インター予測パラメータ符号化部１０３Ａは上述のインター予測パラメータ復号制御部３０３１Ａに対応する手段である。(DBBP bi-prediction restriction)
The image encoding device 11 according to the modification of the present embodiment is configured not to apply bi-prediction in the case of DBBP. Specifically, an inter prediction parameter encoding unit 103A (not shown) instead of the inter prediction parameter encoding unit 103, a merge mode parameter encoding unit 1036A and a merge mode parameter deriving unit 3036A (not shown) instead of the merge mode parameter deriving unit 3036. Is provided. The inter prediction parameter encoding unit 103A is means corresponding to the above-described inter prediction parameter decoding control unit 3031A.

インター予測パラメータ符号化部１０３Ａは、ＤＢＢＰフラグが１の場合には、inter_pred_flag=2(PRED_BI)を符号化しない。すなわち、本実施形態のインター予測パラメータ符号化部１０３Ａは、予測ユニットが所定のサイズではなく(( nPbW + nPbH ) != 12）、かつ、ＤＢＢＰフラグが０の場合（dbbp_flag == 0）に、inter_pred_flagとして０(PRED_L0)、１(PRED_L1)、２(PRED_BI)を復号し、それ以外、予測ユニットが所定のサイズ(( nPbW + nPbH ) != 12）、もしくは、ＤＢＢＰフラグが１の場合（dbbp_flag == 1）、inter_pred_flagとして０(PRED_L0)、１(PRED_L1)を符号化する。 When the DBBP flag is 1, the inter prediction parameter encoding unit 103A does not encode inter_pred_flag = 2 (PRED_BI). That is, the inter prediction parameter encoding unit 103A of the present embodiment, when the prediction unit is not a predetermined size ((nPbW + nPbH)! = 12) and the DBBP flag is 0 (dbbp_flag == 0), When 0 (PRED_L0), 1 (PRED_L1), and 2 (PRED_BI) are decoded as inter_pred_flag and the prediction unit has a predetermined size ((nPbW + nPbH)! = 12) or the DBBP flag is 1 (dbbp_flag == 1), 0 (PRED_L0) and 1 (PRED_L1) are encoded as inter_pred_flag.

また、inter_pred_flagは、予測ユニットが所定のサイズではなく(( nPbW + nPbH ) != 12）、かつ、ＤＢＢＰフラグが０の場合（dbbp_flag == 0）に、インター予測パラメータ符号化部１０３Ａは、inter_pred_flagのビット列として００、０１、１を符号化する。それ以外、予測ユニットが所定のサイズ(( nPbW + nPbH ) != 12）、もしくは、ＤＢＢＰフラグが１の場合（dbbp_flag == 1）、インター予測パラメータ符号化部１０３Ａは、inter_pred_flagのビット列は０、１として符号化する。 Also, inter_pred_flag indicates that when the prediction unit is not a predetermined size ((nPbW + nPbH)! = 12) and the DBBP flag is 0 (dbbp_flag == 0), the inter prediction parameter encoding unit 103A sets inter_pred_flag 00, 01, and 1 are encoded as a bit string. Otherwise, when the prediction unit has a predetermined size ((nPbW + nPbH)! = 12) or the DBBP flag is 1 (dbbp_flag == 1), the inter prediction parameter encoding unit 103A sets the bit string of inter_pred_flag to 0, Encode as 1.

また、本実施形態の画像符号化装置は、マージモードパラメータ導出部３０３６Ａを備えるため、上述の双予測制限条件１もしくは双予測制限条件２の場合に、Ｌ１の参照ピクチャインデックスrefIdxL1とＬ１の予測利用フラグpredFlagL1に、refIdxL1＝-1、predFlagL1＝0を設定することにより、双予測を単予測に変換する。 In addition, since the image coding apparatus according to the present embodiment includes the merge mode parameter deriving unit 3036A, in the case of the above-described bi-prediction restriction condition 1 or bi-prediction restriction condition 2, the L1 reference picture indexes refIdxL1 and L1 are used for prediction. By setting refIdxL1 = −1 and predFlagL1 = 0 to the flag predFlagL1, bi-prediction is converted into single prediction.

すなわち本実施形態の画像符号化装置は、ＤＢＢＰ予測部３０９５とマージモードパラメータ導出部３０３６Ａを備え、上記ＤＢＢＰ予測部３０９５は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部３０９５２と、２つの動き補償画像を生成するＤＢＢＰ画像補間部３０９５１と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部３０９５３を備え、上記画像符号化装置は、図示しないＤＢＢＰフラグを符号化するＤＢＢＰフラグ符号化部をさらに備え、上記マージモードパラメータ導出部３０３６Ａは、上記ＤＢＢＰフラグが１の場合に双予測から単予測に変換することを特徴とする画像符号化装置である。 That is, the image coding apparatus according to the present embodiment includes a DBBP prediction unit 3095 and a merge mode parameter derivation unit 3036A. The DBBP prediction unit 3095 includes a segmentation derivation unit 30952 that derives segmentation information from a depth image, and two motion compensations. A DBBP image interpolation unit 30951 that generates an image and an image synthesis unit 30953 that combines the two interpolation images to generate one motion compensation image, and the image encoding device encodes a DBBP flag (not shown). A DBBP flag encoding unit is further provided, and the merge mode parameter deriving unit 3036A is an image encoding device that performs conversion from bi-prediction to single prediction when the DBBP flag is 1.

以上の構成のマージモードパラメータ導出部３０３６Ａを備える画像符号化部によれば、によれば、ＤＢＢＰ予測部３０９５で導出される補間画像は、単予測（Ｌ０もしくはＬ１の参照ピクチャの場合。predFlagL0が１もしくはpredFlagL1が１）の場合に限定されるため、双予測において各々ＤＢＢＰ予測を用いて補間画像を生成することが可能な場合に比べて、処理量および転送量の最悪ケース大幅に削減する効果を奏する。 According to the image coding unit including the merge mode parameter deriving unit 3036A having the above-described configuration, the interpolated image derived by the DBBP prediction unit 3095 is uni-prediction (in the case of a reference picture of L0 or L1. Since 1 or predFlagL1 is limited to 1), the worst case of the processing amount and the transfer amount is greatly reduced as compared with the case where an interpolated image can be generated using DBBP prediction in bi-prediction. Play.

同様に、以上の構成のインター予測パラメータ符号化部１０３Ａを備える画像符号化部によれば、ＤＢＢＰフラグdbbp_flagが１の場合には、インター予測識別子inter_pred_idcとして単予測（PRED_L0もしくはPRED_L1）の値を符号化し、双予測PRED_BIの値を符号化しないため、ＤＢＢＰ予測の場合に双予測を行うことを禁止している。そのため、双予測において各々ＤＢＢＰ予測を用いて補間画像を生成することが可能な場合に比べて、処理量および転送量の最悪ケースを大幅に削減する効果を奏する。 Similarly, according to the image encoding unit including the inter prediction parameter encoding unit 103A having the above configuration, when the DBBP flag dbbp_flag is 1, a single prediction (PRED_L0 or PRED_L1) value is encoded as the inter prediction identifier inter_pred_idc. Since the bi-prediction PRED_BI value is not encoded, it is prohibited to perform bi-prediction in the case of DBBP prediction. Therefore, the worst case of the processing amount and the transfer amount is greatly reduced as compared with the case where an interpolation image can be generated using DBBP prediction in bi-prediction.

また、以上の構成のインター予測パラメータ符号化部１０３Ａおよびマージモードパラメータ導出部３０３６Ａをともにそなえる場合には、dbbp_flagが１の場合に行われるＤＢＢＰ予測において、双予測が適用される場合を完全に禁止することができる。 Further, when both the inter prediction parameter encoding unit 103A and the merge mode parameter deriving unit 3036A having the above configuration are provided, the case where bi-prediction is applied is completely prohibited in the DBBP prediction performed when dbbp_flag is 1. can do.

なお、上述した実施形態における画像符号化装置１１、画像復号装置３１の一部、例えば、エントロピー復号部３０１、予測パラメータ復号部３０２、予測画像生成部１０１、ＤＣＴ・量子化部１０３、エントロピー符号化部１０４、逆量子化・逆ＤＣＴ部１０５、符号化パラメータ決定部１１０、予測パラメータ符号化部１１１、エントロピー復号部３０１、予測パラメータ復号部３０２、予測画像生成部３０８、逆量子化・逆ＤＣＴ部３１１をコンピュータで実現するようにしても良い。その場合、この制御機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現しても良い。なお、ここでいう「コンピュータシステム」とは、画像符号化装置１１、画像復号装置３１のいずれかに内蔵されたコンピュータシステムであって、ＯＳや周辺機器等のハードウェアを含むものとする。また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでも良い。また上記プログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 Note that a part of the image encoding device 11 and the image decoding device 31 in the above-described embodiment, for example, the entropy decoding unit 301, the prediction parameter decoding unit 302, the predicted image generation unit 101, the DCT / quantization unit 103, and entropy encoding. Unit 104, inverse quantization / inverse DCT unit 105, encoding parameter determination unit 110, prediction parameter encoding unit 111, entropy decoding unit 301, prediction parameter decoding unit 302, predicted image generation unit 308, inverse quantization / inverse DCT unit 311 may be realized by a computer. In that case, the program for realizing the control function may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read by a computer system and executed. Here, the “computer system” is a computer system built in either the image encoding device 11 or the image decoding device 31 and includes an OS and hardware such as peripheral devices. The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, the “computer-readable recording medium” is a medium that dynamically holds a program for a short time, such as a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line, In such a case, a volatile memory inside a computer system serving as a server or a client may be included and a program that holds a program for a certain period of time. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

また、上述した実施形態における画像符号化装置１１、画像復号装置３１の一部、または全部を、ＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ）等の集積回路として実現しても良い。画像符号化装置１１、画像復号装置３１の各機能ブロックは個別にプロセッサ化しても良いし、一部、または全部を集積してプロセッサ化しても良い。また、集積回路化の手法はＬＳＩに限らず専用回路、または汎用プロセッサで実現しても良い。また、半導体技術の進歩によりＬＳＩに代替する集積回路化の技術が出現した場合、当該技術による集積回路を用いても良い。 Moreover, you may implement | achieve part or all of the image coding apparatus 11 in the embodiment mentioned above and the image decoding apparatus 31 as integrated circuits, such as LSI (Large Scale Integration). Each functional block of the image encoding device 11 and the image decoding device 31 may be individually made into a processor, or a part or all of them may be integrated into a processor. Further, the method of circuit integration is not limited to LSI, and may be realized by a dedicated circuit or a general-purpose processor. Further, in the case where an integrated circuit technology that replaces LSI appears due to progress in semiconductor technology, an integrated circuit based on the technology may be used.

以上、図面を参照してこの発明の一実施形態について詳しく説明してきたが、具体的な構成は上述のものに限られることはなく、この発明の要旨を逸脱しない範囲内において様々な設計変更等をすることが可能である。 As described above, the embodiment of the present invention has been described in detail with reference to the drawings. However, the specific configuration is not limited to the above, and various design changes and the like can be made without departing from the scope of the present invention. It is possible to

（付記事項）
本発明は、以下のように表すこともできる。(Additional notes)
The present invention can also be expressed as follows.

本発明の１つの形態は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備えるデプスベースブロック予測画像生成装置において、さらに、デプス画像から分割モードを導出するデプス分割モード導出部を備え、デプス分割モード導出部は２Ｎ×ＮもしくはＮ×２Ｎの分割モードを導出することを特徴とする。 According to one aspect of the present invention, a segmentation deriving unit that derives segmentation information from a depth image, an image interpolating unit that generates two motion compensation images, and a single motion compensation image are generated by combining the two interpolation images. The depth-based block prediction image generation apparatus including the image synthesizing unit further includes a depth division mode deriving unit for deriving a division mode from the depth image, and the depth division mode deriving unit has a 2N × N or N × 2N division mode. It is derived.

本発明の１つの形態は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備えるデプスベースブロック予測画像生成装置において、さらに、デプス画像から分割モードを導出するデプス分割モード導出部を備え、上記セグメンテーション導出部は各画素について０もしくは１をとるセグメンテーション情報を導出し、上記画像合成部は、ブロックの各画素において２つの上記補間画像のいずれかを選択することにより合成することを特徴とする。 According to one aspect of the present invention, a segmentation deriving unit that derives segmentation information from a depth image, an image interpolating unit that generates two motion compensation images, and a single motion compensation image are generated by combining the two interpolation images. A depth-based block prediction image generation apparatus including an image synthesis unit that further includes a depth division mode deriving unit that derives a division mode from a depth image, and the segmentation deriving unit derives segmentation information that takes 0 or 1 for each pixel. The image synthesizing unit synthesizes by selecting one of the two interpolated images at each pixel of the block.

本発明の１つの形態は、上記分割モード導出部は、デプス分割モード導出部は対象ブロックに対応するデプスブロックの左上画素と右下画素の比較と、上記デプスブロックの右上画素と左下画素の比較から分割モードを導出することを特徴とする。 In one aspect of the present invention, the division mode deriving unit is configured such that the depth division mode deriving unit compares the upper left pixel and the lower right pixel of the depth block corresponding to the target block, and compares the upper right pixel and the lower left pixel of the depth block. The division mode is derived from

本発明の１つの形態は、上記分割モード導出部は、対象ブロックに対応するデプスブロックから、垂直成分の座標が等しい２つのデプス画素から、水平方向同士の絶対値差分を導出し、水平成分の座標が等しい２つのデプス画素から、垂直方向同士の絶対値差分を導出し、上記水平方向の絶対値差分が、上記垂直方向の絶対値差分よりも大きい場合に、縦長に分割し、それ以外の場合に横長に分割することにより分割モードを導出し、上記パーティション分割部は、上記水平方向の絶対値差分が、上記垂直方向の絶対値差分よりも大きい場合に、４×８のサブブロックに分割し、それ以外の場合に８×４のサブブロック横長に分割することを特徴とする。 In one form of the present invention, the division mode deriving unit derives an absolute value difference between the horizontal directions from two depth pixels having the same vertical component coordinates from the depth block corresponding to the target block, and An absolute value difference between vertical directions is derived from two depth pixels having the same coordinates, and when the absolute value difference in the horizontal direction is larger than the absolute value difference in the vertical direction, the absolute value difference is divided vertically. In this case, a partition mode is derived by horizontally dividing, and the partition dividing unit divides into 4 × 8 sub-blocks when the absolute value difference in the horizontal direction is larger than the absolute value difference in the vertical direction. In other cases, it is divided into 8 × 4 sub-blocks horizontally long.

本発明の１つの形態は、デプスベースブロック予測画像生成手段とマージモードパラメータ導出部を備える画像復号装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備え、上記画像復号装置は、ＤＢＢＰフラグ復号部をさらに備え、上記マージモードパラメータ導出部は、上記ＤＢＢＰフラグが１の場合に単予測から双予測に変換することを特徴とする。 One aspect of the present invention is an image decoding apparatus including a depth base block prediction image generation unit and a merge mode parameter derivation unit, wherein the depth base block prediction image generation unit includes a segmentation derivation unit that derives segmentation information from a depth image, and An image interpolation unit that generates two motion compensation images and an image synthesis unit that combines the two interpolation images to generate one motion compensation image. The image decoding apparatus further includes a DBBP flag decoding unit. The merge mode parameter deriving unit converts from single prediction to bi-prediction when the DBBP flag is 1.

本発明の１つの形態は、デプスベースブロック予測画像生成手段とマージモードパラメータ導出部を備える画像符号化装置において、上記デプスベースブロック予測画像生成手段は、デプス画像からセグメンテーション情報を導出するセグメンテーション導出部と、２つの動き補償画像を生成する画像補間部と、上記２つの補間画像を合成して１つの動き補償画像を生成する画像合成部を備え、上記画像符号化装置は、ＤＢＢＰフラグ符号化部をさらに備え、上記マージモードパラメータ導出部は、上記ＤＢＢＰフラグが１の場合に単予測から双予測に変換することを特徴とする。 One aspect of the present invention is an image encoding device including a depth base block prediction image generation unit and a merge mode parameter derivation unit, wherein the depth base block prediction image generation unit derives segmentation information from the depth image. And an image interpolating unit for generating two motion compensated images, and an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image, and the image encoding device includes a DBBP flag encoding unit And the merge mode parameter derivation unit converts from single prediction to bi-prediction when the DBBP flag is 1.

本発明の１つの形態は、上記共通の視差ベクトルは、ブロックサイズが所定のサイズより大きい場合には、デプスによりリファインされる視差ベクトルであり、ブロックサイズが所定のサイズ以下の場合には、デプスによりリファインされる前の視差ベクトルであることを特徴とする。 According to one aspect of the present invention, the common disparity vector is a disparity vector refined by depth when the block size is larger than a predetermined size, and is reduced when the block size is equal to or smaller than the predetermined size. It is a disparity vector before being refined by.

本発明の１つの形態は、上記共通の視差ベクトルは、予測ブロックの幅と高さの和が１６より大きい場合には、デプスによりリファインされる視差ベクトルであり、それ以外の場合にはデプスによりリファインされる前の視差ベクトルであることを特徴とする。 In one aspect of the present invention, the common disparity vector is a disparity vector refined by depth when the sum of the width and height of the prediction block is greater than 16, and by depth otherwise. It is a disparity vector before being refined.

本発明の１つの形態は、上記共通の視差ベクトルは、予測ブロックの幅と高さの和が２４より大きい場合には、デプスによりリファインされる視差ベクトルであり、それ以外の場合にはデプスによりリファインされる前の視差ベクトルであることを特徴とする。 In one aspect of the present invention, the common disparity vector is a disparity vector refined by depth if the sum of the width and height of the prediction block is greater than 24, and by depth otherwise. It is a disparity vector before being refined.

本発明は、画像データが符号化された符号化データを復号する画像復号装置、および、画像データが符号化された符号化データを生成する画像符号化装置に好適に適用することができる。また、画像符号化装置によって生成され、画像復号装置によって参照される符号化データのデータ構造に好適に適用することができる。 The present invention can be suitably applied to an image decoding apparatus that decodes encoded data obtained by encoding image data and an image encoding apparatus that generates encoded data obtained by encoding image data. Further, the present invention can be suitably applied to the data structure of encoded data generated by an image encoding device and referenced by the image decoding device.

１…画像伝送システム
１１…画像符号化装置
１０１…予測画像生成部
１０２…減算部
１０３…ＤＣＴ・量子化部
１０３１１…残差予測インデックス符号化部
１０３１２…照度補償フラグ符号化部
１０４…エントロピー符号化部
１０５…逆量子化・逆ＤＣＴ部
１０６…加算部
１０８…予測パラメータメモリ（フレームメモリ）
１０９…参照ピクチャメモリ（フレームメモリ）
１１０…符号化パラメータ決定部
１１１…予測パラメータ符号化部
１１２…インター予測パラメータ符号化部
１１２１…マージモードパラメータ導出部
１１２２…AMVP予測パラメータ導出部
１１２３…減算部
１１２６…インター予測パラメータ符号化制御部
１１３…イントラ予測パラメータ符号化部
２１…ネットワーク
３１…画像復号装置
３０１…エントロピー復号部
３０２…予測パラメータ復号部
３０３…インター予測パラメータ復号部
３０３１、３０３１Ａ…インター予測パラメータ復号制御部
３０３１１…分割モード復号部
３０３１２、３０３１２Ａ…インター予測識別子復号部
３０３１３…ＤＢＢＰフラグ復号部
３０３２…AMVP予測パラメータ導出部
３０３５…加算部
３０３６…マージモードパラメータ導出部
３０３６１…マージ候補導出部
３０３６１１…マージ候補格納部
３０３６２…マージ候補選択部
３０３６３、３０３６３Ａ…双予測制限部
３０３７０…拡張マージ候補導出部
３０３７１…レイヤ間マージ候補導出部
３０３７３…変位マージ候補導出部
３０３７４…VSPマージ候補導出部（ＶＳＰ予測部、視点合成予測手段、パーティション分割部、デプスベクトル導出部）
３０３８０…基本マージ候補導出部
３０３８１…空間マージ候補導出部
３０３８２…時間マージ候補導出部
３０３８３…結合マージ候補導出部
３０３８４…ゼロマージ候補導出部
３０４…イントラ予測パラメータ復号部
３０６…参照ピクチャメモリ（フレームメモリ）
３０７…予測パラメータメモリ（フレームメモリ）
３０８…予測画像生成部
３０９…インター予測画像生成部
３０９１…動き変位補償部
３０９２…残差予測部
３０９２２…参照画像補間部
３０９２３…残差合成部
３０９２４…残差予測用ベクトル導出部
３０９３…照度補償部
３０９５、３０９５Ａ、３０９５Ｂ、３０９５Ｃ…ＤＢＢＰ予測部（デプスベースブロック予測画像生成装置）
３０９５１…ＤＢＢＰ画像補間部（画像補間部、画像補間手段）
３０９５２…セグメンテーション部
３０９５３…画像合成部
３０９５４、３０９５４Ａ、３０９５４Ｂ、３０９５４Ｃ…ＤＢＢＰ分割モード導出部（デプス分割モード導出手段）
３０９６…重み付け予測部
３１０…イントラ予測画像生成部
３１１…逆量子化・逆ＤＣＴ部
３１２…加算部
３５１…デプスＤＶ導出部
３５２…変位ベクトル導出部
３５３…分割モード導出部
３５４…スイッチ
４１…画像表示装置DESCRIPTION OF SYMBOLS 1 ... Image transmission system 11 ... Image encoding apparatus 101 ... Predictive image generation part 102 ... Subtraction part 103 ... DCT / quantization part 10311 ... Residual prediction index encoding part 10312 ... Illuminance compensation flag encoding part 104 ... Entropy encoding Unit 105 ... inverse quantization / inverse DCT unit 106 ... addition unit 108 ... prediction parameter memory (frame memory)
109 ... Reference picture memory (frame memory)
DESCRIPTION OF SYMBOLS 110 ... Coding parameter determination part 111 ... Prediction parameter coding part 112 ... Inter prediction parameter coding part 1121 ... Merge mode parameter derivation part 1122 ... AMVP prediction parameter derivation part 1123 ... Subtraction part 1126 ... Inter prediction parameter coding control part 113 ... Intra prediction parameter encoding unit 21 ... Network 31 ... Image decoding device 301 ... Entropy decoding unit 302 ... Prediction parameter decoding unit 303 ... Inter prediction parameter decoding units 3031 and 3031A ... Inter prediction parameter decoding control unit 30311 ... Division mode decoding unit 30312 , 30312A ... Inter prediction identifier decoding unit 30313 ... DBBP flag decoding unit 3032 ... AMVP prediction parameter derivation unit 3035 ... Addition unit 3036 ... Merge mode parameter derivation unit 30361 ... Merge candidate Output unit 303611 ... Merge candidate storage unit 30362 ... Merge candidate selection unit 30363, 30363A ... Bi-prediction restriction unit 30370 ... Extended merge candidate derivation unit 30371 ... Inter-layer merge candidate derivation unit 30373 ... Displacement merge candidate derivation unit 30374 ... VSP merge candidate derivation (VSP prediction unit, view synthesis prediction means, partition division unit, depth vector derivation unit)
30380 ... Basic merge candidate derivation unit 30281 ... Spatial merge candidate derivation unit 30382 ... Temporal merge candidate derivation unit 30383 ... Combined merge candidate derivation unit 30384 ... Zero merge candidate derivation unit 304 ... Intra prediction parameter decoding unit 306 ... Reference picture memory (frame memory)
307 ... Prediction parameter memory (frame memory)
308 ... Prediction image generation unit 309 ... Inter prediction image generation unit 3091 ... Motion displacement compensation unit 3092 ... Residual prediction unit 30922 ... Reference image interpolation unit 30923 ... Residual synthesis unit 30924 ... Residual prediction vector derivation unit 3093 ... Illuminance compensation Units 3095, 3095A, 3095B, 3095C ... DBBP prediction unit (depth base block prediction image generation device)
30951 ... DBBP image interpolation unit (image interpolation unit, image interpolation means)
30952 ... Segmentation unit 30953 ... Image composition units 30954, 30954A, 30954B, 30954C ... DBBP division mode deriving unit (depth division mode deriving means)
3096 ... Weighted prediction unit 310 ... Intra prediction image generation unit 311 ... Inverse quantization / inverse DCT unit 312 ... Addition unit 351 ... Depth DV derivation unit 352 ... Displacement vector derivation unit 353 ... Split mode derivation unit 354 ... Switch 41 ... Image display apparatus

Claims

In an image decoding apparatus including a depth base block predicted image generation unit and a merge mode parameter derivation unit, the depth base block prediction image generation unit generates a segmentation derivation unit that derives segmentation information from a depth image, and generates two motion compensation images. An image interpolating unit that combines the two interpolated images to generate one motion compensated image, the image decoding device further includes a DBBP flag decoding unit, and the merge mode parameter deriving unit includes: An image decoding apparatus that converts from bi-prediction to uni-prediction when the DBBP flag is 1.

In an image decoding apparatus including a depth base block prediction image generation unit and an inter prediction parameter decoding unit, the depth base block prediction image generation unit generates a segmentation deriving unit for deriving segmentation information from the depth image, and generates two motion compensation images. And an image synthesizing unit that synthesizes the two interpolated images to generate one motion compensated image. The image decoding apparatus further includes a DBBP flag decoding unit, and the inter prediction parameter decoding unit includes: An image decoding apparatus that does not decode a bi-prediction value as an inter prediction identifier when the DBBP flag is 1.

In the image coding apparatus including a depth base block predicted image generation unit and a merge mode parameter derivation unit, the depth base block prediction image generation unit includes a segmentation derivation unit that derives segmentation information from the depth image, and two motion compensation images. An image interpolating unit for generating and an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image, and the image encoding device further includes a DBBP flag encoding unit for encoding the DBBP flag. And the merge mode parameter deriving unit converts from single prediction to bi-prediction when the DBBP flag is 1.

A segmentation deriving unit for deriving segmentation information from a depth image, an image interpolating unit for generating two motion compensated images, an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image, and a division mode. A depth-based block predicted image generation apparatus comprising a depth division mode deriving unit for deriving, wherein the depth division mode deriving unit derives a 2N × N or N × 2N division mode.

A depth base comprising a segmentation deriving unit for deriving segmentation information from a depth image, an image interpolating unit for generating two motion compensated images, and an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image In the block prediction image generation device, the image interpolation unit generates the two motion compensated images by bilinear prediction, and the depth base block prediction image generation device.

A segmentation deriving unit for deriving segmentation information from a depth image, an image interpolating unit for generating two motion compensated images, an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image, and a division mode. A depth-based block prediction image generation apparatus comprising a depth division mode deriving unit for deriving, wherein the depth division mode deriving unit derives a division mode from pixels at four corners of a depth block.

5. The depth division mode deriving unit derives a division mode from a comparison between an upper left pixel and a lower right pixel of a depth block corresponding to a target block and a comparison between an upper right pixel and a lower left pixel of the depth block. Or the depth-based block prediction image generation device according to 6.

The depth-based block prediction image generation device according to any one of claims 4 to 7, wherein the segmentation derivation unit derives segmentation information that takes 0 or 1 for each pixel, and the image synthesis unit includes: A depth-based block predictive image generation apparatus, characterized in that synthesis is performed by selecting one of the two interpolated images in each pixel of a block.

The depth-based block prediction image generation device according to any one of claims 4 to 8, and an image decoding device including the DBBP flag decoding unit, wherein the depth-base block prediction image generation device has a DBBP flag of 1. In this case, an image decoding apparatus that performs DBBP prediction.

In an image decoding apparatus including a depth-based block prediction image generation unit and a viewpoint synthesis prediction unit,
The depth-based block predicted image generation means includes a segmentation deriving unit for deriving segmentation information from the depth image, an image interpolation unit for generating two motion compensation images, and a single motion compensation image by combining the two interpolation images. An image synthesizing unit for generating a partition mode, and a partition mode deriving unit for deriving a division mode, wherein the viewpoint synthesis predicting means includes a partition division unit that performs partition division from a depth image, and a depth motion DV that derives a motion vector from the depth image. An image decoding apparatus comprising: a derivation unit, wherein the partition mode derivation unit and the partition division unit include a common partition mode derivation unit.

The depth division mode deriving unit derives a division mode from a comparison between an upper left pixel and a lower right pixel of a depth block corresponding to a target block and a comparison between an upper right pixel and a lower left pixel of the depth block. The image decoding device described.

The division mode deriving unit derives an absolute value difference between the horizontal directions from two depth pixels having the same vertical component coordinates from the depth block corresponding to the target block, and from the two depth pixels having the same horizontal component coordinates. By deriving the absolute value difference between the vertical directions, if the absolute value difference in the horizontal direction is larger than the absolute value difference in the vertical direction, it is divided vertically, otherwise it is divided horizontally A partition mode is derived, and the partition division unit divides the block into 4 × 8 sub-blocks when the horizontal absolute value difference is larger than the vertical absolute value difference, and 8 otherwise. The image decoding device according to claim 9, wherein the image decoding device is divided into × 4 sub-blocks horizontally long.

The depth-based block prediction image generation device according to any one of claims 3 to 7 and an image encoding device including a DBBP flag encoding unit, wherein the depth-based block prediction image generation device includes a DBBP flag. An image encoding apparatus that performs DBBP prediction when is 1.

In the image coding apparatus including the depth base block prediction image generation unit and the viewpoint synthesis prediction unit, the depth base block prediction image generation unit includes a segmentation derivation unit that derives segmentation information from the depth image, and two motion compensation images. An image interpolating unit for generating, an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image, and a split mode deriving unit for deriving a split mode. A partition division unit that performs partition division and a depth motion vector derivation unit that derives a motion vector from the depth image are provided. The division mode derivation unit and the partition division unit include a common division mode derivation unit. An image encoding apparatus characterized by that.

In an image decoding apparatus including a depth base block prediction image generation unit and a viewpoint synthesis prediction unit, the depth base block prediction image generation unit generates a segmentation deriving unit for deriving segmentation information from the depth image, and generates two motion compensation images. An image interpolating unit for synthesizing the image, an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image, and a division mode deriving unit for deriving a division mode. A partition division unit that performs partition division and a depth motion vector derivation unit that derives a motion vector from the depth image, and the depth image that is referred to by the segmentation derivation unit and the division mode derivation unit of the depth-based block prediction image generation unit. The disparity vector used to derive the position and Image decoding apparatus characterized by a common disparity vector and a disparity vector used to derive the position of the depth image in the partitioning portion of the view synthesized predicting means and the depth motion vector derivation unit.

The common disparity vector is a disparity vector refined by depth when the block size is larger than a predetermined size, and a disparity vector before being refined by depth when the block size is equal to or smaller than the predetermined size. The image decoding apparatus according to claim 15, wherein:

The common disparity vector is a disparity vector refined by depth when the sum of the width and height of the prediction block is larger than 16, and is a disparity vector before being refined by depth otherwise. The image decoding apparatus according to claim 16, wherein the image decoding apparatus is provided.

The common disparity vector is a disparity vector refined by depth when the sum of the width and height of the prediction block is greater than 24, and is a disparity vector before being refined by depth otherwise. The image decoding device according to claim 17, wherein the image decoding device is provided.

In the image coding apparatus including the depth base block prediction image generation unit and the viewpoint synthesis prediction unit, the depth base block prediction image generation unit includes a segmentation derivation unit that derives segmentation information from the depth image, and two motion compensation images. An image interpolating unit for generating, an image synthesizing unit for synthesizing the two interpolated images to generate one motion compensated image, and a division mode deriving unit for deriving a division mode. A depth division image that is referred to by the segmentation derivation unit and the division mode derivation unit of the depth-based block prediction image generation unit, including a partition division unit that performs partition division from a depth image and a depth motion vector derivation unit that derives a motion vector from the depth image. Disparity vector used to derive the position of The image encoding apparatus characterized by a common disparity vector and a disparity vector used to derive the position of the depth image in the partitioning portion of the view synthesized predicting means and the depth motion vector derivation unit.