WO2015143603A1 - An improved method for temporal motion vector prediction in video coding - Google Patents
An improved method for temporal motion vector prediction in video coding Download PDFInfo
- Publication number
- WO2015143603A1 WO2015143603A1 PCT/CN2014/073949 CN2014073949W WO2015143603A1 WO 2015143603 A1 WO2015143603 A1 WO 2015143603A1 CN 2014073949 W CN2014073949 W CN 2014073949W WO 2015143603 A1 WO2015143603 A1 WO 2015143603A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- picture
- collocated
- poc
- same
- tmvp
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
Definitions
- the invention relates generally to video processing.
- the presented invention relates to temporal motion vector prediction in video coding.
- TMVP temporal motion vector prediction
- MVP motion vector predictor
- the collocated picture is a temporal reference picture for the current picture.
- the collocated picture can be derived implicitly at decoder as specified in H.264/AVC, or it can be signaled explicitly from the encoder to the decoder as specified in H.265/HEVC.
- the collocated block and the current block are at the same relative position in the collocated picture and in the current picture respectively.
- the motion vector (MV) in the collocated block is named as collocated MV.
- TMVP is derived by scaling the collocated MV according to the time distances.
- Time distance (or named POC distance, POC difference) for a MV specifies the time distance between the picture the MV belonging to and the picture the MV pointing to.
- POC represents picture order count.
- the time distance for the collocated MV is available since the collocated MV has been determined when the collocated picture is encoded/ decoded.
- the time distance for the TMVP is determined by determining the reference picture of the TMVP.
- the reference for TMVP is signaled explicitly for a block coded with advanced motion vector prediction (AMVP) mode, while the reference for TMVP is determined implicitly for a block coded with merge mode.
- AMVP advanced motion vector prediction
- collocated block possesses only one MV, it should be treated as the collocated MV directly. If the collocated block possesses more than one MV, e.g., two MVs with one from reference list 0 and the other from reference list 1, the collocated MV should be selected as follows as specified in clause 8.5.3.2.8 in HEVC.
- the collocated MV should be chosen as the one in the reference list the same as the reference list in which the reference for TMVP is.
- collocated MV should be chosen as the one in the reference list with the same value as collocated from lO flag, which is signaled in slice header.
- tMVP (cMV * tD )l cD.
- HEVC High Efficiency Video Coding
- a motion compression is applied to reduce the storage required by MVs.
- a 16: 1 compression is applied after a picture is encoded /decoded.
- MVC Multi-view video coding
- 3DVC 3D video coding
- SVC scalable video coding
- an inter-view reference picture can be inserted into the reference list as a special reference picture for a picture on a dependent view.
- a low-quality reference picture can be inserted into the reference list as a special reference picture for a picture on a high-quality layer.
- an up-sampled low-resolution reference picture can be inserted into the reference list as a special reference picture for a picture on a high-resolution layer.
- Fig. 1 is a diagram illustrating TMVP in video coding
- Fig. 2 is a diagram illustrating to use the interview reference picture as the collocated picture in MVC or 3DVC;
- Fig. 3 is a diagram illustrating to use the corresponding texture picture as the collocated picture in MVC or 3DVC with depth;
- Fig. 4 is a diagram illustrating to use the inter-quality-layer picture as the collocated picture in SVC;
- Fig. 5 is a diagram illustrating to use the inter-spatial-layer picture as the collocated picture in SVC;
- Fig. 6 is a diagram illustrating a general paradigm for virtual collocated picture.
- the judgment 'If DiffPicOrderCnt( aPic, currPic ) is less than or equal to 0 for every picture aPic in every reference picture list of the current slice' is not executed if the POC of the current picture and that of the collocated picture are the same.
- the collocated MV should be chosen as the one in the reference list the same as the reference list in which the reference for TMVP is, if the collocated block possesses more than one MV, and the POC of the current picture and that of the collocated picture are the same. For example, if the POC of the current picture and that of the collocated picture are the same, the reference for TMVP is in the reference list 0, and the collocated block possesses two MVs from reference list 0 and reference list 1 respectively, then the MV from reference list 0 should be chosen as the collocated MV.
- the reference for TMVP is in the reference list 1, and the collocated block possesses two MVs from reference list 0 and reference list 1 respectively, then the MV from reference list 1 should be chosen as the collocated MV.
- the POC of the current picture and that of the collocated picture are the same in the following cases:
- the inter- view reference picture is treated as the collocated picture used in TMVP in MVC, such as H.264 MVC extension or MV-HEVC, or 3DVC such as 3D-HEVC.as depicted in Fig. 2.
- the corresponding texture picture is treated as the collocated picture used in TMVP to code a depth picture in MVC, such as H.264 MVC extension or MV-HEVC, or 3DVC such as 3D-HEVC with depth coding as depicted in Fig. 3.
- MVC such as H.264 MVC extension or MV-HEVC
- 3DVC such as 3D-HEVC with depth coding as depicted in Fig. 3.
- the inter-quality-layer reference picture is treated as the collocated picture used in TMVP to code a picture in SVC, such as H.264 SVC extension or HEVC SVC extension as depicted in Fig. 4.
- the upsampled or scaled inter-spatial-layer reference picture is treated as the collocated picture used in TMVP to code a picture in SVC, such as H.264 SVC extension or HEVC SVC extension as depicted in Fig. 5.
- the proposed collocated MV selection method does not change the CU level design of HEVC. Only slice level or picture level changes are required.
- an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein.
- An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein.
- DSP Digital Signal Processor
- the invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA).
- processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention.
- the software code or firmware codes may be developed in different programming languages and different format or style.
- the software code may also be compiled for different target platform.
- different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
A method is proposed to improve the derivation of temporal motion vector prediction. Some special methods are further proposed for MVC,SVC,and3DVC.
Description
AN IMPROVED METHOD FOR TEMPORAL MOTION VECTOR
PREDICTION IN VIDEO CODING
FIELD OF INVENTION
The invention relates generally to video processing. In particular, the presented invention relates to temporal motion vector prediction in video coding.
BACKGROUND OF THE INVENTION
In video coding, temporal motion vector prediction (TMVP) is utilized as a kind of motion prediction. In TMVP, the motion vector predictor (MVP) is derived from the motion information of a collocated block in a collocated picture as depicted in Fig. 1.
The collocated picture is a temporal reference picture for the current picture. The collocated picture can be derived implicitly at decoder as specified in H.264/AVC, or it can be signaled explicitly from the encoder to the decoder as specified in H.265/HEVC.
The collocated block and the current block are at the same relative position in the collocated picture and in the current picture respectively. The motion vector (MV) in the collocated block is named as collocated MV.
TMVP is derived by scaling the collocated MV according to the time distances. Time distance (or named POC distance, POC difference) for a MV specifies the time distance between the picture the MV belonging to and the picture the MV pointing to. Herein POC represents picture order count.
The time distance for the collocated MV is available since the collocated MV has been determined when the collocated picture is encoded/ decoded. The time distance for the TMVP is determined by determining the reference picture of the TMVP. In HEVC, the reference for TMVP is signaled explicitly for a block coded with advanced motion vector prediction (AMVP) mode, while the reference for TMVP is determined implicitly for a block coded with merge mode.
If the collocated block possesses only one MV, it should be treated as the collocated MV directly. If the collocated block possesses more than one MV, e.g., two MVs with one from reference list 0 and the other from reference list 1, the collocated MV should be selected as follows as specified in clause 8.5.3.2.8 in HEVC.
1. If DiffPicOrderCnt( aPic, currPic ) is less than or equal to 0 for every picture aPic in every reference picture list of the current slice, the collocated MV should be chosen as the one
in the reference list the same as the reference list in which the reference for TMVP is.
2. Otherwise, the collocated MV should be chosen as the one in the reference list with the same value as collocated from lO flag, which is signaled in slice header.
Suppose the time distance of the collocated MV, cMV, is cD and the time distance of the TMVP, tMVP, is tD, then tMVP should be calculated as tMVP = (cMV * tD )l cD. In practical implementation such as HEVC, some more sophisticated methods are used to guarantee the division can be done with limited bit-width. In the example of Fig. 1, the time distance of the collocated MV is 2 and the time distance of the TMVP is 1, thus tMVP should be equal to cMV/2.
A motion compression is applied to reduce the storage required by MVs. In H.265/HEVC, a 16: 1 compression is applied after a picture is encoded /decoded.
There are several extensions for video coding, such as Multi-view video coding (MVC), 3D video coding (3DVC) and scalable video coding (SVC). In MVC and 3DVC, an inter-view reference picture can be inserted into the reference list as a special reference picture for a picture on a dependent view. In SVC, a low-quality reference picture can be inserted into the reference list as a special reference picture for a picture on a high-quality layer. Still in SVC, an up-sampled low-resolution reference picture can be inserted into the reference list as a special reference picture for a picture on a high-resolution layer.
Since these special reference pictures are in the reference list, it is possible to treat the special reference picture as the collocated picture for the current picture when the special reference picture is available. In such a case, the POC of the current picture and that of the collocated picture will be the same. The collocated MV selection method specified in clause 8.5.3.2.8 in HEVC will be inefficient. SUMMARY OF THE INVENTION
In light of the previously described problems, methods are proposed to improve the MV selection method for TMVP in some cases.
Other aspects and features of the invention will become apparent to those with ordinary skill in the art upon review of the following descriptions of specific embodiments.
BRIEF DESCRIPTION OF THE DRAWINGS
The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
Fig. 1 is a diagram illustrating TMVP in video coding;
Fig. 2 is a diagram illustrating to use the interview reference picture as the collocated picture in MVC or 3DVC;
Fig. 3 is a diagram illustrating to use the corresponding texture picture as the collocated picture in MVC or 3DVC with depth;
Fig. 4 is a diagram illustrating to use the inter-quality-layer picture as the collocated picture in SVC;
Fig. 5 is a diagram illustrating to use the inter-spatial-layer picture as the collocated picture in SVC;
Fig. 6 is a diagram illustrating a general paradigm for virtual collocated picture.
DETAILED DESCRIPTION
The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
It is proposed to select the collocated MV from the reference list 0 or reference list 1 in a different way, if the collocated block possesses more than one MV, and the POC of the current picture and that of the collocated picture are the same.
In one embodiment, the judgment 'If DiffPicOrderCnt( aPic, currPic ) is less than or equal to 0 for every picture aPic in every reference picture list of the current slice' is not executed if the POC of the current picture and that of the collocated picture are the same.
In another embodiment, the collocated MV should be chosen as the one in the reference list the same as the reference list in which the reference for TMVP is, if the collocated block possesses more than one MV, and the POC of the current picture and that of the collocated picture are the same. For example, if the POC of the current picture and that of the collocated picture are the same, the reference for TMVP is in the reference list 0, and the collocated block possesses two MVs from reference list 0 and reference list 1 respectively, then the MV from reference list 0 should be chosen as the collocated MV. In another example, if the POC of the current picture and that of the collocated picture are the same, the reference for TMVP is in the reference list 1, and the collocated block possesses two MVs from reference list 0 and reference list 1 respectively, then the MV from reference list 1 should be chosen as the collocated MV.
In still another embodiment, the POC of the current picture and that of the collocated picture are the same in the following cases:
1. The inter- view reference picture is treated as the collocated picture used in TMVP in
MVC, such as H.264 MVC extension or MV-HEVC, or 3DVC such as 3D-HEVC.as depicted in Fig. 2.
2. The corresponding texture picture is treated as the collocated picture used in TMVP to code a depth picture in MVC, such as H.264 MVC extension or MV-HEVC, or 3DVC such as 3D-HEVC with depth coding as depicted in Fig. 3.
3. The inter-quality-layer reference picture is treated as the collocated picture used in TMVP to code a picture in SVC, such as H.264 SVC extension or HEVC SVC extension as depicted in Fig. 4.
4. The upsampled or scaled inter-spatial-layer reference picture is treated as the collocated picture used in TMVP to code a picture in SVC, such as H.264 SVC extension or HEVC SVC extension as depicted in Fig. 5.
5. A virtual or a created which holds the same POC as the current picture is treated as the collocated picture used in TMVP as depicted in Fig. 6.
In still another embodiment, the proposed collocated MV selection method does not change the CU level design of HEVC. Only slice level or picture level changes are required.
The methods described above can be used in a video encoder as well as in a video decoder. Embodiments of disparity vector derivation methods according to the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA). These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware codes may be developed in different programming languages and different format or style. The software code may also be compiled for different target platform. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. To the contrary, it is intended to cover various modifications and
similar arrangements (as would be apparent to those skilled in the art). Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.
Claims
1. A method of obtaining temporal motion vector prediction (TMVP), comprising:
selecting a collocated motion vector (MV) according to properties of a current picture and a collocated reference picture; and
applying encoding or decoding based on the selected collocated MV.
2. The method as claimed in claim 1, wherein the collocated MV from reference list 0 or reference list 1 is selected in different ways depending on whether a picture order count (POC) of the current picture and that of the collocated picture are the same or not.
3. The method as claimed in claim 1, wherein a judgment 'If DiffPicOrderCnt( aPic, currPic ) is less than or equal to 0 for every picture aPic in every reference picture list of a current slice' is not executed if a POC of the current picture and that of the collocated picture are the same.
4. The method as claimed in claim 1, wherein the collocated MV is chosen as the one in a same reference list as the one in which the reference for TMVP is, if the collocated block possesses more than one MV, and a POC of the current picture and that of the collocated picture are the same.
5. The method as claimed in claim 2, wherein the POC of the current picture and that of the collocated picture are the same if an inter- view reference picture is treated as the collocated picture used in TMVP in multi-view video coding (MVC) or three-dimensional video coding (3DVC).
6. The method as claimed in claim 2, wherein the POC of the current picture and that of the collocated picture are the same if a corresponding texture picture is treated as the collocated picture used in TMVP to code a depth picture in MVC or 3DVC with depth coding.
7. The method as claimed in claim 2, wherein the POC of the current picture and that of the collocated picture are the same if an inter-quality-layer reference picture is treated as the collocated picture used in TMVP to code a picture in scalable video coding (SVC).
8. The method as claimed in claim 2, wherein the POC of the current picture and that of the collocated picture are the same if an upsampled or scaled inter-spatial-layer reference picture is treated as the collocated picture used in TMVP to code a picture in SVC.
9. The method as claimed in claim 2, wherein the POC of the current picture and that of the collocated picture are the same if a virtual or a created which holds the same POC as the current picture is treated as the collocated picture used in TMVP.
10. The method as claimed in claim 2, wherein the method does not change the coding unit (CU) level design of HEVC, and only slice level or picture level changes are required.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2014/073949 WO2015143603A1 (en) | 2014-03-24 | 2014-03-24 | An improved method for temporal motion vector prediction in video coding |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2014/073949 WO2015143603A1 (en) | 2014-03-24 | 2014-03-24 | An improved method for temporal motion vector prediction in video coding |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2015143603A1 true WO2015143603A1 (en) | 2015-10-01 |
Family
ID=54193847
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2014/073949 WO2015143603A1 (en) | 2014-03-24 | 2014-03-24 | An improved method for temporal motion vector prediction in video coding |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2015143603A1 (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019178721A1 (en) * | 2018-03-19 | 2019-09-26 | Qualcomm Incorporated | Improvements to advanced temporal motion vector prediction |
| WO2020140242A1 (en) * | 2019-01-03 | 2020-07-09 | 北京大学 | Video processing method and apparatus |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2013168407A1 (en) * | 2012-05-09 | 2013-11-14 | Panasonic Corporation | Method of performing motion vector prediction, encoding and decoding methods, and apparatuses thereof |
| WO2014005467A1 (en) * | 2012-07-03 | 2014-01-09 | Mediatek Singapore Pte. Ltd. | Method and apparatus of inter-view motion vector prediction and disparity vector prediction in 3d video coding |
| WO2014011652A2 (en) * | 2012-07-09 | 2014-01-16 | Qualcomm Incorporated | Temporal motion vector prediction in video coding extensions |
| US20140064374A1 (en) * | 2012-08-29 | 2014-03-06 | Vid Scale, Inc. | Method and apparatus of motion vector prediction for scalable video coding |
-
2014
- 2014-03-24 WO PCT/CN2014/073949 patent/WO2015143603A1/en active Application Filing
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2013168407A1 (en) * | 2012-05-09 | 2013-11-14 | Panasonic Corporation | Method of performing motion vector prediction, encoding and decoding methods, and apparatuses thereof |
| WO2014005467A1 (en) * | 2012-07-03 | 2014-01-09 | Mediatek Singapore Pte. Ltd. | Method and apparatus of inter-view motion vector prediction and disparity vector prediction in 3d video coding |
| WO2014011652A2 (en) * | 2012-07-09 | 2014-01-16 | Qualcomm Incorporated | Temporal motion vector prediction in video coding extensions |
| US20140064374A1 (en) * | 2012-08-29 | 2014-03-06 | Vid Scale, Inc. | Method and apparatus of motion vector prediction for scalable video coding |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019178721A1 (en) * | 2018-03-19 | 2019-09-26 | Qualcomm Incorporated | Improvements to advanced temporal motion vector prediction |
| CN111869214A (en) * | 2018-03-19 | 2020-10-30 | 高通股份有限公司 | Improvements to Advanced Temporal Motion Vector Prediction |
| US12081780B2 (en) | 2018-03-19 | 2024-09-03 | Qualcomm Incorporated | Advanced temporal motion vector prediction |
| WO2020140242A1 (en) * | 2019-01-03 | 2020-07-09 | 北京大学 | Video processing method and apparatus |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2015109598A1 (en) | Methods for motion parameter hole filling | |
| EP3130147B1 (en) | Methods of block vector prediction and decoding for intra block copy mode coding | |
| US10306225B2 (en) | Method of texture merging candidate derivation in 3D video coding | |
| WO2016165617A1 (en) | Method and apparatus for deriving temporal motion vector prediction | |
| WO2016008161A1 (en) | Temporal derived bi-directional motion vector predictor | |
| WO2015100710A1 (en) | Existence of inter-view reference picture and availability of 3dvc coding tools | |
| US9992494B2 (en) | Method of depth based block partitioning | |
| WO2015003383A1 (en) | Methods for inter-view motion prediction | |
| JP2015514342A5 (en) | ||
| EP3123718A1 (en) | Method of 3d or multi-view video coding including view synthesis prediction | |
| WO2016008157A1 (en) | Methods for motion compensation using high order motion model | |
| EP3025498A1 (en) | Method of deriving default disparity vector in 3d and multiview video coding | |
| WO2014106388A1 (en) | Method and device for determining temporal motion vector predictor | |
| EP2965520A1 (en) | Method of reference view selection for 3d video coding | |
| WO2014166109A1 (en) | Methods for disparity vector derivation | |
| WO2015006922A1 (en) | Methods for residual prediction | |
| WO2015143603A1 (en) | An improved method for temporal motion vector prediction in video coding | |
| US9883205B2 (en) | Method of inter-view residual prediction with reduced complexity in three-dimensional video coding | |
| WO2014106327A1 (en) | Method and apparatus for inter-view residual prediction in multiview video coding | |
| WO2013159326A1 (en) | Inter-view motion prediction in 3d video coding | |
| WO2015139205A1 (en) | An improved method for collocated picture in video coding | |
| US9843821B2 (en) | Method of inter-view advanced residual prediction in 3D video coding | |
| WO2015188332A1 (en) | A default vector selection method for block copy vector prediction in video compression | |
| WO2015135137A1 (en) | A method of motion information sharing in multi-view and 3d video coding | |
| WO2015103747A1 (en) | Motion parameter hole filling |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14887610 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 14887610 Country of ref document: EP Kind code of ref document: A1 |