P
US11265540B2ActiveUtilityPatentIndex 84

Apparatus and method for applying artificial neural network to image encoding or decoding

Assignee: SK TELECOM CO LTDPriority: Feb 23, 2018Filed: Oct 6, 2020Granted: Mar 1, 2022
Est. expiryFeb 23, 2038(~11.6 yrs left)· nominal 20-yr term from priority
Inventors:NA TAE YOUNGLEE SUN-YOUNGSHIN JAE-SEOBSON SE-HOONKIM HYO SONGLIM JEONG-YEON
G06N 3/045G06N 3/0464G06N 3/09H04N 19/117H04N 19/86H04N 19/176G06N 5/04H04N 19/157H04N 19/124G06N 3/084H04N 19/82G06N 3/04G06N 3/08
84
PatentIndex Score
5
Cited by
11
References
15
Claims

Abstract

The present disclosure relates to video encoding or decoding and, more specifically, to an apparatus and a method for applying an artificial neural network (ANN) to video encoding or decoding. The apparatus and the method of the present disclosure are characterized by applying a CNN-based filter to a first picture and at least one of a quantization parameter map and a block partition map to output a second picture.

Claims

exact text as granted — not AI-modified
What is claimed is: 
     
       1. A video decoding method using a convolutional neural network (CNN)-based filter, the method comprising:
 obtaining an input data, the input data including pixel data of a first reconstructed picture region which is partitioned into a plurality of coding units, and a quantization parameter map and a block partition map which are associated with the first reconstructed picture region, the first reconstructed picture region having been reconstructed from a bitstream of a video data, wherein the quantization parameter map is a two dimensional array for representing quantization parameters for the respective coding units constituting the first reconstructed picture region, and the block partition map is a two dimensional array for representing boundaries between the coding units in the first reconstructed picture region; and 
 providing the neural network based filter with the input data to obtain a second picture region that the first reconstructed picture region is filtered with the neural network based filter, wherein the neural network based filter has filter coefficients which have been trained with training data including pixel data of sample picture regions, and quantization parameter maps and block partition maps associated with the sample picture regions. 
 
     
     
       2. The method of  claim 1 , wherein the quantization parameter map is constructed at the same resolution as the first reconstructed picture region, and is filled with quantization parameters for the coding units constituting the first reconstructed picture region. 
     
     
       3. The method of  claim 1 ,
 wherein the input data includes a block mode map which indicates an encoding mode for each of the coding units constituting the first reconstructed picture region. 
 
     
     
       4. The method of  claim 1 , wherein the block partition map represents pixels indicating boundary of the coding block and pixels indicating an inner region of the coding block as different values. 
     
     
       5. The method of  claim 4 , wherein, in the block partition map, a number of pixels indicating the boundary of the coding block is depending on at least one of a size of the coding block, a value of a quantization parameter, an encoding mode, a number of pixels to be updated, and a number of pixels to be referred to for filtering. 
     
     
       6. The method of  claim 4 , wherein, in the block partition map, the pixels indicating the boundary of the coding block have different values depending on at least one of a size of the coding block, a value of a quantization parameter, a coding mode, a number of pixels to be updated, and a number of pixels to be referred to for filtering. 
     
     
       7. The method of  claim 1 , wherein the filter coefficients of the neural network based filter are received from a video encoding apparatus. 
     
     
       8. A video decoding apparatus using a neural network based filter, the apparatus comprising:
 an input unit configured to receive an input data, the input data including pixel data of a first reconstructed picture region, and a quantization parameter map and a block partition map which are associated with the first reconstructed picture region, the first reconstructed picture region having been reconstructed from a bitstream of a video data, wherein the quantization parameter map is a two dimensional array for representing quantization parameters for the respective coding units constituting the first reconstructed picture region, and the block partition map is a two dimensional array for representing boundaries between the coding units in the first reconstructed picture region; 
 a filter unit configured to apply the neural network based filter to the input data; and 
 an output unit configured to output a second picture region obtained from an output of the neural network based filter, the second picture region being a filtered picture region of the first reconstructed picture region, wherein the neural network based filter has filter coefficients which have been trained with training data including pixel data of sample picture regions, and quantization parameter maps and block partition maps associated with the sample picture regions. 
 
     
     
       9. The apparatus of  claim 8 , wherein the quantization parameter map is constructed at the same resolution as the first reconstructed picture, and is filled with quantization parameters for the coding units constituting the first reconstructed picture region. 
     
     
       10. The apparatus of  claim 8 , wherein the input data includes a block mode map which indicates an encoding mode for each of the coding units constituting the first reconstructed picture region. 
     
     
       11. The apparatus of  claim 10 , wherein the training data includes block mode maps associated with the sample picture regions. 
     
     
       12. The apparatus of  claim 8 , wherein the block partition map represents pixels indicating a boundary of a coding block and pixels indicating an inner region of the coding block with different values. 
     
     
       13. The apparatus of  claim 12 , wherein, in the block partition map, a number of pixels indicating the boundary of the coding block is depending on at least one of a size of the coding block, a value of a quantization parameter, an encoding mode, a number of pixels to be updated, and a number of pixels to be referred to for filtering. 
     
     
       14. The apparatus of  claim 12 , wherein, in the block partition map, the pixels indicating the boundary of the coding block have different values depending on at least one of a size of the coding block, a value of a quantization parameter, a coding mode, a number of pixels to be updated, and a number of pixels to be referred to for filtering. 
     
     
       15. The apparatus of  claim 8 , wherein the filter coefficients of the neural network based filter are received from a video encoding apparatus.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.