P
US12200318B2ActiveUtilityPatentIndex 62

Method and apparatus for encapsulating annotated region in ISOBMFF tracks

Assignee: CANON KKPriority: Jun 24, 2020Filed: Jun 3, 2021Granted: Jan 14, 2025
Est. expiryJun 24, 2040(~14 yrs left)· nominal 20-yr term from priority
Inventors:DENOUAL FRANCKMAZE FRÉDÉRICRUELLAN HERVÉ
H04N 21/84G06T 7/60G06V 2201/07G06V 10/25G06T 7/50H04N 21/8456H04N 21/2353G11B 27/322G06F 16/783H04N 21/8153H04N 21/85406H04N 21/235H04N 21/845H04N 21/854
62
PatentIndex Score
0
Cited by
25
References
13
Claims

Abstract

A method of generating a media file, the method comprising: generating a first track of media data samples; generating at least one annotated region, the annotated region being associated with a geometry of the annotated region, the annotated region being associated with at least one region annotation describing the annotated region; wherein the annotated region is associated with the first track of media data samples; and generating a media file including the first track and the at least one annotated region, associated geometry and associated region annotation.

Claims

exact text as granted — not AI-modified
The invention claimed is: 
     
       1. A method of generating a media file, the method comprising:
 generating a first track of media data samples; 
 generating a second track of timed metadata, the second track being associated with the first track using a track reference and comprising at least one region geometry stored in a sample, 
 wherein the at least one region geometry describes a geometry of a region in a media data sample of the first track, 
 wherein the at least one region geometry is associated with at least one region annotation describing the region in the media data sample of the first track; and 
 generating a media file including the first track, the second track and associated region annotation. 
 
     
     
       2. The method of  claim 1 , wherein:
 the region geometry comprises the location of an anchor point and the shape of the region. 
 
     
     
       3. The method of  claim 1 , wherein:
 the region annotations are stored as properties of a group of samples of the second track. 
 
     
     
       4. The method of  claim 1 , wherein the method further comprises:
 generating a third track of samples, each sample being a region annotation associated with a region geometry of the second track, the third track being associated with the second track using a track reference. 
 
     
     
       5. The method of  claim 1 , wherein:
 region annotations are stored as properties in a property container in a meta data part of the media file. 
 
     
     
       6. The method of  claim 5 , wherein:
 the at least one region geometry is associated with the region annotations through group of samples in the second track. 
 
     
     
       7. The method of  claim 5 , wherein:
 the association between region annotations and the at least one region geometry is stored in a sample group description entry indicating the index of the region annotation in the property container, the sample group description entry being associated with a group of samples in the second track. 
 
     
     
       8. The method according to  claim 1 , wherein region annotations comprise textual annotations. 
     
     
       9. The method according to  claim 1 , wherein region annotations comprise alternative representations of the region. 
     
     
       10. A non-transitory computer-readable storage medium storing instructions of a computer program for implementing a method according to  claim 1 . 
     
     
       11. A device for generating a media file, the device comprising a processor configured for:
 generating a first track of media data samples; 
 generating a second track of timed metadata, the second track being associated with the first track using a track reference and comprising at least one region geometry stored in a sample, 
 wherein the at least one region geometry describes a geometry of a region in a media data sample of the first track, 
 wherein the at least one region geometry is associated with at least one region annotation describing the region in the media data sample of the first track; and 
 generating a media file including the first track, the second track and associated region annotation. 
 
     
     
       12. A method of processing a media file, the method comprising:
 obtaining a media file including:
 a first track of media data samples; and
 a second track of timed metadata, the second track being associated with the first track using a track reference and comprising at least one region geometry stored in a sample, 
 
 wherein the at least one region geometry describes a geometry of a region in a media data sample of the first track, 
 wherein the at least one region geometry is associated with at least one region annotation describing the region in the media data sample of the first track; 
 obtaining media data samples from the media file; 
 obtaining a region geometry associated with media data samples, associated geometry and associated region annotations; 
 processing the obtained media data and information for outputting a video based on the obtained media data and information. 
 
 
     
     
       13. A device for processing a media file, the device comprising a processor configured for:
 obtaining a media file including:
 a first track of media data samples; and 
 a second track of timed metadata, the second track being associated with the first track using a track reference and comprising at least one region geometry stored in a sample,
 wherein the at least one region geometry describes a geometry of a region in a media data sample of the first track, 
 wherein the at least one region geometry is associated with at least one region annotation describing the region in the media data sample of the first track; 
 obtaining media data samples from the media file; 
 
 obtaining a region geometry associated with media data samples, associated geometry and associated region annotations; 
 processing the obtained media data and information for outputting a video based on the obtained media data and information.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.