US7840410B2ExpiredUtilityPatentIndex 62
Audio coding based on block grouping

Assignee: DOLBY LAB LICENSING CORPPriority: Jan 20, 2004Filed: Jan 19, 2005Granted: Nov 23, 2010
Est. expiryJan 20, 2024(expired)· nominal 20-yr term from priority
Inventors:FELLERS MATTHEW CONRAD VINTON MARK STUART BAUER CLAUS DAVIDSON GRANT ALLEN
G10L 19/032G10L 25/60G10L 19/02
PatentIndex Score
Cited by
References
Claims
Abstract

Blocks of audio information are arranged in groups that share encoding control parameters to reduce the amount of side information needed to convey the control parameters in an encoded signal. The configuration of groups that reduces the distortion of the encoded audio information may be determined by any of several techniques that search for an optimal or near optimal solution. The techniques include an exhaustive search, a fast optimal search and a greed merge, which allow the search technique to tradeoff the reduction in distortion against the bit rate of the encoded signal and/or the computational complexity of the search technique.
Claims

exact text as granted — not AI-modified
1. A method for processing blocks of audio information arranged in frames, each block having content representing a respective time interval of audio information, wherein the method comprises:
 (a) receiving an input signal conveying the blocks of audio information; 
 (b) obtaining two or more measures of quality such that:
 (1) each set in a plurality of sets of groups of the blocks in a respective frame has an associated measure of quality, 
 (2) each group has one or more blocks, 
 (3) each set of groups includes all blocks in the respective frame and no block is included in more than one group in each set, and 
 (4) the measure of quality represents excellence in results obtainable by processing each block in a respective group according to one or more control parameters associated with the respective group; 
 
 (c) analyzing the measures of quality to identify a selected set of groups having a minimum number of groups such that a measure of processing performance obtained at least in part from the associated measure of quality is higher than a threshold; and 
 (d) processing each group of blocks in the selected set of groups according to the associated one or more control parameters to generate an output signal representing contents of the input signal and representing the associated control parameters for each group in the selected set. 
 
     
     
       2. The method of  claim 1  wherein the blocks comprise time-domain samples of audio information. 
     
     
       3. The method of  claim 1  wherein the blocks comprise frequency-domain coefficients of audio information. 
     
     
       4. The method of  claim 1  wherein at least one pair of blocks in the groups having more than one block have content representing audio information in time intervals that are adjacent to one another or overlap one another. 
     
     
       5. The method of  claim 1  that comprises:
 obtaining two or more measures of cost, each measure of cost affiliated with a set of groups of blocks, wherein the measure of cost represents an amount of resources needed to process the blocks in the affiliated set according to the associated of control parameters; 
 
       wherein the measure of processing performance is obtained in part from the measure of cost affiliated with the selected set. 
     
     
       6. The method of  claim 5  wherein the measures of cost are responsive to amounts of data needed to represent the control parameters in the encoded signal. 
     
     
       7. The method of  claim 5  wherein the measures of cost are responsive to amounts of computational resources needed to process the blocks of audio information. 
     
     
       8. The method of  claim 1  wherein the analyzing is performed in one or more iterations of an iterative process to determine one or more sets of groups that are not candidates for the selected set and excludes analyzing these one or more sets in subsequent iterations of the process. 
     
     
       9. The method of  claim 1  wherein the selected set is identified by an iterative process that comprises:
 determining a second measure of processing performance for pairs of groups in an initial set of groups; 
 merging the pair of groups having a highest second measure of processing performance to form a revised set of groups provided that the highest second measure of processing performance is greater than a threshold, and determining the second measure of processing performance for pairs of groups in the revised set of groups; and 
 continuing the merging until no pair of groups in the revised set of groups has a second measure of processing performance that is greater than the threshold, wherein the revised set of groups is the selected set. 
 
     
     
       10. The method of  claim 1  wherein a respective frame has a number of blocks equal to N and the analyzing of the measures of quality comprises:
 iterating a value p from 1 to N, where p is the number of groups of blocks in a frame; 
 identifying for each value of p at least some of the sets of groups that have the measure of processing performance that is higher than the threshold; and 
 analyzing at least some of the identified sets of groups to determine the selected set of groups that maximizes the measure of processing performance among the sets of groups that are analyzed. 
 
     
     
       11. The method of  claim 1  wherein each block in the respective frame comprises spectral coefficients and the measure of processing performance for a particular set of groups represents a measure of error energy between the spectral coefficients in the respective frame for the particular set of groups and the spectral coefficients in the respective frame with each block in its own group. 
     
     
       12. The method of  claim 1  wherein the measure of processing performance is responsive to a total number of bits available to represent a respective frame of blocks. 
     
     
       13. An apparatus for processing blocks of audio information arranged in frames, each block having content representing a respective time interval of audio information, wherein the method comprises:
 means for receiving an input signal conveying the blocks of audio information; 
 means for obtaining two or more measures of quality such that:
 (1) each set in a plurality of sets of groups of the blocks in a respective frame has an associated measure of quality, 
 (2) each group has one or more blocks, 
 (3) each set of groups includes all blocks in the respective frame and no block is included in more than one group in each set, and 
 (4) the measure of quality represents excellence in results obtainable by processing each block in a respective group according to one or more control parameters associated with the respective group; 
 
 means for analyzing the measures of quality to identify a selected set of groups having a minimum number of groups such that a measure of processing performance obtained at least in part from the associated measure of quality is higher than a threshold; and 
 means for processing each group of blocks in the selected set of groups according to the associated one or more control parameters to generate an output signal representing contents of the input signal and representing the associated control parameters for each group in the selected set. 
 
     
     
       14. The apparatus of  claim 13  wherein the blocks comprise time-domain samples of audio information. 
     
     
       15. The apparatus of  claim 13  wherein the blocks comprise frequency-domain coefficients of audio information. 
     
     
       16. The apparatus of  claim 13  wherein at least one pair of blocks in the groups having more than one block have content representing audio information in time intervals that are adjacent to one another or overlap one another. 
     
     
       17. The apparatus of  claim 13  that comprises:
 means for obtaining two or more measures of cost, each measure of cost affiliated with a set of groups of blocks, wherein the measure of cost represents an amount of resources needed to process the blocks in the affiliated set according to the associated control parameters; 
 
       wherein the measure of processing performance is obtained in part from the measure of cost affiliated with the selected set. 
     
     
       18. The apparatus of  claim 17  wherein the measures of cost are responsive to amounts of data needed to represent the control parameters in the encoded signal. 
     
     
       19. The apparatus of  claim 17  wherein the measures of cost are responsive to amounts of computational resources needed to process the blocks of audio information. 
     
     
       20. The apparatus of  claim 13  wherein the means for analyzing iteratively analyzes to determine one or more sets of groups that are not candidates for the selected set and excludes analyzing these one or more sets in subsequent iterations. 
     
     
       21. The apparatus of  claim 13  wherein the means for analyzing performs its analysis by:
 determining a second measure of processing performance for pairs of groups in an initial set of groups; 
 merging the pair of groups having a highest second measure of processing performance to form a revised set of groups provided that the highest second measure of processing performance is greater than a threshold, and determining the second measure of processing performance for pairs of groups in the revised set of groups; and 
 continuing the merging until no pair of groups in the revised set of groups has a second measure of processing performance that is greater than the threshold, wherein the revised set of groups is the selected set. 
 
     
     
       22. The apparatus of  claim 13  wherein a respective frame has a number of blocks equal to N and the analyzing of the measures of quality comprises:
 iterating a value p from 1 to N, where p is the number of groups of blocks in a frame; 
 identifying for each value of p at least some of the sets of groups that have the measure of processing performance that is higher than the threshold; and 
 analyzing at least some of the identified sets of groups to determine the selected set of groups that maximizes the measure of processing performance among the sets of groups that are analyzed. 
 
     
     
       23. The apparatus of  claim 13  wherein each block in the respective frame comprises spectral coefficients and the measure of processing performance for a particular set of groups represents a measure of error energy between the spectral coefficients in the respective frame for the particular set of groups and the spectral coefficients in the respective frame with each block in its own group. 
     
     
       24. The apparatus of  claim 13  wherein the measure of processing performance is responsive to a total number of bits available to represent a respective frame of blocks. 
     
     
       25. A computer-readable storage medium recording a program of instructions that is executable by a device to perform a method for processing blocks of audio information arranged in frames, each block having content representing a respective time interval of audio information, wherein the method comprises:
 (a) receiving an input signal conveying the blocks of audio information; 
 (b) obtaining two or more measures of quality such that:
 (1) each set in a plurality of sets of groups of the blocks in a respective frame has an associated measure of quality, 
 (2) each group has one or more blocks, 
 (3) each set of groups includes all blocks in the respective frame and no block is included in more than one group in each set, and 
 (4) the measure of quality represents excellence in results obtainable by processing each block in a respective group according to one or more control parameters associated with the respective group; 
 
 (c) analyzing the measures of quality to identify a selected set of groups having a minimum number of groups such that a measure of processing performance obtained at least in part from the associated measure of quality is higher than a threshold; and 
 (d) processing each group of blocks in the selected set of groups according to the associated one or more control parameters to generate an output signal representing contents of the input signal and representing the associated control parameters for each group in the selected set. 
 
     
     
       26. The medium of  claim 25  wherein the blocks comprise time-domain samples of audio information. 
     
     
       27. The medium of  claim 25  wherein the blocks comprise frequency-domain coefficients of audio information. 
     
     
       28. The medium of  claim 25  wherein at least one pair of blocks in the groups having more than one block have content representing audio information in time intervals that are adjacent to one another or overlap one another. 
     
     
       29. The medium of  claim 25  wherein the method comprises:
 obtaining two or more measures of cost, each measure of cost affiliated with a set of groups of blocks, wherein the measure of cost represents an amount of resources needed to process the blocks in the affiliated set according to the associated control parameters; 
 
       wherein the measure of processing performance is obtained in part from the measure of cost affiliated with the selected set. 
     
     
       30. The medium of  claim 29  wherein the measures of cost are responsive to amounts of data needed to represent the control parameters in the encoded signal. 
     
     
       31. The medium of  claim 29  wherein the measures of cost are responsive to amounts of computational resources needed to process the blocks of audio information. 
     
     
       32. The medium of  claim 25  wherein the analyzing is performed in one or more iterations of an iterative process to determine one or more sets of groups that are not candidates for the selected set and excludes analyzing these one or more sets in subsequent iterations of the process. 
     
     
       33. The medium of  claim 25  wherein the selected set is identified by an iterative process that comprises:
 determining a second measure of processing performance for pairs of groups in an initial set of groups; 
 merging the pair of groups having a highest second measure of processing performance to form a revised set of groups provided that the highest second measure of processing performance is greater than a threshold, and determining the second measure of processing performance for pairs of groups in the revised set of groups; and 
 continuing the merging until no pair of groups in the revised set of groups has a second measure of processing performance that is greater than the threshold, wherein the revised set of groups is the selected set. 
 
     
     
       34. The medium of  claim 25  wherein a respective frame has a number of blocks equal to N and the analyzing of the measures of quality comprises:
 iterating a value p from 1 to N, where p is the number of groups of blocks in a frame; 
 identifying for each value of p at least some of the sets of groups that have the measure of processing performance that is higher than the threshold; and 
 analyzing at least some of the identified sets of groups to determine the selected set of groups that maximizes the measure of processing performance among the sets of groups that are analyzed. 
 
     
     
       35. The medium of  claim 25  wherein each block in the respective frame comprises spectral coefficients and the measure of processing performance for a particular set of groups represents a measure of error energy between the spectral coefficients in the respective frame for the particular set of groups and the spectral coefficients in the respective frame with each block in its own group. 
     
     
       36. The medium  claim 25  wherein the measure of processing performance is responsive to a total number of bits available to represent a respective frame of blocks.
Cited by (0)

No later patents cite this yet.
References (0)

No backward citations on record.