P
US9100756B2ActiveUtilityPatentIndex 83

Microphone occlusion detector

Assignee: APPLE INCPriority: Jun 8, 2012Filed: Dec 14, 2012Granted: Aug 4, 2015
Est. expiryJun 8, 2032(~5.9 yrs left)· nominal 20-yr term from priority
Inventors:DUSAN SORIN VYEH DAVID TLINDAHL ARAM MKANARIS ALEXANDER
H04R 29/00H04R 29/004H04R 2410/05H04R 3/005H04R 2499/11
83
PatentIndex Score
13
Cited by
27
References
25
Claims

Abstract

Digital signal processing microphone occlusion detection is described that can be used with a noise suppression system that uses two types of noise estimators, including a more aggressive one based on two audio signals (such as for non-stationary noises) and a less aggressive one based on one audio signal (such as for stationary noises). Decisions are made on how to select or combine the outputs of the noise estimators into a usable noise estimate, based on an occlusion function. The occlusion detection may alternatively be used to trigger an alert to users of multi-microphone audio processing systems, such as smart phones, headsets, laptops and tablet computers. Other embodiments are also described and claimed.

Claims

exact text as granted — not AI-modified
What is claimed is: 
     
       1. An electronic system for audio noise processing and for noise reduction, using a plurality of microphones, comprising:
 a first noise estimator to process a first audio signal from a first one of the microphones, and generate a first noise estimate; 
 a second noise estimator to process the first audio signal, and a second audio signal from a second one of the microphones, in parallel with the first noise estimator, and generate a second noise estimate; 
 a combiner-selector to receive the first and second noise estimates, and to generate an output noise estimate using the first and second noise estimates; and 
 a microphone occlusion detector to process the first and second audio signals including to band pass filter the first and second audio signals and to generate a microphone occlusion signal using the processed first and second audio signals, wherein the microphone occlusion signal represents a measure of how severely or how likely it is that one of the microphones is occluded, and 
 wherein the combiner-selector is to generate its output noise estimate also based on the occlusion signal. 
 
     
     
       2. The system of  claim 1  wherein the combiner-selector selects the first noise estimate for its output noise estimate, and not the second noise estimate, when the occlusion signal indicates that the second one of the microphones is substantially occluded. 
     
     
       3. The system of  claim 1  wherein the occlusion detector computes a power or energy ratio (PR) or a magnitude ratio (MR) of band pass filtered versions of the first and second audio signals, and evaluates an occlusion function at the computed PR or MR. 
     
     
       4. The system of  claim 3  wherein the occlusion detector is to band pass filter the first and second audio signals over a pass band of about 2000Hz-4000Hz. 
     
     
       5. The system of  claim 3  wherein when the PR or MR is greater than a threshold the occlusion function has a fixed value indicating substantial occlusion, and when the PR or MR is less than the threshold the occlusion function has a different fixed value that indicates no substantial occlusion. 
     
     
       6. The system of  claim 3  wherein the PR is computed using the formula
   PR=pow1 t −pow2 t  
 
   where 
   pow1 t =10*log 10{[summation of frame_mic1( i )*frame_mic1( i )]/ N},    
   pow2 t =10*log 10{[summation of frame_mic2( i )*frame_mic2( i )]/ N}   
 
       and where frame_mic 1  and frame_mic 2  include samples from i=1 to i=N of band pass filtered versions of the signals from the first and second microphones, or it is computed in the frequency domain by summation over a range of frequency bins. 
     
     
       7. The system of  claim 6  wherein the PR is computed in dB. 
     
     
       8. The system of  claim 1  wherein the occlusion detector further computes one or more of the following and uses them to generate the occlusion signal: absolute power of the second audio signal over a given time frame and over a range of frequency bins within the given time frame; the output noise estimate per frequency bin; and a VAD decision per frequency bin. 
     
     
       9. The system of  claim 6  wherein the occlusion function is proportional to a logistic function C/(1+A*exp(−B*PR)) where A, B and C are scalar coefficients that define the slope, position and final magnitude of the logistic function. 
     
     
       10. A microphone occlusion detector comprising:
 means for processing first and second audio signals that are from first and second microphones, respectively, including means for band pass filtering the signals; and 
 means for evaluating a microphone occlusion function that represents a measure of how severely or how likely it is that the second microphone is occluded, using the processed first and second audio signals. 
 
     
     
       11. The occlusion detector of  claim 10  wherein the processing means computes a power ratio (PR) or magnitude ratio (MR) of band pass filtered versions of the first and second audio signals in a pass band of about 2000Hz-4000Hz, and the occlusion function is a function of the PR or MR. 
     
     
       12. The occlusion detector of  claim 11  wherein the occlusion function takes on a high value that indicates occlusion or greater occlusion when the PR or MR is greater than a threshold, and the occlusion function takes on a low value that indicates no occlusion or lesser occlusion when the PR or MR is less than the threshold. 
     
     
       13. The occlusion detector of  claim 11  wherein the power ratio is computed using the formula
   PR=pow1 t −pow2 t  
 
   where 
   pow1 t =10*log10{[summation of frame_mic1( i )*frame_mic1( i )]/ N},    
   pow2 t =10*log 10{[summation of frame_mic2( i )*frame_mic2( i )]/ N}   
 
       where frame_mic 1  and frame_mic 2  include samples from i=1 to i=N of band pass filtered versions of audio signals from the first microphone and the second microphone, or is computed in the frequency domain by summing power spectrum bins from start to stop where start and stop are the frequency bins that define the boundaries of the band pass filter pass band. 
     
     
       14. The occlusion detector of  claim 10  wherein the occlusion function is proportional to a logistic function C/(1+A*exp(−B*PR)) where A, B and C are scalar coefficients that define the slope, position and final magnitude of the logistic function. 
     
     
       15. The occlusion detector of  claim 14  further comprising:
 smoothing means for smoothing the logistic function. 
 
     
     
       16. A microphone occlusion detector that uses multiple occlusion component signals, comprising:
 means for processing first and second audio signals that are from first and second microphones, respectively, including means for band pass filtering the signals; 
 means for evaluating a first occlusion component function that represents a measure of how severely or how likely it is that the second microphone is occluded during speech activity, using the band pass filtered first and second audio signals; 
 means for evaluating a second occlusion component function that represents a measure of how severely or how likely it is that the second microphone is occluded during no speech activity, using the band pass filtered first and second audio signals; and 
 means for selecting between the first and second occlusion component functions, 
 wherein in a no speech condition, the second component function is selected but not the first component function, and 
 in a no speech condition where the level of background noise is at a high level, the first component function is selected but not the second compound function. 
 
     
     
       17. The occlusion detector of  claim 16  wherein the audio signal processing means comprises a voice activity detector that indicates said no speech condition. 
     
     
       18. The occlusion detector of  claim 16  wherein the audio signal processing means comprises means for computing absolute power of the second audio signal to indicate the level of background noise is at a low level. 
     
     
       19. The occlusion detector of  claim 16  wherein the audio signal processing means comprises a background noise estimator. 
     
     
       20. The occlusion detector of  claim 16  wherein each of the first and second occlusion component functions is a logistic function, each being a function of a power ratio (PR) or magnitude ratio (MR) of the first and second audio signals. 
     
     
       21. The occlusion detector of  claim 20  wherein an inflection point of the first component function is at a lower PR or MR value than that of the second component function. 
     
     
       22. A method for detecting occlusion of a microphone, comprising:
 processing band pass filtered versions of first and second audio signals that are from first and second microphones, respectively, to compute a power ratio (PR) or a magnitude ratio (MR) of the band pass filtered first and second signals; and 
 evaluating an occlusion function, being a measure of how occluded the second microphone is, as a function of PR or MR, wherein the occlusion function is one of
 a) a curve that abruptly indicates substantial occlusion when the PR or MR is greater than a threshold, and abruptly indicates no substantial occlusion when the PR or MR is smaller than the threshold, 
 b) a curve that gradually indicates increasing occlusion when the PR or MR is greater than a threshold, and abruptly indicates no substantial occlusion when the PR or MR is smaller than the threshold, and 
 c) a logistic function. 
 
 
     
     
       23. The method of  claim 22  wherein the occlusion function is the logistic function, the method further comprising smoothing the logistic function. 
     
     
       24. The method of  claim 22  wherein the logistic function is smoothed using an exponential filter. 
     
     
       25. The method of  claim 22  further comprising:
 generating a noise estimate from the first audio signal and not the second audio signal, responsive to the evaluated occlusion function indicating more occlusion or occlusion present; and 
 generating the noise estimate from both the first and second audio signals responsive to the evaluated occlusion function indicating less occlusion or occlusion absent.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.