P
US10043535B2ActiveUtilityPatentIndex 62

Method and device for spectral expansion for an audio signal

Assignee: STATON TECHIYA LLCPriority: Jan 15, 2013Filed: Jan 15, 2014Granted: Aug 7, 2018
Est. expiryJan 15, 2033(~6.5 yrs left)· nominal 20-yr term from priority
Inventors:USHER JOHNELLIS DAN
H04R 5/033H04R 1/028G10L 21/0388H04R 3/00H04R 1/1083H04R 2420/07
62
PatentIndex Score
1
Cited by
79
References
20
Claims

Abstract

A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the input narrowband audio signal. Other embodiments are disclosed.

Claims

exact text as granted — not AI-modified
What is claimed is: 
     
       1. A method for automatically expanding a spectral bandwidth of an audio signal comprising the steps of:
 a first training step of generating a mapping matrix, by a digital signal processor, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, the first training step consisting of simultaneously recording a sentence by an ambient microphone and by an ear canal microphone located in an earphone, wherein the ambient microphone captures a reference wideband signal and the digital signal processor performs a frequency transform on the wideband signal, wherein the ear canal microphone captures a reference narrowband signal and the digital signal processor performs a frequency transform on the narrowband signal into a plurality of bands, wherein the mapping matrix is based on an analysis of the frequency transform on the wideband signal and the frequency transform on the narrowband signal into a plurality of bands; 
 generating, by the digital signal processor, an energy envelope analysis of an input narrowband audio signal; 
 generating, by the digital signal processor, a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis; 
 high-pass filtering, by the digital signal processor, the resynthesized noise signal; and 
 summing, by the digital signal processor, the high-pass filtered resynthesized noise signal with the input narrowband audio signal to automatically expand the spectral bandwidth of the input narrowband audio signal to produce a summed signal with spectral enhancement; 
 reproducing by a loudspeaker the summed signal with spectral enhancement. 
 
     
     
       2. The method of  claim 1 , wherein the reference wideband and narrowband signals consist of a simultaneous recording of a phonetically balanced sentence. 
     
     
       3. The method of  claim 1 , where the input narrowband audio signal is taken from the ear-canal microphone within the occluded ear canal and located in the earphone. 
     
     
       4. The method of  claim 1 , where the input narrowband audio signal is taken from a received speech audio signal in a speech telecommunications system. 
     
     
       5. The method of  claim 1 , where the summed signal is directed to a speech telecommunications system. 
     
     
       6. The method of  claim 1 , where the summed signal is directed to a voice controlled device. 
     
     
       7. The method of  claim 1 , where the mapping matrix is generated from a least squares fit analysis of the reference wideband and reference narrowband signals. 
     
     
       8. The method of  claim 1 , where the mapping matrix is generated by a linear regression model, where the input reference wideband signals and reference narrowband signals are first converted to a frequency domain representation using a Fast Fourier Transform, and secondly each frequency band envelope is converted to a decibel domain representation to perform a linear prediction. 
     
     
       9. A non-transitory computer readable medium containing instructions for spectral enhancement, the execution of the instructions by one or more processors of a computer system causing the one or more processors to perform operations comprising:
 generating a mapping matrix, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating of the mapping matrix consisting of simultaneously recording of a sentence by an ambient microphone and by an ear canal microphone, wherein the ambient microphone captures a reference wideband signal and the digital signal processor performs a frequency transform on the wideband signal, wherein the ear canal microphone captures a reference narrowband signal and the digital signal processor performs a frequency transform on the narrowband signal into a plurality of bands, wherein the mapping matrix is based on an analysis of the frequency transform on the wideband signal and the frequency transform on the narrowband signal into a plurality of bands; 
 generating an energy envelope analysis of an input narrowband audio signal; 
 generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis; 
 high-pass filtering the resynthesized noise signal; 
 summing the high-pass filtered resynthesized noise signal with the input narrowband audio signal to produce a summed signal with spectral enhancement; and 
 reproducing by a loudspeaker the summed signal with spectral enhancement. 
 
     
     
       10. A system for automatically expanding the spectral bandwidth of an audio signal comprising:
 one or more processors; 
 a memory having instructions and being operatively coupled to the one or more processors, the instructions when executed by the one or more processors performs the operations of:
 generating a mapping matrix, consisting of simultaneously recording of a sentence by an ambient microphone and by an ear canal microphone, wherein the ambient microphone captures a reference wideband signal and the digital signal processor performs a frequency transform on the wideband signal, wherein the ear canal microphone captures a reference narrowband signal and the digital signal processor performs a frequency transform on the narrowband signal into a plurality of bands, wherein the mapping matrix is based on an analysis of the frequency transform on the wideband signal and the frequency transform on the narrowband signal into a plurality of bands; 
 generating an energy envelope analysis of an input narrowband audio signal; 
 generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis to provide a resynthesized noise signal; 
 high-pass filtering the resynthesized noise signal to provide a high-pass filtered resynthesized noise signal; and 
 summing the high-pass filtered resynthesized noise signal with the input narrowband audio signal to provide a wideband signal to produce a summed signal with spectral enhancement; 
 reproducing by a loudspeaker the summed signal with spectral enhancement. 
 
 
     
     
       11. The system of  claim 10 , wherein the mapping matrix is a transformation matrix predicting high frequency energy from a low frequency energy envelope. 
     
     
       12. The system of  claim 10 , wherein the summing provides the wideband signal as the output to the loudspeaker for reproducing the audio signal with an expanded spectral bandwidth. 
     
     
       13. The system of  claim 12 , where the mapping matrix is generated from a least squares fit analysis of the reference wideband signal and reference narrowband signal. 
     
     
       14. The system of  claim 12 , where the mapping matrix is generated by a linear regression model, where the input reference wideband signals and reference narrowband signals are first converted to a frequency domain representation providing frequency band envelopes, and secondly each frequency band envelope is converted to a decibel domain representation. 
     
     
       15. The system of  claim 10 , wherein the system further includes a smart watch having one or more microphones. 
     
     
       16. The system of  claim 10 , wherein the system further includes a set of smart glasses having one or more microphones. 
     
     
       17. The system of  claim 10 , wherein the system further includes a mobile phone having one or more microphones. 
     
     
       18. The system of  claim 10 , wherein the system further includes a hearing aid having one or more microphones. 
     
     
       19. The system of  claim 10 , wherein the system further includes steering wheel having one or more microphones. 
     
     
       20. The system of  claim 10 , wherein the system is formed in an earpiece device.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.