USH2189HExpiredUtilityPatentIndex 70

SQL enhancements to support text queries on speech recognition results of audio data

Assignee: ORACLE INT CORPPriority: Oct 21, 2002Filed: Feb 11, 2003Granted: May 1, 2007

Est. expiryOct 21, 2022(expired)· nominal 20-yr term from priority

Inventors:RAO VISHAL CHOPRA RAJIV

G06F 16/685G10L 15/26

PatentIndex Score

Cited by

References

Claims

Abstract

A system, method, computer program product, and application program interface for indexing data relating to results of speech recognition in a database management system provides the capability to perform simple and efficient searches on audio speech data with reduced development effort. An application program interface for indexing data relating to results of speech recognition in a database management system comprises an indextype operable to support text queries on speech recognition results, an interface operable to provide interaction with an index of the indextype, and a format adapter interface a format adapter that the index creation activity will invoke to extract relevant information from a proprietary speech recognition format.

Claims

exact text as granted — not AI-modified

1. A method for indexing data relating to results of speech recognition in a database management system, comprising the steps of:
receiving speech recognition results at the database management system, the speech recognition results having a first format;
converting the first format of the speech recognition results to a second format; and
generating an index of the speech recognition results in the database management system.

2. The method of claim 1 , wherein the converting step comprises the steps of:
parsing the speech recognition results in the first format;
extracting from the speech recognition results text data representing the recognized speech, information relating to a confidence in each speech recognition result, and timestamp information indicating a location of each portion of a speech recognition result; and
generate speech recognition results in the second format using the extracted text data representing the recognized speech, information relating to a confidence in each speech recognition result, and timestamp information indicating a location of each portion of a speech recognition result.

3. The method of claim 2 , wherein the second format is a standardized format.

4. The method of claim 3 , wherein the first format is a proprietary format.

5. The method of claim 2 , wherein the generating step comprises the steps of:
generating an index using the extracted speech recognition results, including the text data representing the recognized speech, the information relating to a confidence in each speech recognition result, and the timestamp information indicating a location of each portion of a speech recognition result in the database management system; and
storing the extracted information.

6. The method of claim 5 , wherein the second format is a standardized format.

7. The method of claim 6 , wherein the first format is a proprietary format.

8. A system for indexing data relating to results of speech recognition in a database management system comprising:
a processor operable to execute computer program instructions;
a memory operable to store computer program instructions executable by the processor; and
computer program instructions stored in the memory and executable to perform the steps of:
receiving speech recognition results at the database management system, the speech recognition results having a first format;
converting the first format of the speech recognition results to a second format; and
generating an index of the speech recognition results in the database management system.

9. The system of claim 8 , wherein the converting step comprises the steps of:
parsing the speech recognition results in the first format;
extracting from the speech recognition results text data representing the recognized speech, information relating to a confidence in each speech recognition result, and timestamp information indicating a location of each portion of a speech recognition result; and
generate speech recognition results in the second format using the extracted text data representing the recognized speech, information relating to a confidence in each speech recognition result, and timestamp information indicating a location of each portion of a speech recognition result.

10. The system of claim 9 , wherein the second format is a standardized format.

11. The system of claim 10 , wherein the first format is a proprietary format.

12. The system of claim 9 , wherein the generating step comprises the steps of:
generating an index using the extracted speech recognition results, including the text data representing the recognized speech, the information relating to a confidence in each speech recognition result, and the timestamp information indicating a location of each portion of a speech recognition result in the database management system; and
storing the extracted information.

13. The system of claim 12 , wherein the second format is a standardized format.

14. The system of claim 13 , wherein the first format is a proprietary format.

15. A computer program product for indexing data relating to results of speech recognition in a database management system comprising:
a computer readable medium;
computer program instructions, recorded on the computer readable medium, executable by a processor, for performing the steps of
receiving speech recognition results at the database management system, the speech recognition results having a first format;
converting the first format of the speech recognition results to a second format;
generating an index of the speech recognition results in the database management system.

16. The computer program product of claim 15 , wherein the converting step comprises the steps of:
parsing the speech recognition results in the first format; extracting from the speech recognition results text data representing the recognized speech, information relating to a confidence in each speech recognition result, and timestamp information indicating a location of each portion of a speech recognition result; and
generate speech recognition results in the second format using the extracted text data representing the recognized speech, information relating to a confidence in each speech recognition result, and tirnestamp information indicating a location of each portion of a speech recognition result.

17. The computer program product of claim 16 , wherein the second format is a standardized format.

18. The computer program product of claim 17 , wherein the first format is a proprietary format.

19. The computer program product of claim 16 , wherein the generating step comprises the steps of:
generating an index using the extracted speech recognition results, including the text data representing the recognized speech, the information relating to a confidence in each speech recognition result, and the timestamp information indicating a location of each portion of a speech recognition result in the database management system; and
storing the extracted information.

20. The computer program product of claim 19 , wherein the second format is a standardized format.

21. The computer program product of claim 20 , wherein the first format is a proprietary format.

22. An application program interface for indexing data relating to results of speech recognition in a database management system comprising:
an indextype operable to support text queries on speech recognition results;
an interface operable to provide interaction with an index of the indextype; and
a format adapter interface operable to invoke a format adapter for converting speech recognition results having a first format to a second format.

23. The application program interface of claim 22 , wherein the format adapter is operable to parse the speech recognition results in the first format, extract from the speech recognition results text data representing the recognized speech, information relating to a confidence in each speech recognition result, and timestamp information indicating a location of each portion of a speech recognition result, and generate speech recognition results in the second format using the extracted text data representing the recognized speech, information relating to a confidence in each speech recognition result, and timestamp information indicating a location of each portion of a speech recognition result.

24. The application program interface of claim 23 , wherein the indextype comprises the text data representing the recognized speech, the information relating to a confidence in each speech recognition result, and the timestamp information indicating a location of each portion of a speech recognition result in the database management system.

25. The application program interface of claim 24 , wherein the interface is operable to provide interaction comprising performing a query of the text data representing the recognized speech.

26. The application program interface of claim 25 , wherein the query of the text data representing the recognized speech relates to the confidence information and/or the timestamp information.

27. The application program interface of claim 26 , wherein results of the query indicate time offsets within each matched media asset where matches occurred and speech recognition confidence of each match occurrence within a matched media asset.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.