P
US12197870B2ActiveUtilityPatentIndex 57

Bootstrapping topic detection in conversations

Assignee: SOLVENTUM INTELLECTUAL PROPERTIES COMPANYPriority: Apr 15, 2020Filed: Mar 18, 2021Granted: Jan 14, 2025
Est. expiryApr 15, 2040(~13.8 yrs left)· nominal 20-yr term from priority
Inventors:POLZIN THOMAS SCHENG HUAKOLL DETLEF
G06F 40/166G06F 16/35G06F 40/284G06F 40/30G06F 16/353
57
PatentIndex Score
0
Cited by
8
References
14
Claims

Abstract

A computer system and method identifies topics in conversations, such as a conversation between a doctor and patient during a medical examination. The system and method generates, based on first text (such as a document corpus including previous clinical documentation), a plurality of sentence embeddings representing a plurality of semantic representations in a plurality of sentences in the training text. The system and method generate a classifier based on the second text, which includes a plurality of sections associated with a plurality of topics, and the plurality of sentence embeddings. The system and method generate, based on a sentence (such as a sentence in a doctor-patient conversation) and the classifier, an identifier of a topic to associate with the first sentence. The system and method may also insert the sentence into a section, associated with the identified topic, in a document (such as a clinical note).

Claims

exact text as granted — not AI-modified
The invention claimed is: 
     
       1. A method, for identifying a first topic represented by a first sentence, performed by at least one computer processor executing computer program instructions stored on at least one non-transitory computer readable medium, the method comprising:
 (A) generating, based on first text, a plurality of sentence embeddings representing a plurality of semantic representations of a plurality of sentences in the training text; 
 (B) generating, based on second text and the plurality of sentence embeddings, the second text comprising a plurality of sections associated with a plurality of topics, a classifier; 
 (C) generating, based on the first sentence and the classifier, a first identifier of the first topic to associate with the first sentence; and 
 (D) inserting the first sentence into a first section of a first document, the first section being associated with the first topic. 
 
     
     
       2. The method of  claim 1 , further comprising:
 (E) generating, based on a second sentence and the classifier, a second identifier of a second topic to associate with the second sentence. 
 
     
     
       3. The method of  claim 2 , further comprising:
 (F) inserting the second sentence into a second section of the first document, the second section being associated with the second topic. 
 
     
     
       4. The method of  claim 1 :
 wherein the second text comprises a plurality of documents; 
 wherein the plurality of documents comprises a first document comprising a first section in the plurality of sections, wherein the first section is associated with a first one of the plurality of topics; and 
 wherein the plurality of documents comprises a second document comprising a second section in the plurality of sections, wherein the second section is associated with the first one of the plurality of topics. 
 
     
     
       5. The method of  claim 4 :
 wherein the first document comprises a third section in the plurality of sections, wherein the third section is associated with a second one of the plurality of topics; and 
 wherein the second document comprises a fourth section in the plurality of sections, wherein the fourth section is associated with the second one of the plurality of topics. 
 
     
     
       6. The method of  claim 1 , further comprising:
 (E) generating, based on the classifier and data representing an utterance, an identifier of a topic to associate with the utterance. 
 
     
     
       7. The method of  claim 1 , wherein the first text includes the second text. 
     
     
       8. A system comprising a non-transitory computer-readable medium having computer-readable instructions stored thereon, wherein the computer-readable instructions are executable by at least one computer processor to perform a method for identifying a first topic represented by a first sentence, the method comprising:
 (A) generating, based on first text, a plurality of sentence embeddings representing a plurality of semantic representations of a plurality of sentences in the training text; 
 (B) generating, based on second text and the plurality of sentence embeddings, the second text comprising a plurality of sections associated with a plurality of topics, a classifier; 
 (C) generating, based on the first sentence and the classifier, a first identifier of the first topic to associate with the first sentence; and 
 (D) inserting the first sentence into a first section of a first document, the first section being associated with the first topic. 
 
     
     
       9. The system of  claim 8 , wherein the method further comprises:
 (E) generating, based on a second sentence and the classifier, a second identifier of a second topic to associate with the second sentence. 
 
     
     
       10. The system of  claim 9 , wherein the method further comprises:
 (F) inserting the second sentence into a second section of the first document, the second section being associated with the second topic. 
 
     
     
       11. The system of  claim 8 :
 wherein the second text comprises a plurality of documents; 
 wherein the plurality of documents comprises a first document comprising a first section in the plurality of sections, wherein the first section is associated with a first one of the plurality of topics; and 
 wherein the plurality of documents comprises a second document comprising a second section in the plurality of sections, wherein the second section is associated with the first one of the plurality of topics. 
 
     
     
       12. The system of  claim 11 :
 wherein the first document comprises a third section in the plurality of sections, wherein the third section is associated with a second one of the plurality of topics; and 
 wherein the second document comprises a fourth section in the plurality of sections, wherein the fourth section is associated with the second one of the plurality of topics. 
 
     
     
       13. The system of  claim 8 , wherein the method further comprises:
 (E) generating, based on the classifier and data representing an utterance, an identifier of a topic to associate with the utterance. 
 
     
     
       14. The system of  claim 8 , wherein the first text includes the second text.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.