P
US8965873B2ExpiredUtilityPatentIndex 62

Methods and systems for eliminating duplicate events

Assignee: KHAN OMAR HABIBPriority: Mar 31, 2004Filed: Mar 16, 2012Granted: Feb 24, 2015
Est. expiryMar 31, 2024(expired)· nominal 20-yr term from priority
Inventors:KHAN OMAR HABIBLAWRENCE STEPHEN R
G06F 17/30864G06F 16/951G06F 16/334
62
PatentIndex Score
3
Cited by
51
References
18
Claims

Abstract

Computer-implemented systems and methods are described for eliminating duplicate events. In accordance with some implementations, a computer-implemented system is provided which includes a computer-readable storage medium that stores program code and a processor that executes the program code. The operations performed by the processor may include capturing an event comprising a user interaction with an article stored in association with a client device and determining a level of similarity between the event and a stored event. The operations performed by the processor may also include determining if the event is a duplicate of the stored event based at least in part on the level of similarity between the event and the stored event and indexing the event on the client device dependent on the determination of whether or not the event is a duplicate of the stored event.

Claims

exact text as granted — not AI-modified
That which is claimed: 
     
       1. A computer-implemented system, comprising:
 a processor for executing program code; and 
 a non-transitory computer-readable storage medium storing program code executable to perform steps comprising:
 capturing an event comprising a user interaction with an article stored in association with a client device; 
 determining an amount of elapsed time between the event and a stored event, wherein the amount of elapsed time is greater than zero; 
 determining a level of similarity between the event and the stored event; 
 modifying a threshold level of similarity used to identify the event as a duplicate based on the amount of elapsed time; and 
 determining if the event is a duplicate of the stored event based at least in part on the level of similarity between the event and the stored event and the threshold level of similarity; and 
 indexing the event on the client device dependent on the determination of whether or not the event is a duplicate of the stored event. 
 
 
     
     
       2. The system of  claim 1 , wherein the event is indexed if the event is determined not to be a duplicate of a stored event. 
     
     
       3. The system of  claim 1 , wherein the event is partially indexed if the event is determined to be a duplicate of a stored event. 
     
     
       4. The system of  claim 3 , wherein partially indexing the event comprises indexing a portion of event data associated with the event. 
     
     
       5. The system of  claim 1 , the program code further executable to partially store the event if the event is determined to be a duplicate event. 
     
     
       6. The system of  claim 1 , the program code further executable to store the event if the event is determined not to be a duplicate of a stored event. 
     
     
       7. The system of  claim 1 , the program code further executable to update access information for the article associated with the event if the event is determined to be a duplicate of a stored event. 
     
     
       8. The system of  claim 1 , wherein determining whether the event is a duplicate comprises comparing indexable text associated with the event with indexable text associated with stored events. 
     
     
       9. The system of  claim 1 , wherein determining whether the event is a duplicate comprises attempting to match an article title associated with the event with article titles associated with stored events. 
     
     
       10. The system of  claim 1 , wherein determining whether the event is a duplicate comprises computing a first duplicate value based at least in part on event data of the captured event and comparing the first duplicate value to stored duplicate values computed for stored events. 
     
     
       11. The system of  claim 10 , wherein the first duplicate value comprises a first hash computed on the event data of the captured event and wherein the stored duplicate values comprise stored hash values computed on the stored events. 
     
     
       12. The system of  claim 1 , wherein determining whether the event is a duplicate comprises determining a client application in focus at the time associated with the event. 
     
     
       13. The system of  claim 1 , wherein indexing the event comprises:
 determining indexable terms associated with the event; 
 assigning an event ID to the event; and 
 associating the event ID with the indexable terms in an index. 
 
     
     
       14. The system of  claim 1 , wherein the event comprises a real-time event or a historical event. 
     
     
       15. The system of  claim 1 , wherein determining a level of similarity between the event and the stored event comprises comparing a storage location of the article associated with the event to a storage location of an article associated with the stored event. 
     
     
       16. The system of  claim 1 , wherein determining if the event is a duplicate of the stored event comprises determining that the event is a duplicate event if the event and the stored event are within a predetermined time amount of elapsed time. 
     
     
       17. A computer-implemented system, comprising:
 a processor for executing program code; and 
 a non-transitory computer-readable storage medium storing program code executable to perform steps comprising:
 capturing an event comprising a user interaction with an article stored in association with a client device; 
 determining an amount of elapsed time between the event and a stored event from a database of stored events, wherein the amount of elapsed time is greater than zero; 
 modifying a threshold level of similarity used to identify the event as a duplicate based on the amount of elapsed time; 
 comparing indexable text associated with the captured event with indexable text associated with the stored event from the database of stored events to determine if the threshold level of similarity is met; 
 determining if the captured event occurred within a predetermined time period after the stored event, wherein the predetermined time period is greater than zero; 
 determining that the captured event is a duplicate of the stored event if the threshold level of similarity between the captured event and the stored event is met and the captured event occurred within the predetermined time period after the stored event; 
 indexing the captured event if the captured event is determined not to be a duplicate of a stored event; 
 storing the captured event if the captured event is determined not to be a duplicate of a stored event; and 
 updating access information for the article associated with the captured event, if the event is determined to be a duplicate of a stored event. 
 
 
     
     
       18. A computer-implemented system, comprising:
 a processor for executing program code; and 
 a non-transitory computer-readable storage medium storing program code executable to perform steps comprising;
 capturing an event comprising a user interaction with a stored article; 
 determining an amount of elapsed time between the event and a stored event wherein the event and the stored event are associated with different times; 
 determining a level of similarity between the event and the stored event; 
 modifying a threshold level of similarity used to identify the event as a duplicate based on the amount of elapsed time; and 
 determining if the event is a duplicate of the stored event based at least in part on the modified threshold level of similarity.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.