Method for determining correct scanning distance using augmented reality and machine learning models
Abstract
A smart device is provided with an application program for displaying a video feed received from the smart device's camera. The application can determine the coordinates for an intersection point, which is a point on the ground where the smart device is pointing at. The application can display a target on the visual representation of the intersection point. Based on whether the smart device is at an appropriate distance from the intersection point, the user interface can superimpose an indicator on the video feed received from the camera. This can inform the user whether the smart device is at an optimal scan distance from the intersection point (or an object) so that the object can be identified by a machine learning model.
Claims
exact text as granted — not AI-modifiedThe invention claimed is:
1. A method comprising:
displaying a video feed, wherein the video feed is captured from a first position;
determining an intersection point between a ground plane and the first position;
determining a distance between the first position and the intersection point;
displaying an indicator in the user interface, wherein a characteristic of the indicator is determined based on the distance between the first position and the intersection point;
identifying, using an object recognition model, an object shown in the video feed; and
displaying information about the identified object.
2. The method of claim 1 , wherein the intersection point is a point on a ground plane where a perpendicular hypothetical line extending from the first position would intersect the ground plane.
3. The method of claim 2 , wherein the intersection point corresponds to a centerpoint of a field of view of a camera located at the first position.
4. The method of claim 1 , wherein the indicator is displayed at a point over the video corresponding to the intersection point.
5. The method of claim 4 , wherein the indicator is displayed over a visual representation of the intersection point.
6. The method of claim 5 , wherein the indicator is displayed within the same plane as a visual representation of the ground plane.
7. The method of claim 1 , further comprising capturing an image of the object for use by the object recognition model in identifying the object.
8. The method of claim 7 , wherein the image of the object is captured automatically based upon the distance between the first position and the intersection point.
9. The method of claim 8 , wherein the image of the object is captured automatically when the distance between the first position and the intersection point is within an optimal scan distance.
10. The method of claim 9 , wherein the optimal scan distance is determined based on an altitude of the first position relative to the ground plane.
11. The method of claim 9 , wherein the optical scan distance is determined based on a field of view and an angular separation of pixels of a camera located at the first position.
12. The method of claim 7 , wherein a notification is displayed in the video feed upon the capture of the image of the object.
13. The method of claim 1 , wherein the user interface is configured to change the characteristic of the indicator when the distance between the first position and the intersection point changes.
14. The method of claim 1 , further comprising:
comparing the distance between the first position and the intersection point with a threshold distance; and
displaying the characteristic of the indicator based on the comparison.
15. A non-transitory computer-accessible medium having stored thereon computer-executable instructions for an application for identifying objects executable by a computing hardware arrangement, wherein, when the computing hardware arrangement executes the instructions, the computing hardware arrangement is configured to perform procedures comprising:
displaying a video feed, wherein the video feed is captured from a first position;
determining a distance between the first position and an intersection point;
comparing the distance between the device and the intersection point with a threshold distance; and
displaying an indicator, wherein the indicator is determined based on the comparison;
identifying, using an object recognition model, a product shown in the video feed; and
displaying information about the identified product.
16. The non-transitory computer-accessible medium of claim 15 , wherein the intersection point is a point on a ground plane where a perpendicular hypothetical line extending from the first position would intersect the ground plane.
17. The non-transitory computer-accessible medium of claim 15 , further comprising:
capturing a frame of the video feed when a condition is met;
conducting a search query for the product identified in the frame; and
displaying a listing including the product.
18. The non-transitory computer-accessible medium of claim 17 , wherein the condition is met when the device is within an optimal scan distance of the product.
19. The non-transitory computer-accessible medium of claim 18 , wherein the optimal scan distance is determined based on an altitude of the first position relative to the ground plane.
20. The non-transitory computer-accessible medium of claim 18 , wherein the optical scan distance is determined based on a field of view and an angular separation of pixels of a camera located at the first position.Cited by (0)
No later patents cite this yet.
References (0)
No backward citations on record.