P
US8675926B2ActiveUtilityPatentIndex 92

Distinguishing live faces from flat surfaces

Assignee: ZHANG ZHENGYOUPriority: Jun 8, 2010Filed: Jun 8, 2010Granted: Mar 18, 2014
Est. expiryJun 8, 2030(~3.9 yrs left)· nominal 20-yr term from priority
Inventors:ZHANG ZHENGYOUCAI QINKASSELMAN PIETER RBAKER ARTHUR H
G06V 40/176G06V 10/464G06V 40/45G06V 40/171G06V 40/167G06F 21/32G06V 40/161G06V 40/165G06F 21/6245
92
PatentIndex Score
35
Cited by
29
References
20
Claims

Abstract

Multiple images including a face presented by a user are accessed. One or more determinations are made based on the multiple images, such as a determination of whether the face included in the multiple images is a 3-dimensional structure or a flat surface and/or a determination of whether motion is present in one or more face components (e.g., eyes or mouth). If it is determined that the face included in the multiple images is a 3-dimensional structure or that that motion is present in the one or more face components, then an indication is provided that the user can be authenticated. However, if it is determined that the face included in the multiple images is a flat surface or that motion is not present in the one or more face components, then an indication is provided that the user cannot be authenticated.

Claims

exact text as granted — not AI-modified
What is claimed is: 
     
       1. A method comprising:
 accessing multiple images including a face presented by a user; 
 determining, based on the multiple images, whether the face included in the multiple images is a 3-dimensional structure or a flat surface by:
 accessing a first image and a second image of the multiple images; 
 generating a warped image by warping the first image using a homography matrix; 
 determining an image difference between the warped image and the second image; and 
 determining that the face included in the multiple images is a 3-dimensional structure if the difference between the warped image and the second image meets a threshold value, and otherwise determining that the face included in the multiple images is a flat surface; 
 
 indicating that the user can be authenticated if the face included in the multiple images is a 3-dimensional structure; and 
 indicating that the user cannot be authenticated if the face included in the multiple images is a flat surface. 
 
     
     
       2. A method as recited in  claim 1 , further comprising:
 extracting one or more feature points from the first image and one or more feature points from the second image; 
 matching feature points across the first image and the second image; and 
 using feature points matched across the first image and the second image to generate the homography matrix. 
 
     
     
       3. A method as recited in  claim 1 , the determining the image difference comprising identifying a first set of sub-regions in the first image, a second set of sub-regions in the second image, and determining a difference between color values of pixels in the first set of sub-regions and the second set of sub-regions. 
     
     
       4. A method as recited in  claim 1 , further comprising:
 repeating the accessing a first image and a second image, generating a warped image, and determining an image difference for multiple pairs of images of the multiple images; and 
 using the image differences determined for the multiple pairs of images in determining whether the face included in the multiple images is a 3-dimensional structure or a flat surface. 
 
     
     
       5. A method as recited in  claim 1 , the multiple images comprising a sequence of three or more images, and one or more intervening images in the sequence of three or more images being between the first image and the second image in the sequence of three or more images. 
     
     
       6. A method as recited in  claim 1 , further comprising:
 determining, based on the multiple images, whether motion is present in one or more components of the face; and 
 the indicating that the user can be authenticated comprising indicating that the user can be authenticated only if both the face included in the multiple images is a 3-dimensional structure and if motion is present in the one or more components of the face. 
 
     
     
       7. A method as recited in  claim 6 , the one or more components including an eye of the face. 
     
     
       8. A method comprising:
 accessing multiple images including a face presented by a user; 
 determining, based on the multiple images, whether the face included in the multiple images is a 3-dimensional structure or a flat surface by:
 accessing a first image and a second image of the multiple images; 
 reconstructing a 3-dimensional scene based on the first image and the second image using structure from motion estimation; 
 fitting the 3-dimensional scene to a plane; 
 analyzing characteristics of the 3-dimensional scene fitted to the plane; and 
 determining that the face included in the multiple images is a 3-dimensional structure if the characteristics meet a threshold, and otherwise determining that the face included in the multiple images is a flat surface; 
 
 indicating that the user can be authenticated if the face included in the multiple images is a 3-dimensional structure; and 
 indicating that the user cannot be authenticated if the face included in the multiple images is a flat surface. 
 
     
     
       9. A method as recited in  claim 8 , further comprising:
 extracting one or more feature points from the first image and one or more feature points from the second image; 
 matching feature points across the first image and the second image; and 
 using feature points matched across the first image and the second image to reconstruct the 3-dimensional scene. 
 
     
     
       10. A method as recited in  claim 8 , the analyzing comprising:
 generating a first value that is a distance between a feature in the reconstructed 3D scene and the plane; 
 generating a second value that is a distance between two facial components on the plane; and 
 calculating a ratio of the first value to the second value. 
 
     
     
       11. A method as recited in  claim 8 , further comprising:
 repeating the accessing a first and a second image, reconstructing the 3-dimensional scene, and determining that the face is a 3-dimensional structure or a flat surface for multiple pairs of images of the multiple images; and 
 using the determinations that the face is a 3-dimensional structure or a flat surface for the multiple pairs of images in determining whether the face included in the multiple images is a 3-dimensional structure or a flat surface. 
 
     
     
       12. A method as recited in  claim 8 , further comprising:
 determining, based on the multiple images, whether motion is present in one or more components of the face; and 
 the indicating that the user can be authenticated comprising indicating that the user can be authenticated only if both the face included in the multiple images is a 3-dimensional structure and if motion is present in the one or more components of the face. 
 
     
     
       13. A method as recited in  claim 12 , the one or more components including a mouth of the face. 
     
     
       14. A computing device comprising:
 an image capture component configured to capture multiple images including a face presented by a user; 
 a live face detection module, implemented at least in part in hardware, configured to:
 access the multiple images; 
 determine, based on the multiple images, whether the face included in the multiple images is a 3-dimensional structure or a flat surface by:
 accessing a first image and a second image of the multiple images; 
 generating a warped image by warping the first image using a homography matrix; 
 determining an image difference between the warped image and the second image; and 
 determining that the face included in the multiple images is a 3-dimensional structure if the difference between the warped image and the second image meets a threshold value, and otherwise determining that the face included in the multiple images is a flat surface; 
 
 indicate that the user can be authenticated if the face included in the multiple images is a 3-dimensional structure; and 
 indicate that the user cannot be authenticated if the face included in the multiple images is a flat surface. 
 
 
     
     
       15. A computing device as recited in  claim 14 , the live face detection module being further configured to:
 extract one or more feature points from the first image and one or more feature points from the second image; 
 match feature points across the first image and the second image; and 
 use feature points matched across the first image and the second image to generate the homography matrix. 
 
     
     
       16. A computing device as recited in  claim 14 , the determining the image difference comprising identifying a first set of sub-regions in the first image, a second set of sub-regions in the second image, and determining a difference between color values of pixels in the first set of sub-regions and the second set of sub-regions. 
     
     
       17. A computing device as recited in  claim 14 , the live face detection module being further configured to:
 repeat the accessing a first image and a second image, generating a warped image, and determining an image difference for multiple pairs of images of the multiple images; and 
 use the image differences determined for the multiple pairs of images in determining whether the face included in the multiple images is a 3-dimensional structure or a flat surface. 
 
     
     
       18. A computing device as recited in  claim 14 , the multiple images comprising a sequence of three or more images, and one or more intervening images in the sequence of three or more images being between the first image and the second image in the sequence of three or more images. 
     
     
       19. A computing device as recited in  claim 14 , the live face detection module being further configured to:
 determine, based on the multiple images, whether motion is present in one or more components of the face; and 
 indicate that the user can be authenticated only if both the face included in the multiple images is a 3-dimensional structure and if motion is present in the one or more components of the face. 
 
     
     
       20. A computing device as recited in  claim 19 , the live face detection module being further configured to determine whether motion is present in one or more components of the face by:
 extracting a face component from each of the multiple images; 
 matching face components across the multiple images; 
 determining a face component difference between two of the multiple images; and 
 determining whether motion is present in the one or more components based at least in part on the face component difference between the two of the multiple images.

Cited by (0)

No later patents cite this yet.

References (0)

No backward citations on record.