CVRL Data Sets

 
 

Biometrics Data Sets


We have been collecting data sets and conducting baseline and advanced personal identification studies using biometric measurements.  We are committed to releasing all data collected to eligible research groups, with appropriate controls to forbid on-line distribution outside the research community. Data is distributed using rsync.


Please make sure you follow these instructions:

  1. Download all applicable license agreements.  Several of our datasets require more than one license agreement.

  2. Have the license agreement reviewed and signed BY AN INDIVIDUAL AUTHORIZED TO MAKE LEGAL COMMITMENTS ON BEHALF OF YOUR INSTITUTION.  WE CANNOT ACCEPT LICENSES SIGNED BY STUDENTS OR FACULTY MEMBERS.  YOUR INSTITUTION'S LEGAL OFFICE MUST REVIEW AND EXECUTE THE LICENSE.

  3. Return the properly signed license agreement via your INSTITUTIONAL  e-mail address (we cannot accept license agreements sent through third party e-mail providers) to cvrl@nd.edu or fax, attention D. Wright, to 1-574-631-9260. 

  4. Include in the e-mail/cover page the full name, title, address and phone number of the institution and institutional point of contact.

Upon our receipt and approval of the license agreement, download instructions will be sent to the licensee.


As of this writing, the following modal collections are available.


  1. The Point and Shoot Face and Person Recognition Challenge (PaSC) data collection.  Information about the PaSC effort may be found at http://face.nist.gov and http://www.cs.colostate.edu/pasc.   To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee. Publications using this database must cite the paper listed in the license agreement.

  2. Prediction of Gender from Iris Image:GFI. (License Agreement) This is a set of (left eye, right eye) iris images acquired using an LG 4000 iris sensor.  It is balanced on gender, and has been used in research on predicting the gender of the person from features of the iris texture.  There are images for 750 males and 750 females, left eye and right eye, for 3,000 total images.  The images are 480x640, 8-bit / pixel intensity images taken under near-infrared illumination.

  3. FACE Features Set.  The FACE Features Set comprises feature patterns for imagery that is amenable to human-assisted face clustering. The features were computed for faces observed in blurry point-and-shoot videos, images of women seen before and after the application of makeup, and photographs of twins.  Please follow the instructions in the README file regarding citation.   Download the file Here

  4. SN-Flip Crowd Video Database (license agreement): Comprising 190 subjects recorded in 28 crowd videos over a two year period, SN-Flip captures variations in illumination, facial expression, scale, focus and pose. The videos were recorded with point-and-shoot camcorders from the Cisco Flip family of products, so the image quality is representative of typical web videos. Ground truth information for subject identities and social groups is included to facilitate future research in vision-driven social network analysis.  To obtain access to this database, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.  Publications using this database must cite the paper listed in the license agreement.

  5. ND-CrossSensor-Iris-2013 Dataset.  (License Agreement) This dataset was initially released for the  Cross Sensor Iris Recognition Challenge associated with the BTAS 2013 conference (http://www3.nd.edu/~asgroi/Competition/CrossSensorCompetition.htm) This dataset occupies about 114 GBytes, and consists of 27 sessions of data with 676 unique subjects. An average session contains 160 unique subjects which have multiple images from both the LG2200 and LG4000 iris sensors. There are 29,986 images from the LG4000 and 116,564 images from the LG2200. Every subject occurs in at least two sessions across the entire data set. This data set spans three years, 2008 to 2010. The initial images are taken from both sensors and are 640 by 480. There are additional images included in this data set, known as the modified LG2200 images. The original images have been stretched vertically by 5% to compensate for the non-unit aspect ratio of the digitizer used in the LG2200 computer-hosted runtime acquisition system (this elongation was suggested by Imad Malhas of IrisGuard Inc. in 2009). Hence these additional images are of size 640 by 504.  

  6. ND_Cosmetic_Contact_Lenses_2013 Dataset. (License Agreement This dataset contains iris images of subjects without contact lenses, with soft contact lenses, and with cosmetic contact lenses, acquired using an LG 4000 and an IrisGuard AD100 iris sensor.  The dataset contains 4,200 TIFF files from the LG4000 sensor, 900 TIFF images form the AD100 sensor, and four metadata files describing the images. The download size is approximately 1.6 GB.For a more thorough description, please see the README document (README).  It corresponds to the dataset in: Doyle, J.S.; Bowyer, K.W.; Flynn, P.J., "Variation in accuracy of textured contact lens detection based on sensor and lens pattern," Biometrics: Theory, Applications and Systems (BTAS), 2013 IEEE Sixth International Conference on , vol., no., pp.1,7, Sept. 29 2013-Oct. 2 2013.  To obtain access to this data set, retrieve the  and forward it to your institution's or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.

  7. ND-CrossSensor-Iris-2012 Dataset.  This dataset was initially released for the  Cross Sensor Iris Recognition Challenge associated with the BTAS 2012 conference (https://sites.google.com/a/nd.edu/btas2012crosssensoririscompetition/).  This dataset occupies about 104 GBytes, and consists of 27 sessions of data with 676 unique subjects. An average session contains 160 unique subjects which have multiple images from both the LG2200 and LG4000 iris sensors. There are 29939 images from the LG4000 and 117503 images from the LG2200. Every subject occurs in at least two sessions across the entire data set. This data set spans three years, 2008 to 2010. The initial images are taken from both sensors and are 640 by 480. There are additional images included in this data set, known as the modified LG2200 images. The original images have been stretched vertically by 5% to compensate for the non-unit aspect ratio of the digitizer used in the LG2200 computer-hosted runtime acquisition system (this elongation was suggested by Imad Malhas of IrisGuard Inc. in 2009). Hence these additional images are of size 640 by 504.  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.

  8. ND-TimeLapseIris-2012. The TimeLapseIris data set contains 6797 images collected from 23 subjects (46 different irises) between January 2004 and October 2008.  It corresponds to the data set used in a chapter in the forthcoming book: Template Aging in Iris Biometrics: Evidence of Increased False Reject Rate in ICE 2006, Sarah Baker, Kevin W. Bowyer, Patrick J. Flynn and P. Jonathon Phillips, in Handbook of Iris Recognition, Mark Burge and Kevin W. Bowyer, editors, Springer, 2012.  to demonstrate the effects of elapsed time between probe and gallery image acquisition on iris recognition system performance.  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee. 

  9. ND-2006 Dataset.  The ND-2006 data set contains a total of 13,450 images containing 6 different types of expressions (Neutral, Happiness, Sadness, Surprise, Disgust, and Other). A total of 888 distinct persons, with as many as 63 images per subject, are available in this data set.  This data set corresponds exactly to the data set described in: Faltemier, T.C.;   Bowyer, K.W.;   Flynn, P.J.; Using a Multi-Instance Enrollment Representation to Improve 3D Face Recognition, Proc. First IEEE International Conference on Biometrics: Theory, Applications, and Systems, September 2007, pp. 1-6.  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.

  10. 3D Twins Expression Challenge ("3D TEC") Dataset.  The data set contains 3D face scans for 107 pairs of twins. There are 107 x 2 = 214 individuals, each with a 3D face scan with a smiling expression and a scan with a neutral expression, and so 214 x 2 = 428 total scans. The scans were acquired with a Minolta Vivid 910.  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.

  11. ND-Iris-Template-Aging-2008-2010. The ND_Iris_Template_Aging_2008-2010 dataset contains sequences of iris images of different persons, acquired using an LG 4000 iris sensor.  Images are from Spring 2008, Spring 2009, and Spring 2010.  This allows two different one-year template aging studies, 2008-2009 and 2009-2010, and one two-year template aging study, 2008-2010.  The dataset contains 11,776 TIFF files and it’s download size is 4.5 GB.  This dataset is used in the paper:  Analysis of Template Aging in Iris Biometrics,  Samuel P. Fenker and Kevin W. Bowyer, IEEE Computer Society Biometrics Workshop, June 2012.  .  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.

  12. ND-TWINS-2009-2010.  The data set contains 24050 color photographs of the faces of 435 attendees at the Twins Days Festivals in Twinsburg, Ohio in 2009 and 2010. All images were captured by Nikon D90 SLR cameras. Images were captured under natural light in "indoor" and "outdoor" configurations ("indoor" was a tent).  Facial yaw varied from -90 to +90 degrees in steps of 45 degrees (zero degrees was frontal).  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.

  13. ND-QO-Flip Crowd Video Database (license agreement): The database contains 14 crowd videos of 90 subjects, five of whom appear in multiple videos and 85 of whom appear in one video.  These videos were acquired between November 2009 and May 2010.  To obtain access to this database, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.  Publications using this database must cite the paper listed in the license agreement.

  14. Collection ND_Iris_Contact_Lenses_2010.  This a dataset of iris images that was used in a study of the effects of wearing contact lenses on the performance of iris recognition: "Degradation of Iris Recognition Performance Due to Non-Cosmetic Prescription Contact Lenses", Sarah E. Baker, Amanda Hentz, Kevin W. Bowyer, and Patrick J. Flynn, Computer Vision and Image Understanding 114 (9), 1030-1044, September 2010.  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members.  Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.  Publications using this database must cite the paper listed in the license agreement.

  15. ND-IRIS-0405 Iris Image Dataset.  A technical report describing this data set is here. The data set contains 64,980 iris images obtained from 356 subjects (712 unique irises) between January 2004 and May 2005.  To obtain access to this data set, retrieve the license agreement and forward it to your institution or company’s legal office for review and signature. The license agreement MUST be reviewed and signed by an individual authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Once the form is signed, it may be scanned and emailed to cvrl@nd.edu, or faxed to the attention of D. Wright at +1 574 631-9260.  Upon receipt and approval of the license agreement, download instructions will be sent to the licensee.  Publications using this database must cite the paper listed in the license agreement.

  16. Multiple Biometric Grand Challenge (MBGC) version 2 data collection.  Information about the MBGC program may be found here.  The MBGC v2 and MBGC v1 data sets are separately licensed; MBGC v1 licensees must complete a new license for MBGC v2. (MBGC-v2.0-data-license-agreement) (MBGC-v2.0-UTD-video-DB-permission) (MBGC-v2.0-UTD-video-license-agreement)

  17. Multiple Biometric Grand Challenge (MBGC) version 1 data collection.  Information about the MBGC program may be found here(MBGC v1.0 license agreement)

  18. Iris Challenge Evaluation (ICE 2005) data collection. With the release of the ND-IRIS-0405 data set described above, the ICE 2005 data set is no longer available.

  19. Face Recognition Grand Challenge (FRGC v2.0) data collection. Information about the FRGC program may be found here. .  Note: the FRGC 1.0a data collection has been superseded by the FRGC v2.0 collection and is no longer available. As of 4/2/2014 the Bee Software has been separated from the FRGC 2.0 dataset.  If you wish to download it as well as the FRGC 2.0 dataset, please download and execute the Bee software license agreement.   (FRGC 2.0 license agreement) (BEE software license agreement)

  20. Face and Ocular Challenge Series (FOCS) data collection.  Information about the FOCS program may be found here.  The license agreement, UTD-video-DB-Permission agreement and the UTD-video-license-agreement must all be properly executed and sent in.  (license agreement)(UTD-video-DB-permission) (UTD-video-license-agreement)


The following UND biometrics data sets may be obtained using instructions at the bottom of the page.


  1. Collection B (license agreement) : 33,287 visible-light frontal face images captured from 487 human subjects from 2002 through 2004.  Each subject was photographed with a high-resolution digital camera (1600x1200 or 2272x1704) under different lighting and expression conditions. Many subjects were photographed every week for 10 weeks in the Spring of 2002, 13 weeks in the fall of 2002, and 15 weeks in the spring of 2003. The number of images per subject ranges from 4 to 227 with an average of 68. Hence, this database provides a significant amount of 'repeat data' to assess performance of face recognition systems with respect to time elapsed since enrollment.

  2. Collection C (license agreement): 2,492 LWIR frontal face images from 241 human subjects captured in 2002.  All images are 320x240 and were captured in 2002 by a Merlin-Uncooled camera purchased from Indigo Systems in 2001.

  3. Collection D (license agreement) : 953 3D (+ corresponding 2D) frontal face images from 277 human subjects, captured in 2003.  These images were acquired with a Minolta Vivid 900 3D range scanner.

  4. Collection E (license agreement) : 464 visible-light profile (ear) images from 114 human subjects captured in 2002.

  5. Collection F (license agreement) : 942 3D (+ corresponding 2D) profile (ear) images from 302 human subjects captured in 2003 and 2004.

  6. Collection G (license agreement): 738 3D (+ corresponding 2D) profile (ear) images from 235 human subjects captured between 2003 and 2005.

  7. Collection H (license agreement): 1191 3D (+ corresponding 2D) images of the back  (non-palm) portion of 223 different human hands captured between 2003 and 2005.

  8. Collection J2 (license agreement): 1800 3D (+ corresponding 2D) profile (ear) images from 415 human subjects captured between 2003 and 2005. Corresponds to data used in Yan and Bowyer, "Biometric recognition using three-dimensional ear shape," PAMI 29(8), August 2007.

  9. Collection X1 (license agreement) : 2292 IR frontal face images and 2292 visible frontal face images from 82 human subjects captured from 2002-2004.

  10. Collection X2 (license agreement): 1191 3D (+ corresponding 2D) images of the back of the hand from 223 human subjects, captured from 2003-2004

  11. Collection NDOff-2007 (license agreement): 6940 3D (+ corresponding 2D) images of 387 human subject faces. The subjects were asked to orient their heads in a variety of ways. The face yaw and pitch are encoded in the file names.  Corresponds approximately to data used in: + Timothy Faltemier, Kevin W. Bowyer and Patrick J. Flynn, Rotated Profile Signatures for robust 3D feature detection, Proc. Automatic Face & Gesture Recognition (FG ’08), Sept. 2008, Amsterdam.        http://dx.doi.org/10.1109/AFGR.2008.4813413


If you wish to obtain any element of the UND Biometrics Database:

  1. Download the applicable license agreement.

  2. The license agreement MUST be reviewed and signed by the individual or entity authorized to make legal commitments on behalf of the institution or corporation.  We cannot accept licenses signed by students or faculty members. Your institution’s legal office must review and execute the license.

  3. Return the properly signed license agreement of the collection via fax (+1-574-631-9260, attention D. Wright) or email a scanned copy to cvrl@nd.edu .

  4. Wait for approval and information about shipment/transmission of the data. All the data will be distributed via the internet using the rsync utility.