Dynamic Batch Size Selection for Batch Mode Active Learning in Biometrics

Publication Type:

Conference Paper


S. Chakraborty, V. Balasubramanian, S. Panchanathan


IEEE International Conference on Machine Learning and Applications (ICMLA) (2010)


Robust biometric recognition is of paramount importance in security and surveillance applications. In face based biometric systems, data is usually collected using a video camera with high frame rate and thus the captured data has high redundancy. Selecting the appropriate instances from this data to update a classification model, is a significant, yet valuable challenge. Active learning methods have gained popularity in identifying the salient and exemplar data instances from superfluous sets. Batch mode active learning schemes attempt to select a batch of samples simultaneously rather than updating the model after selecting every single data point. Existing work on batch mode active learning assume a fixed batch size, which is not a practical assumption in biometric recognition applications. In this paper, we propose a novel framework to dynamically select the batch size using clustering based unsupervised learning techniques. We also present a batch mode active learning strategy specially suited to handle the high redundancy in biometric datasets. The results obtained on the challenging VidTIMIT and MOBIO datasets corroborate the superiority of dynamic batch size selection over static batch size and also certify the potential of the proposed active learning scheme in being used for real world biometric recognition applications.


Dr. Shayok Chakraborty

Dr. Shayok Chakraborty

Assistant Research Professor, School of Computing, Informatics, and Decision Systems Engineering; Associate Director, Center for Cognitive Ubiquitous Computing (CUbiC)

Vineeth N Balasubramanian

Vineeth N Balasubramanian

Assistant Research Professor

Dr. Sethuraman "Panch" Panchanathan

Dr. Sethuraman "Panch" Panchanathan

Director, National Science Foundation


The rapid escalation of technology and the widespread emergence of modern technological equipments have resulted in the generation of large quantities of digital data. This has expanded the possibilities of solving real world problems using computational learning…