Zero-Shot Object DetectionAnkan Bansal, Karan Sikka*, Gaurav Sharma^, Rama Chellappa, and Ajay Divakaran*
* SRI International, Princeton, NJ ^ NEC Labs America, Cupertino, CA Fig. 1: We highlight the task of zero-shot object detection where object classes “arm”, “hand”, and “shirt” are observed (seen) during training, while classes “skirt”, and “shoulder” are not seen. These unseen classes are localized by our approach that leverages semantic relationships, obtained via word embeddings, between seen and unseen classes along with the proposed zero-shot detection framework. The example has been generated by our model on images from VisualGenome dataset.
ResultsWe highlight the results obtained on unseen classes in the following figure. Fig. 2: This figure shows some detections made by our background-aware methods. We have used Latent Assignment Based (LAB) model for VisualGenome (rows 1-2) and the Static Background (SB) model (rows 3-4) for MSCOCO. Reasonable detections are shown in blue and two failure cases in red. This figure highlights the effectiveness of our methods in being able to detect unseen classes in a zero-shot setting. DownloadsWe are releasing our seen and unseen class names and train and test splits. This is an attempt to standardize the work done in this area. VGTrain and Test splits Seen and Unseen classes Synset-Word Dictionary MSCOCOTrain and Test splits
Seen and Unseen classes Synset-Word Dictionary PaperOur paper is available here. I also wrote a short blog post about the paper. If you found the paper and data useful, please consider citing our paper using the bibtex: @inproceedings{bansal2018zero, title={Zero-Shot Object Detection}, author={Bansal, Ankan and Sikka, Karan and Sharma, Gaurav and Chellappa, Rama and Divakaran, Ajay}, booktitle={Proceedings of the European Conference on Computer Vision (ECCV)}, pages={384--400}, year={2018} } Acknowledgments This project is sponsored by the Air Force Research Laboratory (AFRL) and Defense Advanced
Research Projects Agency (DARPA) under the contract number USAF/AFMC AFRL FA8750-16-C-0158. This work is also supported by the Intelligence Advanced Research Projects Activity (IARPA)
via Department of Interior/Interior Business Center (DOI/IBC) contract number D17PC00345. The
U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes not
withstanding any copyright annotation thereon. |