On Label Granularity and Object Localization

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

  • Elijah Cole
  • Kimberly Wilber
  • Grant Van Horn
  • Xuan Yang
  • Marco Fornoni
  • Pietro Perona
  • Belongie, Serge
  • Andrew Howard
  • Oisin Mac Aodha

Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels. However, many objects can be labeled at different levels of granularity. Is it an animal, a bird, or a great horned owl? Which image-level labels should we use? In this paper we study the role of label granularity in WSOL. To facilitate this investigation we introduce iNatLoc500, a new large-scale fine-grained benchmark dataset for WSOL. Surprisingly, we find that choosing the right training label granularity provides a much larger performance boost than choosing the best WSOL algorithm. We also show that changing the label granularity can significantly improve data efficiency.

OriginalsprogEngelsk
TitelComputer Vision – ECCV 2022 : 17th European Conference, Proceedings
RedaktørerShai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
Antal sider17
ForlagSpringer
Publikationsdato2022
Sider604-620
ISBN (Trykt)9783031200793
DOI
StatusUdgivet - 2022
Begivenhed17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, Israel
Varighed: 23 okt. 202227 okt. 2022

Konference

Konference17th European Conference on Computer Vision, ECCV 2022
LandIsrael
ByTel Aviv
Periode23/10/202227/10/2022
NavnLecture Notes in Computer Science
Vol/bind13670 LNCS
ISSN0302-9743

Bibliografisk note

Funding Information:
Acknowledgements. We thank the iNaturalist community for sharing images and species annotations. This work was supported by the Caltech Resnick Sustainability Institute, an NSF Graduate Research Fellowship (grant number DGE1745301), and the Pioneer Centre for AI (DNRF grant number P1).

Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Links

ID: 342672283