The multidimensional wisdom of crowds

Research output: Contribution to journal › Conference article › Research › peer-review

Peter Welinder
Steve Branson
Belongie, Serge
Pietro Perona

Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important method for annotating large datasets. We present a method for estimating the underlying value (e.g. the class) of each image from (noisy) annotations provided by multiple annotators. Our method is based on a model of the image formation and annotation process. Each image has different characteristics that are represented in an abstract Euclidean space. Each annotator is modeled as a multidimensional entity with variables representing competence, expertise and bias. This allows the model to discover and represent groups of annotators that have different sets of skills and knowledge, as well as groups of images that differ qualitatively. We find that our model predicts ground truth labels on both synthetic and real data more accurately than state of the art methods. Experiments also show that our model, starting from a set of binary labels, may discover rich information, such as different "schools of thought" amongst the annotators, and can group together images belonging to separate categories.

Original language	English
Journal	Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010
Publication status	Published - 2010
Externally published	Yes
Event	24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010 - Vancouver, BC, Canada Duration: 6 Dec 2010 → 9 Dec 2010

Conference

Conference	24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010
Country	Canada
City	Vancouver, BC
Period	06/12/2010 → 09/12/2010
Sponsor	Neural Information Processing Systems (NIPS)

ID: 302047406

Forskning

The multidimensional wisdom of crowds

Conference