The fastest pedestrian detector in the west

Research output: Contribution to conferencePaperResearchpeer-review

We demonstrate a multiscale pedestrian detector operating in near real time (∼6 fps on 640×480 images) with state-of-the-art detection performance. The computational bottleneck of many modern detectors is the construction of an image pyramid, typically sampled at 8-16 scales per octave, and associated feature computations at each scale. We propose a technique to avoid constructing such a finely sampled image pyramid without sacrificing performance: our key insight is that for a broad family of features, including gradient histograms, the feature responses computed at a single scale can be used to approximate feature responses at nearby scales. The approximation is accurate within an entire scale octave. This allows us to decouple the sampling of the image pyramid from the sampling of detection scales. Overall, our approximation yields a speedup of 10-100 times over competing methods with only a minor loss in detection accuracy of about 1-2% on the Caltech Pedestrian dataset across a wide range of evaluation settings. The results are confirmed on three additional datasets (INRIA, ETH, and TUD-Brussels) where our method always scores within a few percent of the state-of-the-art while being 1-2 orders of magnitude faster. The approach is general and should be widely applicable.

Original languageEnglish
Publication date2010
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event2010 21st British Machine Vision Conference, BMVC 2010 - Aberystwyth, United Kingdom
Duration: 31 Aug 20103 Sep 2010

Conference

Conference2010 21st British Machine Vision Conference, BMVC 2010
CountryUnited Kingdom
CityAberystwyth
Period31/08/201003/09/2010

ID: 301831759