The fastest pedestrian detector in the west
Research output: Contribution to conference › Paper › Research › peer-review
We demonstrate a multiscale pedestrian detector operating in near real time (∼6 fps on 640×480 images) with state-of-the-art detection performance. The computational bottleneck of many modern detectors is the construction of an image pyramid, typically sampled at 8-16 scales per octave, and associated feature computations at each scale. We propose a technique to avoid constructing such a finely sampled image pyramid without sacrificing performance: our key insight is that for a broad family of features, including gradient histograms, the feature responses computed at a single scale can be used to approximate feature responses at nearby scales. The approximation is accurate within an entire scale octave. This allows us to decouple the sampling of the image pyramid from the sampling of detection scales. Overall, our approximation yields a speedup of 10-100 times over competing methods with only a minor loss in detection accuracy of about 1-2% on the Caltech Pedestrian dataset across a wide range of evaluation settings. The results are confirmed on three additional datasets (INRIA, ETH, and TUD-Brussels) where our method always scores within a few percent of the state-of-the-art while being 1-2 orders of magnitude faster. The approach is general and should be widely applicable.
Original language | English |
---|---|
Publication date | 2010 |
DOIs | |
Publication status | Published - 2010 |
Externally published | Yes |
Event | 2010 21st British Machine Vision Conference, BMVC 2010 - Aberystwyth, United Kingdom Duration: 31 Aug 2010 → 3 Sep 2010 |
Conference
Conference | 2010 21st British Machine Vision Conference, BMVC 2010 |
---|---|
Country | United Kingdom |
City | Aberystwyth |
Period | 31/08/2010 → 03/09/2010 |
ID: 301831759