Feature representation of RGB-D images using joint spatial-depth feature pooling

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Feature representation of RGB-D images using joint spatial-depth feature pooling. / Pan, Hong; Olsen, Søren Ingvor; Zhu, Yaping.

I: Pattern Recognition Letters, Bind 80, Nr. 1, 2016, s. 239-248.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Pan, H, Olsen, SI & Zhu, Y 2016, 'Feature representation of RGB-D images using joint spatial-depth feature pooling', Pattern Recognition Letters, bind 80, nr. 1, s. 239-248. https://doi.org/10.1016/j.patrec.2016.04.001

APA

Pan, H., Olsen, S. I., & Zhu, Y. (2016). Feature representation of RGB-D images using joint spatial-depth feature pooling. Pattern Recognition Letters, 80(1), 239-248. https://doi.org/10.1016/j.patrec.2016.04.001

Vancouver

Pan H, Olsen SI, Zhu Y. Feature representation of RGB-D images using joint spatial-depth feature pooling. Pattern Recognition Letters. 2016;80(1):239-248. https://doi.org/10.1016/j.patrec.2016.04.001

Author

Pan, Hong ; Olsen, Søren Ingvor ; Zhu, Yaping. / Feature representation of RGB-D images using joint spatial-depth feature pooling. I: Pattern Recognition Letters. 2016 ; Bind 80, Nr. 1. s. 239-248.

Bibtex

@article{506f9692c5dc428b8c6b8eb6fedb6f3e,
title = "Feature representation of RGB-D images using joint spatial-depth feature pooling",
abstract = "Recent development in depth imaging technology makes acquisition of depth information easier. With the additional depth cue, RGB-D cameras can provide effective support for many RGB-D perception tasks beyond traditional RGB information. However, current feature representation based on RGB-D images utilizes depth information only to extract local features, without considering it to improve robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split a 2D image plane into sub-regions for feature pooling of RGB-D images. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards their depth topological structures. Instead, we propose a novel joint spatial-depth pooling (JSDP) scheme which further partitions SPM using the depth cue and pools features simultaneously in 2D image plane and along the depth direction. By combining the JSDP with standard feature extraction and feature encoding modules, we outperform state-of-the-art methods on benchmarks for RGB-D object classification, detection and scene recognition.",
keywords = "Faculty of Science, Computer Science, Image analysis, Computer Vision, RGB-D feature representation",
author = "Hong Pan and Olsen, {S{\o}ren Ingvor} and Yaping Zhu",
year = "2016",
doi = "10.1016/j.patrec.2016.04.001",
language = "English",
volume = "80",
pages = "239--248",
journal = "Pattern Recognition Letters",
issn = "0167-8655",
publisher = "Elsevier BV * North-Holland",
number = "1",

}

RIS

TY - JOUR

T1 - Feature representation of RGB-D images using joint spatial-depth feature pooling

AU - Pan, Hong

AU - Olsen, Søren Ingvor

AU - Zhu, Yaping

PY - 2016

Y1 - 2016

N2 - Recent development in depth imaging technology makes acquisition of depth information easier. With the additional depth cue, RGB-D cameras can provide effective support for many RGB-D perception tasks beyond traditional RGB information. However, current feature representation based on RGB-D images utilizes depth information only to extract local features, without considering it to improve robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split a 2D image plane into sub-regions for feature pooling of RGB-D images. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards their depth topological structures. Instead, we propose a novel joint spatial-depth pooling (JSDP) scheme which further partitions SPM using the depth cue and pools features simultaneously in 2D image plane and along the depth direction. By combining the JSDP with standard feature extraction and feature encoding modules, we outperform state-of-the-art methods on benchmarks for RGB-D object classification, detection and scene recognition.

AB - Recent development in depth imaging technology makes acquisition of depth information easier. With the additional depth cue, RGB-D cameras can provide effective support for many RGB-D perception tasks beyond traditional RGB information. However, current feature representation based on RGB-D images utilizes depth information only to extract local features, without considering it to improve robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split a 2D image plane into sub-regions for feature pooling of RGB-D images. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards their depth topological structures. Instead, we propose a novel joint spatial-depth pooling (JSDP) scheme which further partitions SPM using the depth cue and pools features simultaneously in 2D image plane and along the depth direction. By combining the JSDP with standard feature extraction and feature encoding modules, we outperform state-of-the-art methods on benchmarks for RGB-D object classification, detection and scene recognition.

KW - Faculty of Science

KW - Computer Science

KW - Image analysis

KW - Computer Vision

KW - RGB-D feature representation

U2 - 10.1016/j.patrec.2016.04.001

DO - 10.1016/j.patrec.2016.04.001

M3 - Journal article

VL - 80

SP - 239

EP - 248

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

SN - 0167-8655

IS - 1

ER -

ID: 165352582