SITTA - Resultat

SITTA: Single Image Texture Translation for Data Augmentation

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Standard

SITTA : Single Image Texture Translation for Data Augmentation. / Li, Boyi; Cui, Yin; Lin, Tsung Yi; Belongie, Serge.

Computer Vision – ECCV 2022 Workshops, Proceedings: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part II. ed. / Leonid Karlinsky; Tomer Michaeli; Ko Nishino. Springer, 2023. p. 3-20 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 13802 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Harvard

Li, B, Cui, Y, Lin, TY & Belongie, S 2023, SITTA: Single Image Texture Translation for Data Augmentation. in L Karlinsky, T Michaeli & K Nishino (eds), Computer Vision – ECCV 2022 Workshops, Proceedings: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part II. Springer, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13802 LNCS, pp. 3-20, 17th European Conference on Computer Vision, ECCV 2022, Tel Aviv, Israel, 23/10/2022. https://doi.org/10.1007/978-3-031-25063-7_1

APA

Li, B., Cui, Y., Lin, T. Y., & Belongie, S. (2023). SITTA: Single Image Texture Translation for Data Augmentation. In L. Karlinsky, T. Michaeli, & K. Nishino (Eds.), Computer Vision – ECCV 2022 Workshops, Proceedings: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part II (pp. 3-20). Springer. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Vol. 13802 LNCS https://doi.org/10.1007/978-3-031-25063-7_1

Vancouver

Li B, Cui Y, Lin TY, Belongie S. SITTA: Single Image Texture Translation for Data Augmentation. In Karlinsky L, Michaeli T, Nishino K, editors, Computer Vision – ECCV 2022 Workshops, Proceedings: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part II. Springer. 2023. p. 3-20. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 13802 LNCS). https://doi.org/10.1007/978-3-031-25063-7_1

Author

Li, Boyi ; Cui, Yin ; Lin, Tsung Yi ; Belongie, Serge. / SITTA : Single Image Texture Translation for Data Augmentation. Computer Vision – ECCV 2022 Workshops, Proceedings: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part II. editor / Leonid Karlinsky ; Tomer Michaeli ; Ko Nishino. Springer, 2023. pp. 3-20 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 13802 LNCS).

Bibtex

@inproceedings{999f5fc93a3a4befaa0ad95af29171dc,

title = "SITTA: Single Image Texture Translation for Data Augmentation",

abstract = "Recent advances in data augmentation enable one to translate images by learning the mapping between a source domain and a target domain. Existing methods tend to learn the distributions by training a model on a variety of datasets, with results evaluated largely in a subjective manner. Relatively few works in this area, however, study the potential use of image synthesis methods for recognition tasks. In this paper, we propose and explore the problem of image translation for data augmentation. We first propose a lightweight yet efficient model for translating texture to augment images based on a single input of source texture, allowing for fast training and testing, referred to as Single Image Texture Translation for data Augmentation (SITTA). Then we explore the use of augmented data in long-tailed and few-shot image classification tasks. We find the proposed augmentation method and workflow is capable of translating the texture of input data into a target domain, leading to consistently improved image recognition performance. Finally, we examine how SITTA and related image translation methods can provide a basis for a data-efficient, “augmentation engineering” approach to model training.",

author = "Boyi Li and Yin Cui and Lin, {Tsung Yi} and Serge Belongie",

note = "Funding Information: Acknowledgement. This work was supported in part by the Pioneer Centre for AI, DNRF grant number P1. Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.; 17th European Conference on Computer Vision, ECCV 2022 ; Conference date: 23-10-2022 Through 27-10-2022",

year = "2023",

doi = "10.1007/978-3-031-25063-7_1",

language = "English",

isbn = "9783031250620",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "3--20",

editor = "Leonid Karlinsky and Tomer Michaeli and Ko Nishino",

booktitle = "Computer Vision – ECCV 2022 Workshops, Proceedings",

address = "Switzerland",

}

RIS

TY - GEN

T1 - SITTA

T2 - 17th European Conference on Computer Vision, ECCV 2022

AU - Li, Boyi

AU - Cui, Yin

AU - Lin, Tsung Yi

AU - Belongie, Serge

N1 - Funding Information: Acknowledgement. This work was supported in part by the Pioneer Centre for AI, DNRF grant number P1. Publisher Copyright: © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

PY - 2023

Y1 - 2023

N2 - Recent advances in data augmentation enable one to translate images by learning the mapping between a source domain and a target domain. Existing methods tend to learn the distributions by training a model on a variety of datasets, with results evaluated largely in a subjective manner. Relatively few works in this area, however, study the potential use of image synthesis methods for recognition tasks. In this paper, we propose and explore the problem of image translation for data augmentation. We first propose a lightweight yet efficient model for translating texture to augment images based on a single input of source texture, allowing for fast training and testing, referred to as Single Image Texture Translation for data Augmentation (SITTA). Then we explore the use of augmented data in long-tailed and few-shot image classification tasks. We find the proposed augmentation method and workflow is capable of translating the texture of input data into a target domain, leading to consistently improved image recognition performance. Finally, we examine how SITTA and related image translation methods can provide a basis for a data-efficient, “augmentation engineering” approach to model training.

AB - Recent advances in data augmentation enable one to translate images by learning the mapping between a source domain and a target domain. Existing methods tend to learn the distributions by training a model on a variety of datasets, with results evaluated largely in a subjective manner. Relatively few works in this area, however, study the potential use of image synthesis methods for recognition tasks. In this paper, we propose and explore the problem of image translation for data augmentation. We first propose a lightweight yet efficient model for translating texture to augment images based on a single input of source texture, allowing for fast training and testing, referred to as Single Image Texture Translation for data Augmentation (SITTA). Then we explore the use of augmented data in long-tailed and few-shot image classification tasks. We find the proposed augmentation method and workflow is capable of translating the texture of input data into a target domain, leading to consistently improved image recognition performance. Finally, we examine how SITTA and related image translation methods can provide a basis for a data-efficient, “augmentation engineering” approach to model training.

U2 - 10.1007/978-3-031-25063-7_1

DO - 10.1007/978-3-031-25063-7_1

M3 - Article in proceedings

AN - SCOPUS:85151063823

SN - 9783031250620

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 3

EP - 20

BT - Computer Vision – ECCV 2022 Workshops, Proceedings

A2 - Karlinsky, Leonid

A2 - Michaeli, Tomer

A2 - Nishino, Ko

PB - Springer

Y2 - 23 October 2022 through 27 October 2022

ER -

ID: 389598422

Forskning