Video text detection and recognition

Video text detection and recognition: Dataset and benchmark

Publikation: Bidrag til tidsskrift › Konferenceartikel › Forskning › fagfællebedømt

Phuc Xuan Nguyen
Kai Wang
Belongie, Serge

This paper focuses on the problem of text detection and recognition in videos. Even though text detection and recognition in images has seen much progress in recent years, relatively little work has been done to extend these solutions to the video domain. In this work, we extend an existing end-to-end solution for text recognition in natural images to video. We explore a variety of methods for training local character models and explore methods to capitalize on the temporal redundancy of text in video. We present detection performance using the Video Analysis and Content Extraction (VACE) benchmarking framework on the ICDAR 2013 Robust Reading Challenge 3 video dataset and on a new video text dataset. We also propose a new performance metric based on precision-recall curves to measure the performance of text recognition in videos. Using this metric, we provide early video text recognition results on the above mentioned datasets.

Originalsprog	Engelsk
Tidsskrift	2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
Sider (fra-til)	776-783
Antal sider	8
DOI	https://doi.org/10.1109/WACV.2014.6836024
Status	Udgivet - 2014
Eksternt udgivet	Ja
Begivenhed	2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014 - Steamboat Springs, CO, USA Varighed: 24 mar. 2014 → 26 mar. 2014

Konference

Konference	2014 IEEE Winter Conference on Applications of Computer Vision, WACV 2014
Land	USA
By	Steamboat Springs, CO
Periode	24/03/2014 → 26/03/2014

ID: 302044488