This chapter presents a method for caption text detection. The proposed
method will be included in a generic indexing system dealing with other semantic
concepts which are to be automatically detected as well. To have a coherent detection
system, the various object detection algorithms use a common image description,
a hierarchical region-based image model. The proposed method takes advantage
of texture and geometric features to detect the caption text. Texture features are
estimated using wavelet analysis and mainly applied for text candidate spotting. In
turn, text characteristics verification relies on geometric features,which are estimated
exploiting the region-based image model. Analysis of the region hierarchy provides
the final caption text objects. The final step of consistency analysis for output is
performed by a binarization algorithm that robustly estimates the thresholds on the
caption text area of support.
CitationLeón, M. [et al.]. Region-based caption text extraction. "Lecture Notes in Electrical Engineering", 2013, vol. 158, p. 21-36.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder. If you wish to make any use of the work not provided for in the law, please contact: firstname.lastname@example.org