Mostra el registre d'ítem simple

dc.contributorEscalera, Sergio
dc.contributorLu, Shijian
dc.contributor.authorPolzounov, Andrei
dc.date.accessioned2017-03-03T11:50:20Z
dc.date.available2017-03-03T11:50:20Z
dc.date.issued2017-01
dc.identifier.urihttp://hdl.handle.net/2117/101911
dc.descriptionEn col·laboració amb la Universitat de Barcelona (UB) i la Universitat Rovira i Virgili (URV)
dc.description.abstractIn recent years, text recognition has achieved remarkable success in recognizing scanned document text. However, word recognition in natural images is still an open problem, which generally requires time consuming post-processing steps. We present a novel architecture for individual word detection in scene images based on semantic segmentation. Our contributions are twofold: the concept of WordFence, which detects border areas surrounding each individual word and a unique pixelwise weighted softmax loss function which penalizes background and emphasizes small text regions. WordFence ensures that each word is detected individually, and the new loss function provides a strong training signal to both text and word border localization. The proposed technique avoids intensive post-processing by combining semantic word segmentation with a voting scheme for merging segmentations of multiple scales, producing an end-to-end word detection system. We achieve superior localization recall on common benchmark datasets - 92% recall on ICDAR11 and ICDAR13 and 63% recall on SVT. Furthermore, end-to-end word recognition achieves state-of-the-art 86% F-Score on ICDAR13.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
dc.subject.lcshMachine learning
dc.subject.lcshNeural networks (Computer science)
dc.subject.otherartificial intelligence
dc.subject.otherobject detection
dc.subject.othertext detection
dc.subject.othertext recognition
dc.titleWordFences: Text localization and recognition
dc.typeMaster thesis
dc.subject.lemacAprenentatge automàtic
dc.subject.lemacXarxes neuronals (Informàtica)
dc.identifier.slug122778
dc.rights.accessOpen Access
dc.date.updated2017-02-11T05:00:07Z
dc.audience.educationlevelMàster
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2012)
dc.contributor.covenanteeInstitute for Infocomm Research


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple