Mostra el registre d'ítem simple

dc.contributor.authorAlegria, Iñaki
dc.contributor.authorAranberri, Nora
dc.contributor.authorComas Umbert, Pere Ramon
dc.contributor.authorFresno, Víctor
dc.contributor.authorGamallo, Pablo
dc.contributor.authorPadró, Lluís
dc.contributor.authorSan Vicente Roncal, Iñaki
dc.contributor.authorTurmo Borras, Jorge
dc.contributor.authorZubiaga, Arkaitz
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics
dc.date.accessioned2014-07-07T11:09:10Z
dc.date.available2014-07-07T11:09:10Z
dc.date.created2014
dc.date.issued2014
dc.identifier.citationAlegria, I. [et al.]. TweetNorm_es: an annotated corpus for Spanish microtext normalization. A: International Conference on Language Resources and Evaluation. "Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)". Reykjavik: European Language Resources Association (ELRA), 2014, p. 2274-2278.
dc.identifier.isbn978-2-9517408-8-4
dc.identifier.urihttp://hdl.handle.net/2117/23411
dc.description.abstractIn this paper we introduce TweetNorm es, an annotated corpus of tweets in Spanish language, which we make publicly available under the terms of the CC-BY license. This corpus is intended for development and testing of microtext normalization systems. It was created for Tweet-Norm, a tweet normalization workshop and shared task, and is the result of a joint annotation effort from different research groups. In this paper we describe the methodology defined to build the corpus as well as the guidelines followed in the annotation process. We also present a brief overview of the Tweet-Norm shared task, as the first evaluation environment where the corpus was used.
dc.format.extent5 p.
dc.language.isoeng
dc.publisherEuropean Language Resources Association (ELRA)
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subject.lcshSpanish language -- 21st century
dc.subject.otherMicrotext normalization
dc.subject.otherTwitter
dc.subject.otherphonology
dc.titleTweetNorm_es: an annotated corpus for Spanish microtext normalization
dc.typeConference lecture
dc.subject.lemacCastellà -- Fonologia
dc.contributor.groupUniversitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://www.lrec-conf.org/proceedings/lrec2014/pdf/442_Paper.pdf
dc.rights.accessOpen Access
local.identifier.drac14920822
dc.description.versionPostprint (published version)
local.citation.authorAlegria, I.; Aranberri, N.; Comas, P.R.; Fresno, V.; Gamallo, P.; Padro, L.; San Vicente, I.; Turmo, J.; Zubiaga, A.
local.citation.contributorInternational Conference on Language Resources and Evaluation
local.citation.pubplaceReykjavik
local.citation.publicationNameProceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
local.citation.startingPage2274
local.citation.endingPage2278


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple