Mostra el registre d'ítem simple

dc.contributor.authorGijón Agudo, Manuel
dc.contributor.authorVilalta Arias, Armand
dc.contributor.authorGarcia Gasulla, Dario
dc.contributor.otherUniversitat Politècnica de Catalunya. Doctorat en Intel·ligència Artificial
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2020-05-19T07:40:40Z
dc.date.available2020-05-19T07:40:40Z
dc.date.issued2019
dc.identifier.citationGijón, M.; Vilalta, A.; Garcia-Gasulla, D. Analyzing distances in word embeddings and their relation with seme analysis. A: International Conference of the Catalan Association for Artificial Intelligence. "Proceedings of the 22nd International Conference of the Catalan Association for Artificial Intelligence". IOS Press, 2019, p. 407-416.
dc.identifier.isbn978-1-64368-015-6
dc.identifier.urihttp://hdl.handle.net/2117/188023
dc.description.abstractWord embeddings have recently become a fundamental tool of Natural Language Processing, with application to tasks like machine translation or image annotation. The high-dimensional space defined by these embeddings is typically explored and exploited through distance-based operations. In this paper we work on the problem of finding words related between them in a text embedding. This relationship can be of different kind, we focus in semantic relations like synonymy and antonym. We explore the idea of using the distance between norms instead of, like other authors has done before, the vector that units them. We present different norms, some of them well known in the literature and others no so widely used and also we introduce a new one and its theoretical mathematical framework. We also give an explanation of why them work properly or not and compare their performance on the two most used embeddings, GloVe and Word2Vec.
dc.format.extent10 p.
dc.language.isoeng
dc.publisherIOS Press
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subject.lcshComputational linguistics
dc.subject.lcshSemantic computing
dc.subject.otherWord embeddings
dc.subject.otherEmbedding space
dc.subject.otherDistances
dc.subject.otherSemantic relations
dc.subject.otherWordNet
dc.subject.otherHigh dimensional vector spaces
dc.titleAnalyzing distances in word embeddings and their relation with seme analysis
dc.typeConference report
dc.subject.lemacLingüística computacional
dc.identifier.doi10.3233/FAIA190153
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ebooks.iospress.nl/volumearticle/52866
dc.rights.accessOpen Access
local.identifier.drac28383474
dc.description.versionPostprint (author's final draft)
local.citation.authorGijón, M.; Vilalta, A.; García-Gasulla, D.
local.citation.contributorInternational Conference of the Catalan Association for Artificial Intelligence
local.citation.publicationNameProceedings of the 22nd International Conference of the Catalan Association for Artificial Intelligence
local.citation.startingPage407
local.citation.endingPage416


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple