Mostra el registre d'ítem simple
Analyzing distances in word embeddings and their relation with seme analysis
dc.contributor.author | Gijón Agudo, Manuel |
dc.contributor.author | Vilalta Arias, Armand |
dc.contributor.author | Garcia Gasulla, Dario |
dc.contributor.other | Universitat Politècnica de Catalunya. Doctorat en Intel·ligència Artificial |
dc.contributor.other | Barcelona Supercomputing Center |
dc.date.accessioned | 2020-05-19T07:40:40Z |
dc.date.available | 2020-05-19T07:40:40Z |
dc.date.issued | 2019 |
dc.identifier.citation | Gijón, M.; Vilalta, A.; Garcia-Gasulla, D. Analyzing distances in word embeddings and their relation with seme analysis. A: International Conference of the Catalan Association for Artificial Intelligence. "Proceedings of the 22nd International Conference of the Catalan Association for Artificial Intelligence". IOS Press, 2019, p. 407-416. |
dc.identifier.isbn | 978-1-64368-015-6 |
dc.identifier.uri | http://hdl.handle.net/2117/188023 |
dc.description.abstract | Word embeddings have recently become a fundamental tool of Natural Language Processing, with application to tasks like machine translation or image annotation. The high-dimensional space defined by these embeddings is typically explored and exploited through distance-based operations. In this paper we work on the problem of finding words related between them in a text embedding. This relationship can be of different kind, we focus in semantic relations like synonymy and antonym. We explore the idea of using the distance between norms instead of, like other authors has done before, the vector that units them. We present different norms, some of them well known in the literature and others no so widely used and also we introduce a new one and its theoretical mathematical framework. We also give an explanation of why them work properly or not and compare their performance on the two most used embeddings, GloVe and Word2Vec. |
dc.format.extent | 10 p. |
dc.language.iso | eng |
dc.publisher | IOS Press |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural |
dc.subject.lcsh | Computational linguistics |
dc.subject.lcsh | Semantic computing |
dc.subject.other | Word embeddings |
dc.subject.other | Embedding space |
dc.subject.other | Distances |
dc.subject.other | Semantic relations |
dc.subject.other | WordNet |
dc.subject.other | High dimensional vector spaces |
dc.title | Analyzing distances in word embeddings and their relation with seme analysis |
dc.type | Conference report |
dc.subject.lemac | Lingüística computacional |
dc.identifier.doi | 10.3233/FAIA190153 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://ebooks.iospress.nl/volumearticle/52866 |
dc.rights.access | Open Access |
local.identifier.drac | 28383474 |
dc.description.version | Postprint (author's final draft) |
local.citation.author | Gijón, M.; Vilalta, A.; García-Gasulla, D. |
local.citation.contributor | International Conference of the Catalan Association for Artificial Intelligence |
local.citation.publicationName | Proceedings of the 22nd International Conference of the Catalan Association for Artificial Intelligence |
local.citation.startingPage | 407 |
local.citation.endingPage | 416 |