Mostra el registre d'ítem simple
Correspondence analysis of textual data involving contextual information: CA-GALT on principal components
dc.contributor.author | Bécue Bertaut, Mónica María |
dc.contributor.author | Pages, Jerome |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa |
dc.date.accessioned | 2016-01-20T15:37:58Z |
dc.date.issued | 2015-06-01 |
dc.identifier.citation | Becue-Bertaut, M., Pages, J. Correspondence analysis of textual data involving contextual information: CA-GALT on principal components. "Advances in data analysis and classification", 01 Juny 2015, vol. 9, núm. 2, p. 125-142. |
dc.identifier.issn | 1862-5347 |
dc.identifier.uri | http://hdl.handle.net/2117/81756 |
dc.description.abstract | Correspondence analysis on an aggregated lexical table is a typical practice in textual analysis in which a contextual categorical variable is used to aggregate documents, depending on the categories to which they belong. This work generalises this approach and considers several quantitative, categorical or mixed contextual variables. The result is a new method that we have called 'correspondence analysis on a generalised aggregated lexical table'. A favoured application derives from surveys by questionnaire, including both open-ended and closed questions. The free-text answers are encoded into a respondents words frequency table called a lexical table. The closed questions, either quantitative or categorical, form the contextual variables. The primary objective is to establish a typology of the variables and a typology of the words from their mutual relationships as grasped from jointly analysing the textual and contextual tables. Validation tests are offered, particularly in the form of confidence ellipses. The comprehensive and numerous properties of the method, similar to correspondence analysis properties, are detailed. Promising results are obtained as indicated by an application to a marketing survey conducted among 1,000 respondents. |
dc.format.extent | 18 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Matemàtiques i estadística |
dc.subject.other | Correspondence analysis |
dc.subject.other | Textual and contextual data |
dc.subject.other | Textual analysis |
dc.subject.other | Lexical table |
dc.subject.other | Confidence ellipses |
dc.subject.other | Contingency table |
dc.subject.other | Canonical correspondence-analysis |
dc.subject.other | regression |
dc.subject.other | tables |
dc.title | Correspondence analysis of textual data involving contextual information: CA-GALT on principal components |
dc.type | Article |
dc.identifier.doi | 10.1007/s11634-014-0171-9 |
dc.subject.ams | Classificació AMS::62 Statistics::62H Multivariate analysis |
dc.relation.publisherversion | http://link.springer.com/article/10.1007%2Fs11634-014-0171-9 |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 16615163 |
dc.description.version | Postprint (published version) |
dc.relation.projectid | info:eu-repo/grantAgreement/MINECO//ECO2012-35584/ES/METODOS CUANTITATIVOS PARA LA MEDICION Y VALORACION DE RIESGOS EN EMPRESAS ASEGURADORAS/ |
dc.date.lift | 10000-01-01 |
local.citation.author | Becue-Bertaut, M.; Pages, J. |
local.citation.publicationName | Advances in data analysis and classification |
local.citation.volume | 9 |
local.citation.number | 2 |
local.citation.startingPage | 125 |
local.citation.endingPage | 142 |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Articles de revista [712]