Correspondence analysis of textual data involving contextual information: CA-GALT on principal components

Bécue Bertaut, Mónica María; Pages, Jerome

doi:10.1007/s11634-014-0171-9

Visualitza/Obre

art%3A10.1007%2Fs11634-014-0171-9.pdf (868,3Kb) (Accés restringit) Sol·licita una còpia a l'autor

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Bécue Bertaut, Mónica María

Pages, Jerome

Tipus de documentArticle

Data publicació2015-06-01

Condicions d'accésAccés restringit per política de l'editorial

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

ProjecteMETODOS CUANTITATIVOS PARA LA MEDICION Y VALORACION DE RIESGOS EN EMPRESAS ASEGURADORAS (MINECO-ECO2012-35584)

Abstract

Correspondence analysis on an aggregated lexical table is a typical practice in textual analysis in which a contextual categorical variable is used to aggregate documents, depending on the categories to which they belong. This work generalises this approach and considers several quantitative, categorical or mixed contextual variables. The result is a new method that we have called 'correspondence analysis on a generalised aggregated lexical table'. A favoured application derives from surveys by questionnaire, including both open-ended and closed questions. The free-text answers are encoded into a respondents words frequency table called a lexical table. The closed questions, either quantitative or categorical, form the contextual variables. The primary objective is to establish a typology of the variables and a typology of the words from their mutual relationships as grasped from jointly analysing the textual and contextual tables. Validation tests are offered, particularly in the form of confidence ellipses. The comprehensive and numerous properties of the method, similar to correspondence analysis properties, are detailed. Promising results are obtained as indicated by an application to a marketing survey conducted among 1,000 respondents.

CitacióBecue-Bertaut, M., Pages, J. Correspondence analysis of textual data involving contextual information: CA-GALT on principal components. "Advances in data analysis and classification", 01 Juny 2015, vol. 9, núm. 2, p. 125-142.

URIhttp://hdl.handle.net/2117/81756

DOI10.1007/s11634-014-0171-9

ISSN1862-5347

Versió de l'editorhttp://link.springer.com/article/10.1007%2Fs11634-014-0171-9

Col·leccions

Departament d'Estadística i Investigació Operativa - Articles de revista [719]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
art%3A10.1007%2Fs11634-014-0171-9.pdf		868,3Kb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Correspondence analysis of textual data involving contextual information: CA-GALT on principal components

Visualitza/Obre

Explora