Interviewer chatbot using NLG

Pladevall Morros, Jaume

Visualitza/Obre

170838.pdf (2,970Mb) (Accés restringit)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Pladevall Morros, Jaume

Tutor / directorTurmo Borras, Jorge

; Salord Quetglas, Lluís

Realitzat a/ambOpground

Tipus de documentProjecte Final de Màster Oficial

Data2022-06-28

Condicions d'accésAccés restringit per decisió de l'autor

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

The purpose of this project is to design and implement a conversational agent that can conduct a prescreening interview to Opground users. A finite-state template-based task-oriented dialogue system was chosen to comply with Opground's requirements. This system is built on top of ConvLab-2 given its modularity and personalization of the agent's components. To improve the user experience when answering the interview questions a proof of concept (PoC) feedback generation model, Answer2Feedback, is proposed. Research for available datasets was done, but none was found that matched our task objective. Therefore, a job interview single-turn dataset is created from Opground's interviews. Then a subset of the dataset, that was cleaned and prepared to contain only Spanish questions about spoken languages to test the concept, was manually annotated. The dataset with the feedback was split into two. One with only annotations from one annotator and the other with annotations from two annotators. The latter dataset contains the former annotations. Two different Spanish pre-trained language models were used to experiment (t5 and mt5), where the mt5 was already fine-tuned for summarization. It was also experimented with the kind of input fed into the model. For two cases, the first case, with only the answer given by the candidate and the second case, with the answer plus the important entities from a business perspective extracted with a Named Entity Recognition (NER) model. Therefore, eight final fine-tuned models (model used: t5 or mt5; dataset used: one-annotator or two-annotator; input used: without entities or with entities) were compared in feedback generation. Extrinsic metrics BLEU and ROUGE, and intrinsic metric perplexity were used to evaluate the models. The evaluation results of the feedback were ambiguous depending on the metric. Finally, perplexity was selected as the metric to evaluate the quality of feedback generation after the results of the PoC indicated a correlation between perplexity and the naturalness of the feedback generated. Concluding, models fine-tuned with more instances and using entities in the input improve feedback generation.

MatèriesNatural language processing (Computer science), Tractament del llenguatge natural (Informàtica)

TitulacióMÀSTER UNIVERSITARI EN INNOVACIÓ I RECERCA EN INFORMÀTICA (Pla 2012)

URIhttp://hdl.handle.net/2117/372831

Col·leccions

Màsters oficials - Master in Innovation and Research in Informatics - MIRI [454]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
170838.pdf		2,970Mb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Interviewer chatbot using NLG

Visualitza/Obre

Explora