pith. machine review for the scientific record. sign in

arxiv: 1901.08149 · v2 · submitted 2019-01-23 · 💻 cs.CL

Recognition: unknown

TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents

Authors on Pith no claims yet
classification 💻 cs.CL
keywords absoluteapproachconversationalimprovementlearningmodelmodelsstate-of-the-art
0
0 comments X
read the original abstract

We introduce a new approach to generative data-driven dialogue systems (e.g. chatbots) called TransferTransfo which is a combination of a Transfer learning based training scheme and a high-capacity Transformer model. Fine-tuning is performed by using a multi-task objective which combines several unsupervised prediction tasks. The resulting fine-tuned model shows strong improvements over the current state-of-the-art end-to-end conversational models like memory augmented seq2seq and information-retrieval models. On the privately held PERSONA-CHAT dataset of the Conversational Intelligence Challenge 2, this approach obtains a new state-of-the-art, with respective perplexity, Hits@1 and F1 metrics of 16.28 (45 % absolute improvement), 80.7 (46 % absolute improvement) and 19.5 (20 % absolute improvement).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. PeReGrINE: Evaluating Personalized Review Fidelity with User Item Graph Context

    cs.IR 2026-04 unverdicted novelty 6.0

    PeReGrINE is a graph-based benchmark that restructures Amazon Reviews 2023 with temporal cutoffs and introduces dissonance analysis to measure how well retrieval-conditioned models match user style and product consensus.