Multi-Task Cross-Lingual Sequence Tagging from Scratch

Ruslan Salakhutdinov; William Cohen; Zhilin Yang

arxiv: 1603.06270 · v2 · pith:OWIV4QONnew · submitted 2016-03-20 · 💻 cs.CL · cs.LG

Multi-Task Cross-Lingual Sequence Tagging from Scratch

Zhilin Yang , Ruslan Salakhutdinov , William Cohen This is my paper

classification 💻 cs.CL cs.LG

keywords modelcross-lingualmulti-tasksequencetaggingdeepindependentjoint

0 comments

read the original abstract

We present a deep hierarchical recurrent neural network for sequence tagging. Given a sequence of words, our model employs deep gated recurrent units on both character and word levels to encode morphology and context information, and applies a conditional random field layer to predict the tags. Our model is task independent, language independent, and feature engineering free. We further extend our model to multi-task and cross-lingual joint training by sharing the architecture and parameters. Our model achieves state-of-the-art results in multiple languages on several benchmark tasks including POS tagging, chunking, and NER. We also demonstrate that multi-task and cross-lingual joint training can improve the performance in various cases.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks
cs.CL 2019-07 unverdicted novelty 6.0

DropAttention regularizes attention weights in fully-connected self-attention networks to reduce overfitting and improve performance.