Semi-Supervised Learning for Neural Machine Translation

Cheng, Yong; Xu, Wei; He, Zhongjun; He, Wei; Wu, Hua; Sun, Maosong; Liu, Yang

Computer Science > Computation and Language

arXiv:1606.04596 (cs)

[Submitted on 15 Jun 2016 (v1), last revised 10 Dec 2016 (this version, v3)]

Title:Semi-Supervised Learning for Neural Machine Translation

Authors:Yong Cheng, Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun, Yang Liu

View PDF

Abstract:While end-to-end neural machine translation (NMT) has made remarkable progress recently, NMT systems only rely on parallel corpora for parameter estimation. Since parallel corpora are usually limited in quantity, quality, and coverage, especially for low-resource languages, it is appealing to exploit monolingual corpora to improve NMT. We propose a semi-supervised approach for training NMT models on the concatenation of labeled (parallel corpora) and unlabeled (monolingual corpora) data. The central idea is to reconstruct the monolingual corpora using an autoencoder, in which the source-to-target and target-to-source translation models serve as the encoder and decoder, respectively. Our approach can not only exploit the monolingual corpora of the target language, but also of the source language. Experiments on the Chinese-English dataset show that our approach achieves significant improvements over state-of-the-art SMT and NMT systems.

Comments:	Corrected a typo
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1606.04596 [cs.CL]
	(or arXiv:1606.04596v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1606.04596

Submission history

From: Yang Liu [view email]
[v1] Wed, 15 Jun 2016 00:22:27 UTC (223 KB)
[v2] Wed, 10 Aug 2016 19:08:20 UTC (223 KB)
[v3] Sat, 10 Dec 2016 20:02:52 UTC (223 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2016-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yong Cheng
Wei Xu
Zhongjun He
Wei He
Hua Wu

…

export BibTeX citation

Computer Science > Computation and Language

Title:Semi-Supervised Learning for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Semi-Supervised Learning for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators