edgelm/examples/wmt20/README.md
This page provides pointers to the models of Facebook-FAIR's WMT'20 news translation task submission (Chen et al., 2020).
| Model | Description | Download |
|---|---|---|
transformer.wmt20.ta-en | Ta->En | download (.tar.gz) |
transformer.wmt20.en-ta | En->Ta | download (.tar.gz) |
transformer.wmt20.iu-en.news | Iu->En (News domain) | download (.tar.gz) |
transformer.wmt20.en-iu.news | En->Iu (News domain) | download (.tar.gz) |
transformer.wmt20.iu-en.nh | Iu->En (Nunavut Hansard domain) | download (.tar.gz) |
transformer.wmt20.en-iu.nh | En->Iu (Nunavut Hansard domain) | download (.tar.gz) |
| Model | Description | Download |
|---|---|---|
transformer_lm.wmt20.en | En Language Model | download (.tar.gz) |
transformer_lm.wmt20.ta | Ta Language Model | download (.tar.gz) |
transformer_lm.wmt20.iu.news | Iu Language Model (News domain) | download (.tar.gz) |
transformer_lm.wmt20.iu.nh | Iu Language Model (Nunavut Hansard domain) | download (.tar.gz) |
import torch
# English to Tamil translation
en2ta = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.en-ta')
en2ta.translate("Machine learning is great!") # 'இயந்திரக் கற்றல் அருமை!'
# Tamil to English translation
ta2en = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.ta-en')
ta2en.translate("இயந்திரக் கற்றல் அருமை!") # 'Machine learning is great!'
# English to Inuktitut translation
en2iu = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.en-iu.news')
en2iu.translate("machine learning is great!") # 'ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ ᐱᐅᔪᒻᒪᕆᒃ!'
# Inuktitut to English translation
iu2en = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.iu-en.news')
iu2en.translate("ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ ᐱᐅᔪᒻᒪᕆᒃ!") # 'Machine learning excellence!'
# Sample from the English LM
en_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.en')
en_lm.sample("Machine learning is") # 'Machine learning is a type of artificial intelligence that uses machine learning to learn from data and make predictions.'
# Sample from the Tamil LM
ta_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.ta')
ta_lm.sample("இயந்திரக் கற்றல் என்பது செயற்கை நுண்ணறிவின்") # 'இயந்திரக் கற்றல் என்பது செயற்கை நுண்ணறிவின் ஒரு பகுதியாகும்.'
# Sample from the Inuktitut LM
iu_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.iu.news')
iu_lm.sample("ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ") # 'ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ, ᐊᒻᒪᓗ ᓯᓚᐅᑉ ᐊᓯᙳᖅᐸᓪᓕᐊᓂᖓᓄᑦ ᖃᓄᐃᓕᐅᕈᑎᒃᓴᑦ, ᐃᓚᖃᖅᖢᑎᒃ ᐅᑯᓂᖓ:'
@inproceedings{chen2020facebook
title={Facebook AI's WMT20 News Translation Task Submission},
author={Peng-Jen Chen and Ann Lee and Changhan Wang and Naman Goyal and Angela Fan and Mary Williamson and Jiatao Gu},
booktitle={Proc. of WMT},
year={2020},
}