Back to Nlp Progress

Question answering

chinese/question_answering.md

0.32.6 KB
Original Source

Question answering

Question answering is the task of answering a question.

Table of contents

Reading comprehension

CMRC 2018

The Chinese Machine Reading Comprehension (CMRC 2018) is a SQuAD-like reading comprehension dataset that consists of 20,000 questions annotated on Wikipedia paragraphs by human experts. The dataset can be downloaded here. Below we show the F1 and EM scores both on the test set and the challenge set.

ModelTest F1Test EMChallenge F1Challenge EMPaper
Human performance97.992.495.290.4A Span-Extraction Dataset for Chinese Machine Reading Comprehension
Dual BERT (w / SQuAD; Cui et al., 2019)90.273.655.227.8Cross-Lingual Machine Reading Comprehension
Dual BERT (Cui et al., 2019)88.170.447.923.8Cross-Lingual Machine Reading Comprehension

DRCD

The Delta Reading Comprehension Dataset (DRCD) is a SQuAD-like reading comprehension dataset that contains 30,000+ questions on 10,014 paragraphs from 2,108 Wikipedia articles. The dataset can be downloaded here.

ModelF1EMPaper
Human performance93.380.4DRCD: a Chinese Machine Reading Comprehension Dataset
Dual BERT (w / SQuAD; Cui et al., 2019)91.685.4Cross-Lingual Machine Reading Comprehension
Dual BERT (Cui et al., 2019)90.383.7Cross-Lingual Machine Reading Comprehension

DuReader

DuReader is a large-scale reading comprehension dataset that is based on the logs of Baidu Search and contains 200k questions, 420k answers, and 1M documents. For more information, refer to its website to see the introduction. You can download the dataset here. The best models can be view on the public leaderboard.