TriviaQA: A Large Scale Dataset for Reading Comprehension and Question Answering
TriviaQA is a reading comprehension dataset containing over 650K question-answer-evidence triples. TriviaQA includes 95K question-answer pairs authored by trivia enthusiasts and independently gathered evidence documents, six per question on average, that provide high quality distant supervision for answering the questions. The details can be found in our ACL 17 paper TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
In Association for Computational Linguistics (ACL) 2017, Vancouver, Canada.
The TriviaQA leaderboard is now live on Codalab. Submit your predictions for evaluation on the test set!