id author title date pages extension mime words sentences flesch summary cache txt cord-020813-0wc23ixy Hashemi, Helia ANTIQUE: A Non-factoid Question Answering Benchmark 2020-03-24 .txt text/plain 2941 185 59 Despite the importance of the task, the community still feels the significant lack of large-scale non-factoid question answering collections with real questions and comprehensive relevance judgments. Despite the widely-known importance of studying answer passage retrieval for non-factoid questions [1, 2, 8, 18] , the research progress for this task is limited by the availability of high-quality public data. Although WikiPassageQA is an invaluable contribution to the community, it does not cover all aspects of the non-factoid question answering task and has the following limitations: (i) it only contains an average of 1.7 relevant passages per question and does not cover many questions with multiple correct answers; (ii) it was created from the Wikipedia website, containing only formal text; (iii) more importantly, the questions in the WikiPassageQA dataset were generated by crowdworkers, which is different from the questions that users ask in real-world systems; (iv) the relevant passages in WikiPassageQA contain the answer to the question in addition to some surrounding text. In contrast, ANTIQUE provides a reliable collection with complete relevance annotations for evaluating non-factoid QA models. ./cache/cord-020813-0wc23ixy.txt ./txt/cord-020813-0wc23ixy.txt