id author title date pages extension mime words sentences flesch summary cache txt work_3jdx4rpw3ne7pdgamiplej6n4e Emily M. Bender Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science 2018 17 .pdf application/pdf 11314 890 51 data statements will help alleviate issues related to exclusion and bias in language technology; lead to better precision in claims A data statement is a characterization of a dataset which provides context to allow developers and users to as data statements bring our datasets and their represented populations into better focus, they should NLP needs data statements (§3) and relate our proposal to current practice (§4). Recent studies have documented the fact that limitations in training data lead to ethically problematic limitations in the resulting NLP systems. NLP papers using datasets for training or test data tend statements should be included in every NLP publication which presents new datasets and in the 2026 the Association for Computational Linguistics (ACL) proposes that data statements be standardized and required components of research papers. 'data statements' in all publications and documentation for all NLP systems. ./cache/work_3jdx4rpw3ne7pdgamiplej6n4e.pdf ./txt/work_3jdx4rpw3ne7pdgamiplej6n4e.txt