id author title date pages extension mime words sentences flesch summary cache txt 10_1101-2021_01_08_425887 Hu, Yan Auto-CORPus: Automated and Consistent Outputs from Research Publications 2021 10 .pdf application/pdf 6886 553 53 the same structured model, so that these can be used as input to rule-based or deep learning algorithms for data extraction. example, at this point in this article the main headers are 'abstract' followed by 'introduction' and 'materials and methods' that could make up a digraph. We use this process to evaluate new potential synonyms for existing terms and identify abstract → introduction → materials → results → discussion → conclusion → acknowledgements → footnotes section → references. Based on the digraph, we then assigned data and data description to be synonyms of the materials section, and participants From the analysis of ego-networks four new potential categories were identified: disclosure, graphical abstract, highlights and participants. Newly identified synonyms for existing IAO terms (00006xx) from the digraph mapping of 2,441 publications. Newly identified synonyms for existing IAO terms (00006xx) from the digraph mapping of 2,441 publications. ./cache/10_1101-2021_01_08_425887.pdf ./txt/10_1101-2021_01_08_425887.txt