id author title date pages extension mime words sentences flesch summary cache txt cord-348972-r94fhpe0 Gussow, Ayal B. Machine-learning approach expands the repertoire of anti-CRISPR protein families 2020-07-29 .txt text/plain 9653 505 54 The most striking and obvious common feature of the Acrs is their small size (weighted mean Acr length: 104 aa, Table 1 ), and the tendency to form sets of small proteins that are encoded by co-directional and closely spaced genes in (pro)virus genomes (hereafter directons; Fig. 1 , Table 1 ). As genes encoding Acrs tend to form small directons, we sought to estimate a heuristic maximum threshold for the mean directon size in a candidate family that would enrich our protein set for true Acrs. The initial set consisted of 232,616 clusters and was first filtered for clusters that included at least one member with an HTH-domain-containing protein encoded downstream, and at least one member from a self-targeting genome, two hallmark Acr characteristics 20 . ./cache/cord-348972-r94fhpe0.txt ./txt/cord-348972-r94fhpe0.txt