Creating study carrel named bioinformatics-from-biorxiv
Initializing database
Creating cache from Bioarxiv xml file
10.1101/2021.01.08.425887
10_1101-2021_01_08_425887.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
10.1101/2021.01.08.425897
10_1101-2021_01_08_425897.pdf

10.1101/2021.01.08.425855
10_1101-2021_01_08_425855.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
10.1101/2020.03.27.012757
10_1101-2020_03_27_012757.pdf

10.1101/2021.01.08.425976
10_1101-2021_01_08_425976.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
10.1101/2021.01.08.425918
10_1101-2021_01_08_425918.pdf

10.1101/2021.01.08.425952
10_1101-2021_01_08_425952.pdf

10.1101/2021.01.08.426008
10_1101-2021_01_08_426008.pdf

10.1101/2021.01.08.425967
10_1101-2021_01_08_425967.pdf

10.1101/2021.01.08.425379
10_1101-2021_01_08_425379.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: Resource temporarily unavailable
10.1101/2020.04.17.043323
10_1101-2020_04_17_043323.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
10.1101/2021.01.06.425560
10_1101-2021_01_06_425560.pdf

10.1101/2021.01.06.425494
10_1101-2021_01_06_425494.pdf

10.1101/2021.01.07.425637
10_1101-2021_01_07_425637.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
10.1101/2021.01.06.425581
10_1101-2021_01_06_425581.pdf

10.1101/2021.01.07.425782
10_1101-2021_01_07_425782.pdf

10.1101/2021.01.07.425773
10_1101-2021_01_07_425773.pdf

10.1101/2021.01.07.425794
10_1101-2021_01_07_425794.pdf

10.1101/2021.01.07.425801
10_1101-2021_01_07_425801.pdf

10.1101/2021.01.08.425885
10_1101-2021_01_08_425885.pdf

10.1101/2020.10.26.351783
10_1101-2020_10_26_351783.pdf

10.1101/2021.01.06.425546
10_1101-2021_01_06_425546.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: Resource temporarily unavailable
10.1101/2021.01.06.425544
10_1101-2021_01_06_425544.pdf

10.1101/2021.01.06.425569
10_1101-2021_01_06_425569.pdf

10.1101/2021.01.05.425266
10_1101-2021_01_05_425266.pdf

10.1101/2021.01.06.425550
10_1101-2021_01_06_425550.pdf

10.1101/2021.01.05.425414
10_1101-2021_01_05_425414.pdf

10.1101/2020.12.26.424429
10_1101-2020_12_26_424429.pdf

10.1101/2021.01.07.425716
10_1101-2021_01_07_425716.pdf

10.1101/2021.01.07.425697
10_1101-2021_01_07_425697.pdf

10.1101/2020.11.13.381475
10_1101-2020_11_13_381475.pdf

10.1101/2021.01.04.425315
10_1101-2021_01_04_425315.pdf

10.1101/2021.01.05.425384
10_1101-2021_01_05_425384.pdf

10.1101/2020.05.22.110247
10_1101-2020_05_22_110247.pdf

10.1101/2021.01.02.425006
10_1101-2021_01_02_425006.pdf

10.1101/332965
10_1101-332965.pdf

10.1101/2021.01.05.425409
10_1101-2021_01_05_425409.pdf

10.1101/2021.01.05.425417
10_1101-2021_01_05_425417.pdf

10.1101/2021.01.05.425508
10_1101-2021_01_05_425508.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
10.1101/2021.01.04.425335
10_1101-2021_01_04_425335.pdf

10.1101/2020.12.14.422697
10_1101-2020_12_14_422697.pdf

10.1101/2020.01.29.925354
10_1101-2020_01_29_925354.pdf

10.1101/2020.08.13.249839
10_1101-2020_08_13_249839.pdf

/data-disk/reader-compute/reader-classic/bin/bioarxiv-harvest.sh: fork: retry: No child processes
10.1101/2021.01.04.425250
10_1101-2021_01_04_425250.pdf

10.1101/2020.09.09.289074
10_1101-2020_09_09_289074.pdf

10.1101/2021.01.04.425285
10_1101-2021_01_04_425285.pdf

10.1101/2020.12.24.424332
10_1101-2020_12_24_424332.pdf

10.1101/2021.01.04.425288
10_1101-2021_01_04_425288.pdf

10.1101/436634
10_1101-436634.pdf

10.1101/2020.08.28.271981
10_1101-2020_08_28_271981.pdf

parallel: Warning: Only enough available processes to run 50 jobs in parallel.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf 
parallel: Warning: or /proc/sys/kernel/pid_max may help.
2021-01-10 13:53:39 URL:https://www.biorxiv.org/ [37867/37867] -> "./cache/10_1101-2021_01_08_425976.pdf" [1]
2021-01-10 13:53:39 URL:https://www.biorxiv.org/ [37867/37867] -> "./cache/10_1101-2021_01_06_425494.pdf" [1]
parallel: Warning: No more processes: Decreasing number of running jobs to 49.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
2021-01-10 13:53:39 URL:https://www.biorxiv.org/ [37867/37867] -> "./cache/10_1101-2021_01_06_425546.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425855v1.full.pdf [997276] -> "./cache/10_1101-2021_01_08_425855.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425952v1.full.pdf [756020] -> "./cache/10_1101-2021_01_08_425952.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.426008v1.full.pdf [1085923] -> "./cache/10_1101-2021_01_08_426008.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425887v1.full.pdf [2003129] -> "./cache/10_1101-2021_01_08_425887.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425967v1.full.pdf [1024766] -> "./cache/10_1101-2021_01_08_425967.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.07.425782v1.full.pdf [515471] -> "./cache/10_1101-2021_01_07_425782.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.07.425697v1.full.pdf [325711] -> "./cache/10_1101-2021_01_07_425697.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2020.10.26.351783v2.full.pdf [464291] -> "./cache/10_1101-2020_10_26_351783.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.07.425773v1.full.pdf [698997] -> "./cache/10_1101-2021_01_07_425773.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.07.425637v1.full.pdf [2059489] -> "./cache/10_1101-2021_01_07_425637.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2020.04.17.043323v2.full.pdf [1380957] -> "./cache/10_1101-2020_04_17_043323.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.06.425581v1.full.pdf [1124172] -> "./cache/10_1101-2021_01_06_425581.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.06.425560v1.full.pdf [347001] -> "./cache/10_1101-2021_01_06_425560.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2020.12.26.424429v2.full.pdf [635843] -> "./cache/10_1101-2020_12_26_424429.pdf" [1]
2021-01-10 13:53:40 URL:https://www.biorxiv.org/content/10.1101/2021.01.05.425417v1.full.pdf [726035] -> "./cache/10_1101-2021_01_05_425417.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.07.425801v1.full.pdf [1867988] -> "./cache/10_1101-2021_01_07_425801.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.07.425794v1.full.pdf [2432557] -> "./cache/10_1101-2021_01_07_425794.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.07.425716v1.full.pdf [1494051] -> "./cache/10_1101-2021_01_07_425716.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2020.11.13.381475v2.full.pdf [885062] -> "./cache/10_1101-2020_11_13_381475.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/332965v6.full.pdf [1032701] -> "./cache/10_1101-332965.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2020.09.09.289074v3.full.pdf [1253804] -> "./cache/10_1101-2020_09_09_289074.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.05.425384v1.full.pdf [1150864] -> "./cache/10_1101-2021_01_05_425384.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2020.12.24.424332v2.full.pdf [502039] -> "./cache/10_1101-2020_12_24_424332.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.02.425006v2.full.pdf [2060734] -> "./cache/10_1101-2021_01_02_425006.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.04.425315v1.full.pdf [2312938] -> "./cache/10_1101-2021_01_04_425315.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.04.425285v1.full.pdf [1270813] -> "./cache/10_1101-2021_01_04_425285.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.05.425409v1.full.pdf [2965978] -> "./cache/10_1101-2021_01_05_425409.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.06.425550v1.full.pdf [1855323] -> "./cache/10_1101-2021_01_06_425550.pdf" [1]
2021-01-10 13:53:41 URL:https://www.biorxiv.org/content/10.1101/2021.01.05.425508v1.full.pdf [1540504] -> "./cache/10_1101-2021_01_05_425508.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/2021.01.04.425288v1.full.pdf [2595876] -> "./cache/10_1101-2021_01_04_425288.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/2020.05.22.110247v3.full.pdf [2010779] -> "./cache/10_1101-2020_05_22_110247.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/2021.01.06.425544v1.full.pdf [4030944] -> "./cache/10_1101-2021_01_06_425544.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/2020.08.13.249839v3.full.pdf [2612711] -> "./cache/10_1101-2020_08_13_249839.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/2021.01.05.425414v2.full.pdf [5645102] -> "./cache/10_1101-2021_01_05_425414.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/2021.01.06.425569v1.full.pdf [3141887] -> "./cache/10_1101-2021_01_06_425569.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/2020.01.29.925354v7.full.pdf [3585828] -> "./cache/10_1101-2020_01_29_925354.pdf" [1]
2021-01-10 13:53:42 URL:https://www.biorxiv.org/content/10.1101/436634v4.full.pdf [2530420] -> "./cache/10_1101-436634.pdf" [1]
2021-01-10 13:53:43 URL:https://www.biorxiv.org/content/10.1101/2020.08.28.271981v2.full.pdf [3422825] -> "./cache/10_1101-2020_08_28_271981.pdf" [1]
2021-01-10 13:53:43 URL:https://www.biorxiv.org/content/10.1101/2020.12.14.422697v2.full.pdf [3566702] -> "./cache/10_1101-2020_12_14_422697.pdf" [1]
2021-01-10 13:53:44 URL:https://www.biorxiv.org/content/10.1101/2021.01.04.425250v1.full.pdf [7491933] -> "./cache/10_1101-2021_01_04_425250.pdf" [1]
2021-01-10 13:53:44 URL:https://www.biorxiv.org/content/10.1101/2020.03.27.012757v3.full.pdf [8740861] -> "./cache/10_1101-2020_03_27_012757.pdf" [1]
2021-01-10 13:53:45 URL:https://www.biorxiv.org/content/10.1101/2021.01.05.425266v1.full.pdf [10964191] -> "./cache/10_1101-2021_01_05_425266.pdf" [1]
2021-01-10 13:53:45 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425918v1.full.pdf [16151735] -> "./cache/10_1101-2021_01_08_425918.pdf" [1]
2021-01-10 13:53:46 URL:https://www.biorxiv.org/content/10.1101/2021.01.04.425335v2.full.pdf [3704787] -> "./cache/10_1101-2021_01_04_425335.pdf" [1]
2021-01-10 13:53:49 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425379v1.full.pdf [23721305] -> "./cache/10_1101-2021_01_08_425379.pdf" [1]
2021-01-10 13:53:50 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425885v1.full.pdf [23691029] -> "./cache/10_1101-2021_01_08_425885.pdf" [1]
2021-01-10 13:53:51 URL:https://www.biorxiv.org/content/10.1101/2021.01.08.425897v1.full.pdf [5264590] -> "./cache/10_1101-2021_01_08_425897.pdf" [1]
=== updating bibliographic database
/data-disk/reader-compute/reader-classic/bin/bioarxiv2cache.sh: fork: retry: No child processes
Building study carrel named bioinformatics-from-biorxiv
parallel: Warning: Only enough available processes to run 21 jobs in parallel.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf 
parallel: Warning: or /proc/sys/kernel/pid_max may help.
parallel: Warning: No more processes: Decreasing number of running jobs to 20.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
  FILE: cache/10_1101-2021_01_08_425976.pdf
OUTPUT: txt/10_1101-2021_01_08_425976.txt
  FILE: cache/10_1101-2021_01_08_425887.pdf
OUTPUT: txt/10_1101-2021_01_08_425887.txt
  FILE: cache/10_1101-2021_01_08_425897.pdf
OUTPUT: txt/10_1101-2021_01_08_425897.txt
  FILE: cache/10_1101-2021_01_08_426008.pdf
OUTPUT: txt/10_1101-2021_01_08_426008.txt
  FILE: cache/10_1101-2021_01_08_425967.pdf
OUTPUT: txt/10_1101-2021_01_08_425967.txt
  FILE: cache/10_1101-2021_01_06_425581.pdf
OUTPUT: txt/10_1101-2021_01_06_425581.txt
  FILE: cache/10_1101-2021_01_07_425801.pdf
OUTPUT: txt/10_1101-2021_01_07_425801.txt
  FILE: cache/10_1101-2021_01_06_425560.pdf
OUTPUT: txt/10_1101-2021_01_06_425560.txt
  FILE: cache/10_1101-2020_04_17_043323.pdf
OUTPUT: txt/10_1101-2020_04_17_043323.txt
  FILE: cache/10_1101-2021_01_06_425494.pdf
OUTPUT: txt/10_1101-2021_01_06_425494.txt
  FILE: cache/10_1101-2021_01_07_425782.pdf
OUTPUT: txt/10_1101-2021_01_07_425782.txt
  FILE: cache/10_1101-2021_01_07_425637.pdf
OUTPUT: txt/10_1101-2021_01_07_425637.txt
  FILE: cache/10_1101-2020_03_27_012757.pdf
OUTPUT: txt/10_1101-2020_03_27_012757.txt
  FILE: cache/10_1101-2021_01_08_425952.pdf
OUTPUT: txt/10_1101-2021_01_08_425952.txt
  FILE: cache/10_1101-2021_01_07_425794.pdf
OUTPUT: txt/10_1101-2021_01_07_425794.txt
  FILE: cache/10_1101-2021_01_07_425773.pdf
OUTPUT: txt/10_1101-2021_01_07_425773.txt
  FILE: cache/10_1101-2021_01_08_425918.pdf
OUTPUT: txt/10_1101-2021_01_08_425918.txt
  FILE: cache/10_1101-2021_01_06_425546.pdf
OUTPUT: txt/10_1101-2021_01_06_425546.txt
/data-disk/reader-compute/reader-classic/bin/file2txt.sh: fork: retry: No child processes
  FILE: cache/10_1101-2021_01_08_425855.pdf
OUTPUT: txt/10_1101-2021_01_08_425855.txt
  FILE: cache/10_1101-2021_01_08_425885.pdf
OUTPUT: txt/10_1101-2021_01_08_425885.txt
  FILE: cache/10_1101-2021_01_06_425544.pdf
OUTPUT: txt/10_1101-2021_01_06_425544.txt
  FILE: cache/10_1101-2020_10_26_351783.pdf
OUTPUT: txt/10_1101-2020_10_26_351783.txt
  FILE: cache/10_1101-2021_01_07_425697.pdf
OUTPUT: txt/10_1101-2021_01_07_425697.txt
  FILE: cache/10_1101-2020_12_26_424429.pdf
OUTPUT: txt/10_1101-2020_12_26_424429.txt
  FILE: cache/10_1101-2021_01_05_425266.pdf
OUTPUT: txt/10_1101-2021_01_05_425266.txt
  FILE: cache/10_1101-2021_01_05_425414.pdf
OUTPUT: txt/10_1101-2021_01_05_425414.txt
  FILE: cache/10_1101-2021_01_06_425569.pdf
OUTPUT: txt/10_1101-2021_01_06_425569.txt
  FILE: cache/10_1101-2021_01_05_425384.pdf
OUTPUT: txt/10_1101-2021_01_05_425384.txt
  FILE: cache/10_1101-2021_01_07_425716.pdf
OUTPUT: txt/10_1101-2021_01_07_425716.txt
  FILE: cache/10_1101-332965.pdf
OUTPUT: txt/10_1101-332965.txt
  FILE: cache/10_1101-2021_01_05_425409.pdf
OUTPUT: txt/10_1101-2021_01_05_425409.txt
  FILE: cache/10_1101-2020_11_13_381475.pdf
OUTPUT: txt/10_1101-2020_11_13_381475.txt
  FILE: cache/10_1101-2021_01_02_425006.pdf
OUTPUT: txt/10_1101-2021_01_02_425006.txt
  FILE: cache/10_1101-2021_01_04_425315.pdf
OUTPUT: txt/10_1101-2021_01_04_425315.txt
  FILE: cache/10_1101-2020_05_22_110247.pdf
OUTPUT: txt/10_1101-2020_05_22_110247.txt
  FILE: cache/10_1101-2020_12_14_422697.pdf
OUTPUT: txt/10_1101-2020_12_14_422697.txt
  FILE: cache/10_1101-2021_01_06_425550.pdf
OUTPUT: txt/10_1101-2021_01_06_425550.txt
  FILE: cache/10_1101-2020_12_24_424332.pdf
OUTPUT: txt/10_1101-2020_12_24_424332.txt
/data-disk/reader-compute/reader-classic/bin/file2txt.sh: fork: retry: No child processes
  FILE: cache/10_1101-2021_01_05_425508.pdf
OUTPUT: txt/10_1101-2021_01_05_425508.txt
  FILE: cache/10_1101-2020_09_09_289074.pdf
OUTPUT: txt/10_1101-2020_09_09_289074.txt
  FILE: cache/10_1101-2021_01_04_425288.pdf
OUTPUT: txt/10_1101-2021_01_04_425288.txt
  FILE: cache/10_1101-2020_01_29_925354.pdf
OUTPUT: txt/10_1101-2020_01_29_925354.txt
  FILE: cache/10_1101-2021_01_04_425250.pdf
OUTPUT: txt/10_1101-2021_01_04_425250.txt
  FILE: cache/10_1101-2021_01_05_425417.pdf
OUTPUT: txt/10_1101-2021_01_05_425417.txt
/data-disk/reader-compute/reader-classic/bin/file2txt.sh: fork: retry: No child processes
  FILE: cache/10_1101-2021_01_04_425285.pdf
OUTPUT: txt/10_1101-2021_01_04_425285.txt
  FILE: cache/10_1101-436634.pdf
OUTPUT: txt/10_1101-436634.txt
  FILE: cache/10_1101-2020_08_13_249839.pdf
OUTPUT: txt/10_1101-2020_08_13_249839.txt
  FILE: cache/10_1101-2020_08_28_271981.pdf
OUTPUT: txt/10_1101-2020_08_28_271981.txt
  FILE: cache/10_1101-2021_01_04_425335.pdf
OUTPUT: txt/10_1101-2021_01_04_425335.txt
  FILE: cache/10_1101-2021_01_08_425379.pdf
OUTPUT: txt/10_1101-2021_01_08_425379.txt
parallel: Warning: Cannot spawn any jobs. Raising ulimit -u or 'nproc' in /etc/security/limits.conf
parallel: Warning: or /proc/sys/kernel/pid_max may help.
parallel: Warning: Only enough available processes to run 9 jobs in parallel.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf 
parallel: Warning: or /proc/sys/kernel/pid_max may help.
parallel: Warning: Only enough available processes to run 1 jobs in parallel.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf 
parallel: Warning: or /proc/sys/kernel/pid_max may help.
parallel: Warning: Only enough available processes to run 15 jobs in parallel.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf 
parallel: Warning: or /proc/sys/kernel/pid_max may help.
parallel: Warning: No more processes: Decreasing number of running jobs to 14.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
parallel: Warning: Only enough available processes to run 29 jobs in parallel.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf 
parallel: Warning: or /proc/sys/kernel/pid_max may help.
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
10_1101-2021_01_08_425976  txt/../ent/10_1101-2021_01_08_425976.ent
10_1101-2021_01_06_425494  txt/../pos/10_1101-2021_01_06_425494.pos
10_1101-2021_01_08_425976  txt/../pos/10_1101-2021_01_08_425976.pos
10_1101-2021_01_06_425494  txt/../ent/10_1101-2021_01_06_425494.ent
10_1101-2021_01_06_425546  txt/../ent/10_1101-2021_01_06_425546.ent
parallel: Warning: No more processes: Decreasing number of running jobs to 49.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
parallel: Warning: No more processes: Decreasing number of running jobs to 48.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
/data-disk/reader-compute/reader-classic/bin/txt2keywords.sh: fork: retry: No child processes
10_1101-2021_01_08_425976  txt/../wrd/10_1101-2021_01_08_425976.wrd
10_1101-2021_01_07_425782  txt/../ent/10_1101-2021_01_07_425782.ent
10_1101-2021_01_08_425967  txt/../pos/10_1101-2021_01_08_425967.pos
10_1101-2021_01_08_426008  txt/../pos/10_1101-2021_01_08_426008.pos
10_1101-2021_01_08_425967  txt/../wrd/10_1101-2021_01_08_425967.wrd
parallel: Warning: No more processes: Decreasing number of running jobs to 8.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
/data-disk/reader-compute/reader-classic/bin/txt2keywords.sh: fork: retry: No child processes
10_1101-2021_01_08_426008  txt/../wrd/10_1101-2021_01_08_426008.wrd
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37738 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425967  txt/../ent/10_1101-2021_01_08_425967.ent
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37879 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38133 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_426008  txt/../ent/10_1101-2021_01_08_426008.ent
10_1101-2021_01_08_425887  txt/../pos/10_1101-2021_01_08_425887.pos
10_1101-2020_12_26_424429  txt/../ent/10_1101-2020_12_26_424429.ent
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38093 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37790 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37921 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
10_1101-2020_04_17_043323  txt/../wrd/10_1101-2020_04_17_043323.wrd
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37739 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_06_425581  txt/../pos/10_1101-2021_01_06_425581.pos
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37628 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425887  txt/../wrd/10_1101-2021_01_08_425887.wrd
10_1101-2021_01_08_425918  txt/../wrd/10_1101-2021_01_08_425918.wrd
10_1101-2020_04_17_043323  txt/../pos/10_1101-2020_04_17_043323.pos
10_1101-2020_04_17_043323  txt/../ent/10_1101-2020_04_17_043323.ent
10_1101-2021_01_07_425773  txt/../ent/10_1101-2021_01_07_425773.ent
=== file2bib.sh ===
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: fork: retry: No child processes
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 39125 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38082 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425887  txt/../ent/10_1101-2021_01_08_425887.ent
/data-disk/reader-compute/reader-classic/bin/txt2pos.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2pos.sh: fork: retry: No child processes
10_1101-2021_01_08_425918  txt/../pos/10_1101-2021_01_08_425918.pos
=== file2bib.sh ===
         id: 10_1101-2021_01_06_425494
     author: Baldwin, Quenisha
      title: The topological free energy of proteins
       date: 2021
      pages: 
  extension: .pdf
        txt: ./txt/10_1101-2021_01_06_425494.txt
      cache: ./cache/10_1101-2021_01_06_425494.pdf

Content-Encoding	UTF-8
Content-Language	en
Content-Type	application/xhtml+xml; charset=UTF-8
Content-Type-Hint	text/html; charset=utf-8
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.html.HtmlParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	7
dc:title	bioRxiv.org - the preprint server for Biology
description	bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
generator	Drupal 7 (http://drupal.org)
resourceName	b'10_1101-2021_01_06_425494.pdf'
title	bioRxiv.org - the preprint server for Biology
twitter:card	summary
twitter:description	Welcome to the bioRxiv homepage.
twitter:site	@biorxivpreprint
twitter:title	bioRxiv
viewport	width=device-width, initial-scale=1, maximum-scale=3, minimum-scale=1, user-scalable=yes
10_1101-2021_01_06_425550  txt/../ent/10_1101-2021_01_06_425550.ent
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37978 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37922 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
         id: 10_1101-2021_01_08_425976
     author: Ahuja, Yuri
      title: Semi-supervised Calibration of Risk with Noisy Event Times (SCORNET) Using Electronic Health Record Data
       date: 2021
      pages: 
  extension: .pdf
        txt: ./txt/10_1101-2021_01_08_425976.txt
      cache: ./cache/10_1101-2021_01_08_425976.pdf

Content-Encoding	UTF-8
Content-Language	en
Content-Type	application/xhtml+xml; charset=UTF-8
Content-Type-Hint	text/html; charset=utf-8
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.html.HtmlParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	4
dc:title	bioRxiv.org - the preprint server for Biology
description	bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
generator	Drupal 7 (http://drupal.org)
resourceName	b'10_1101-2021_01_08_425976.pdf'
title	bioRxiv.org - the preprint server for Biology
twitter:card	summary
twitter:description	Welcome to the bioRxiv homepage.
twitter:site	@biorxivpreprint
twitter:title	bioRxiv
viewport	width=device-width, initial-scale=1, maximum-scale=3, minimum-scale=1, user-scalable=yes
10_1101-2021_01_08_425952  txt/../pos/10_1101-2021_01_08_425952.pos
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37448 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38129 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425897  txt/../wrd/10_1101-2021_01_08_425897.wrd
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38086 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: fork: retry: No child processes
         id: 10_1101-2021_01_06_425546
     author: Osthus, Dave
      title: Fast and Accurate Influenza Forecasting in the United States with Inferno
       date: 2021
      pages: 
  extension: .pdf
        txt: ./txt/10_1101-2021_01_06_425546.txt
      cache: ./cache/10_1101-2021_01_06_425546.pdf

Content-Encoding	UTF-8
Content-Language	en
Content-Type	application/xhtml+xml; charset=UTF-8
Content-Type-Hint	text/html; charset=utf-8
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.html.HtmlParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	5
dc:title	bioRxiv.org - the preprint server for Biology
description	bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
generator	Drupal 7 (http://drupal.org)
resourceName	b'10_1101-2021_01_06_425546.pdf'
title	bioRxiv.org - the preprint server for Biology
twitter:card	summary
twitter:description	Welcome to the bioRxiv homepage.
twitter:site	@biorxivpreprint
twitter:title	bioRxiv
viewport	width=device-width, initial-scale=1, maximum-scale=3, minimum-scale=1, user-scalable=yes
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38048 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425952  txt/../wrd/10_1101-2021_01_08_425952.wrd
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37986 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38036 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38124 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425952  txt/../ent/10_1101-2021_01_08_425952.ent
=== file2bib.sh ===
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: fork: retry: No child processes
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38252 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425855  txt/../ent/10_1101-2021_01_08_425855.ent
10_1101-2021_01_07_425637  txt/../pos/10_1101-2021_01_07_425637.pos
10_1101-2021_01_08_425897  txt/../pos/10_1101-2021_01_08_425897.pos
10_1101-2021_01_06_425581  txt/../ent/10_1101-2021_01_06_425581.ent
10_1101-2021_01_07_425782  txt/../pos/10_1101-2021_01_07_425782.pos
10_1101-2021_01_05_425384  txt/../ent/10_1101-2021_01_05_425384.ent
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38134 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37994 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_06_425544  txt/../ent/10_1101-2021_01_06_425544.ent
10_1101-2021_01_07_425801  txt/../pos/10_1101-2021_01_07_425801.pos
10_1101-2021_01_06_425494  txt/../wrd/10_1101-2021_01_06_425494.wrd
parallel: Warning: No more processes: Decreasing number of running jobs to 13.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
10_1101-2021_01_08_425918  txt/../ent/10_1101-2021_01_08_425918.ent
10_1101-2021_01_07_425697  txt/../ent/10_1101-2021_01_07_425697.ent
10_1101-2021_01_07_425794  txt/../ent/10_1101-2021_01_07_425794.ent
10_1101-2021_01_07_425794  txt/../pos/10_1101-2021_01_07_425794.pos
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38099 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_07_425801  txt/../ent/10_1101-2021_01_07_425801.ent
=== file2bib.sh ===
         id: 10_1101-2021_01_08_426008
     author: Pipes, Lenore
      title: AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees
       date: 2021
      pages: 7
  extension: .pdf
        txt: ./txt/10_1101-2021_01_08_426008.txt
      cache: ./cache/10_1101-2021_01_08_426008.pdf

Content-Type	application/pdf
Creation-Date	2021-01-09T06:37:34Z
Last-Modified	2021-01-10T13:53:39Z
Last-Save-Date	2021-01-10T13:53:39Z
PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	153
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-09T06:37:34Z
date	2021-01-10T13:53:39Z
dc:format	application/pdf; version=1.6
dc:title	AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees
dcterms:created	2021-01-09T06:37:34Z
dcterms:modified	2021-01-10T13:53:39Z
meta:creation-date	2021-01-09T06:37:34Z
meta:save-date	2021-01-10T13:53:39Z
modified	2021-01-10T13:53:39Z
pdf:PDFVersion	1.6
pdf:charsPerPage	['3555', '5141', '5584', '2225', '1264', '1496', '1290']
pdf:docinfo:created	2021-01-09T06:37:34Z
pdf:docinfo:creator_tool	TeX
pdf:docinfo:custom:PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
pdf:docinfo:modified	2021-01-10T13:53:39Z
pdf:docinfo:producer	pdfTeX-1.40.20
pdf:docinfo:title	AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees
pdf:docinfo:trapped	False
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '3', '0', '0', '0', '0', '0']
producer	pdfTeX-1.40.20
resourceName	b'10_1101-2021_01_08_426008.pdf'
title	AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees
trapped	False
xmp:CreatorTool	TeX
xmpMM:DocumentID	uuid:fdfd1c65-a18d-ac42-91bc-281909f7fc61
xmpTPg:NPages	7
10_1101-2021_01_05_425266  txt/../ent/10_1101-2021_01_05_425266.ent
/data-disk/reader-compute/reader-classic/bin/txt2pos.sh: fork: retry: No child processes
10_1101-2021_01_06_425546  txt/../pos/10_1101-2021_01_06_425546.pos
10_1101-2021_01_07_425773  txt/../pos/10_1101-2021_01_07_425773.pos
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 38094 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_06_425560  txt/../pos/10_1101-2021_01_06_425560.pos
=== file2bib.sh ===
         id: 10_1101-2021_01_07_425801
     author: Hou, Yapeng
      title: Fibrinolysis influences SARS-CoV-2 infection in ciliated cells
       date: 2021
      pages: 18
  extension: .pdf
        txt: ./txt/10_1101-2021_01_07_425801.txt
      cache: ./cache/10_1101-2021_01_07_425801.pdf

Author	Administrator
Content-Type	application/pdf
Creation-Date	2021-01-08T03:07:53Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	518
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-08T03:07:53Z
creator	Administrator
date	2021-01-10T13:53:40Z
dc:creator	Administrator
dc:format	application/pdf; version=1.5
dc:language	zh-CN
dc:title	Fibrinolysis influences SARS-CoV-2 infection in ciliated cells
dcterms:created	2021-01-08T03:07:53Z
dcterms:modified	2021-01-10T13:53:40Z
language	zh-CN
meta:author	Administrator
meta:creation-date	2021-01-08T03:07:53Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['722', '1871', '3487', '3197', '3371', '3756', '3316', '3064', '627', '5265', '5443', '3671', '1281', '462', '884', '1108', '859', '952']
pdf:docinfo:created	2021-01-08T03:07:53Z
pdf:docinfo:creator	Administrator
pdf:docinfo:creator_tool	Microsoft® Word 2016
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	Microsoft® Word 2016
pdf:docinfo:title	Fibrinolysis influences SARS-CoV-2 infection in ciliated cells
pdf:encrypted	false
pdf:hasMarkedContent	true
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Microsoft® Word 2016
resourceName	b'10_1101-2021_01_07_425801.pdf'
title	Fibrinolysis influences SARS-CoV-2 infection in ciliated cells
xmp:CreatorTool	Microsoft® Word 2016
xmpMM:DocumentID	uuid:7ce519dd-1dd2-11b2-0a00-7109276d7200
xmpTPg:NPages	18
=== file2bib.sh ===
         id: 10_1101-2021_01_07_425697
     author: Luo, Yin
      title: Capsule network for protein ubiquitination site prediction
       date: 2021
      pages: 14
  extension: .pdf
        txt: ./txt/10_1101-2021_01_07_425697.txt
      cache: ./cache/10_1101-2021_01_07_425697.pdf

Content-Type	application/pdf
Creation-Date	2021-01-10T13:53:40Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	166
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-10T13:53:40Z
date	2021-01-10T13:53:40Z
dc:format	application/pdf; version=1.7
dc:language	en-US
dc:title	Capsule network for protein ubiquitination site prediction
dcterms:created	2021-01-10T13:53:40Z
dcterms:modified	2021-01-10T13:53:40Z
language	en-US
meta:creation-date	2021-01-10T13:53:40Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.7
pdf:charsPerPage	['2688', '3466', '2582', '2785', '2196', '2236', '2406', '2250', '1972', '2118', '2509', '2637', '2867', '846']
pdf:docinfo:created	2021-01-10T13:53:40Z
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:title	Capsule network for protein ubiquitination site prediction
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
resourceName	b'10_1101-2021_01_07_425697.pdf'
title	Capsule network for protein ubiquitination site prediction
xmpMM:DocumentID	uuid:7ce5345f-1dd2-11b2-0a00-080a276d7200
xmpTPg:NPages	14
=== file2bib.sh ===
         id: 10_1101-2020_04_17_043323
     author: Waschke, Johannes
      title: <em>linus</em>: Conveniently explore, share, and present large-scale biological trajectory data from a web browser
       date: 2021
      pages: 13
  extension: .pdf
        txt: ./txt/10_1101-2020_04_17_043323.txt
      cache: ./cache/10_1101-2020_04_17_043323.pdf

Content-Type	application/pdf
Creation-Date	2021-01-10T13:53:40Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	279
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-10T13:53:40Z
date	2021-01-10T13:53:40Z
dc:format	application/pdf; version=1.5
dc:title	linus: Conveniently explore, share, and present large-scale biological trajectory data from a web browser
dcterms:created	2021-01-10T13:53:40Z
dcterms:modified	2021-01-10T13:53:40Z
meta:creation-date	2021-01-10T13:53:40Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['2135', '5735', '5933', '5935', '5592', '4833', '4365', '900', '1468', '2008', '949', '612', '752']
pdf:docinfo:created	2021-01-10T13:53:40Z
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	Skia/PDF m89
pdf:docinfo:title	linus: Conveniently explore, share, and present large-scale biological trajectory data from a web browser
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Skia/PDF m89
resourceName	b'10_1101-2020_04_17_043323.pdf'
title	linus: Conveniently explore, share, and present large-scale biological trajectory data from a web browser
xmpMM:DocumentID	uuid:7ce4fdf1-1dd2-11b2-0a00-120a27bd3700
xmpTPg:NPages	13
=== file2bib.sh ===
         id: 10_1101-2021_01_05_425384
     author: Halilaj, Iva
      title: Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19
       date: 2021
      pages: 17
  extension: .pdf
        txt: ./txt/10_1101-2021_01_05_425384.txt
      cache: ./cache/10_1101-2021_01_05_425384.pdf

Content-Type	application/pdf
Creation-Date	2021-01-10T13:53:40Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	323
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-10T13:53:40Z
date	2021-01-10T13:53:40Z
dc:format	application/pdf; version=1.4
dc:title	Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19
dcterms:created	2021-01-10T13:53:40Z
dcterms:modified	2021-01-10T13:53:40Z
meta:creation-date	2021-01-10T13:53:40Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.4
pdf:charsPerPage	['2051', '2105', '2362', '1765', '1582', '1343', '1585', '2011', '2055', '2129', '1890', '2654', '2281', '336', '336', '336', '336']
pdf:docinfo:created	2021-01-10T13:53:40Z
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	PyPDF2
pdf:docinfo:title	Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	PyPDF2
resourceName	b'10_1101-2021_01_05_425384.pdf'
title	Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19
xmpMM:DocumentID	uuid:7ce62e13-1dd2-11b2-0a00-9809271d5700
xmpTPg:NPages	17
=== file2bib.sh ===
         id: 10_1101-2020_12_26_424429
     author: Nayar, Gowri
      title: Analysis and Forecasting of Global RT-PCR Primers for SARS-CoV-2
       date: 2021
      pages: 13
  extension: .pdf
        txt: ./txt/10_1101-2020_12_26_424429.txt
      cache: ./cache/10_1101-2020_12_26_424429.pdf

Author	
Content-Type	application/pdf
Creation-Date	2021-01-06T13:51:52Z
Keywords	
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	167
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
cp:subject	
created	2021-01-06T13:51:52Z
creator	
date	2021-01-10T13:53:40Z
dc:creator	
dc:format	application/pdf; version=1.5
dc:subject	
dc:title	Analysis and Forecasting of Global RT-PCR Primers for SARS-CoV-2
dcterms:created	2021-01-06T13:51:52Z
dcterms:modified	2021-01-10T13:53:40Z
meta:author	
meta:creation-date	2021-01-06T13:51:52Z
meta:keyword	
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['4413', '4643', '5089', '4501', '3150', '1086', '507', '499', '562', '617', '721', '498', '439']
pdf:docinfo:created	2021-01-06T13:51:52Z
pdf:docinfo:creator	
pdf:docinfo:creator_tool	LaTeX with hyperref
pdf:docinfo:custom:PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
pdf:docinfo:keywords	
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	pdfTeX-1.40.20
pdf:docinfo:subject	
pdf:docinfo:title	Analysis and Forecasting of Global RT-PCR Primers for SARS-CoV-2
pdf:docinfo:trapped	False
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	pdfTeX-1.40.20
resourceName	b'10_1101-2020_12_26_424429.pdf'
subject	
title	Analysis and Forecasting of Global RT-PCR Primers for SARS-CoV-2
trapped	False
xmp:CreatorTool	LaTeX with hyperref
xmpMM:DocumentID	uuid:7ce5549f-1dd2-11b2-0a00-5508275d6100
xmpTPg:NPages	13
10_1101-2020_10_26_351783  txt/../ent/10_1101-2020_10_26_351783.ent
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37503 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
=== file2bib.sh ===
         id: 10_1101-2020_09_09_289074
     author: Ortuso, Francesco
      title: Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex
       date: 2021
      pages: 23
  extension: .pdf
        txt: ./txt/10_1101-2020_09_09_289074.txt
      cache: ./cache/10_1101-2020_09_09_289074.pdf

Author	Administrator
Content-Type	application/pdf
Creation-Date	2021-01-03T12:48:26Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	336
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-03T12:48:26Z
creator	Administrator
date	2021-01-10T13:53:40Z
dc:creator	Administrator
dc:format	application/pdf; version=1.4
dc:title	Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex
dcterms:created	2021-01-03T12:48:26Z
dcterms:modified	2021-01-10T13:53:40Z
meta:author	Administrator
meta:creation-date	2021-01-03T12:48:26Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.4
pdf:charsPerPage	['1600', '3000', '4455', '3521', '3822', '4499', '3952', '3723', '4011', '3749', '880', '374', '754', '677', '731', '1103', '787', '1724', '1655', '871', '1698', '668', '1039']
pdf:docinfo:created	2021-01-03T12:48:26Z
pdf:docinfo:creator	Administrator
pdf:docinfo:creator_tool	PScript5.dll Version 5.2.2
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	Acrobat Distiller 8.1.0 (Windows)
pdf:docinfo:title	Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Acrobat Distiller 8.1.0 (Windows)
resourceName	b'10_1101-2020_09_09_289074.pdf'
title	Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex
xmp:CreatorTool	PScript5.dll Version 5.2.2
xmpMM:DocumentID	uuid:7ce60f7d-1dd2-11b2-0a00-3e0927fd5800
xmpTPg:NPages	23
10_1101-2020_03_27_012757  txt/../wrd/10_1101-2020_03_27_012757.wrd
10_1101-2021_01_06_425569  txt/../ent/10_1101-2021_01_06_425569.ent
10_1101-2021_01_07_425637  txt/../ent/10_1101-2021_01_07_425637.ent
=== file2bib.sh ===
         id: 10_1101-2021_01_06_425550
     author: Chen, Nae-Chyun
      title: Improving variant calling using population data and deep learning
       date: 2021
      pages: 17
  extension: .pdf
        txt: ./txt/10_1101-2021_01_06_425550.txt
      cache: ./cache/10_1101-2021_01_06_425550.pdf

Author	
Content-Type	application/pdf
Creation-Date	2021-01-06T13:30:43Z
Keywords	
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	3048
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
cp:subject	
created	2021-01-06T13:30:43Z
creator	
date	2021-01-10T13:53:40Z
dc:creator	
dc:format	application/pdf; version=1.5
dc:subject	
dc:title	Improving variant calling using population data and deep learning
dcterms:created	2021-01-06T13:30:43Z
dcterms:modified	2021-01-10T13:53:40Z
meta:author	
meta:creation-date	2021-01-06T13:30:43Z
meta:keyword	
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['1892', '3079', '2785', '1502', '3130', '2042', '2228', '2130', '2295', '2597', '2427', '2331', '1956', '1888', '1890', '2459', '2403']
pdf:docinfo:created	2021-01-06T13:30:43Z
pdf:docinfo:creator	
pdf:docinfo:creator_tool	LaTeX with hyperref
pdf:docinfo:custom:PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
pdf:docinfo:keywords	
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	pdfTeX-1.40.20
pdf:docinfo:subject	
pdf:docinfo:title	Improving variant calling using population data and deep learning
pdf:docinfo:trapped	False
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	pdfTeX-1.40.20
resourceName	b'10_1101-2021_01_06_425550.pdf'
subject	
title	Improving variant calling using population data and deep learning
trapped	False
xmp:CreatorTool	LaTeX with hyperref
xmpMM:DocumentID	uuid:7ce548ad-1dd2-11b2-0a00-1909277d8900
xmpTPg:NPages	17
=== file2bib.sh ===
         id: 10_1101-2021_01_08_425887
     author: Hu, Yan
      title: Auto-CORPus: Automated and Consistent Outputs from Research Publications
       date: 2021
      pages: 10
  extension: .pdf
        txt: ./txt/10_1101-2021_01_08_425887.txt
      cache: ./cache/10_1101-2021_01_08_425887.pdf

Author	
Content-Type	application/pdf
Creation-Date	2021-01-08T14:22:00Z
Keywords	
Last-Modified	2021-01-10T13:53:39Z
Last-Save-Date	2021-01-10T13:53:39Z
PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	201
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
cp:subject	
created	2021-01-08T14:22:00Z
creator	
date	2021-01-10T13:53:39Z
dc:creator	
dc:format	application/pdf; version=1.5
dc:subject	
dc:title	Auto-CORPus: Automated and Consistent Outputs from Research Publications
dcterms:created	2021-01-08T14:22:00Z
dcterms:modified	2021-01-10T13:53:39Z
meta:author	
meta:creation-date	2021-01-08T14:22:00Z
meta:keyword	
meta:save-date	2021-01-10T13:53:39Z
modified	2021-01-10T13:53:39Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['4806', '2694', '5605', '5607', '3771', '3643', '2723', '4024', '3545', '3485']
pdf:docinfo:created	2021-01-08T14:22:00Z
pdf:docinfo:creator	
pdf:docinfo:creator_tool	LaTeX with hyperref
pdf:docinfo:custom:PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.20 (TeX Live 2019) kpathsea version 6.3.1
pdf:docinfo:keywords	
pdf:docinfo:modified	2021-01-10T13:53:39Z
pdf:docinfo:producer	pdfTeX-1.40.20
pdf:docinfo:subject	
pdf:docinfo:title	Auto-CORPus: Automated and Consistent Outputs from Research Publications
pdf:docinfo:trapped	False
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['1', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	pdfTeX-1.40.20
resourceName	b'10_1101-2021_01_08_425887.pdf'
subject	
title	Auto-CORPus: Automated and Consistent Outputs from Research Publications
trapped	False
xmp:CreatorTool	LaTeX with hyperref
xmpMM:DocumentID	uuid:7ce4275f-1dd2-11b2-0a00-0b0927bd3700
xmpTPg:NPages	10
10_1101-2020_03_27_012757  txt/../pos/10_1101-2020_03_27_012757.pos
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2021_01_05_425414  txt/../ent/10_1101-2021_01_05_425414.ent
=== file2bib.sh ===
         id: 10_1101-2021_01_08_425855
     author: Wu, Canbiao
      title: DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites.
       date: 2021
      pages: 31
  extension: .pdf
        txt: ./txt/10_1101-2021_01_08_425855.txt
      cache: ./cache/10_1101-2021_01_08_425855.pdf

Content-Type	application/pdf
Creation-Date	2021-01-10T13:53:39Z
Last-Modified	2021-01-10T13:53:39Z
Last-Save-Date	2021-01-10T13:53:39Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	364
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-10T13:53:39Z
date	2021-01-10T13:53:39Z
dc:format	application/pdf; version=1.4
dc:title	DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites.
dcterms:created	2021-01-10T13:53:39Z
dcterms:modified	2021-01-10T13:53:39Z
meta:creation-date	2021-01-10T13:53:39Z
meta:save-date	2021-01-10T13:53:39Z
modified	2021-01-10T13:53:39Z
pdf:PDFVersion	1.4
pdf:charsPerPage	['1607', '1916', '1126', '1990', '1775', '2001', '2031', '1882', '2110', '2101', '1667', '905', '2003', '1987', '1741', '1883', '1718', '1706', '1520', '1804', '1730', '1537', '1901', '1905', '606', '2016', '1101', '336', '336', '336', '336']
pdf:docinfo:created	2021-01-10T13:53:39Z
pdf:docinfo:modified	2021-01-10T13:53:39Z
pdf:docinfo:title	DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites.
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
resourceName	b'10_1101-2021_01_08_425855.pdf'
title	DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites.
xmpMM:DocumentID	uuid:7ce43a60-1dd2-11b2-0a00-9e09276d7200
xmpTPg:NPages	31
10_1101-2021_01_04_425315  txt/../ent/10_1101-2021_01_04_425315.ent
10_1101-2020_12_24_424332  txt/../ent/10_1101-2020_12_24_424332.ent
10_1101-2021_01_08_425897  txt/../ent/10_1101-2021_01_08_425897.ent
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37902 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2021_01_08_425855  txt/../pos/10_1101-2021_01_08_425855.pos
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
=== file2bib.sh ===
         id: 10_1101-2021_01_04_425315
     author: Chen, Lulu
      title: Sample-wise unsupervised deconvolution of complex tissues
       date: 2021
      pages: 29
  extension: .pdf
        txt: ./txt/10_1101-2021_01_04_425315.txt
      cache: ./cache/10_1101-2021_01_04_425315.pdf

Author	Lulu Chen
Content-Type	application/pdf
Creation-Date	2021-01-04T23:53:51Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	1489
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-04T23:53:51Z
creator	Lulu Chen
date	2021-01-10T13:53:40Z
dc:creator	Lulu Chen
dc:format	application/pdf; version=1.7
dc:language	zh-TW
dc:title	Sample-wise unsupervised deconvolution of complex tissues
dcterms:created	2021-01-04T23:53:51Z
dcterms:modified	2021-01-10T13:53:40Z
language	zh-TW
meta:author	Lulu Chen
meta:creation-date	2021-01-04T23:53:51Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.7
pdf:charsPerPage	['1311', '1662', '2963', '2208', '1333', '1654', '1262', '1408', '934', '1313', '1963', '1083', '1865', '2277', '1818', '524', '514', '1301', '2816', '555', '518', '593', '557', '736', '2652', '2768', '1153', '3421', '1687']
pdf:docinfo:created	2021-01-04T23:53:51Z
pdf:docinfo:creator	Lulu Chen
pdf:docinfo:creator_tool	Microsoft® Word 2019
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	Microsoft® Word 2019
pdf:docinfo:title	Sample-wise unsupervised deconvolution of complex tissues
pdf:encrypted	false
pdf:hasMarkedContent	true
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '2', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Microsoft® Word 2019
resourceName	b'10_1101-2021_01_04_425315.pdf'
title	Sample-wise unsupervised deconvolution of complex tissues
xmp:CreatorTool	Microsoft® Word 2019
xmpMM:DocumentID	uuid:A61ECC4D-00FA-4B39-93FF-9569F8B8E46A
xmpTPg:NPages	29
=== file2bib.sh ===
         id: 10_1101-332965
     author: Banerjee, Arpita
      title: Identification and design of vinyl sulfone inhibitors against Cryptopain-1 – a cysteine protease from cryptosporidiosis-causing <em>Cryptosporidium parvum</em>
       date: 2021
      pages: 30
  extension: .pdf
        txt: ./txt/10_1101-332965.txt
      cache: ./cache/10_1101-332965.pdf

Appligent	AppendPDF Pro 5.5 Linux Kernel 2.6 64bit Oct  2 2014 Library 10.1.0
Author	Administrator
Content-Type	application/pdf
Creation-Date	2021-01-06T15:06:22Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	425
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-06T15:06:22Z
creator	Administrator
date	2021-01-10T13:53:40Z
dc:creator	Administrator
dc:format	application/pdf; version=1.4
dc:title	96865564
dcterms:created	2021-01-06T15:06:22Z
dcterms:modified	2021-01-10T13:53:40Z
meta:author	Administrator
meta:creation-date	2021-01-06T15:06:22Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.4
pdf:charsPerPage	['706', '2387', '3912', '3428', '3769', '3016', '3658', '3316', '3652', '3492', '3337', '2856', '2939', '1528', '1350', '1426', '1431', '1380', '1499', '1749', '2476', '1620', '1380', '702', '724', '401', '402', '400', '668', '664']
pdf:docinfo:created	2021-01-06T15:06:22Z
pdf:docinfo:creator	Administrator
pdf:docinfo:creator_tool	Appligent AppendPDF Pro 5.5
pdf:docinfo:custom:Appligent	AppendPDF Pro 5.5 Linux Kernel 2.6 64bit Oct  2 2014 Library 10.1.0
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	Acrobat Distiller 8.1.0 (Windows)
pdf:docinfo:title	96865564
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Acrobat Distiller 8.1.0 (Windows)
resourceName	b'10_1101-332965.pdf'
title	96865564
xmp:CreatorTool	Appligent AppendPDF Pro 5.5
xmpMM:DocumentID	uuid:8af60254-b03e-11b2-0a00-782dad000000
xmpTPg:NPages	30
10_1101-2021_01_06_425581  txt/../wrd/10_1101-2021_01_06_425581.wrd
10_1101-2020_10_26_351783  txt/../pos/10_1101-2020_10_26_351783.pos
10_1101-2021_01_06_425560  txt/../ent/10_1101-2021_01_06_425560.ent
10_1101-2021_01_07_425637  txt/../wrd/10_1101-2021_01_07_425637.wrd
10_1101-2021_01_06_425560  txt/../wrd/10_1101-2021_01_06_425560.wrd
10_1101-2020_11_13_381475  txt/../ent/10_1101-2020_11_13_381475.ent
=== file2bib.sh ===
         id: 10_1101-2021_01_07_425794
     author: Chisanga, David
      title: Impact of gene annotation choice on the quantification of RNA-seq data
       date: 2021
      pages: 24
  extension: .pdf
        txt: ./txt/10_1101-2021_01_07_425794.txt
      cache: ./cache/10_1101-2021_01_07_425794.pdf

Content-Type	application/pdf
Creation-Date	2021-01-08T01:31:03Z
Last-Modified	2021-01-10T13:53:39Z
Last-Save-Date	2021-01-10T13:53:39Z
PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.21 (TeX Live 2020) kpathsea version 6.3.2
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	340
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-08T01:31:03Z
date	2021-01-10T13:53:39Z
dc:format	application/pdf; version=1.5
dc:title	Impact of gene annotation choice on the quantification of RNA-seq data
dcterms:created	2021-01-08T01:31:03Z
dcterms:modified	2021-01-10T13:53:39Z
meta:creation-date	2021-01-08T01:31:03Z
meta:save-date	2021-01-10T13:53:39Z
modified	2021-01-10T13:53:39Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['2242', '2821', '2618', '2632', '2654', '1873', '2649', '2185', '2667', '2100', '1901', '2821', '1498', '1827', '1472', '1386', '2641', '1229', '2875', '2744', '2158', '2317', '2215', '463']
pdf:docinfo:created	2021-01-08T01:31:03Z
pdf:docinfo:creator_tool	TeX
pdf:docinfo:custom:PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.21 (TeX Live 2020) kpathsea version 6.3.2
pdf:docinfo:modified	2021-01-10T13:53:39Z
pdf:docinfo:producer	pdfTeX-1.40.21
pdf:docinfo:title	Impact of gene annotation choice on the quantification of RNA-seq data
pdf:docinfo:trapped	False
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['2', '0', '0', '0', '0', '2', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '2', '0', '0', '0']
producer	pdfTeX-1.40.21
resourceName	b'10_1101-2021_01_07_425794.pdf'
title	Impact of gene annotation choice on the quantification of RNA-seq data
trapped	False
xmp:CreatorTool	TeX
xmpMM:DocumentID	uuid:7ce4d3da-1dd2-11b2-0a00-bb0827fd5800
xmpTPg:NPages	24
10_1101-2021_01_07_425773  txt/../wrd/10_1101-2021_01_07_425773.wrd
10_1101-2021_01_06_425550  txt/../pos/10_1101-2021_01_06_425550.pos
10_1101-2021_01_06_425544  txt/../pos/10_1101-2021_01_06_425544.pos
10_1101-2021_01_07_425801  txt/../wrd/10_1101-2021_01_07_425801.wrd
10_1101-2020_12_26_424429  txt/../pos/10_1101-2020_12_26_424429.pos
10_1101-2021_01_07_425716  txt/../ent/10_1101-2021_01_07_425716.ent
=== file2bib.sh ===
         id: 10_1101-2021_01_04_425250
     author: Thibodeau, Asa
      title: A read count-based method to detect multiplets and their cellular origins from snATAC-seq data
       date: 2021
      pages: 32
  extension: .pdf
        txt: ./txt/10_1101-2021_01_04_425250.txt
      cache: ./cache/10_1101-2021_01_04_425250.pdf

Content-Type	application/pdf
Creation-Date	2021-01-04T18:10:51Z
Last-Modified	2021-01-10T13:53:41Z
Last-Save-Date	2021-01-10T13:53:41Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	1511
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-04T18:10:51Z
date	2021-01-10T13:53:41Z
dc:format	application/pdf; version=1.4
dc:title	A read count-based method to detect multiplets and their cellular origins from snATAC-seq data
dcterms:created	2021-01-04T18:10:51Z
dcterms:modified	2021-01-10T13:53:41Z
meta:creation-date	2021-01-04T18:10:51Z
meta:save-date	2021-01-10T13:53:41Z
modified	2021-01-10T13:53:41Z
pdf:PDFVersion	1.4
pdf:charsPerPage	['2433', '3265', '3115', '3083', '3173', '3376', '3036', '2845', '3309', '1871', '2931', '3153', '3138', '3024', '2317', '2568', '1281', '1061', '1292', '1472', '1077', '1387', '584', '890', '1029', '1015', '767', '1230', '985', '652', '1257', '1028']
pdf:docinfo:created	2021-01-04T18:10:51Z
pdf:docinfo:creator_tool	Word
pdf:docinfo:modified	2021-01-10T13:53:41Z
pdf:docinfo:producer	macOS Version 10.15.2 (Build 19C57) Quartz PDFContext
pdf:docinfo:title	A read count-based method to detect multiplets and their cellular origins from snATAC-seq data
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	macOS Version 10.15.2 (Build 19C57) Quartz PDFContext
resourceName	b'10_1101-2021_01_04_425250.pdf'
title	A read count-based method to detect multiplets and their cellular origins from snATAC-seq data
xmp:CreatorTool	Word
xmpMM:DocumentID	uuid:7ce6bbb5-1dd2-11b2-0a00-a808276d7200
xmpTPg:NPages	32
=== file2bib.sh ===
         id: 10_1101-2021_01_07_425637
     author: Arneson, Adriana
      title: A mammalian methylation array for profiling methylation levels at conserved sequences
       date: 2021
      pages: 39
  extension: .pdf
        txt: ./txt/10_1101-2021_01_07_425637.txt
      cache: ./cache/10_1101-2021_01_07_425637.pdf

Author	Ernst, Jason
Content-Type	application/pdf
Creation-Date	2021-01-07T17:32:37Z
Last-Modified	2021-01-10T12:02:05Z
Last-Save-Date	2021-01-10T12:02:05Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	596
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-07T17:32:37Z
creator	Ernst, Jason
date	2021-01-10T12:02:05Z
dc:creator	Ernst, Jason
dc:format	application/pdf; version=1.5
dc:language	en-US
dc:title	A mammalian methylation array for profiling methylation levels at conserved sequences
dcterms:created	2021-01-07T17:32:37Z
dcterms:modified	2021-01-10T12:02:05Z
language	en-US
meta:author	Ernst, Jason
meta:creation-date	2021-01-07T17:32:37Z
meta:save-date	2021-01-10T12:02:05Z
modified	2021-01-10T12:02:05Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['1899', '2339', '2223', '2387', '2621', '2463', '2523', '2277', '2258', '2261', '2377', '2414', '2508', '2378', '2346', '1987', '2308', '2430', '2120', '2211', '1815', '2243', '1644', '1586', '1000', '1003', '1723', '875', '1980', '1114', '1577', '1293', '1092', '1917', '602', '910', '3079', '3137', '2921']
pdf:docinfo:created	2021-01-07T17:32:37Z
pdf:docinfo:creator	Ernst, Jason
pdf:docinfo:creator_tool	Microsoft® Word 2013
pdf:docinfo:modified	2021-01-10T12:02:05Z
pdf:docinfo:producer	Microsoft® Word 2013
pdf:docinfo:title	A mammalian methylation array for profiling methylation levels at conserved sequences
pdf:encrypted	false
pdf:hasMarkedContent	true
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Microsoft® Word 2013
resourceName	b'10_1101-2021_01_07_425637.pdf'
title	A mammalian methylation array for profiling methylation levels at conserved sequences
xmp:CreatorTool	Microsoft® Word 2013
xmpMM:DocumentID	uuid:54fddfc9-1dd2-11b2-0a00-010927edca00
xmpTPg:NPages	39
10_1101-2020_05_22_110247  txt/../ent/10_1101-2020_05_22_110247.ent
10_1101-2021_01_07_425794  txt/../wrd/10_1101-2021_01_07_425794.wrd
/data-disk/reader-compute/reader-classic/bin/txt2ent.sh: fork: retry: No child processes
10_1101-332965  txt/../ent/10_1101-332965.ent
10_1101-2021_01_07_425697  txt/../pos/10_1101-2021_01_07_425697.pos
10_1101-2021_01_07_425782  txt/../wrd/10_1101-2021_01_07_425782.wrd
10_1101-2021_01_05_425409  txt/../ent/10_1101-2021_01_05_425409.ent
10_1101-2020_03_27_012757  txt/../ent/10_1101-2020_03_27_012757.ent
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2021_01_05_425266  txt/../pos/10_1101-2021_01_05_425266.pos
10_1101-2021_01_05_425508  txt/../ent/10_1101-2021_01_05_425508.ent
10_1101-2021_01_06_425569  txt/../pos/10_1101-2021_01_06_425569.pos
=== file2bib.sh ===
         id: 10_1101-2021_01_06_425569
     author: Chen, Li
      title: Metabolite discovery through global annotation of untargeted metabolomics data
       date: 2021
      pages: 31
  extension: .pdf
        txt: ./txt/10_1101-2021_01_06_425569.txt
      cache: ./cache/10_1101-2021_01_06_425569.pdf

Appligent	AppendPDF Pro 5.5 Linux Kernel 2.6 64bit Oct  2 2014 Library 10.1.0
Author	Li Chen
Content-Type	application/pdf
Creation-Date	2021-01-06T15:21:03Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	2082
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-06T15:21:03Z
creator	Li Chen
date	2021-01-10T13:53:40Z
dc:creator	Li Chen
dc:format	application/pdf; version=1.7
dc:title	65426741
dcterms:created	2021-01-06T15:21:03Z
dcterms:modified	2021-01-10T13:53:40Z
meta:author	Li Chen
meta:creation-date	2021-01-06T15:21:03Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.7
pdf:charsPerPage	['2460', '3458', '3307', '3490', '3574', '3281', '3462', '557', '3319', '3611', '2974', '3309', '3055', '2272', '2755', '2191', '2509', '2022', '2581', '2742', '3352', '1113', '1761', '1851', '1816', '1866', '1570', '1229', '1227', '1006', '448']
pdf:docinfo:created	2021-01-06T15:21:03Z
pdf:docinfo:creator	Li Chen
pdf:docinfo:creator_tool	Appligent AppendPDF Pro 5.5
pdf:docinfo:custom:Appligent	AppendPDF Pro 5.5 Linux Kernel 2.6 64bit Oct  2 2014 Library 10.1.0
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	Microsoft: Print To PDF
pdf:docinfo:title	65426741
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '74', '13', '56', '145', '128', '245', '113', '152', '102', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Microsoft: Print To PDF
resourceName	b'10_1101-2021_01_06_425569.pdf'
title	65426741
xmp:CreatorTool	Appligent AppendPDF Pro 5.5
xmpMM:DocumentID	uuid:9035ca28-b03e-11b2-0a00-782dad000000
xmpTPg:NPages	31
=== file2bib.sh ===
         id: 10_1101-2020_08_28_271981
     author: He, Jiahua
      title: Full-length <em>de novo</em> protein structure determination from cryo-EM maps using deep learning
       date: 2021
      pages: 26
  extension: .pdf
        txt: ./txt/10_1101-2020_08_28_271981.txt
      cache: ./cache/10_1101-2020_08_28_271981.pdf

Author	
Content-Type	application/pdf
Creation-Date	2021-01-06T00:42:03Z
Keywords	
Last-Modified	2021-01-10T13:53:41Z
Last-Save-Date	2021-01-10T13:53:41Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	1227
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
cp:subject	
created	2021-01-06T00:42:03Z
creator	
date	2021-01-10T13:53:41Z
dc:creator	
dc:format	application/pdf; version=1.7
dc:subject	
dc:title	Full-length de novo protein structure determination from cryo-EM maps using deep learning
dcterms:created	2021-01-06T00:42:03Z
dcterms:modified	2021-01-10T13:53:41Z
meta:author	
meta:creation-date	2021-01-06T00:42:03Z
meta:keyword	
meta:save-date	2021-01-10T13:53:41Z
modified	2021-01-10T13:53:41Z
pdf:PDFVersion	1.7
pdf:charsPerPage	['1943', '2468', '2238', '2509', '2310', '2351', '2176', '2351', '2170', '2402', '2310', '2469', '2660', '2374', '901', '2797', '2994', '3097', '1592', '1650', '893', '738', '968', '1010', '1120', '1087']
pdf:docinfo:created	2021-01-06T00:42:03Z
pdf:docinfo:creator	
pdf:docinfo:creator_tool	LaTeX with hyperref
pdf:docinfo:keywords	
pdf:docinfo:modified	2021-01-10T13:53:41Z
pdf:docinfo:producer	dvips + GPL Ghostscript 9.26
pdf:docinfo:subject	
pdf:docinfo:title	Full-length de novo protein structure determination from cryo-EM maps using deep learning
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '12', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	dvips + GPL Ghostscript 9.26
resourceName	b'10_1101-2020_08_28_271981.pdf'
subject	
title	Full-length de novo protein structure determination from cryo-EM maps using deep learning
xmp:CreatorTool	LaTeX with hyperref
xmpMM:DocumentID	uuid:7e5f0079-5213-11eb-0000-c489bb077109
xmpTPg:NPages	26
10_1101-2020_09_09_289074  txt/../ent/10_1101-2020_09_09_289074.ent
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2ent.sh: fork: retry: No child processes
10_1101-2021_01_04_425335  txt/../ent/10_1101-2021_01_04_425335.ent
=== file2bib.sh ===
         id: 10_1101-2021_01_06_425560
     author: Lengyel, Attila
      title: Review and performance evaluation of trait-based between-community dissimilarity measures
       date: 2021
      pages: 50
  extension: .pdf
        txt: ./txt/10_1101-2021_01_06_425560.txt
      cache: ./cache/10_1101-2021_01_06_425560.pdf

Author	Administrator
Content-Type	application/pdf
Creation-Date	2021-01-06T14:24:23Z
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	926
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-06T14:24:23Z
creator	Administrator
date	2021-01-10T13:53:40Z
dc:creator	Administrator
dc:format	application/pdf; version=1.4
dc:title	Review and performance evaluation of trait-based between-community dissimilarity measures
dcterms:created	2021-01-06T14:24:23Z
dcterms:modified	2021-01-10T13:53:40Z
meta:author	Administrator
meta:creation-date	2021-01-06T14:24:23Z
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.4
pdf:charsPerPage	['631', '2163', '1813', '2515', '2232', '2246', '1574', '2045', '2545', '2302', '2239', '1483', '2390', '2358', '2253', '2429', '1998', '2237', '1992', '2450', '2589', '2584', '2577', '2239', '2492', '2317', '2355', '2544', '2531', '2591', '2481', '1781', '1920', '1945', '1906', '1890', '1942', '1982', '2039', '1876', '1958', '1769', '776', '1101', '711', '1620', '418', '1651', '706', '748']
pdf:docinfo:created	2021-01-06T14:24:23Z
pdf:docinfo:creator	Administrator
pdf:docinfo:creator_tool	PScript5.dll Version 5.2.2
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	Acrobat Distiller 8.1.0 (Windows)
pdf:docinfo:title	Review and performance evaluation of trait-based between-community dissimilarity measures
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '54', '185', '63', '0', '0', '0', '97', '150', '0', '0', '0', '129', '27', '42', '46', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '4', '5', '3', '2', '0', '1', '2', '0', '0', '1', '213', '0', '0', '0', '0', '0', '0', '0']
producer	Acrobat Distiller 8.1.0 (Windows)
resourceName	b'10_1101-2021_01_06_425560.pdf'
title	Review and performance evaluation of trait-based between-community dissimilarity measures
xmp:CreatorTool	PScript5.dll Version 5.2.2
xmpMM:DocumentID	uuid:7ce5b446-1dd2-11b2-0a00-2c09271d5700
xmpTPg:NPages	50
10_1101-2021_01_06_425546  txt/../wrd/10_1101-2021_01_06_425546.wrd
10_1101-2021_01_05_425414  txt/../pos/10_1101-2021_01_05_425414.pos
10_1101-2021_01_02_425006  txt/../ent/10_1101-2021_01_02_425006.ent
10_1101-2021_01_05_425384  txt/../pos/10_1101-2021_01_05_425384.pos
10_1101-2021_01_04_425250  txt/../ent/10_1101-2021_01_04_425250.ent
10_1101-2020_08_28_271981  txt/../ent/10_1101-2020_08_28_271981.ent
10_1101-2021_01_05_425417  txt/../ent/10_1101-2021_01_05_425417.ent
=== file2bib.sh ===
         id: 10_1101-436634
     author: Cotto, Kelsy C.
      title: RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer
       date: 2021
      pages: 48
  extension: .pdf
        txt: ./txt/10_1101-436634.txt
      cache: ./cache/10_1101-436634.pdf

Author	Administrator
Content-Type	application/pdf
Creation-Date	2021-01-05T20:13:51Z
Last-Modified	2021-01-10T13:53:41Z
Last-Save-Date	2021-01-10T13:53:41Z
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	460
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
created	2021-01-05T20:13:51Z
creator	Administrator
date	2021-01-10T13:53:41Z
dc:creator	Administrator
dc:format	application/pdf; version=1.4
dc:title	RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer
dcterms:created	2021-01-05T20:13:51Z
dcterms:modified	2021-01-10T13:53:41Z
meta:author	Administrator
meta:creation-date	2021-01-05T20:13:51Z
meta:save-date	2021-01-10T13:53:41Z
modified	2021-01-10T13:53:41Z
pdf:PDFVersion	1.4
pdf:charsPerPage	['2833', '4130', '3829', '3602', '4185', '4041', '3947', '4165', '3764', '3267', '3607', '3731', '3257', '3577', '3167', '3487', '3501', '2297', '2181', '2205', '2143', '1060', '1918', '1786', '337', '921', '828', '722', '1106', '822', '561', '335', '1110', '1283', '690', '653', '912', '885', '1351', '1363', '1135', '1175', '1281', '928', '819', '796', '335', '1000']
pdf:docinfo:created	2021-01-05T20:13:51Z
pdf:docinfo:creator	Administrator
pdf:docinfo:creator_tool	PScript5.dll Version 5.2.2
pdf:docinfo:modified	2021-01-10T13:53:41Z
pdf:docinfo:producer	Acrobat Distiller 8.1.0 (Windows)
pdf:docinfo:title	RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0']
producer	Acrobat Distiller 8.1.0 (Windows)
resourceName	b'10_1101-436634.pdf'
title	RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer
xmp:CreatorTool	PScript5.dll Version 5.2.2
xmpMM:DocumentID	uuid:7ce6c54e-1dd2-11b2-0a00-a10827fd5800
xmpTPg:NPages	48
=== file2bib.sh ===
         id: 10_1101-2020_01_29_925354
     author: Bartoszewicz, Jakub M.
      title: Interpretable detection of novel human viruses from genome sequencing data
       date: 2021
      pages: 14
  extension: .pdf
        txt: ./txt/10_1101-2020_01_29_925354.txt
      cache: ./cache/10_1101-2020_01_29_925354.pdf

Author	
Content-Type	application/pdf
Creation-Date	2020-12-11T18:12:32Z
Keywords	
Last-Modified	2021-01-10T13:53:40Z
Last-Save-Date	2021-01-10T13:53:40Z
PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.21 (TeX Live 2020) kpathsea version 6.3.2
X-Parsed-By	['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser']
X-TIKA:content_handler	ToTextContentHandler
X-TIKA:embedded_depth	0
X-TIKA:parse_time_millis	1695
access_permission:assemble_document	true
access_permission:can_modify	true
access_permission:can_print	true
access_permission:can_print_degraded	true
access_permission:extract_content	true
access_permission:extract_for_accessibility	true
access_permission:fill_in_form	true
access_permission:modify_annotations	true
cp:subject	
created	2020-12-11T18:12:32Z
creator	
date	2021-01-10T13:53:40Z
dc:creator	
dc:format	application/pdf; version=1.5
dc:subject	
dc:title	Interpretable detection of novel human viruses from genome sequencing data
dcterms:created	2020-12-11T18:12:32Z
dcterms:modified	2021-01-10T13:53:40Z
meta:author	
meta:creation-date	2020-12-11T18:12:32Z
meta:keyword	
meta:save-date	2021-01-10T13:53:40Z
modified	2021-01-10T13:53:40Z
pdf:PDFVersion	1.5
pdf:charsPerPage	['4623', '6065', '5996', '6060', '5314', '5390', '5311', '6292', '4380', '2648', '3121', '6311', '8125', '5094']
pdf:docinfo:created	2020-12-11T18:12:32Z
pdf:docinfo:creator	
pdf:docinfo:creator_tool	LaTeX with hyperref
pdf:docinfo:custom:PTEX.Fullbanner	This is pdfTeX, Version 3.14159265-2.6-1.40.21 (TeX Live 2020) kpathsea version 6.3.2
pdf:docinfo:keywords	
pdf:docinfo:modified	2021-01-10T13:53:40Z
pdf:docinfo:producer	pdfTeX-1.40.21
pdf:docinfo:subject	
pdf:docinfo:title	Interpretable detection of novel human viruses from genome sequencing data
pdf:docinfo:trapped	False
pdf:encrypted	false
pdf:hasMarkedContent	false
pdf:hasXFA	false
pdf:hasXMP	true
pdf:unmappedUnicodeCharsPerPage	['8', '8', '8', '8', '10', '9', '8', '8', '8', '8', '8', '8', '8', '8']
producer	pdfTeX-1.40.21
resourceName	b'10_1101-2020_01_29_925354.pdf'
subject	
title	Interpretable detection of novel human viruses from genome sequencing data
trapped	False
xmp:CreatorTool	LaTeX with hyperref
xmpMM:DocumentID	uuid:7ce5e8c1-1dd2-11b2-0a00-cd0927edca00
xmpTPg:NPages	14
10_1101-2021_01_04_425288  txt/../ent/10_1101-2021_01_04_425288.ent
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
=== file2bib.sh ===
OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint Try decreasing the value of OMP_NUM_THREADS.
/data-disk/reader-compute/reader-classic/bin/file2bib.sh: line 39: 37737 Aborted                 $FILE2BIB "$FILE" > "$OUTPUT"
10_1101-2020_08_13_249839  txt/../ent/10_1101-2020_08_13_249839.ent
10_1101-2021_01_04_425315  txt/../pos/10_1101-2021_01_04_425315.pos
10_1101-2020_11_13_381475  txt/../pos/10_1101-2020_11_13_381475.pos
10_1101-2021_01_08_425379  txt/../pos/10_1101-2021_01_08_425379.pos
10_1101-2021_01_08_425855  txt/../wrd/10_1101-2021_01_08_425855.wrd
10_1101-2021_01_08_425885  txt/../pos/10_1101-2021_01_08_425885.pos
10_1101-2021_01_07_425716  txt/../pos/10_1101-2021_01_07_425716.pos
10_1101-2021_01_05_425409  txt/../pos/10_1101-2021_01_05_425409.pos
10_1101-2021_01_08_425885  txt/../ent/10_1101-2021_01_08_425885.ent
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
10_1101-332965  txt/../pos/10_1101-332965.pos
10_1101-2021_01_06_425544  txt/../wrd/10_1101-2021_01_06_425544.wrd
10_1101-2020_10_26_351783  txt/../wrd/10_1101-2020_10_26_351783.wrd
10_1101-2020_05_22_110247  txt/../pos/10_1101-2020_05_22_110247.pos
10_1101-2020_12_14_422697  txt/../ent/10_1101-2020_12_14_422697.ent
10_1101-2021_01_05_425266  txt/../wrd/10_1101-2021_01_05_425266.wrd
10_1101-2021_01_04_425285  txt/../ent/10_1101-2021_01_04_425285.ent
10_1101-2020_01_29_925354  txt/../ent/10_1101-2020_01_29_925354.ent
10_1101-2021_01_02_425006  txt/../pos/10_1101-2021_01_02_425006.pos
10_1101-436634  txt/../ent/10_1101-436634.ent
10_1101-2021_01_06_425550  txt/../wrd/10_1101-2021_01_06_425550.wrd
10_1101-2021_01_06_425569  txt/../wrd/10_1101-2021_01_06_425569.wrd
10_1101-2020_12_24_424332  txt/../pos/10_1101-2020_12_24_424332.pos
10_1101-2021_01_08_425379  txt/../wrd/10_1101-2021_01_08_425379.wrd
10_1101-2021_01_04_425335  txt/../pos/10_1101-2021_01_04_425335.pos
10_1101-2020_12_26_424429  txt/../wrd/10_1101-2020_12_26_424429.wrd
10_1101-2021_01_08_425379  txt/../ent/10_1101-2021_01_08_425379.ent
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2020_12_14_422697  txt/../pos/10_1101-2020_12_14_422697.pos
10_1101-2021_01_05_425417  txt/../pos/10_1101-2021_01_05_425417.pos
10_1101-2020_08_13_249839  txt/../pos/10_1101-2020_08_13_249839.pos
10_1101-2021_01_05_425508  txt/../pos/10_1101-2021_01_05_425508.pos
10_1101-2020_09_09_289074  txt/../pos/10_1101-2020_09_09_289074.pos
10_1101-2020_01_29_925354  txt/../pos/10_1101-2020_01_29_925354.pos
10_1101-2021_01_07_425697  txt/../wrd/10_1101-2021_01_07_425697.wrd
parallel: Warning: No more processes: Decreasing number of running jobs to 7.
parallel: Warning: Raising ulimit -u or /etc/security/limits.conf may help.
10_1101-2021_01_04_425250  txt/../pos/10_1101-2021_01_04_425250.pos
10_1101-2021_01_04_425288  txt/../pos/10_1101-2021_01_04_425288.pos
10_1101-2021_01_08_425885  txt/../wrd/10_1101-2021_01_08_425885.wrd
10_1101-2021_01_04_425285  txt/../pos/10_1101-2021_01_04_425285.pos
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2021_01_05_425414  txt/../wrd/10_1101-2021_01_05_425414.wrd
10_1101-2020_08_28_271981  txt/../pos/10_1101-2020_08_28_271981.pos
10_1101-2021_01_05_425384  txt/../wrd/10_1101-2021_01_05_425384.wrd
10_1101-2021_01_04_425315  txt/../wrd/10_1101-2021_01_04_425315.wrd
10_1101-2020_11_13_381475  txt/../wrd/10_1101-2020_11_13_381475.wrd
10_1101-436634  txt/../pos/10_1101-436634.pos
10_1101-2021_01_07_425716  txt/../wrd/10_1101-2021_01_07_425716.wrd
10_1101-2021_01_02_425006  txt/../wrd/10_1101-2021_01_02_425006.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2021_01_05_425409  txt/../wrd/10_1101-2021_01_05_425409.wrd
10_1101-2020_05_22_110247  txt/../wrd/10_1101-2020_05_22_110247.wrd
10_1101-332965  txt/../wrd/10_1101-332965.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2021_01_04_425335  txt/../wrd/10_1101-2021_01_04_425335.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2020_12_14_422697  txt/../wrd/10_1101-2020_12_14_422697.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2020_08_13_249839  txt/../wrd/10_1101-2020_08_13_249839.wrd
10_1101-2020_09_09_289074  txt/../wrd/10_1101-2020_09_09_289074.wrd
10_1101-2020_01_29_925354  txt/../wrd/10_1101-2020_01_29_925354.wrd
10_1101-2021_01_04_425250  txt/../wrd/10_1101-2021_01_04_425250.wrd
10_1101-2021_01_05_425417  txt/../wrd/10_1101-2021_01_05_425417.wrd
10_1101-2021_01_05_425508  txt/../wrd/10_1101-2021_01_05_425508.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2keywords.sh: fork: retry: No child processes
10_1101-2020_12_24_424332  txt/../wrd/10_1101-2020_12_24_424332.wrd
10_1101-2021_01_04_425285  txt/../wrd/10_1101-2021_01_04_425285.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
10_1101-2021_01_04_425288  txt/../wrd/10_1101-2021_01_04_425288.wrd
10_1101-2020_08_28_271981  txt/../wrd/10_1101-2020_08_28_271981.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
10_1101-436634  txt/../wrd/10_1101-436634.wrd
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: No child processes
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
/data-disk/reader-compute/reader-classic/bin/txt2urls.sh: fork: retry: Resource temporarily unavailable
Done mapping.
Reducing bioinformatics-from-biorxiv
=== reduce.pl bib ===
         id = 10_1101-2021_01_08_425887
     author = Hu, Yan
      title = Auto-CORPus: Automated and Consistent Outputs from Research Publications
       date = 2021
      pages = 10
  extension = .pdf
       mime = application/pdf
      words = 6886
  sentences = 553
     flesch = 53
    summary = the same structured model, so that these can be used as input to rule-based or deep learning algorithms for data extraction. example, at this point in this article the main headers are 'abstract' followed by 'introduction' and 'materials and methods' that could make up a digraph. We use this process to evaluate new potential synonyms for existing terms and identify abstract → introduction → materials → results → discussion → conclusion → acknowledgements → footnotes section → references. Based on the digraph, we then assigned data and data description to be synonyms of the materials section, and participants From the analysis of ego-networks four new potential categories were identified: disclosure, graphical abstract, highlights and participants. Newly identified synonyms for existing IAO terms (00006xx) from the digraph mapping of 2,441 publications. Newly identified synonyms for existing IAO terms (00006xx) from the digraph mapping of 2,441 publications.
      cache = ./cache/10_1101-2021_01_08_425887.pdf
       txt  = ./txt/10_1101-2021_01_08_425887.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_08_425855
     author = Wu, Canbiao
      title = DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites.
       date = 2021
      pages = 31
  extension = .pdf
       mime = application/pdf
      words = 7280
  sentences = 806
     flesch = 53
    summary = DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites. deep learning model DeepHBV to predict HBV integration sites by learning local learning model DeepHBV to predict HBV integration sites by learning local genomic DeepHBV effectively predicts HBV integration sites by adding genomic features. mixed HBV integration sequences, positive genome feature samples, and randomly peaks and DeepHBV with HBV integration sequences + TCGA Pan Cancer peaks) on model trained with HBV integrated sequences + TCGA Pan Cancer showed an performed better compared with DeepHBV model with HBV integration sequences + HBV integration sites + TCGA Pan Cancer, a cluster of attention weights much output of DeepHBV with HBV integration sites plus TCGA Pan Cancer showed the of DeepHBV with HBV integration sequences + TCGA Pan Cancer showed strong DeepHBV with HBV integration sequences + TCGA Pan Cancer model on (a) DeepHBV with HBV integration sequences + TCGA Pan Cancer model on (a) 
      cache = ./cache/10_1101-2021_01_08_425855.pdf
       txt  = ./txt/10_1101-2021_01_08_425855.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_08_425976
     author = Ahuja, Yuri
      title = Semi-supervised Calibration of Risk with Noisy Event Times (SCORNET) Using Electronic Health Record Data
       date = 2021
      pages = 
  extension = .pdf
       mime = application/xhtml+xml
      words = 72
  sentences = 16
     flesch = 15
    summary = bioRxiv.org the preprint server for Biology Skip to main content Home Submit ALERTS / RSS Search for this keyword Advanced Search Subject Areas All Articles Animal Behavior and Cognition Biochemistry Bioengineering Bioinformatics Biophysics Cancer Biology Cell Biology Clinical Trials Developmental Biology Ecology Epidemiology Evolutionary Biology Genetics Genomics Immunology Microbiology Molecular Biology Neuroscience Paleontology Pathology Pharmacology and Toxicology Physiology Plant Biology Scientific Communication and Education Synthetic Biology Systems Biology Zoology View by Month
      cache = ./cache/10_1101-2021_01_08_425976.pdf
       txt  = ./txt/10_1101-2021_01_08_425976.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_08_426008
     author = Pipes, Lenore
      title = AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees
       date = 2021
      pages = 7
  extension = .pdf
       mime = application/pdf
      words = 3682
  sentences = 382
     flesch = 63
    summary = AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees Despite the exponential increase in the size of sequence databases of homologous genes, few methods exist to cluster At low identities, these methods produce uneven clusters where the majority of sequences are are no clustering methods that can accurately cluster large taxonomically divergent metabarcoding reference databases such as databases (Schoch et al., 2020), there is a need for new computationally efficient methods that can cluster divergent sequences. To cluster divergent sequences, we developed AncestralClust clustering methods: UCLUST (Edgar, 2010), meshclust2 (James dataset against UCLUST because it is the most widely used clustering program and it performs better than CD-HIT on low identity We developed a phylogenetic-based clustering method, AncestralClust, specifically to cluster divergent metabarcode sequences. Comparisons of clustering methods using 13,043 COI sequences from 11 different species.
      cache = ./cache/10_1101-2021_01_08_426008.pdf
       txt  = ./txt/10_1101-2021_01_08_426008.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2020_04_17_043323
     author = Waschke, Johannes
      title = <em>linus</em>: Conveniently explore, share, and present large-scale biological trajectory data from a web browser
       date = 2021
      pages = 13
  extension = .pdf
       mime = application/pdf
      words = 6528
  sentences = 571
     flesch = 54
    summary = linus: Conveniently explore, share, and present large-scale biological trajectory data from a web browser linus: Conveniently explore, share, and present large-scale biological trajectory data In biology, we are often confronted with information-rich, large-scale trajectory data, but exploring and communicating We provide a python script that reads trajectory data and enriches them with additional features, such as edge bundling or custom axes and generates an interactive web-based visualisation that can be shared offline from diffusion MRI imaging (Liu et al., 2020), or tracking data such as cell trajectories or animal trails (Romero-Ferrero et visualisation tool linus, making it easier to explore 3D trajectory data from any device without a local installation of Creating a visualisation package with linus is done in a few simple steps (Fig. 1a): The user imports trajectory data from a Figure 1 Browser-based exploration and sharing of trajectory visualizations with linus.
      cache = ./cache/10_1101-2020_04_17_043323.pdf
       txt  = ./txt/10_1101-2020_04_17_043323.txt
=== reduce.pl bib ===
         id = 10_1101-2021_01_06_425560
     author = Lengyel, Attila
      title = Review and performance evaluation of trait-based between-community dissimilarity measures
       date = 2021
      pages = 50
  extension = .pdf
       mime = application/pdf
      words = 15548
  sentences = 2320
     flesch = 67
    summary = Review and performance evaluation of trait-based between-community dissimilarity measures 2 Review and performance evaluation of trait-based between-community dissimilarity measures 2 2. In this paper we reviewed the trait-based dissimilarity indices available in the 16 dissimilarities calculated by different indices correlate with environmental distances. beta diversity, dissimilarity index, distance metric, community ecology, functional traits 39 including several families of trait-based dissimilarity indices. FDissim indices incorporate trait information into the calculation of dissimilarity in different 162 Indices following this approach represent each community with a typical trait value, and 185 2005) or trait-based dissimilarity of species (Lepš 220 of the similarity indices for presence/absence data disregarding species properties, while the 281 ordinariness values in the species-based (dis-)similarity indices. Ricotta & Pavoine (2015) introduced a new family of trait-based similarity measures called 331 For species-based analyses, Ricotta & Podani (2017) suggested a general formula of distance 336 compared how strongly the dissimilarity indices correlate with the environmental distance 515 
      cache = ./cache/10_1101-2021_01_06_425560.pdf
       txt  = ./txt/10_1101-2021_01_06_425560.txt
=== reduce.pl bib ===
         id = 10_1101-2021_01_06_425494
     author = Baldwin, Quenisha
      title = The topological free energy of proteins
       date = 2021
      pages = 
  extension = .pdf
       mime = application/xhtml+xml
      words = 72
  sentences = 16
     flesch = 15
    summary = bioRxiv.org the preprint server for Biology Skip to main content Home Submit ALERTS / RSS Search for this keyword Advanced Search Subject Areas All Articles Animal Behavior and Cognition Biochemistry Bioengineering Bioinformatics Biophysics Cancer Biology Cell Biology Clinical Trials Developmental Biology Ecology Epidemiology Evolutionary Biology Genetics Genomics Immunology Microbiology Molecular Biology Neuroscience Paleontology Pathology Pharmacology and Toxicology Physiology Plant Biology Scientific Communication and Education Synthetic Biology Systems Biology Zoology View by Month
      cache = ./cache/10_1101-2021_01_06_425494.pdf
       txt  = ./txt/10_1101-2021_01_06_425494.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_07_425637
     author = Arneson, Adriana
      title = A mammalian methylation array for profiling methylation levels at conserved sequences
       date = 2021
      pages = 39
  extension = .pdf
       mime = application/pdf
      words = 12153
  sentences = 1117
     flesch = 58
    summary = characterize the CpGs on the mammalian methylation array with various genomic annotations. Array probes are sequences of length 50bp flanking a target CpG based on the human reference We added probes targeting 1986 CpGs to the mammalian methylation array based on All 37488 CpGs profiled on the mammalian methylation array apply to humans, but only a CpGs on the mammalian array cover 6871 human and 5659 mouse genes when each DNA methylation samples for three species: human (n=10 arrays), mouse (n=20), and rat (n=15), synthetic DNA data from 3 species: human (n=10 mammalian arrays), mouse (n=20), and rat CpG and gene coverage of probes on the mammalian methylation array across CpG island and chromatin state analysis of mammalian methylation probes. probes targeting the same CpG that can also be found on the human EPIC array that were not mammalian methylation array to the human (hg19) and mouse (mm10) genome using QUASR 
      cache = ./cache/10_1101-2021_01_07_425637.pdf
       txt  = ./txt/10_1101-2021_01_07_425637.txt
=== reduce.pl bib ===
         id = 10_1101-2021_01_07_425801
     author = Hou, Yapeng
      title = Fibrinolysis influences SARS-CoV-2 infection in ciliated cells
       date = 2021
      pages = 18
  extension = .pdf
       mime = application/pdf
      words = 6841
  sentences = 912
     flesch = 71
    summary = proteases, may cleave the furin site of SARS-CoV-2 S protein and  subunits of epithelial sodium channels ( 15 TMPRSS2, and ACE2 were significantly upregulated in severe COVID-19 patients and SARS-CoV-2 infected 22 Plasmin cleaves the furin site in SARS-CoV S protein (Kam et al. lung epithelial cells and whether SARS-CoV-2 infection alters their expression at the single-cell level. severe/moderate COVID-19 patients and SARS-CoV-2 infected cell lines, mainly owning to ciliated cells. The expression levels of proteases (PLAU, FURIN, TMPRSS2, PLG), ACE2, and SCNN1G in 11 cell 88 Expression levels of PLAU, SCNN1G, and ACE2 in SARS-CoV-2 infection 93 epithelial cell lines infected with SARS-CoV-2: A549, Calu-3, and NHBE (Blanco-Melo et al. CoV-2 infection also increased the expression level of ACE2 in A549 cells (P < 0.05) (Smith et al. Our data showed that the respiratory cells co-express SARS-CoV-2 receptor, ENaC 137 Changes of proteases, ACE2, and SCNN1G in respiratory cell lines after SARS-CoV-2 
      cache = ./cache/10_1101-2021_01_07_425801.pdf
       txt  = ./txt/10_1101-2021_01_07_425801.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_07_425794
     author = Chisanga, David
      title = Impact of gene annotation choice on the quantification of RNA-seq data
       date = 2021
      pages = 24
  extension = .pdf
       mime = application/pdf
      words = 9715
  sentences = 1233
     flesch = 66
    summary = Despite the importance of gene annotations in RNA-seq data analysis, very little research has been conducted to examine how differences in annotations impact on gene compared the effect of human genome annotations from popular databases including Ensembl, GENCODE and RefSeq on various aspects of RNA-seq analysis and they showed gene-level expression quantification in an RNA-seq data analysis pipeline. The Ensembl, RefSeq-NCBI and RefSeqRsubread annotations were provided to featureCounts to generate read counts for genes Gene expression data generated using TaqMan RT-PCR and Illumina's BeadChip microarray were used to validate the gene-level quantification results from the RNA-seq The Ensembl and NCBI RefSeq annotations are among the most widely used gene annotations that have been utilized for RNA-seq gene expression quantification in the field. led to a better concordance in gene expression between the RNA-seq data and the RTPCR data, compared to the use of Ensembl and RefSeq-NCBI annotations.
      cache = ./cache/10_1101-2021_01_07_425794.pdf
       txt  = ./txt/10_1101-2021_01_07_425794.txt
=== reduce.pl bib ===
         id = 10_1101-2021_01_06_425569
     author = Chen, Li
      title = Metabolite discovery through global annotation of untargeted metabolomics data
       date = 2021
      pages = 31
  extension = .pdf
       mime = application/pdf
      words = 11555
  sentences = 1084
     flesch = 59
    summary = to yeast and mouse data, we identify a half-dozen novel metabolites, including thiamine and taurine Peak annotation occurs in a single global optimization step, based on linear programming, connected nodes matches the atom mass difference and (ii) only co-eluting peaks are connected by edges receive a positive score for MS2 spectra similarity match between the connected nodes, and With a score assigned for each potential node and edge annotation, we formulate the global network A final edge annotation score S( 𝑢, 𝑣, 𝑎 , 𝑏 , 𝐷 ) for choosing candidate formula 𝑎 for node u, A final edge annotation score S( 𝑢, 𝑣, 𝑎 , 𝑏 , 𝐷 ) for choosing candidate formula 𝑎 for node u, A global network optimization approach for untargeted metabolomics data annotation NetID applies global optimization for metabolomics data annotation and metabolite A global network optimization approach for untargeted metabolomics data annotation (NetID).
      cache = ./cache/10_1101-2021_01_06_425569.pdf
       txt  = ./txt/10_1101-2021_01_06_425569.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_06_425550
     author = Chen, Nae-Chyun
      title = Improving variant calling using population data and deep learning
       date = 2021
      pages = 17
  extension = .pdf
       mime = application/pdf
      words = 6912
  sentences = 754
     flesch = 55
    summary = We further evaluated the performance of the models using two whole-exome sequencing (WES) datasets from a recently released set of genome and exome data [23] (Figure 2). Among the populationresolved false-positive errors, more than two third (71.0%) are uncommon (allele frequency ≤ 5%) among the 1000Genomes samples, whereas there are only 11.4% uncommon variants for population-induced false positives. This observation supports the hypothesis that the population-aware model uses allele frequency to adjust its variant calls. A potential concern for population-aware variant calling models is increasing false negative rate for novel alleles. To better understand the zero-frequency variants, we called variants using the DeepVariant PacBio model with the PrecisionFDA v2 35x HG003 reads set sequenced with the We evaluate potential biases introduced by population information in variant calling by comparing population-aware models that use allele frequencies from different Despite greater overall accuracy, we note that the population-aware model underperforms on variants with zero allele frequencies in 1000Genomes.
      cache = ./cache/10_1101-2021_01_06_425550.pdf
       txt  = ./txt/10_1101-2021_01_06_425550.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2020_12_26_424429
     author = Nayar, Gowri
      title = Analysis and Forecasting of Global RT-PCR Primers for SARS-CoV-2
       date = 2021
      pages = 13
  extension = .pdf
       mime = application/pdf
      words = 4921
  sentences = 407
     flesch = 61
    summary = SARS-CoV-2 primers in use today by measuring the number of mismatches between primer sequence and genome targets with respect to the sequenced SARS-CoV-2 genomes, we can measure how the targeted proteins are mutating. primer sequences and protocols developed for six different regions – USA, Germany, China, Hong Kong, Japan, and Thailand – percent of genomes hit by each PCR test, labelled by the country and target gene region. Figure 6 shows the average number of mismatches over time, grouped by the genomes sampled The results of this study also demonstrate that each primer target develops a different number of mismatches over time The mutations that lead to mismatches between gene PCR primers and their targets reflect the sequence evolution of the consistent with the observed increasing number of mismatches over time, and shows that evolution of SARS-CoV-2 genomes is of the RdRp gene, primer target creates a disproportionate number of mismatches when compared to genomes sequenced within
      cache = ./cache/10_1101-2020_12_26_424429.pdf
       txt  = ./txt/10_1101-2020_12_26_424429.txt
=== reduce.pl bib ===
         id = 10_1101-2021_01_05_425384
     author = Halilaj, Iva
      title = Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19
       date = 2021
      pages = 17
  extension = .pdf
       mime = application/pdf
      words = 5110
  sentences = 1327
     flesch = -15
    summary = of COVID-19 patients, expediting the models' transition from research to clinical practice. The open source website https://covid19risk.ai/ currently incorporates nine models from six an inclusive platform for predictive models related to COVID-19. supplement their judgment with patient-specific predictions from externally-validated models Keywords: Covid-19, predictive models, diagnosis, prognosis, nomogram, machine We, as researchers working on COVID-19 models, saw an urgent need for a web-based Our aim for this platform is to include validated prediction models (TRIPOD type 2b and 3) published AI prediction models related to all aspects of COVID-19, including diagnosis, COVID-19 predictive models will serve as a decision aid for doctors. Decision Support System for Severity Risk Prediction and Triage of COVID-19 Patients Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19 Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19
      cache = ./cache/10_1101-2021_01_05_425384.pdf
       txt  = ./txt/10_1101-2021_01_05_425384.txt
=== reduce.pl bib ===
         id = 10_1101-2021_01_07_425697
     author = Luo, Yin
      title = Capsule network for protein ubiquitination site prediction
       date = 2021
      pages = 14
  extension = .pdf
       mime = application/pdf
      words = 5391
  sentences = 552
     flesch = 59
    summary = Capsule network for protein ubiquitination site prediction Capsule network for protein ubiquitination site years, some calculation methods have been developed to predict potential ubiquitination sites. this paper, a deep learning model, "Caps-Ubi," is proposed that uses a capsule network for protein network layer are used as a feature extractor to obtain the functional domains in the protein Data of protein ubiquitination sites the amino acid sequence around the protein ubiquitination site; namely, one-of-K encoding and the performance of various window sizes in one-of-21 and amino acid continuous encoding modes. modes is the best on the capsule network: this proposed Caps-Ubi model achieved an accuracy, In this paper, a new deep learning model for predicting protein ubiquitination sites is proposed, ubiquitination sites in proteins. Large-scale prediction of protein ubiquitination sites machine learning method with substrate motifs to predict ubiquitin-conjugation site on machine learning method with substrate motifs to predict ubiquitin-conjugation site on 
      cache = ./cache/10_1101-2021_01_07_425697.pdf
       txt  = ./txt/10_1101-2021_01_07_425697.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_04_425315
     author = Chen, Lulu
      title = Sample-wise unsupervised deconvolution of complex tissues
       date = 2021
      pages = 29
  extension = .pdf
       mime = application/pdf
      words = 7333
  sentences = 1028
     flesch = 66
    summary = tool enables statistically-principled subtype-level downstream analyses, such as detecting subtypespecific differentially expressed genes (sDEG) and differential dependency networks (DDN) nuclear-norm regularized low-rank matrix factorization problem (Wang, Hoffman et al. regularization to optimize the estimation of between-sample variations in each subtype to recover sample-specific deconvolution and optimization solver used in swCAM algorithm, followed by Sample-specific deconvolution problem formulation and the assumption of hidden low-rank pattern CAM-estimated subtype-specific expression matrix serves as the initial reference 𝑺. The objective function of swCAM for sample-specific deconvolution problem and its reformulation As swCAM focuses on subtype-specific variation estimation, simulating biological variance The observations for 300 genes in 50 samples were simulated with subtype-specific expression Gene co-expressed function modules detected by WGCNA on swCAM estimated sample-specific Gene co-expressed function modules detected by WGCNA on swCAM estimated samplespecific expression for each subtype with λ=5 and δ=1 or 0.1. capacity of swCAM to estimate sample-specific signals in each subtype using simulations where 
      cache = ./cache/10_1101-2021_01_04_425315.pdf
       txt  = ./txt/10_1101-2021_01_04_425315.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-332965
     author = Banerjee, Arpita
      title = Identification and design of vinyl sulfone inhibitors against Cryptopain-1 – a cysteine protease from cryptosporidiosis-causing <em>Cryptosporidium parvum</em>
       date = 2021
      pages = 30
  extension = .pdf
       mime = application/pdf
      words = 9346
  sentences = 1457
     flesch = 72
    summary = protease well by making favorable interactions with important residues of the enzyme. Keywords: Vinyl sulfone inhibitors, Cryptopain-1, Cysteine protease, Molecular residues that were contacted by ligand subgroups across the enzymatic cleft, in one or The ligand subgroup-contacting residues in each complex had been mutated to Alanine; favorably interacting subsite residues (derived from Supplementary Table 1) in the Interactions: enzyme subsite residues ligand subgroups with the ligand ring systems showed highly positive ddGbind values for thiophen group in always, showed favorable interactions even when no ligand group was placed near it. with no ligand group placed near the residue, the interactions were unfavorable. around the ligand subgroups of the best-scored vinyl sulfones compounds (PubChem IDs Figure1: Illustration of the typical binding of vinyl sulfone inhibitors to cysteine protease enzymes. Figure 3: All the residues that are contacted by one or more ligands in the docked complexes of 
      cache = ./cache/10_1101-332965.pdf
       txt  = ./txt/10_1101-332965.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_04_425250
     author = Thibodeau, Asa
      title = A read count-based method to detect multiplets and their cellular origins from snATAC-seq data
       date = 2021
      pages = 32
  extension = .pdf
       mime = application/pdf
      words = 9658
  sentences = 910
     flesch = 59
    summary = A read count-based method to detect multiplets and their cellular origins from snATAC-seq data Similar to other droplet-based single cell assays, single nucleus ATAC-seq (snATAC-seq) data harbor multiplets 17 found that when snATAC-seq samples were adequately sequenced (e.g., >20k valid read pairs per cell), ATAC-52 ATAC-DoubletDetector detected heterotypic multiplets introduced in PBMC samples with high recall 126 detected multiplets were homotypic (76.7-84.3% in islets, 63-78.7% in PBMCs), with cell types being distributed 207 with respect to their cell proportions for both homotypic and heterotypic multiplet types (Fig. 5d-e, Extended Data 208 ATAC-DoubletDetector for identifying multiplets from snATAC-seq data with enough reads per nuclei, it can also 239 Fig. 2: ATAC-DoubletDetector identifies heterotypic and homotypic multiplets in human PBMC snATAC-seq data. e, The number of cells and percentage of multiplets detected by ATAC-DoubletDetector in PBMC and islet samples. Extended Data Fig. 6: ATAC-DoubletDetector detects both homotypic and heterotypic multiplets at high read depth.
      cache = ./cache/10_1101-2021_01_04_425250.pdf
       txt  = ./txt/10_1101-2021_01_04_425250.txt
=== reduce.pl bib ===
         id = 10_1101-2020_01_29_925354
     author = Bartoszewicz, Jakub M.
      title = Interpretable detection of novel human viruses from genome sequencing data
       date = 2021
      pages = 14
  extension = .pdf
       mime = application/pdf
      words = 13578
  sentences = 1392
     flesch = 61
    summary = As the state-of-the art approach for the openview detection of pathogens is genome sequencing (5, 6), it learning (17) to predict host range for a small set of three wellstudied species directly from viral sequences. predicting whether a new virus can potentially infect humans. boundary separates human viruses from other DNA sequences generated the reads from the genomes of human-infecting constituting reads yields a prediction for the whole sequence. predictions from all the reads originating from a given genome In the Fig. 1 we present example filters, visualized as "maxcontrib" sequence logos based on mean partial Shapley values prediction directly from next-generation sequencing reads Three receptor-binding domains (RBDs) are colored in blue, white and red according to the predicted infectious potential of the corresponding genomic sequence. Interpretable detection of novel human viruses from genome sequencing data Interpretable detection of novel human viruses from genome sequencing data
      cache = ./cache/10_1101-2020_01_29_925354.pdf
       txt  = ./txt/10_1101-2020_01_29_925354.txt
=== reduce.pl bib ===
         id = 10_1101-2020_09_09_289074
     author = Ortuso, Francesco
      title = Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex
       date = 2021
      pages = 23
  extension = .pdf
       mime = application/pdf
      words = 7878
  sentences = 977
     flesch = 67
    summary = Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex SARS-CoV-2, COVID-19, mutations, Spike, ACE2 SARS-CoV-2 entry in human cells is mediated by the interaction between the viral Spike protein and protein variants in the SARS-CoV-2 population as the result of mutations, and it is unclear if these SARS-CoV-2 (the COVID-19 virus) and human cells, through the analysis of Spike/ACE2 complexes. future mutations targeting the ACE2/Spike binding and detected by sequencing SARS-CoV-2 on a We obtained structural models of the SARS-CoV-2 Spike interacting with the human ACE2 from three contributing to the interaction between Spike and ACE2, according to GBPM (see Table 1 and Fig 3 for A less frequent mutation amongst those predicted to contribute to the ACE2/Spike interaction is population and non-zero GBPM average score in the ACE2/Spike interaction models. ACE2 variants with non-zero GBPM score in the Spike interaction model.
      cache = ./cache/10_1101-2020_09_09_289074.pdf
       txt  = ./txt/10_1101-2020_09_09_289074.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2021_01_06_425546
     author = Osthus, Dave
      title = Fast and Accurate Influenza Forecasting in the United States with Inferno
       date = 2021
      pages = 
  extension = .pdf
       mime = application/xhtml+xml
      words = 72
  sentences = 16
     flesch = 15
    summary = bioRxiv.org the preprint server for Biology Skip to main content Home Submit ALERTS / RSS Search for this keyword Advanced Search Subject Areas All Articles Animal Behavior and Cognition Biochemistry Bioengineering Bioinformatics Biophysics Cancer Biology Cell Biology Clinical Trials Developmental Biology Ecology Epidemiology Evolutionary Biology Genetics Genomics Immunology Microbiology Molecular Biology Neuroscience Paleontology Pathology Pharmacology and Toxicology Physiology Plant Biology Scientific Communication and Education Synthetic Biology Systems Biology Zoology View by Month
      cache = ./cache/10_1101-2021_01_06_425546.pdf
       txt  = ./txt/10_1101-2021_01_06_425546.txt
=== reduce.pl bib ===
=== reduce.pl bib ===
         id = 10_1101-2020_08_28_271981
     author = He, Jiahua
      title = Full-length <em>de novo</em> protein structure determination from cryo-EM maps using deep learning
       date = 2021
      pages = 26
  extension = .pdf
       mime = application/pdf
      words = 10020
  sentences = 1169
     flesch = 72
    summary = Full-length de novo protein structure determination from cryo-EM maps using deep learning structure types were predicted by a second DenseNet. Finally, the protein sequence was aligned to the main-chain according to the predicted Cα probabilities, amino acid types, and secondary structure amino acid type, and secondary structure type for each main-chain point, the target protein sequence The second network (i.e. DenseNet B) is used to predict the amino acid type and secondary structure type of a main-chain local dense point (LDP). Figure 3 shows a comparison of the predicted Cα models for the protein chains of different lengths The authors acknowledge professor Daisuke Kihara and his students Genki Terashi and Sai Raghavendra Maddhuri Venkata Subramaniya from Purdue University for providing their datasets. A New Protocol for Atomic-Level Protein Structure Modeling and Refinement Using Low-to-Medium Resolution Cryo-EM Density Maps. Figure 8: Protein models reconstructed by DeepMM and Phenix for the Chain A of 6DW1
      cache = ./cache/10_1101-2020_08_28_271981.pdf
       txt  = ./txt/10_1101-2020_08_28_271981.txt
=== reduce.pl bib ===
         id = 10_1101-436634
     author = Cotto, Kelsy C.
      title = RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer
       date = 2021
      pages = 48
  extension = .pdf
       mime = application/pdf
      words = 14993
  sentences = 1337
     flesch = 59
    summary = RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer somatic variants from genomic data with splice junctions from transcriptomic data to identify isoforms, we annotated them with the Variant Effect Predictor (VEP), SpliceAI, and GenotypeTissue Expression (GTEx) junction counts and compared our results to other tools that integrate tools, the unbiased nature of RegTools has allowed us to identify novel splice variants and identify potential cis-acting splice-relevant variants in tumors (www.regtools.org). To demonstrate the utility of RegTools in identifying potential splice-relevant variants from tumor transcriptome as described above and its associated variants based on splice junction region For our analysis, we annotated the pairs of associated variants and junctions identified by Pan-cancer analysis of 35 tumor types identifies somatic variants that alter canonical We also identify recurrent splice altering variants in genes not known to be cancer genes RegTools contains three sub-modules: "variants", "junctions", and "cis-splice-effects".
      cache = ./cache/10_1101-436634.pdf
       txt  = ./txt/10_1101-436634.txt
Building ./etc/reader.txt
10_1101-2021_01_08_425379
10_1101-2020_03_27_012757
10_1101-2021_01_08_425885
10_1101-2021_01_08_425379
10_1101-2020_12_14_422697
10_1101-2021_01_08_425885
                number of items: 50
                   sum of words: 175,544
          average size in words: 7,632
      average readability score: 52

                          nouns: preprint; cell; data; version; author; review; copyright; holder; funder; peer; preprintthis; %; cells; license; genes; perpetuity; gene; analysis; number; expression; sequences; cancer; sequence; model; type; fig; samples; p; figure; time; rights; reuse; permission; species; types; licenseavailable; methods; seq; models; values; results; information; licensemade; sample; datasets; lines; b; score; dataset; protein
                          verbs: is; was; are; were; be; posted; certified; has; using; display; used; granted; biorxiv; based; have; made; allowed; reserved; found; �; shown; associated; been; see; compared; identified; obtained; given; known; applied; showed; including; had; generated; calculated; predicted; selected; identify; read; observed; performed; defined; set; shows; derived; computed; included; according; estimated; use
                     adjectives: -; available; different; single; other; international; human; high; same; specific; such; non; more; new; average; multiple; higher; low; first; similar; scalar; significant; variant; relative; negative; biological; positive; genomic; large; functional; common; clinical; regulatory; novel; �; genome; total; several; experimental; subclonal; individual; small; true; top; deep; many; additional; general; wide; main
                        adverbs: not; also; then; only; more; however; well; respectively; most; as; thus; therefore; here; further; significantly; even; e.g.; first; highly; out; at; next; very; finally; often; specifically; least; together; still; previously; randomly; directly; less; much; relatively; above; instead; particularly; so; potentially; up; hence; fully; differentially; furthermore; currently; moreover; http://creativecommons.org/licenses/by-nc-nd/4.0/; recently; rather
                       pronouns: we; it; our; their; i; its; they; them; us; itself; https://doi.org/10.1101/2021.01.06.425569; https://doi.org/10.1101/2021.01.08.425918; his; one; swcam; adroit; he; em; https://doi.org/10.1101/2021.01.04.425335; themselves; https://doi.org/10.1101/2021.01.07.425716; bl; http://paperpile.com/b/h8ctd0/cq1b; her; ng; il-27ra; you; yj; mine; λ; us-; ua; u; she; s; ourselves; n; matchdrugwithdisease; m; https://doi.org/10.1016/j.cell.2011.02.013; 𝜆𝜃; 𝜆; 𝒗𝑖; 𝑺; 𝑢-; 𝑘1; 𝑖𝑗; 𝑓; 𝑉-; ―
                   proper nouns: january; al; et; �; ⋅; j.; m.; nc; rna; s.; c; nd; international; figure; sars; m; −; a.; il-27; s; r.; d.; c.; cov-2; l.; e.; fig; j; p.; p; h.; b.; k.; t.; supplementary; g.; b; a; methods; y.; d; e; .; t; n; g; cell; hla; by; k
                       keywords: january; international; figure; sars; rna; fig; cell; biology; variant; supp; gwas; genome; gene; dna; disease; datum; cancer; blast; ace2; 𝐷27; 𝐷23; vntr; vae; usc; uclust; type; tumor; trajectory; toronto; time; table; t69; sørensen; supplementary; subclonal; study; stat1; srs; spike; simlr; sequence; section; sec; rsubread; rpgg; rpe1; rmsd; ricotta; q18; prescription

       one topic; one dimension: 10
                        file(s): ./cache/10_1101-2021_01_08_425887.pdf
                      titles(s): Auto-CORPus: Automated and Consistent Outputs from Research Publications

    three topics; one dimension: org; com; 10
                        file(s): , , ./cache/10_1101-2021_01_06_425560.pdf
                      titles(s): Computing the Riemannian curvature of image patch and single-cell RNA sequencing data manifolds using extrinsic differential geometry | Competitive binding of STATs to receptor phospho-Tyr motifs accounts for altered cytokine responses in autoimmune disorders | Review and performance evaluation of trait-based between-community dissimilarity measures

  five topics; three dimensions: http com paperpile; org 10 2021; 10 org https; cell 10 doi; 10 org doi
                        file(s): , , , , 
                      titles(s): Competitive binding of STATs to receptor phospho-Tyr motifs accounts for altered cytokine responses in autoimmune disorders | Evaluating the transcriptional fidelity of cancer models | Integrated cross-study datasets of genetic dependencies in cancer | AdRoit: an accurate and robust method to infer complex transcriptome composition | LiquidCNA: tracking subclonal evolution from longitudinal liquid biopsies using somatic copy number alterations

      Type: biorxiv
     title: bioinformatics-from-biorxiv
      date: 2021-01-10
      time: 13:51
  username: emorgan
    patron: Eric Morgan
     email: emorgan@nd.edu
     input: vUYT7OIQgp.xml
==== make-pages.sh htm files
==== make-pages.sh complex files
==== make-pages.sh named enities
==== making bibliographics
         id: 10_1101-2021_01_08_425976
     author: Ahuja, Yuri
      title: Semi-supervised Calibration of Risk with Noisy Event Times (SCORNET) Using Electronic Health Record Data
       date: 2021
      words: 72.0
  sentences: 16.0
      pages: 
     flesch: 15.0
      cache: ./cache/10_1101-2021_01_08_425976.pdf
        txt: ./txt/10_1101-2021_01_08_425976.txt
    summary: bioRxiv.org the preprint server for Biology Skip to main content Home Submit ALERTS / RSS Search for this keyword Advanced Search Subject Areas All Articles Animal Behavior and Cognition Biochemistry Bioengineering Bioinformatics Biophysics Cancer Biology Cell Biology Clinical Trials Developmental Biology Ecology Epidemiology Evolutionary Biology Genetics Genomics Immunology Microbiology Molecular Biology Neuroscience Paleontology Pathology Pharmacology and Toxicology Physiology Plant Biology Scientific Communication and Education Synthetic Biology Systems Biology Zoology View by Month

         id: 10_1101-2021_01_06_425581
     author: Alicea, Bradly J
      title: Periodicity in the embryo: emergence of order in space, diffusion of order in time
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_07_425637
     author: Arneson, Adriana
      title: A mammalian methylation array for profiling methylation levels at conserved sequences
       date: 2021
      words: 12153.0
  sentences: 1117.0
      pages: 39
     flesch: 58.0
      cache: ./cache/10_1101-2021_01_07_425637.pdf
        txt: ./txt/10_1101-2021_01_07_425637.txt
    summary: characterize the CpGs on the mammalian methylation array with various genomic annotations. Array probes are sequences of length 50bp flanking a target CpG based on the human reference We added probes targeting 1986 CpGs to the mammalian methylation array based on All 37488 CpGs profiled on the mammalian methylation array apply to humans, but only a CpGs on the mammalian array cover 6871 human and 5659 mouse genes when each DNA methylation samples for three species: human (n=10 arrays), mouse (n=20), and rat (n=15), synthetic DNA data from 3 species: human (n=10 mammalian arrays), mouse (n=20), and rat CpG and gene coverage of probes on the mammalian methylation array across CpG island and chromatin state analysis of mammalian methylation probes. probes targeting the same CpG that can also be found on the human EPIC array that were not mammalian methylation array to the human (hg19) and mouse (mm10) genome using QUASR 

         id: 10_1101-2020_10_26_351783
     author: Badam, Tejaswi V.S.
      title: A validated generally applicable approach using the systematic assessment of disease modules by GWAS reveals a multi-omic module strongly associated with risk factors in multiple sclerosis
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_06_425494
     author: Baldwin, Quenisha
      title: The topological free energy of proteins
       date: 2021
      words: 72.0
  sentences: 16.0
      pages: 
     flesch: 15.0
      cache: ./cache/10_1101-2021_01_06_425494.pdf
        txt: ./txt/10_1101-2021_01_06_425494.txt
    summary: bioRxiv.org the preprint server for Biology Skip to main content Home Submit ALERTS / RSS Search for this keyword Advanced Search Subject Areas All Articles Animal Behavior and Cognition Biochemistry Bioengineering Bioinformatics Biophysics Cancer Biology Cell Biology Clinical Trials Developmental Biology Ecology Epidemiology Evolutionary Biology Genetics Genomics Immunology Microbiology Molecular Biology Neuroscience Paleontology Pathology Pharmacology and Toxicology Physiology Plant Biology Scientific Communication and Education Synthetic Biology Systems Biology Zoology View by Month

         id: 10_1101-332965
     author: Banerjee, Arpita
      title: Identification and design of vinyl sulfone inhibitors against Cryptopain-1 – a cysteine protease from cryptosporidiosis-causing <em>Cryptosporidium parvum</em>
       date: 2021
      words: 9346.0
  sentences: 1457.0
      pages: 30
     flesch: 72.0
      cache: ./cache/10_1101-332965.pdf
        txt: ./txt/10_1101-332965.txt
    summary: protease well by making favorable interactions with important residues of the enzyme. Keywords: Vinyl sulfone inhibitors, Cryptopain-1, Cysteine protease, Molecular residues that were contacted by ligand subgroups across the enzymatic cleft, in one or The ligand subgroup-contacting residues in each complex had been mutated to Alanine; favorably interacting subsite residues (derived from Supplementary Table 1) in the Interactions: enzyme subsite residues ligand subgroups with the ligand ring systems showed highly positive ddGbind values for thiophen group in always, showed favorable interactions even when no ligand group was placed near it. with no ligand group placed near the residue, the interactions were unfavorable. around the ligand subgroups of the best-scored vinyl sulfones compounds (PubChem IDs Figure1: Illustration of the typical binding of vinyl sulfone inhibitors to cysteine protease enzymes. Figure 3: All the residues that are contacted by one or more ligands in the docked complexes of 

         id: 10_1101-2020_01_29_925354
     author: Bartoszewicz, Jakub M.
      title: Interpretable detection of novel human viruses from genome sequencing data
       date: 2021
      words: 13578.0
  sentences: 1392.0
      pages: 14
     flesch: 61.0
      cache: ./cache/10_1101-2020_01_29_925354.pdf
        txt: ./txt/10_1101-2020_01_29_925354.txt
    summary: As the state-of-the art approach for the openview detection of pathogens is genome sequencing (5, 6), it learning (17) to predict host range for a small set of three wellstudied species directly from viral sequences. predicting whether a new virus can potentially infect humans. boundary separates human viruses from other DNA sequences generated the reads from the genomes of human-infecting constituting reads yields a prediction for the whole sequence. predictions from all the reads originating from a given genome In the Fig. 1 we present example filters, visualized as "maxcontrib" sequence logos based on mean partial Shapley values prediction directly from next-generation sequencing reads Three receptor-binding domains (RBDs) are colored in blue, white and red according to the predicted infectious potential of the corresponding genomic sequence. Interpretable detection of novel human viruses from genome sequencing data Interpretable detection of novel human viruses from genome sequencing data

         id: 10_1101-2021_01_08_425967
     author: Camacho-Hernández, Diego A.
      title: Partition Quantitative Assessment (PQA): A quantitative methodology to assess the embedded noise in clustered omics and systems biology data
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_07_425716
     author: Cao, Yingying
      title: Comprehensive comparison of transcriptomes in SARS-CoV-2 infection: alternative entry routes and innate immune responses
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_06_425569
     author: Chen, Li
      title: Metabolite discovery through global annotation of untargeted metabolomics data
       date: 2021
      words: 11555.0
  sentences: 1084.0
      pages: 31
     flesch: 59.0
      cache: ./cache/10_1101-2021_01_06_425569.pdf
        txt: ./txt/10_1101-2021_01_06_425569.txt
    summary: to yeast and mouse data, we identify a half-dozen novel metabolites, including thiamine and taurine Peak annotation occurs in a single global optimization step, based on linear programming, connected nodes matches the atom mass difference and (ii) only co-eluting peaks are connected by edges receive a positive score for MS2 spectra similarity match between the connected nodes, and With a score assigned for each potential node and edge annotation, we formulate the global network A final edge annotation score S( 𝑢, 𝑣, 𝑎 , 𝑏 , 𝐷 ) for choosing candidate formula 𝑎 for node u, A final edge annotation score S( 𝑢, 𝑣, 𝑎 , 𝑏 , 𝐷 ) for choosing candidate formula 𝑎 for node u, A global network optimization approach for untargeted metabolomics data annotation NetID applies global optimization for metabolomics data annotation and metabolite A global network optimization approach for untargeted metabolomics data annotation (NetID).

         id: 10_1101-2021_01_04_425315
     author: Chen, Lulu
      title: Sample-wise unsupervised deconvolution of complex tissues
       date: 2021
      words: 7333.0
  sentences: 1028.0
      pages: 29
     flesch: 66.0
      cache: ./cache/10_1101-2021_01_04_425315.pdf
        txt: ./txt/10_1101-2021_01_04_425315.txt
    summary: tool enables statistically-principled subtype-level downstream analyses, such as detecting subtypespecific differentially expressed genes (sDEG) and differential dependency networks (DDN) nuclear-norm regularized low-rank matrix factorization problem (Wang, Hoffman et al. regularization to optimize the estimation of between-sample variations in each subtype to recover sample-specific deconvolution and optimization solver used in swCAM algorithm, followed by Sample-specific deconvolution problem formulation and the assumption of hidden low-rank pattern CAM-estimated subtype-specific expression matrix serves as the initial reference 𝑺. The objective function of swCAM for sample-specific deconvolution problem and its reformulation As swCAM focuses on subtype-specific variation estimation, simulating biological variance The observations for 300 genes in 50 samples were simulated with subtype-specific expression Gene co-expressed function modules detected by WGCNA on swCAM estimated sample-specific Gene co-expressed function modules detected by WGCNA on swCAM estimated samplespecific expression for each subtype with λ=5 and δ=1 or 0.1. capacity of swCAM to estimate sample-specific signals in each subtype using simulations where 

         id: 10_1101-2021_01_06_425550
     author: Chen, Nae-Chyun
      title: Improving variant calling using population data and deep learning
       date: 2021
      words: 6912.0
  sentences: 754.0
      pages: 17
     flesch: 55.0
      cache: ./cache/10_1101-2021_01_06_425550.pdf
        txt: ./txt/10_1101-2021_01_06_425550.txt
    summary: We further evaluated the performance of the models using two whole-exome sequencing (WES) datasets from a recently released set of genome and exome data [23] (Figure 2). Among the populationresolved false-positive errors, more than two third (71.0%) are uncommon (allele frequency ≤ 5%) among the 1000Genomes samples, whereas there are only 11.4% uncommon variants for population-induced false positives. This observation supports the hypothesis that the population-aware model uses allele frequency to adjust its variant calls. A potential concern for population-aware variant calling models is increasing false negative rate for novel alleles. To better understand the zero-frequency variants, we called variants using the DeepVariant PacBio model with the PrecisionFDA v2 35x HG003 reads set sequenced with the We evaluate potential biases introduced by population information in variant calling by comparing population-aware models that use allele frequencies from different Despite greater overall accuracy, we note that the population-aware model underperforms on variants with zero allele frequencies in 1000Genomes.

         id: 10_1101-2021_01_07_425794
     author: Chisanga, David
      title: Impact of gene annotation choice on the quantification of RNA-seq data
       date: 2021
      words: 9715.0
  sentences: 1233.0
      pages: 24
     flesch: 66.0
      cache: ./cache/10_1101-2021_01_07_425794.pdf
        txt: ./txt/10_1101-2021_01_07_425794.txt
    summary: Despite the importance of gene annotations in RNA-seq data analysis, very little research has been conducted to examine how differences in annotations impact on gene compared the effect of human genome annotations from popular databases including Ensembl, GENCODE and RefSeq on various aspects of RNA-seq analysis and they showed gene-level expression quantification in an RNA-seq data analysis pipeline. The Ensembl, RefSeq-NCBI and RefSeqRsubread annotations were provided to featureCounts to generate read counts for genes Gene expression data generated using TaqMan RT-PCR and Illumina''s BeadChip microarray were used to validate the gene-level quantification results from the RNA-seq The Ensembl and NCBI RefSeq annotations are among the most widely used gene annotations that have been utilized for RNA-seq gene expression quantification in the field. led to a better concordance in gene expression between the RNA-seq data and the RTPCR data, compared to the use of Ensembl and RefSeq-NCBI annotations.

         id: 10_1101-436634
     author: Cotto, Kelsy C.
      title: RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer
       date: 2021
      words: 14993.0
  sentences: 1337.0
      pages: 48
     flesch: 59.0
      cache: ./cache/10_1101-436634.pdf
        txt: ./txt/10_1101-436634.txt
    summary: RegTools: Integrated analysis of genomic and transcriptomic data for the discovery of splicing variants in cancer somatic variants from genomic data with splice junctions from transcriptomic data to identify isoforms, we annotated them with the Variant Effect Predictor (VEP), SpliceAI, and GenotypeTissue Expression (GTEx) junction counts and compared our results to other tools that integrate tools, the unbiased nature of RegTools has allowed us to identify novel splice variants and identify potential cis-acting splice-relevant variants in tumors (www.regtools.org). To demonstrate the utility of RegTools in identifying potential splice-relevant variants from tumor transcriptome as described above and its associated variants based on splice junction region For our analysis, we annotated the pairs of associated variants and junctions identified by Pan-cancer analysis of 35 tumor types identifies somatic variants that alter canonical We also identify recurrent splice altering variants in genes not known to be cancer genes RegTools contains three sub-modules: "variants", "junctions", and "cis-splice-effects".

         id: 10_1101-2021_01_05_425409
     author: Dholakia, Dhwani
      title: HLA-SPREAD: A comprehensive resource for HLA associated diseases, drug reactions and SNPs across populations
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_05_425384
     author: Halilaj, Iva
      title: Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19
       date: 2021
      words: 5110.0
  sentences: 1327.0
      pages: 17
     flesch: -15.0
      cache: ./cache/10_1101-2021_01_05_425384.pdf
        txt: ./txt/10_1101-2021_01_05_425384.txt
    summary: of COVID-19 patients, expediting the models'' transition from research to clinical practice. The open source website https://covid19risk.ai/ currently incorporates nine models from six an inclusive platform for predictive models related to COVID-19. supplement their judgment with patient-specific predictions from externally-validated models Keywords: Covid-19, predictive models, diagnosis, prognosis, nomogram, machine We, as researchers working on COVID-19 models, saw an urgent need for a web-based Our aim for this platform is to include validated prediction models (TRIPOD type 2b and 3) published AI prediction models related to all aspects of COVID-19, including diagnosis, COVID-19 predictive models will serve as a decision aid for doctors. Decision Support System for Severity Risk Prediction and Triage of COVID-19 Patients Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19 Covid19Risk.ai: An open source repository and online calculator of prediction models for early diagnosis and prognosis of Covid-19

         id: 10_1101-2020_08_28_271981
     author: He, Jiahua
      title: Full-length <em>de novo</em> protein structure determination from cryo-EM maps using deep learning
       date: 2021
      words: 10020.0
  sentences: 1169.0
      pages: 26
     flesch: 72.0
      cache: ./cache/10_1101-2020_08_28_271981.pdf
        txt: ./txt/10_1101-2020_08_28_271981.txt
    summary: Full-length de novo protein structure determination from cryo-EM maps using deep learning structure types were predicted by a second DenseNet. Finally, the protein sequence was aligned to the main-chain according to the predicted Cα probabilities, amino acid types, and secondary structure amino acid type, and secondary structure type for each main-chain point, the target protein sequence The second network (i.e. DenseNet B) is used to predict the amino acid type and secondary structure type of a main-chain local dense point (LDP). Figure 3 shows a comparison of the predicted Cα models for the protein chains of different lengths The authors acknowledge professor Daisuke Kihara and his students Genki Terashi and Sai Raghavendra Maddhuri Venkata Subramaniya from Purdue University for providing their datasets. A New Protocol for Atomic-Level Protein Structure Modeling and Refinement Using Low-to-Medium Resolution Cryo-EM Density Maps. Figure 8: Protein models reconstructed by DeepMM and Phenix for the Chain A of 6DW1

         id: 10_1101-2021_01_07_425801
     author: Hou, Yapeng
      title: Fibrinolysis influences SARS-CoV-2 infection in ciliated cells
       date: 2021
      words: 6841.0
  sentences: 912.0
      pages: 18
     flesch: 71.0
      cache: ./cache/10_1101-2021_01_07_425801.pdf
        txt: ./txt/10_1101-2021_01_07_425801.txt
    summary: proteases, may cleave the furin site of SARS-CoV-2 S protein and  subunits of epithelial sodium channels ( 15 TMPRSS2, and ACE2 were significantly upregulated in severe COVID-19 patients and SARS-CoV-2 infected 22 Plasmin cleaves the furin site in SARS-CoV S protein (Kam et al. lung epithelial cells and whether SARS-CoV-2 infection alters their expression at the single-cell level. severe/moderate COVID-19 patients and SARS-CoV-2 infected cell lines, mainly owning to ciliated cells. The expression levels of proteases (PLAU, FURIN, TMPRSS2, PLG), ACE2, and SCNN1G in 11 cell 88 Expression levels of PLAU, SCNN1G, and ACE2 in SARS-CoV-2 infection 93 epithelial cell lines infected with SARS-CoV-2: A549, Calu-3, and NHBE (Blanco-Melo et al. CoV-2 infection also increased the expression level of ACE2 in A549 cells (P < 0.05) (Smith et al. Our data showed that the respiratory cells co-express SARS-CoV-2 receptor, ENaC 137 Changes of proteases, ACE2, and SCNN1G in respiratory cell lines after SARS-CoV-2 

         id: 10_1101-2021_01_08_425887
     author: Hu, Yan
      title: Auto-CORPus: Automated and Consistent Outputs from Research Publications
       date: 2021
      words: 6886.0
  sentences: 553.0
      pages: 10
     flesch: 53.0
      cache: ./cache/10_1101-2021_01_08_425887.pdf
        txt: ./txt/10_1101-2021_01_08_425887.txt
    summary: the same structured model, so that these can be used as input to rule-based or deep learning algorithms for data extraction. example, at this point in this article the main headers are ''abstract'' followed by ''introduction'' and ''materials and methods'' that could make up a digraph. We use this process to evaluate new potential synonyms for existing terms and identify abstract → introduction → materials → results → discussion → conclusion → acknowledgements → footnotes section → references. Based on the digraph, we then assigned data and data description to be synonyms of the materials section, and participants From the analysis of ego-networks four new potential categories were identified: disclosure, graphical abstract, highlights and participants. Newly identified synonyms for existing IAO terms (00006xx) from the digraph mapping of 2,441 publications. Newly identified synonyms for existing IAO terms (00006xx) from the digraph mapping of 2,441 publications.

         id: 10_1101-2021_01_05_425508
     author: Iwasaki, Yuki
      title: Human cell-dependent, directional, time-dependent changes in the mono- and oligonucleotide compositions of SARS-CoV-2 genomes
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_04_425335
     author: Ji, Guoli
      title: Learning association for single-cell transcriptomics by integrating profiling of gene expression and alternative polyadenylation
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2020_12_24_424332
     author: Jolly, Bani
      title: Genetic epidemiology of variants associated with immune escape from global SARS-CoV-2 genomes
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_05_425414
     author: Lakatos, Eszter
      title: LiquidCNA: tracking subclonal evolution from longitudinal liquid biopsies using somatic copy number alterations
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_06_425560
     author: Lengyel, Attila
      title: Review and performance evaluation of trait-based between-community dissimilarity measures
       date: 2021
      words: 15548.0
  sentences: 2320.0
      pages: 50
     flesch: 67.0
      cache: ./cache/10_1101-2021_01_06_425560.pdf
        txt: ./txt/10_1101-2021_01_06_425560.txt
    summary: Review and performance evaluation of trait-based between-community dissimilarity measures 2 Review and performance evaluation of trait-based between-community dissimilarity measures 2 2. In this paper we reviewed the trait-based dissimilarity indices available in the 16 dissimilarities calculated by different indices correlate with environmental distances. beta diversity, dissimilarity index, distance metric, community ecology, functional traits 39 including several families of trait-based dissimilarity indices. FDissim indices incorporate trait information into the calculation of dissimilarity in different 162 Indices following this approach represent each community with a typical trait value, and 185 2005) or trait-based dissimilarity of species (Lepš 220 of the similarity indices for presence/absence data disregarding species properties, while the 281 ordinariness values in the species-based (dis-)similarity indices. Ricotta & Pavoine (2015) introduced a new family of trait-based similarity measures called 331 For species-based analyses, Ricotta & Podani (2017) suggested a general formula of distance 336 compared how strongly the dissimilarity indices correlate with the environmental distance 515 

         id: 10_1101-2021_01_07_425782
     author: Lu, Tianyu
      title: dynUGENE: an R package for uncertainty-aware gene regulatory network inference, simulation, and visualization
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2020_08_13_249839
     author: Lu, Tsung-Yu
      title: Profiling variable-number tandem repeat variation across populations using repeat-pangenome graphs
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_07_425697
     author: Luo, Yin
      title: Capsule network for protein ubiquitination site prediction
       date: 2021
      words: 5391.0
  sentences: 552.0
      pages: 14
     flesch: 59.0
      cache: ./cache/10_1101-2021_01_07_425697.pdf
        txt: ./txt/10_1101-2021_01_07_425697.txt
    summary: Capsule network for protein ubiquitination site prediction Capsule network for protein ubiquitination site years, some calculation methods have been developed to predict potential ubiquitination sites. this paper, a deep learning model, "Caps-Ubi," is proposed that uses a capsule network for protein network layer are used as a feature extractor to obtain the functional domains in the protein Data of protein ubiquitination sites the amino acid sequence around the protein ubiquitination site; namely, one-of-K encoding and the performance of various window sizes in one-of-21 and amino acid continuous encoding modes. modes is the best on the capsule network: this proposed Caps-Ubi model achieved an accuracy, In this paper, a new deep learning model for predicting protein ubiquitination sites is proposed, ubiquitination sites in proteins. Large-scale prediction of protein ubiquitination sites machine learning method with substrate motifs to predict ubiquitin-conjugation site on machine learning method with substrate motifs to predict ubiquitin-conjugation site on 

         id: 10_1101-2021_01_05_425266
     author: Morales, Manuel A.
      title: DeepStrain: A Deep Learning Workflow for the Automated Characterization of Cardiac Mechanics
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_08_425952
     author: Nash, Anthony
      title: rdrugtrajectory: An R Package for the Analysis of Drug Prescriptions in Electronic Health Care Records
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2020_12_26_424429
     author: Nayar, Gowri
      title: Analysis and Forecasting of Global RT-PCR Primers for SARS-CoV-2
       date: 2021
      words: 4921.0
  sentences: 407.0
      pages: 13
     flesch: 61.0
      cache: ./cache/10_1101-2020_12_26_424429.pdf
        txt: ./txt/10_1101-2020_12_26_424429.txt
    summary: SARS-CoV-2 primers in use today by measuring the number of mismatches between primer sequence and genome targets with respect to the sequenced SARS-CoV-2 genomes, we can measure how the targeted proteins are mutating. primer sequences and protocols developed for six different regions – USA, Germany, China, Hong Kong, Japan, and Thailand – percent of genomes hit by each PCR test, labelled by the country and target gene region. Figure 6 shows the average number of mismatches over time, grouped by the genomes sampled The results of this study also demonstrate that each primer target develops a different number of mismatches over time The mutations that lead to mismatches between gene PCR primers and their targets reflect the sequence evolution of the consistent with the observed increasing number of mismatches over time, and shows that evolution of SARS-CoV-2 genomes is of the RdRp gene, primer target creates a disproportionate number of mismatches when compared to genomes sequenced within

         id: 10_1101-2021_01_04_425285
     author: Nugent, Cameron M.
      title: debar, a sequence-by-sequence denoiser for COI-5P DNA barcode data
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2020_09_09_289074
     author: Ortuso, Francesco
      title: Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex
       date: 2021
      words: 7878.0
  sentences: 977.0
      pages: 23
     flesch: 67.0
      cache: ./cache/10_1101-2020_09_09_289074.pdf
        txt: ./txt/10_1101-2020_09_09_289074.txt
    summary: Structural Genetics of circulating variants affecting the SARS-CoV-2 Spike / human ACE2 complex SARS-CoV-2, COVID-19, mutations, Spike, ACE2 SARS-CoV-2 entry in human cells is mediated by the interaction between the viral Spike protein and protein variants in the SARS-CoV-2 population as the result of mutations, and it is unclear if these SARS-CoV-2 (the COVID-19 virus) and human cells, through the analysis of Spike/ACE2 complexes. future mutations targeting the ACE2/Spike binding and detected by sequencing SARS-CoV-2 on a We obtained structural models of the SARS-CoV-2 Spike interacting with the human ACE2 from three contributing to the interaction between Spike and ACE2, according to GBPM (see Table 1 and Fig 3 for A less frequent mutation amongst those predicted to contribute to the ACE2/Spike interaction is population and non-zero GBPM average score in the ACE2/Spike interaction models. ACE2 variants with non-zero GBPM score in the Spike interaction model.

         id: 10_1101-2021_01_06_425546
     author: Osthus, Dave
      title: Fast and Accurate Influenza Forecasting in the United States with Inferno
       date: 2021
      words: 72.0
  sentences: 16.0
      pages: 
     flesch: 15.0
      cache: ./cache/10_1101-2021_01_06_425546.pdf
        txt: ./txt/10_1101-2021_01_06_425546.txt
    summary: bioRxiv.org the preprint server for Biology Skip to main content Home Submit ALERTS / RSS Search for this keyword Advanced Search Subject Areas All Articles Animal Behavior and Cognition Biochemistry Bioengineering Bioinformatics Biophysics Cancer Biology Cell Biology Clinical Trials Developmental Biology Ecology Epidemiology Evolutionary Biology Genetics Genomics Immunology Microbiology Molecular Biology Neuroscience Paleontology Pathology Pharmacology and Toxicology Physiology Plant Biology Scientific Communication and Education Synthetic Biology Systems Biology Zoology View by Month

         id: 10_1101-2020_11_13_381475
     author: Pabis, Kamil
      title: Triplex and other DNA motifs show motif-specific associations with mitochondrial DNA deletions and species lifespan
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2020_05_22_110247
     author: Pacini, Clare
      title: Integrated cross-study datasets of genetic dependencies in cancer
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2020_03_27_012757
     author: Peng, Da
      title: Evaluating the transcriptional fidelity of cancer models
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_08_426008
     author: Pipes, Lenore
      title: AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees
       date: 2021
      words: 3682.0
  sentences: 382.0
      pages: 7
     flesch: 63.0
      cache: ./cache/10_1101-2021_01_08_426008.pdf
        txt: ./txt/10_1101-2021_01_08_426008.txt
    summary: AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees Despite the exponential increase in the size of sequence databases of homologous genes, few methods exist to cluster At low identities, these methods produce uneven clusters where the majority of sequences are are no clustering methods that can accurately cluster large taxonomically divergent metabarcoding reference databases such as databases (Schoch et al., 2020), there is a need for new computationally efficient methods that can cluster divergent sequences. To cluster divergent sequences, we developed AncestralClust clustering methods: UCLUST (Edgar, 2010), meshclust2 (James dataset against UCLUST because it is the most widely used clustering program and it performs better than CD-HIT on low identity We developed a phylogenetic-based clustering method, AncestralClust, specifically to cluster divergent metabarcode sequences. Comparisons of clustering methods using 13,043 COI sequences from 11 different species.

         id: 10_1101-2021_01_07_425773
     author: Robitaille, Michael C.
      title: A Self-Supervised Machine Learning Approach for Objective Live Cell Segmentation and Analysis
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_08_425897
     author: Soleymanjahi, Saeed
      title: APOBEC1 mediated C-to-U RNA editing: target sequence and trans-acting factor contribution to 177 RNA editing events in 119 murine transcripts in-vivo
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_08_425885
     author: Sritharan, Duluxan
      title: Computing the Riemannian curvature of image patch and single-cell RNA sequencing data manifolds using extrinsic differential geometry
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_04_425250
     author: Thibodeau, Asa
      title: A read count-based method to detect multiplets and their cellular origins from snATAC-seq data
       date: 2021
      words: 9658.0
  sentences: 910.0
      pages: 32
     flesch: 59.0
      cache: ./cache/10_1101-2021_01_04_425250.pdf
        txt: ./txt/10_1101-2021_01_04_425250.txt
    summary: A read count-based method to detect multiplets and their cellular origins from snATAC-seq data Similar to other droplet-based single cell assays, single nucleus ATAC-seq (snATAC-seq) data harbor multiplets 17 found that when snATAC-seq samples were adequately sequenced (e.g., >20k valid read pairs per cell), ATAC-52 ATAC-DoubletDetector detected heterotypic multiplets introduced in PBMC samples with high recall 126 detected multiplets were homotypic (76.7-84.3% in islets, 63-78.7% in PBMCs), with cell types being distributed 207 with respect to their cell proportions for both homotypic and heterotypic multiplet types (Fig. 5d-e, Extended Data 208 ATAC-DoubletDetector for identifying multiplets from snATAC-seq data with enough reads per nuclei, it can also 239 Fig. 2: ATAC-DoubletDetector identifies heterotypic and homotypic multiplets in human PBMC snATAC-seq data. e, The number of cells and percentage of multiplets detected by ATAC-DoubletDetector in PBMC and islet samples. Extended Data Fig. 6: ATAC-DoubletDetector detects both homotypic and heterotypic multiplets at high read depth.

         id: 10_1101-2021_01_02_425006
     author: Wang, Qi
      title: Analysis of next- and third-generation RNA-Seq data reveals the structures of alternative transcription units in bacterial genomes
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_06_425544
     author: Wang, Xia
      title: Complex Systems Analysis Informs on the Spread of COVID-19
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2020_04_17_043323
     author: Waschke, Johannes
      title: <em>linus</em>: Conveniently explore, share, and present large-scale biological trajectory data from a web browser
       date: 2021
      words: 6528.0
  sentences: 571.0
      pages: 13
     flesch: 54.0
      cache: ./cache/10_1101-2020_04_17_043323.pdf
        txt: ./txt/10_1101-2020_04_17_043323.txt
    summary: linus: Conveniently explore, share, and present large-scale biological trajectory data from a web browser linus: Conveniently explore, share, and present large-scale biological trajectory data In biology, we are often confronted with information-rich, large-scale trajectory data, but exploring and communicating We provide a python script that reads trajectory data and enriches them with additional features, such as edge bundling or custom axes and generates an interactive web-based visualisation that can be shared offline from diffusion MRI imaging (Liu et al., 2020), or tracking data such as cell trajectories or animal trails (Romero-Ferrero et visualisation tool linus, making it easier to explore 3D trajectory data from any device without a local installation of Creating a visualisation package with linus is done in a few simple steps (Fig. 1a): The user imports trajectory data from a Figure 1 Browser-based exploration and sharing of trajectory visualizations with linus.

         id: 10_1101-2021_01_04_425288
     author: Wei, Qi
      title: Predicting chemotherapy response using a variational autoencoder approach
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_08_425379
     author: Wilmes, Stephan
      title: Competitive binding of STATs to receptor phospho-Tyr motifs accounts for altered cytokine responses in autoimmune disorders
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_08_425855
     author: Wu, Canbiao
      title: DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites.
       date: 2021
      words: 7280.0
  sentences: 806.0
      pages: 31
     flesch: 53.0
      cache: ./cache/10_1101-2021_01_08_425855.pdf
        txt: ./txt/10_1101-2021_01_08_425855.txt
    summary: DeepHBV: A deep learning model to predict hepatitis B virus (HBV) integration sites. deep learning model DeepHBV to predict HBV integration sites by learning local learning model DeepHBV to predict HBV integration sites by learning local genomic DeepHBV effectively predicts HBV integration sites by adding genomic features. mixed HBV integration sequences, positive genome feature samples, and randomly peaks and DeepHBV with HBV integration sequences + TCGA Pan Cancer peaks) on model trained with HBV integrated sequences + TCGA Pan Cancer showed an performed better compared with DeepHBV model with HBV integration sequences + HBV integration sites + TCGA Pan Cancer, a cluster of attention weights much output of DeepHBV with HBV integration sites plus TCGA Pan Cancer showed the of DeepHBV with HBV integration sequences + TCGA Pan Cancer showed strong DeepHBV with HBV integration sequences + TCGA Pan Cancer model on (a) DeepHBV with HBV integration sequences + TCGA Pan Cancer model on (a) 

         id: 10_1101-2020_12_14_422697
     author: Yang, Tao
      title: AdRoit: an accurate and robust method to infer complex transcriptome composition
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_08_425918
     author: Yogodzinski, Christopher H
      title: A global cancer data integrator reveals principles of synthetic lethality, sex disparity and immunotherapy.
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

         id: 10_1101-2021_01_05_425417
     author: Zielezinski, Andrzej
      title: Taxonomy-aware, sequence similarity ranking reliably predicts phage-host relationships
       date: 2021
      words: nan
  sentences: nan
      pages: 
     flesch: nan
      cache: 
        txt: 
    summary: 

==== make-pages.sh questions
==== make-pages.sh search
==== make-pages.sh topic modeling corpus
Zipping study carrel