Align to AllTheBacteria with low RAM (1-2Gb) and quickly (seconds if just 1000s of hits, 15 mins for a gene present in all 2.4 million genomes) using @shenwei356.bsky.social Lexicmap. So you can basically BLAST AllTheBacteria locally - but need 3-4Tb disk for index
https://www.biorxiv.org/content/10.1101/2024.08.30.610459v1
https://www.biorxiv.org/content/10.1101/2024.08.30.610459v1
Reposted from
Zamin Iqbal
Latest data from AllTheBacteria described in our updated preprint. All illumina WGS bacterial+archaeal sequence data to Aug 2024 consistently assembled, QC, and now AMR profiling. That's 2.4 million genomes. Gene annotations almost done , coming next. Please use!
www.biorxiv.org/content/10.1...
www.biorxiv.org/content/10.1...
Comments