Construction of the microbial reference gene catalog and metagenomic assembled genome database for goat rumen microbiome.
Before running, please ensure that the appropriate software and databases are installed.
Note: Some software is easier to use when configured in a virtual environment, such as MetaWRAP and KofamScan.
Database | Version | Description | Availability |
---|---|---|---|
goat | Genome assembly ARS1.2 | Goat reference genome | https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_001704415.2/ |
NCBI-nr | 20221012 | Protein database | https://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/ |
EggNOG | EggNOG5.0 | EggNOG annotation | http://eggnog5.embl.de/#/app/downloads |
CARD | v3.0.9 | CARD annotion | https://card.mcmaster.ca |
VFDB | 20200703 | VFDB annotion | http://www.mgc.ac.cn/VFs/main.htm |
GTDB-tk | release214 | Taxonomic assignments of MAGs | https://gtdb.ecogenomic.org/downloads |
Note: Update the database versions if necessary.
Metagenomic raw data processing: reads trimming and DNA contamination removal, generating high-quality data.
Assembly: assemble short reads into contigs.
Construct non-redundant gene catalog: predict ORFs and remove redundancy to build the non-redundant gene catalog.
Taxonomic annotation: align protein sequences from the non-redundant gene catalog to the NCBI-nr database to obtain taxonomic annotation information.
Functional annotation: align protein sequences from the non-redundant gene catalog to the eggNOG, KEGG, CAZy, CARD, and VFDB databases to obtain functional annotation information.
The steps related to the reconstruction of metagenome-assembled genomes (MAGs) are outlined in this section. Binning, refining, reassembling, annotating genomes, and estimating MAG abundances were carried out using the modules of the metaWRAP pipeline. MAG dereplication was performed using dRep software. Taxonomic classification and phylogenetic analysis were conducted with GTDB-tk and PhyloPhlAn, respectively.
Note: many custom scripts were involved in the analysis process. Please contact me for access.