Releases: KwanLab/Autometa
Release 2.2.3
- Fix "TypeError: boolean value of NA is ambiguous" Issue #360 in #361 by @chasemc and @Sidduppal
Full Changelog: 2.2.2...2.2.3
Release 2.2.2
What's Changed
- β¬οΈ bump readthedocs-sphinx-search; 0.1.1 to 0.3.2 by @chasemc in #354
- π Fixes Biopython deprecation warning by @Sidduppal in #350
- Fix "ncbi rsync error when downloading markers" #347 in #352 by @Sidduppal
- β¬οΈ Pin BioPython to >=1.82
- β¬οΈ Pin Pandas to >=1.5
- Release 2.2.2 by @Sidduppal in #359
Full Changelog: 2.2.1...2.2.2
Release 2.2.1
What's Changed
- π§βπ§ π Fix docs by @Sidduppal in #323
- Step by step walkthrough by @kaw97 in #314
- π¨ π Issue 330 redo by @shaneroesemann in #338
- π₯π³π Fix docker-builds by removing pinned dependencies by @WiscEvan in #340
- singularity image urls by @kaw97 in #316
- helpful error message added by @RajnishProgrammer in #307
- π:bug::snake: Fix bug for 0 passing predictions by @WiscEvan in #341
- π Fix GTDB database setup by @Sidduppal in #329
- ππ Fix GTDB taxon-binning workflow by @shaneroesemann in #339
- π Update documentation by @WiscEvan in #342
- ππ¨π₯π Fix UnboundLocalError bug by @WiscEvan in #325
- ππ¨π Refactor 'unclassified' handling by @kaw97 and @WiscEvan in #343
- Release 2.2.1 by @WiscEvan in #344
New Contributors
- @RajnishProgrammer made their first contribution in #307
Full Changelog: 2.2.0...2.2.1
Release 2.2.0
Release 2.2.0
Breaking changes π£ ππ
Autometa's autometa-taxonomy
and autometa-binning-summary
entrypoints (instead of taking --ncbi
as an input parameter) now take --dbdir
and --dbtype
to allow the user to use either the NCBI or GTDB database.
Command | π Previous π | π New π |
---|---|---|
autometa-taxonomy |
--ncbi |
--dbdir <ncbi-database-dirpath> , --dbtype ncbi (choices: ncbi, gtdb) |
autometa-binning-summary |
--ncbi |
--dbdir <ncbi-database-dirpath> , --dbtype ncbi (choices: ncbi, gtdb) |
NOTE: For implementation details for integrating other taxonomy databases see #284
π Additional autometa workflows
- Addition of workflow where only required inputs are reads and assembly
autometa.sh
andautometa-large-data-mode.sh
now requiretaxa_routine
as an input parameter (choices are"ncbi"
or"ncbi_gtdb"
. Autometa workflow now contains an optional sub-workflow where binning is guided by GTDB taxonomy after retrieving bacteria and archaea classified using the NCBI database.
TaxonomyDatabase
π GTDB taxonomy integration to use within Autometa's taxon-binning and genome-binning workflows.
For more information on GTDB database setup see the Autometa GTDB database documentation
(autometa) evan@userserver:~/Autometa$ autometa-setup-gtdb -h
usage: autometa-setup-gtdb [-h] --reps-faa REPS_FAA --dbdir DBDIR [--cpus CPUS]
optional arguments:
-h, --help show this help message and exit
--reps-faa REPS_FAA Path to directory containing GTDB ref genome animo acid data sequences. Can be tarballed.
--dbdir DBDIR Path to output GTDB database directory
--cpus CPUS Number of cpus to use for diamond-formatting GTDB database
TL;DR
Abstraction of taxonomy databases using TaxonomyDatabase
abstract base class with required abstractmethods
.
This is currently implemented for both the NCBI and GTDB taxonomy database. Future taxonomy database integrations should follow the format from the TaxonomyDatabase
class.
CAMI formatter
Binning validation/benchmarking utilities added/updated such as formatting autometa binning results into biobox format
Misc
- π Fix pytest requirements in GH actions
- πβ¬οΈ pin
scipy
andjoblib
to avoidhdbscan
import error - ππ€« Fix deprecated pandas method invocation in
bedtools.py
What's Changed
- π Added contribution documentation by @jason-c-kwan in #277
- π¨π Add CAMI formatter entrypoint
autometa-cami-format
by @WiscEvan in #276 - π¨π Update deprecated pandas method invocation by @WiscEvan in #279
- β¬οΈ π force scipy==1.8 by @kaw97 in #286
- π New bash files for Autometa workflow by @samche42 in #281
- β¬οΈ π¨ Allow the use of gtdb taxonomy in Autometa by @Sidduppal in #284
- Add
--average-method
parameter toautometa-benchmark
by @WiscEvan in #290 - GTDB integration by @Sidduppal and @WiscEvan in #284
New Contributors
- @jason-c-kwan made their first contribution in #277
- @kaw97 made their first contribution in #286
Full Changelog: 2.1.0...2.2.0
2.1.0
[2.1.0] - 2022-05-12
Added
- π
autometa
entrypoint to retrieve info on--citation
,--version
and no argument lists other autometa commands - π The autometa version is written after completion of each autometa-related nextflow process
- π The autometa version is emitted at the beginning of the autometa bash workflow.
- π NCBI.py will now search
delnodes.dmp
for any taxids retrieved from blast output and convert to root (taxid=1)
Fixed
- ππ Renamed incorrect variables
$coverage
to$coverages
and$clustering_method
to$cluster_method
inworkflows/autometa.sh
- ππ Fix use of
kwargs
in kmersembed(..., *kwargs)
function - ππ Fix passing
n_jobs
to kmersembed(..., n_jobs=1)
function - π π³Add
params.autometa_image_tag
tomock_data_reporter
process - π π³ Add
params.autometa_image_tag
toget_genomes_for_mock
process
Dependencies
- β¬οΈπ Pinned black version to 22.3.0
- β¬οΈ:green_apple: upgrade diamond to version 2.0.14
Removed
- π₯π Removed
tests/requirements.txt
and consolidated totests/environment.yml
What's Changed (from the git logs)
- π¨π Add taxid conversion from
'unclassified'
to1
for taxon-profiling benchmarking by @WiscEvan in #260 - π Update installation directions for bash workflow by @WiscEvan in #262
- ππ Fix parsing taxids to delnodes set by @WiscEvan in #265
- β¬οΈ π Upgrade Diamond 2.0.9 -> 2.0.14 by @chasemc in #264
- π¨ππ Add
n_jobs
toembed(...)
func. Replace**method_args
w/**method_kwargs
by @WiscEvan in #267 - Versioning by @WiscEvan in #269
- π¨π Add
autometa --version
to autometa nextflow files by @WiscEvan in #271 - ππ Fix variable name for autometa-binning-summary by @WiscEvan in #272
- Release 2.1.0 by @WiscEvan in #273
Full Changelog: 2.0.3...2.1.0
2.0.3
What's Changed
- π Add CITATION file by @shaneroesemann in #242
- ππ¨π Allow utf-8-sig sample sheets #250
- πππ Replace incorrect runtime defaults specified in nf-core launch command #252
- π¨π Continue nextflow workflow (without termination) if bins are not recovered from a particular dataset #253
- π¨π
autometa-benchmark
entrypoint, when checking taxid classifications, now converts any taxids of0
to1
#254 - π NCBI connection is checked prior to NCBI database files download rather than pinging google DNS #258
- π Fix
autometa-length-filter
bug where specifying a directory when writing to--output-fasta
was required (now optional) #256 - π β¬οΈ:whale::green_apple: Bump version to 2.0.3 in
VERSION
andmanifest.version
innextflow.config
s.t. autometa docker images used in nextflow workflow correspond to most recent release
New Contributors
- @shaneroesemann made their first contribution in #242
Full Changelog: 2.0.2...2.0.3
2.0.2
What's Changed
- π Fix the NoneType error during initating the LCA object (#246) by @chtsai0105 in #247
New Contributors
- @chtsai0105 made their first contribution in #247
Full Changelog: 2.0.1...2.0.2
2.0.1
What's Changed
- Environment and directory structure updates by @WiscEvan in #8
- resolved #10 Contributors added and copyright year updated to 2020. by @WiscEvan in #15
- Resolving issues #16, #17, #18, #21 and update to Autometa API and Logger by @WiscEvan in #25
- Resolved #19 added docstring, fixed nproc and removed depth function by @Sidduppa 8000 l in #29
- Documentation by @Sidduppal in #34
- Issue #5 Working conda recipe by @WiscEvan in #38
- πfound in coverage by @WiscEvan in #49
- fixes #2 by @WiscEvan in #47
- Contributing Guidelines by @WiscEvan in #50
- Add Markers class documentation. by @WiscEvan in #62
- Add main.py documentation (fixes #60) by @WiscEvan in #63
- Add functionality to bin without taxonomy. Update docstrings by @WiscEvan in #65
- Documentation by @Sidduppal in #45
- Remove merge conflict resolution lines (Fixes #68) by @WiscEvan in #69
- π¨π Add mock import of modules and link to contribution guidelines (fixes #22) by @Sidduppal in #70
- Resolves #55 Environ by @Sidduppal in #76
- fixes-#54 Metagenome by @WiscEvan in #66
- Update MAG class to MetaBin by @WiscEvan in #67
- hmmer by @Sidduppal in #72
- LCA by @WiscEvan in #78
- Fix writing by @WiscEvan in #82
- Update majority_vote by @WiscEvan in #81
- pre-commit hooks by @WiscEvan in #92
- verbose bug by @WiscEvan in #90
- Recursive DBSCAN by @WiscEvan in #84
- databases and utilities by @WiscEvan in #77
- Rank-specific binning by @WiscEvan in #96
- diamond.py by @Sidduppal in #87
- Add support request issue template. by @WiscEvan in #97
- ncbi.py by @Sidduppal in #83
- Samtools by @Sidduppal in #103
- Binning stats/taxonomy summary by @WiscEvan in #99
- decision tree classifier by @WiscEvan in #100
- Fix config and setup of user project by @WiscEvan in #104
- Update project docstrings by @WiscEvan in #108
- CI/CD by @WiscEvan in #101
- π Change > to >= when calculating N50 by @chasemc in #119
- Fix Dockerfile by @WiscEvan in #123
- Add support for gzipped assemblies by @WiscEvan in #129
- Update bug report template by @WiscEvan in #130
- Remove --multiprocess from autometa-kmers entrypoint by @WiscEvan in #127
- Add GC content std.dev. limit and coverage std. dev. limit Binning metrics by @WiscEvan in #120
- Nextflow implementation template by @WiscEvan in #118
- Update documentation by @Sidduppal in #121
- Add feature to download google drive datasets by @ajlail98 in #138
- Add densmap embed method and fix binning-summary cluster column bug by @WiscEvan in #176
- Classification and Clustering Benchmarking by @WiscEvan in #141
- Nfcore and structuring modules for collaboration by @chasemc in #157
- Delete .gitattribute - there is a .gitattributes by @chasemc in #190
- π₯π Remove duplicate standard slurm profiles by @WiscEvan in #195
- Fix import error in databases.py by @WiscEvan in #194
- Fix/Create mock data subworkflow by @chasemc in #206
- π³:bug: Docker fix :whale: by @WiscEvan in #213
- π Update Documentation by @WiscEvan in #212
- π¨ Add typehints and update kmers docstring by @WiscEvan in #208
- π¨π Add specific parsers for domtblout and tblout for hmmscan output formats by @WiscEvan in #201
- ππ¨π Update metagenome.length_filter(...) by @WiscEvan in #210
- Fix bedtools genomecov deprecation (coverage calculation) by @WiscEvan in #209
- π¨π Add bash-implementations of Autometa workflows by @WiscEvan in #202
- Nextflow documentation by @chasemc in #184
- π₯π Reformat benchmarking docs by @WiscEvan in #215
- Refactor autometa-taxonomy-lca by @WiscEvan in #211
- π¨π³ Replace jason-c-kwan with jasonkwan for docker images by @WiscEvan in #217
- Update mock_data_reporter.Dockerfile by @chasemc in #220
- π³ππ Add docker CI and update links by @WiscEvan in #216
- Simplify Licensing by @chasemc in #222
- Add check for nr.dmnd and nr.gz by @chasemc in #221
- π Change Nextflow I/O behavior by @WiscEvan in #218
- ππ¨ add/update coverage handling by @WiscEvan in #223
- ππ Large data mode by @WiscEvan in #207
- Merge main into dev by @WiscEvan in #224
- pytest & codecov CI by @WiscEvan in #227
- Refactor samplesheet by @WiscEvan in #228
- π π¨ π Fix kingdom-handling and mounting NCBI databases into docker container by @WiscEvan in #229
- Add error handling strategies for nextflow processes by @WiscEvan in #231
- Release 2.0.0 by @WiscEvan in #233
- ππ¨β¬οΈπ₯ update build files and respective docs by @WiscEvan in #234
- Add badges and links to README.md by @WiscEvan in #235
- GH action: Add dynamic docker tags by @WiscEvan in #236
- Update nextflow-workflow.rst by @samche42 in #238
- ππ₯ Fix bedtools coverage calculation bug by @WiscEvan in #243
New Contributors
Full Changelog: 1.0.3...2.0.1
2.0.0
Full Changelog: 2.0...2.0.0
1.0.3
- π Deprecate python2.7 code
- π³ π Dockerfile fix