-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Enhanced MetaWorks Pipeline: An improved and extended version of the original MetaWorks multi-marker metabarcode processing pipeline
This enhanced version of MetaWorks builds upon the excellent foundation established by Porter & Hajibabaei (2022), adding significant improvements:
- π³ Docker Containerization: Complete containerized environment eliminates dependency conflicts and ensures reproducible results across systems
- β‘ Parallel RDP Classifier: Multi-threaded taxonomic assignment dramatically reduces processing time for large datasets
- π§© Modular Architecture: Rules split into logical modules (preprocessing, denoising, pseudogene filtering) for easier maintenance and customization
- π§ Flexible Sample Input: Support for both CSV-based sample sheets and folder-based sample discovery
- π Enhanced Python Scripts: Modernized Python 3 scripts with improved error handling and performance
- π Advanced Statistics: Comprehensive quality metrics and processing statistics at each pipeline step
Section | Description |
---|---|
Installation Guide | Step-by-step installation instructions |
Quick Start | Get running in 10 minutes |
Supported Markers | Complete list of supported metabarcoding markers |
Workflows | Detailed workflow documentation |
Configuration | Pipeline configuration options |
Troubleshooting | Common issues and solutions |
API Reference | Command-line interface documentation |
Examples | Real-world usage examples |
Contributing | How to contribute to this project |
Marker | Target Taxa | Classifier Available |
---|---|---|
COI | Animals, Eukaryotes | β |
16S rRNA | Bacteria, Archaea | β |
ITS | Fungi | β |
rbcL | Plants, Diatoms | β |
12S | Fish, Vertebrates | β |
18S | Eukaryotes | β |
28S | Fungi | β |
Process multiple metabarcoding markers in a single, harmonized workflow using consistent bioinformatic approaches across all supported markers.
Generate high-resolution ESVs or traditional OTUs with taxonomic assignments and confidence scores using the RDP Classifier.
Built with Snakemake for reproducible, scalable processing on everything from laptops to high-performance computing clusters.
- ITS markers: Automatic removal of flanking conserved rRNA regions
- Protein-coding markers: Pseudogene filtering using profile HMMs
- Quality control: Comprehensive read quality assessment and filtering
Research Applications
- Biodiversity assessments
- Environmental monitoring
- Ecological studies
- Taxonomic surveys
Operational Applications
- Biomonitoring programs
- Environmental impact assessments
- Water quality monitoring
- Conservation projects
If you use this enhanced pipeline, please cite the original MetaWorks paper:
Porter, T. M., & Hajibabaei, M. (2022). MetaWorks: A flexible, scalable bioinformatic pipeline for high-throughput multi-marker biodiversity assessments. PLOS ONE, 17(9), e0274260. [doi:10.1371/journal.pone.0274260](https://doi.org/10.1371/journal.pone.0274260)
Additional Citations:
- COI classifier: Porter, T. M., & Hajibabaei, M. (2018). Scientific Reports, 8, 4226.
- Pseudogene filtering: Porter, T.M., & Hajibabaei, M. (2021). BMC Bioinformatics, 22: 256.
- RDP classifier: Wang, Q., et al. (2007). Applied and Environmental Microbiology, 73(16), 5261β5267.
- Issues: [Report bugs or request features](https://github.com/Hajibabaei-Lab/MetaWorks-2.0/issues)
- Discussions: [Join community discussions](https://github.com/Hajibabaei-Lab/MetaWorks-2.0/discussions)
- Contributing: See our Contributing Guidelines
- [Original MetaWorks Repository](https://github.com/terrimporter/MetaWorks)
- [Original MetaWorks Website](https://terrimporter.github.io/MetaWorksSite)
- [Hajibabaei Lab](https://github.com/Hajibabaei-Lab)
Getting Started: Ready to dive in? Head to our Installation Guide or Quick Start to begin processing your metabarcoding data.