Interactive tutorials for using scikit-bio in biological research.
This repository hosts tutorial materials that were initially taught during the ISMB 2024 conference. They were latter updated and expanded to reflect the new features of scikit-bio. The current tutorials are up-to-date with scikit-bio 0.7.0.
The tutorials are broken down into eight sections. They can be directly launched in Google Colab via the following links.
- Working with various omic data types
- Analyzing microbial communities
- Comparing microbial community structure
- Inferring and associating critical features
- Predicting host and environmental traits
These sections use the EMP500 dataset, available for download at this Dropbox link.
- Protein remote homology detection and structural alignment using deep learning
- ProtTrans: Towards Cracking the Language of Life’s Code Through Self-Supervised Learning
- UniFrac: a New Phylogenetic Method for Comparing Microbial Communities
- Establishing microbial composition measurement standards with reference frames
- Unifying the analysis of high-throughput sequencing datasets: characterizing RNA-seq, 16S rRNA gene sequencing and selective growth experiments by compositional data analysis
- Balances: a New Perspective for Microbiome Analysis
- Compositionally Aware Phylogenetic Beta-Diversity Measures Better Resolve Microbiomes Associated with Phenotype