|
| 1 | +# FAQs |
| 2 | + |
| 3 | +# TwistMethylFlow FAQs |
| 4 | + |
| 5 | +## Pipeline Steps |
| 6 | + |
| 7 | +### Generate Reference Genome |
| 8 | +??? question "What is the purpose of Generate Reference Genome?" |
| 9 | + This step creates reference genome index files for **Bismark**, which are required for bisulfite read alignment. |
| 10 | + |
| 11 | +### Raw Data QC |
| 12 | +??? question "How is raw sequencing data quality assessed?" |
| 13 | + Raw Data QC uses **FastQC** to evaluate per-base quality, GC content, adapter contamination, and other metrics. |
| 14 | + |
| 15 | +### Adapter Trimming |
| 16 | +??? question "How should adapters and low-quality bases be handled?" |
| 17 | + **Trim Galore** removes adapters and low-quality bases from reads, ensuring high-quality data for alignment. |
| 18 | + |
| 19 | +### Align Reads |
| 20 | +??? question "How are reads aligned to the reference genome?" |
| 21 | + **Bismark (Bowtie2)** aligns bisulfite-treated reads to the reference genome. |
| 22 | + Accurate alignment is crucial for downstream methylation analysis. |
| 23 | + |
| 24 | +### Deduplicate Removal |
| 25 | +??? question "What is deduplicate removal?" |
| 26 | + **Bismark** removes PCR duplicates from aligned reads to prevent bias in methylation calling. |
| 27 | + |
| 28 | +### Sort and Indexing |
| 29 | +??? question "How are BAM files prepared after alignment?" |
| 30 | + **Samtools** sorts and indexes deduplicated BAM files for efficient access in downstream analyses. |
| 31 | + |
| 32 | +### Extract Methylation Calls |
| 33 | +??? question "How are methylation calls extracted?" |
| 34 | + **Bismark** extracts cytosine methylation calls from aligned reads, generating the input for differential methylation analysis. |
| 35 | + |
| 36 | +### Summary Report |
| 37 | +??? question "How is the summary report generated?" |
| 38 | + **Bismark** summarizes alignment statistics and methylation extraction results, including conversion efficiency. |
| 39 | + |
| 40 | +### Alignment QC |
| 41 | +??? question "How is alignment quality assessed?" |
| 42 | + **Qualimap** generates metrics such as coverage, mapping quality, and duplication rate for aligned reads. |
| 43 | + |
| 44 | +### QC Reporting |
| 45 | +??? question "How are QC reports aggregated?" |
| 46 | + **MultiQC** compiles QC reports from FastQC, Bismark, and Qualimap into a single comprehensive report. |
| 47 | + |
| 48 | +### Differential Methylation |
| 49 | +??? question "How is differential methylation analysis performed?" |
| 50 | + **EdgeR** and **MethylKit** identify differentially methylated positions or regions, considering replicates and experimental design. |
| 51 | + |
| 52 | +### Post Processing |
| 53 | +??? question "How are results visualized?" |
| 54 | + **ggplot2** creates summary plots like heatmaps, volcano plots, and coverage plots for differential methylation results. |
| 55 | + |
| 56 | +### GO Analysis |
| 57 | +??? question "How is Gene Ontology analysis performed?" |
| 58 | + Functional enrichment analysis identifies pathways associated with differentially methylated genes, providing biological interpretation. |
| 59 | + |
| 60 | +--- |
| 61 | + |
| 62 | +## Nextflow Workflow Concepts |
| 63 | + |
| 64 | +### General Nextflow |
| 65 | +??? question "What is Nextflow?" |
| 66 | + Nextflow is a workflow management system for reproducible and scalable scientific pipelines. |
| 67 | + It allows users to define tasks and data flow using **processes** and **channels**. |
| 68 | + |
| 69 | +### Processes |
| 70 | +??? question "What are processes in Nextflow?" |
| 71 | + Processes encapsulate a task with defined inputs, outputs, and commands. |
| 72 | + They run independently and can scale across computing resources. |
| 73 | + |
| 74 | +### Channels |
| 75 | +??? question "What are channels in Nextflow?" |
| 76 | + Channels are asynchronous data streams that connect processes, passing input and output data between them. |
| 77 | + |
| 78 | +### Outputs |
| 79 | +??? question "How do I define outputs in Nextflow?" |
| 80 | + Outputs are declared using the `output` block in processes or workflows. |
| 81 | + This defines what data is saved for downstream steps or exported from the workflow. |
| 82 | + |
| 83 | +### Parallel Execution |
| 84 | +??? question "How does Nextflow handle parallel execution?" |
| 85 | + Nextflow automatically runs independent processes in parallel based on available compute resources, enabling scalable execution. |
| 86 | + |
| 87 | +### Reproducibility |
| 88 | +??? question "How does Nextflow ensure reproducibility?" |
| 89 | + Nextflow supports containerization (Docker/Singularity) and environment specification, ensuring the same pipeline produces identical results across systems. |
| 90 | + |
| 91 | +### Workflow Profiles |
| 92 | +??? question "What are Nextflow profiles?" |
| 93 | + Profiles allow users to define environment-specific configurations, such as compute clusters, Docker images, and resource limits, for flexible pipeline execution. |
| 94 | + |
| 95 | +### Learning Resources |
| 96 | +??? question "Where can I learn more about Nextflow?" |
| 97 | + - [Nextflow Documentation](https://www.nextflow.io/docs/latest/index.html) |
| 98 | + - [Nextflow Tutorials](https://nf-co.re/docs/usage/tutorials/nextflow) |
| 99 | + - [Hello Nextflow Training](https://training.nextflow.io/2.0/hello_nextflow/) |
| 100 | + |
| 101 | + |
0 commit comments