This README describes the deep sequencing analysis in:
Prevalence and mechanisms of evolutionary contingency in human influenza H3N2 neuraminidase
- ./Fasta/SD93_mutlib_ref.fa: Reference sequence for SD93 mutant library
- ./Fasta/Bil69_mutlib_ref.fa: Reference sequence for Bil69 mutant library
- Raw read files in fastq format from NIH SRA database BioProject PRJNA790468
-
Merge overlapping paired-end reads using PEAR
pear -f [FASTQ FILE FOR FORWARD READ] -r [FASTQ FILE FOR FORWARD READ] -o [OUTPUT FASTQ FILE]
- Output files should be placed in the fastq/ folder and named as:
- SD93_ipt.assembled.fastq for input library
- SD93_rep1.assembled.fastq for post-passaged library (replicate 1)
- SD93_rep2.assembled.fastq for post-passaged library (replicate 2)
- Output files should be placed in the fastq/ folder and named as:
-
Count variants from fastq files
python3 script/SD93_NA_fastq2count.py
- Input files:
- ./Fasta/SD93_mutlib_ref.fa
- fastq/SD93_ipt.assembled.fastq
- fastq/SD93_rep1.assembled.fastq
- fastq/SD93_rep2.assembled.fastq
- Output file:
- Input files:
-
Filtering low count variants
python3 script/SD93_NA_filter.py
- Input file:
- Output file:
-
Plot post-passaged frequency of each variant
Rscript script/SD93_NA_plot_freq.R
- Input file:
- Output file:
-
Merge overlapping paired-end reads
pear -f [FASTQ FILE FOR FORWARD READ] -r [FASTQ FILE FOR FORWARD READ] -o [OUTPUT FASTQ FILE]
- Output files should be placed in the fastq/ folder and named as:
- Bil69_ipt.assembled.fastq for input library
- Bil69_rep1.assembled.fastq for post-passaged library (replicate 1)
- Bil69_rep2.assembled.fastq for post-passaged library (replicate 2)
- Output files should be placed in the fastq/ folder and named as:
-
Compute enrichment value for each variant from fastq files
python3 script/Bil69_NA_fastq2enrich.py
- Input files:
- ./Fasta/Bil69_mutlib_ref.fa
- fastq/Bil69_ipt.assembled.fastq
- fastq/Bil69_rep1.assembled.fastq
- fastq/Bil69_rep2.assembled.fastq
- Output files:
- Input files:
-
Filter irrelevant variants
python3 script/Bil69_NA_filter.py
- Input file:
- Output file:
-
Plot enrichment of each variant
Rscript script/Bil69_NA_plot_enrich.R
- Input file:
- Output file: