GitHub - Gab-23/Genomics-Pipeline: Brief genomics pipeline made as a project to take a set of .fastq files and perform every step (also quality controls) up to the variant calling process (outputs a complete folder with also .vcf files).

BRIEF PROGRAM OUTLINE

The program excecutes a complete pipeline of genomics analysis of a family The aim is variant prioritization and potential discovery of mendellian diseases

All the provided files are stored in a specific folder inside a server. The starting materials include:

The original FASTQ files, coming from a exome sequencing experiment
The reference genome and its indexing
The target sequences, according to the exons in the reference genome

The main steps excecuted by this program, according to the usual pipeline will be:

BAM files generation
- Reads, coming from a sequencing procedure are aligned against an already indexed reference genome
- BAM files are obtained
Quality Control step
- FASTQC to assess the quality of the reads
- BAMQC to assess the quality of the alignment
- MULTIQC to obtain a unique summary
Variant Calling
- Using the freebayes tools a VCF file containing called variants is generated
- The file is then filtrated in order to obtain only the variants with the right pattern of transmission
- The file is intersected with respect to the target regions of the reference exome
- The file is then sorted to keep a coherent ordering of the genotypes
BG files generation, for subsequent variant visualization on the UCSC Genome Browser

The output of the program will be a directory, named as the family number, containing 17 directories and 157 files

HOW TO INITIALIZE THE PROGRAM

The program will first ask for the target directory, where the output will be stored

NOTE: If the specified directory is NOT found, the program will create it
By further editing the program, other path-containing variables may be changed
The program can hold up to 10 arguments, since the project asked for a 10 families analysis
If you want to compute less than 10 arguments, mind typing STOP as a last argument, an error is raised otherwise
Arguments MUST be composed in this way: <family_number> _ <disease_model>

NOTE: Disease models either are Autosomic Recessive (AR), or Autosomic Dominant (AD) EXAMPLE: 452_AR, 468_AD ...

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Genomics Report 2024.pdf		Genomics Report 2024.pdf
MK_7_fast		MK_7_fast
README.md		README.md
README.txt		README.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BRIEF PROGRAM OUTLINE

HOW TO INITIALIZE THE PROGRAM

About

Uh oh!

Releases

Packages

Languages

Gab-23/Genomics-Pipeline

Folders and files

Latest commit

History

Repository files navigation

BRIEF PROGRAM OUTLINE

HOW TO INITIALIZE THE PROGRAM

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages