Directory
Sequence analysis »
Visualization »
Table of Contents
This repository contains scripts used to analyze and visualize mutations in the SARS-CoV-2 spike gene.
Current efforts involve detection of evolutionarily convergent SARS-CoV-2 spike mutations, mutations that enable mAb and antiviral resistance, and mutations that increase transmissibility of the disease.
Scripts are compatable with Python 3.7 - Python 3.11, Pandas 1.5.1+, Biopython 1.79+, Streamlit 1.18.0+.
- Anaconda distribution package recommended for conda installations.
- Pip installation requires pip 19.3+.
conda install pandas
pip install pandas
conda install -c conda-forge biopython
pip install biopython
conda install -c conda-forge streamlit
pip install streamlit
conda install -c plotly plotly_express
pip install plotly_express
Data visualization was performed on the sequence analysis output for SARS-CoV-2 sequences submitted to GISAID for the state of Arizona within the last 90 days. Visualization was performed using the python library plotly express and presented in a dashboard using the python library streamlit.
Local directory structure should emulate the format in the visualization folder for this repository.
- The main streamlit script should be in the parent directory while dashboard pages, and data should reside in their respective sub-directories.
- For more information on how to execute the script, visit this streamlit help page.
- Continued development of analysis and visualization script.