Skip to content

Streamlit-based web application for scientific data cleaning, basic statistics, and interactive data visualization. Designed for researchers, students, and small labs to easily upload .csv or .tsv files, clean data, generate plots, and download results — all without writing code.

Notifications You must be signed in to change notification settings

VMansell92/CleanVizBio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧬 CleanVizBio

CleanVizBio is a web-based bioinformatics and data visualization tool built with Python and Streamlit. Designed for researchers, students, and data analysts, it simplifies the process of exploring, visualizing, and reporting scientific data — no coding required.


🚀 Features

  • 📂 Upload .csv or .tsv data files
  • 🧹 Clean datasets by removing empty rows/columns and renaming headers
  • 📊 Visualize data with:
    • Histogram
    • Box Plot
    • Scatter Plot
    • Heatmap (correlation)
    • PCA (Principal Component Analysis)
    • Volcano Plot for log2FC and p-values
  • 📈 View PCA variance explained
  • 💾 Download plots as .png
  • 📄 Generate a Markdown analysis report summarizing stats, PCA, and dataset info

📎 Report Export (Markdown)

Users can export a .md report containing:

  • Dataset overview
  • Summary statistics
  • PCA results (if available)

This report can be:

  • Opened in VS Code, Typora, or Dillinger.io
  • Converted to PDF using tools like pandoc or Markdown editors

🛠️ Tech Stack

  • Python 3
  • Streamlit
  • Pandas, NumPy, Seaborn, Matplotlib
  • Scikit-learn
  • Tabulate (for Markdown formatting)

📦 Run Locally

git clone https://github.com/VMansell92/CleanVizBio.git
cd CleanVizBio
python -m venv venv
venv\Scripts\activate     # (or source venv/bin/activate on Mac/Linux)
pip install -r requirements.txt
streamlit run app.py

🔗 Live App

👉 https://cleanvizbio-4fr2zewe8ykbzmbzi9leor.streamlit.app

📄 License

MIT License

👩‍💻 Author

Victoria Mansell
🔗 GitHub @VMansell92
📫 vmansell92@gmail.com
🚀 Open to freelance and bioinformatics-related opportunities!

About

Streamlit-based web application for scientific data cleaning, basic statistics, and interactive data visualization. Designed for researchers, students, and small labs to easily upload .csv or .tsv files, clean data, generate plots, and download results — all without writing code.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages