Skip to content

A six-frame translation and pairwise alignment pipeline that maps DNA sequences to their best-fit reference target protein.

Notifications You must be signed in to change notification settings

Rohak72/Efficient-Sequence-Analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Efficient-Sequence-Analyzer

A wrapper for Expasy Translate and EMBOSS Needle, automating the six-frame translation and pairwise alignment feedback loop. Available as both a command-line tool and published web app!

NOTE: If you're only interested in the source code, you can clone the cli-tool branch with git clone -b cli-tool --single-branch https://github.com/Rohak72/Efficient-Sequence-Analyzer.git.

Tech Stack:

  • Backend: Python, FastAPI, SQLite (via SQLAlchemy ORM)
  • Frontend: React, Node.js, Tailwind CSS
  • Deployment: AWS Lambda, API Gateway, Vercel Hosting
  • Async Processors: AWS S3 + Worker Lambda + DynamoDB

Future Plans:

  • ✅ Build async processes to ingest compute-expensive alignment tasks within API Gateway's 30-sec limit.
  • ⏱ Overhaul the real-time S3 database idea altogether to reduce request overload issues.
  • ⏱︎ Incorporate Litestream to handle concurrent processes with write-ahead logging to a designated S3 bucket.

Screen Shot 2025-09-04 at 12 16 30 PM

About

A six-frame translation and pairwise alignment pipeline that maps DNA sequences to their best-fit reference target protein.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published