Skip to content
@Daylily-Informatics

Daylily Informatics

Multi-omic Informatics and Operations Consulting Services.

| John Major

I am a bioinformatician, scalable operations architect, Scientist, Artist, software engineer, and systems thinker working at the intersection of biology, clinical genomics, data science & cutting-edge computational solutions.


🚀 About Me

🔬 Scientist: Passionate about unraveling biological complexity.
💻 Open Source Advocate: Building tools to accelerate discovery and collaboration.
🌱 Innovator: Driving sustainable, impactful solutions in informatics and beyond.
🏗️ Builder: Contributor to the 🧬 Human Genome Project and several successful 🏥 clinical diagnostic startup companies.

🔗 Connect With Me


🔧 Skills & Expertise

🧬 Bioinformatics: Clinical WGS, RNA-seq, and variant analysis.
📊 Data Science: Python, R, machine learning.
☁️ HPC & Cloud Computing: AWS, Slurm, high-performance computing.
🌐 Open Source Development: FastAPI, Snakemake, and more.
🏥 Clinical Diagnostic Operations: designing and running scalable diagnostic workflows.
⚖️ Clinical Diagnostic Regulation & Compliance: expertise in CLIA/CAP standards and certification processes.


🌟 Key Projects

🌼 Daylily Ephemeral Cluster && Omics Analysis Workflows

daylily-ephemeral-cluster: Infrastructure as code allows on-demand creation of arbitrarily large self-scaling clusters.
Features:

  • Built using AWS Parallel Cluster and Parallel Cluster UI.
  • Scans AWS Regions and AZs to determine best spot market pricing, and creates clusters where spot pricing is most competitive.
  • Highly performant globally shared filesystem via FSx Lustre mirroring reference and other data from S3.
  • Reproducible and predictable runtimes and costs.
  • Automateable.
  • Fine resolution budget tracking of jobs and resources.
  • Real time cost reporting and decision gating capabilities.
  • Will run any slurm based workflow manager ( snakemake, CROMWELL, nextflow, ...).
  • Tight coupling of reference data allows highly performant and nimble ephemeral cluster lifecycles.
  • Designed to be ephemeral-- packaged tools facilitate rapid creation. monitoring, updating, archiving and deleting of ephemeral clusters.

daylily-omics-analysis: Achieving ~$2–$5 per 30x no-amp WGS from FASTQ to VCF.
Features:

  • Optimized to run w/in a daylily-ephemeral-cluster framework.
  • Industry-leading accuracy, speed, cost, auditability, scalability, QC views & observability.
  • Reproducible, sustainable, growing & open-source omics analysis workflows.
  • Automated infrastructure management with predictive and real-time cost visibility for storage, data transfer, and compute.
  • Open source & free: Deploy daylily in ~1hr and begin returning completed WGS analysis shortly thereafter.

🛠 Snakemake Executor Plugin

snakemake-executor-plugin-pcluster-slurm
A plugin designed to integrate Snakemake workflows with AWS ParallelCluster’s Slurm workload manager.

🧪 Laboratory Information Management Systems (LIMS)

bloom:
A templated, abstract, polymorphic, and opinionated LIMS for efficient laboratory data management. Real time COGS moitoring and operational decision gating.

🖨 Zebra Label Printing

zebra_day:
A library and API for network-connected Zebra printers, managing ZPL label templates and numerous printers with ease.


🌐 API Wrappers and Integrations


🖼 Image Processing Tools


🎨 Color Space Conversion Utilities

  • rgbw_colorspace_converter:
    Utility for RGB to RGBW conversion, supporting HSV, HSI, HSL, and HEX, focused on LED-based projects.

📖 Obsidian Integration Tools

  • gravity_well:
    Imports text, markdown, and PDF files into Obsidian with NLP-derived tags and enhanced metadata tracking.

🎭 Artistic Projects

  • pyramidtriangles:
    Software for artistic LED installations, derived from the grgbrn baaahs2014 codebase.

👀 Explore more in my repositories or get in touch!


🌍 Open Source Advocate

I’m committed to contributing to the global scientific community by creating tools and sharing knowledge. Let’s collaborate to push the boundaries of what’s possible in science and technology.


👀 Interested in what I’m building? Follow, star, or get in touch.

Pinned Loading

  1. zebra_day zebra_day Public

    Zebra Label Printing library & API (for network connected printers). Manage 100s of printers and ZPL label templates in one place.

    Python 10 2

  2. github_markdown_text_colorizer github_markdown_text_colorizer Public

    Service to return images of text where the input text can have font, font size and color set

    Python 1

  3. img_stitcher_day img_stitcher_day Public

    Playing around with merging images taken of the outside of a tube.

    Python 1

Repositories

Showing 10 of 42 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…