Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0020Updated Aug 9, 2025Aug 9, 2025
    • Blazing-Fast Bioinformatic Operations on Python DataFrames
      Python
      22694010Updated Aug 4, 2025Aug 4, 2025
    • Rust
      1102Updated Aug 4, 2025Aug 4, 2025
    • Benchmarks of various genomic ranges operations
      Jupyter Notebook
      1021Updated Jul 21, 2025Jul 21, 2025
    • evo2

      Public
      Genome modeling and design across all domains of life
      Jupyter Notebook
      327000Updated Mar 23, 2025Mar 23, 2025
    • TeX
      0001Updated Mar 20, 2025Mar 20, 2025
    • A set of native implementation of common bioinformatics algorithms to be used as Arrow-Datafusion or SeQuiLa (Apache Spark) extensions.
      Rust
      00180Updated Mar 13, 2025Mar 13, 2025
    • Self service for Data Science labs
      HCL
      48021Updated Jan 18, 2025Jan 18, 2025
    • Jupyter Notebook
      46531Updated Jan 18, 2025Jan 18, 2025
    • Apache DataFusion Comet Spark Accelerator
      Rust
      229001Updated Nov 24, 2024Nov 24, 2024
    • Jupyter Notebook
      1100Updated Oct 27, 2024Oct 27, 2024
    • phenodb

      Public
      Serverless vector database for deep phenotyping
      0000Updated Aug 29, 2024Aug 29, 2024
    • Fine-tuning LLaMA 2 for rare disease concept normalization
      Jupyter Notebook
      3000Updated Aug 9, 2024Aug 9, 2024
    • sequila

      Public
      SeQuiLa: Distributed analytics for genomics based on Apache Spark!
      HTML
      710248Updated Aug 2, 2024Aug 2, 2024
    • PhenoGPT

      Public
      Jupyter Notebook
      8000Updated May 16, 2024May 16, 2024
    • Launcher shortcuts for classic Jupyter Notebook & JupyterLab
      Python
      10000Updated Feb 26, 2024Feb 26, 2024
    • PhenoTagger
      GAP
      17000Updated Jan 24, 2024Jan 24, 2024
    • rnafusion

      Public
      RNA-seq analysis pipeline for detection gene-fusions
      Nextflow
      116000Updated Dec 1, 2023Dec 1, 2023
    • rust-bio

      Public
      This library provides implementations of many algorithms and data structures that are useful for bioinformatics. All provided implementations are rigorously tested via continuous integration.
      Rust
      213000Updated Nov 18, 2023Nov 18, 2023
    • coitrees

      Public
      A very fast interval tree data structure
      Rust
      9000Updated Nov 10, 2023Nov 10, 2023
    • iitii

      Public
      Implicit Interval Tree with Interpolation Index
      Jupyter Notebook
      4000Updated Nov 9, 2023Nov 9, 2023
    • A little benchmarking tool for Python
      Python
      6000Updated Oct 15, 2023Oct 15, 2023
    • Python
      2000Updated Aug 24, 2023Aug 24, 2023
    • ds-images

      Public
      Shell
      00154Updated May 25, 2023May 25, 2023
    • sparkseq

      Public
      Scala
      0000Updated Mar 5, 2023Mar 5, 2023
    • popgen

      Public
      Scala
      0000Updated Mar 5, 2023Mar 5, 2023
    • Scala
      0000Updated Mar 5, 2023Mar 5, 2023
    • disq

      Public
      A library for manipulating bioinformatics sequencing formats in Apache Spark
      Java
      12100Updated Jan 29, 2023Jan 29, 2023
    • cannoli

      Public
      Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
      Scala
      17000Updated Jan 29, 2023Jan 29, 2023
    • Python
      2000Updated Jan 17, 2023Jan 17, 2023