Skip to content

mfz/GOR.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GOR.jl

Stable Dev Build Status

Genome ordered streams in Julia

GOR.jl is a Julia library to operate on genome ordered streams. Elements of genome ordered streams are sorted by chromosome and position, the first two items of each row element. Streaming allows operations on data sets that are larger than available memory, and genomic order speeds up relational operations like joins.

GOR supports the Tables.jl interface. This means it works with all sources and sinks that conform to the Tables.jl interface, e.g.

  • CSV files,
  • SQLite3 tables,
  • Parquet files,
  • and DataFrames,

and are ordered by chromosome and position.

GOR.jl allows creation of complex pipelines by joining together operators using the |> syntax. Each operator is implemented as a Julia iterator that is parameterized on input and output types. This allows for easy extension of the library by user-defined operations.

About

Genome ordered streams in Julia

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •