Releases: csiro-fair/marimba
Marimba: v1.0.1
Marimba is a Python framework for structuring, processing, packaging, and distributing FAIR scientific image datasets. Developed collaboratively by CSIRO and MBARI, Marimba provides core functionality for ensuring the FAIR compliance of scientific image data.
This initial stable release delivers a comprehensive framework for researchers, data scientists, and engineers working with large-scale image datasets, particularly in marine science and related fields.
Key Features
- Project-Based Workflow: Standardized structure for managing the entire processing pipeline from import to distribution
- Pipeline Architecture: Reusable, modular pipelines for processing data from specific instruments or systems
- Collection Management: Organized import and processing of data collections with parallel execution
- FAIR Compliance: Built-in support for the iFDO (image FAIR Digital Object) standard
- Metadata Management: Comprehensive metadata capture, embedding, and reporting at multiple levels
- Image Processing: Standard library for image conversion, compression, resizing, and thumbnail generation
- Dataset Packaging: Standardized approach for packaging processed datasets with provenance tracking
- Extensible Design: Easy to extend with custom pipelines for specialized data processing needs
Installation
Marimba can be installed using pip:
pip install marimbaSystem requirements:
- Python 3.10+
- ffmpeg (for video processing)
Documentation
For detailed usage instructions, see:
Feedback
This is our first stable release and we welcome your feedback! Please open issues for any bugs you encounter or features you'd like to see in future versions.
Thank you to all contributors who helped make this release possible.