DeltaLink is a lightweight and efficient solution for performing basic operations and queries on Delta Lake tables stored in Azure Data Lake Storage (ADLS), leveraging the power of Unity Catalog for secure and unified data governance.
- 🔍 Query Delta tables directly from ADLS storage, Managed or External Delta table
- 🛠️ Perform basic CRUD operations (Create, Read, Update, Delete) Delta table
- 🔐 Integrates with Unity Catalog for fine-grained access control
- ⚡ Optimized for performance and scalability using Daft and Ray
- Data exploration and validation in Delta Lake tables
- Lightweight data engineering workflows
- CRUD over REST endpoint
- Azure Data Lake Storage (ADLS)
- Daft (with Delta table and UC support)
- Fast API (Serving REST API with openspec v3)
- Poetry (Python project manager)
- Ray (Optional used for the SQL endpoint)
- Unity Catalog for access control
/
├── deltalink/ # Python module
├── tests/ # Unit and integration tests
├── pyproject.toml
└── README.md
- Clone the repository:
git clone https://github.com/your-username/deltalink.git
cd deltalink
- Install dependencies:
poetry install
- Configure credentials and endpoints.
# TODO
# TODO
Run tests using:
pytest tests/
This project is licensed under the MIT License. See the LICENSE file for details.
Contributions are welcome! Please open issues or submit pull requests for improvements or new features.