Change the repository type filter
All
Repositories list
36 repositories
- Accelerates migrations to Databricks by automating code conversion and migration validation
ucx
PublicAutomated migrations to Unity Catalogsandbox
PublicExperimental labs projectsdqx
PublicDatabricks framework to validate Data Quality of pySpark DataFrames- Python Testing for Databricks
- API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
blueprint
PublicBaseline for Databricks Labs projects written in Python- Metadata driven Databricks Delta Live Tables framework for bronze/silver pipelines
mosaic
Public- Lightweight SQL execution wrapper only on top of Databricks SDK
- Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
discoverx
PublicA Swiss-Army-knife for your Data Intelligence platform administration.dbx
Public🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.- Databricks Add-on for Splunk
- Capture deep metrics on one or all assets within a Databricks workspace
pylint-plugin
PublicDatabricks Plugin for PyLinttika-ocr
Publicgeoscan
Public- Automated provisioning of an industry Lakehouse with enterprise data model
databricks-sdk-r
PublicDatabricks SDK for R (Experimental)transpiler
Publicdatabricks-sync
Public- Delta Sharing + MLflow for ML model & experiment exchange (arcuate delta - a fan shaped river delta)
feature-factory
Public