Skip to content
This repository was archived by the owner on Mar 4, 2025. It is now read-only.

inigohidalgo/kedro-ibis-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kedro IbisDataSet

This was a proof of concept designed to kickstart discussions around integrating Ibis and Kedro. This functionality was merged into Kedro, refer to the official implementation by @deepyaman.


Connector to load data from SQL tables into Python Kedro pipelines using Ibis tables. This allows for connecting to databases and constructing queries dynamically using Kedro's node-pipeline framework to construct Ibis expressions.

To save data into SQL tables, the connector can save either pandas dataframes or ibis expressions.

To initiate a DataSet, currently we must pass a connection string ({backend}://{database-connection-string}). See this section of the Ibis documentation for more details.

How to use

To see an example usage in a kedro pipeline see this project.

1. pip install the package

pip install kedro-ibis-dataset

2. Use as any other Kedro DataSet

credentials.yml

database_creds:
    con: backend://connection-url

catalog.yml

ibis_dataset_table:
  type: kedro_ibis_dataset.IbisDataSet
  table_name: table_name
  credentials: database_creds
  save_args:
    overwrite: true/false

For more information on how to use Ibis to connect to SQL databases see the Ibis documentation.

About

Proof of concept Ibis connector for Kedro. Refer to official implementation.

Resources

Stars

Watchers

Forks