Skip to content

Conduct market research on c̶a̶t̶a̶l̶o̶g̶s̶ metastores #141

@astrojuanlu

Description

@astrojuanlu

At the moment, each platform offers only full read & write capabilities to their own catalog, and read-only capabilities for competitors:

image

(source)

And what's more important: data catalogs aren't new, but we're seeing catalogs created for different use cases and business needs: technical, business, and operational (source).

These are just some open source ones1 that have been in the news recently. But there's also Apache Nessie, the Hive Metastore, the Iceberg REST Catalog, probably others I'm missing. Then there are the commercial, vendor-driven ones.

And then we have... the Kedro Catalog!

We've sometimes got questions on "how does the Kedro Catalog compare to the Unity Catalog" - and the answer is that they're complementary, but this is not immediately clear to users (see kedro-org/kedro-plugins#542).

It's very clear that this is going to be a hot topic of discussion in the data engineering space in the coming months so we should have a good answer to how does Kedro interact with all these.

Footnotes

  1. counting Polaris as open source

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions