Replies: 3 comments
-
Lance benifits from Amoro
Amoro benifits from Lance
Roadmap(mindstorm)
|
Beta Was this translation helpful? Give feedback.
-
Thanks for bringing up this discussion. Lance format has indeed expanded the boundaries of data lake tables. People have often asked me how data lake tables (like Iceberg) can store unstructured data - now it appears that the Lance format provides an excellent answer to this question. I am very much looking forward to the integration work between Lance and Amoro. From Amoro's integration history with Iceberg, we can see that an excellent management system can greatly simplify the usage burden of data lake tables, accelerate their adoption in enterprises, and further speed up the creation of business value, which is exactly the result my team hopes to achieve. So big +1, not only as a contributor to Amoro, but also as a member of Tencent Cloud's big data team. |
Beta Was this translation helpful? Give feedback.
-
If Amoro supports Lance, it would be a milestone for the project, as no other lake management platform currently supports this table format. Supporting Lance table management would be a significant step forward for Amoro in the domain of unstructured data. However, it also comes with many challenges. All in all, I believe it's a worthwhile endeavor. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Recently, Lance format has sparked phenomenal discussion across technical communities. As a next-generation data lake format optimized for AI workloads such as ML pipelines, vector search, and multi-modal processing, Lance offers unique advantages in performance and flexibility. Exploring synergies between Amoro's unified lakehouse management and Lance's AI-centric capabilities could unlock new potentials in scalable, intelligent data architectures.
What is Lance
Lance: https://github.com/lancedb/lance
As offical said:
Modern columnar data format for ML. Convert from Parquet in 2-lines of code for 100x faster random access, zero-cost schema evolution, rich secondary indices, versioning, and more.
Lance format could be considered as a new table format like Iceberg. The table format(aka. dataset) api and capabilities are similar except Lance was born in python and rust and more friendly to python and AI users. Under the table format layer, Lance also integrates its file format, which optimized IO and scheduling models for multi-modal data.
The key features of Lance that other table formats like Iceberg don't have:
Beta Was this translation helpful? Give feedback.
All reactions