Skip to content

Reaching out from the Ray Team! #742

@richardliaw

Description

@richardliaw

Search before continuing 先搜索,再继续

  • I have searched the Data-Juicer issues and found no similar feature requests. 我已经搜索了 Data-Juicer 的 issue 列表但是没有发现类似的功能需求。

Description 描述

Hi! This is a really exciting project. I'm one of the authors of Ray, and I am currently working on Ray Data. We're seeing more users mention Data-Juicer as a key library on top of Ray.

I'd love to better understand how we can improve Ray Data for your users and use cases.

In particular, I noticed that you might not be using some of the recent Ray Data improvements (hash-shuffle, joins) that were introduced in 2.46.

It also seems like data-juicer has a lot of standalone operators; I wonder if we can feature some of these as examples on the Ray Data side in a streaming execution pipeline.

If you're interested, we'd love to connect (perhaps on email, or wechat). Let us know!

Use case 使用场景

No response

Additional 额外信息

No response

Are you willing to submit a PR for this feature? 您是否乐意为此功能提交一个 PR?

  • Yes I'd like to help by submitting a PR! 是的!我愿意提供帮助并提交一个PR!

Metadata

Metadata

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions