Replies: 1 comment
-
Do you need something like this? Feel free to browse the file since it has all the UDF use cases we support. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I haven't found a pattern for returning more than one column back from an operation on a worker node.
I found Microsoft.Spark.Sql.DataFrameFunctions and related DataFrameUdfWrapper. But the only functionality that seems to be available is the ability to return a single column.
In some of my prior scala code, there are transformations that attach two or more additional columns onto an existing dataframe. You can see pseudocode of that here.
#857 (comment)
... you will see that in this example we only attach a single additional column DIM_ResultSurrogate. But you can see that the code can easily be extended to return other columns as well.
I'm not looking to chain two UDF's. I want to do all the work in a single UDF, and return multiple distinct columns of information if possible. Is there any way to do that? Maybe a way to return some sort of tuples class that will fit into a single arrow column output?
Beta Was this translation helpful? Give feedback.
All reactions