Replies: 1 comment
-
I think we could create a second repository under the "Baler" organization for pre-processing of data before running it through Baler. Thoughts? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Pre-processing of HEP data does not really belong in baler. Implementing pre-processing in the configuration file does not prove to be a more elegant solution (see figure). Baler should require that the user provides the data clean as a pickled pandas dataframe, how to get there is up to the user.
The problem with this for our use case is that each HEP file we use will require slightly different pre-processing, and we don't want these scripts to exist only locally.
TL;DR: Pre-processing functions should not come with Baler, but we want to store and share our functions somewhere safe (on git?)
Beta Was this translation helpful? Give feedback.
All reactions