Skip to content

Loss of good data due to excess Extraneous Same Day exclusions #73

@kradimer

Description

@kradimer

Hello! First, I love this package. I had been working on cleaning a dataset of over 700K weights using overall z-scores but some bad weights were still getting through. The weighted moving averages are amazing at catching these. Thank you! The only thing that this package is doing that I wish it weren't is throwing out nearly all of the same day measures. My algorithm was keeping one measure per day with the lowest absolute z-score, whereas growthcleanR threw out 13K such measures. I didn't see anything under the configuration options that would allow me to change that. Am I missing anything? Or might you be able to add an option for keeping one per day if they are not otherwise outliers? Anyway, thank you again for creating and sharing this package. It's fantastic.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions