Skip to content

Add count/nonnegative outcome variable type to sl3 and poisson family supporting learner properties #356

@Larsvanderlaan

Description

@Larsvanderlaan

Nonnegative regression is useful in a number of contexts. For example, if one wishes to estimate the conditional variance then regressing (Y-E[Y|X])^2 on X using poisson regression makes sure the estimates are nonnegative. Also, count data is fairly common and it is beneficial to use poisson regression over linear regression in such cases.

Nice change would be:
A outcome-type called "count" and learners compatible with this outcome type should support poisson regression (e.g. HAL, glm, glmnet, xgboost, gam). By passing in family objects or other things, you can already make learners do poisson regression, but it would be nice to be natively supported.

This would require some minor changes to learners and the outcome_variable type.

Issue:
For automatic selection of outcome variable type, the difference between categorical variables with integer coding and counts may be unclear. Probably should just default to categorical? Or simply force the user to input the outcome count.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions