You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Nonnegative regression is useful in a number of contexts. For example, if one wishes to estimate the conditional variance then regressing (Y-E[Y|X])^2 on X using poisson regression makes sure the estimates are nonnegative. Also, count data is fairly common and it is beneficial to use poisson regression over linear regression in such cases.
Nice change would be:
A outcome-type called "count" and learners compatible with this outcome type should support poisson regression (e.g. HAL, glm, glmnet, xgboost, gam). By passing in family objects or other things, you can already make learners do poisson regression, but it would be nice to be natively supported.
This would require some minor changes to learners and the outcome_variable type.
Issue:
For automatic selection of outcome variable type, the difference between categorical variables with integer coding and counts may be unclear. Probably should just default to categorical? Or simply force the user to input the outcome count.
The text was updated successfully, but these errors were encountered:
Also, being able to pass outcome types through task$next_in_chain is useful. It is somewhat annoying that ordered/count-valued outcomes get classified as categorical after doing next_in_chain. This is only an issue if you change your outcome though, so mostly minor if the main suggestion is added.
Nonnegative regression is useful in a number of contexts. For example, if one wishes to estimate the conditional variance then regressing (Y-E[Y|X])^2 on X using poisson regression makes sure the estimates are nonnegative. Also, count data is fairly common and it is beneficial to use poisson regression over linear regression in such cases.
Nice change would be:
A outcome-type called "count" and learners compatible with this outcome type should support poisson regression (e.g. HAL, glm, glmnet, xgboost, gam). By passing in family objects or other things, you can already make learners do poisson regression, but it would be nice to be natively supported.
This would require some minor changes to learners and the outcome_variable type.
Issue:
For automatic selection of outcome variable type, the difference between categorical variables with integer coding and counts may be unclear. Probably should just default to categorical? Or simply force the user to input the outcome count.
The text was updated successfully, but these errors were encountered: