Add RMSEs and targets #182

SvenKlaassen · 2023-01-06T09:49:27Z

Description

Add RMSE evaluations for each nuisance component to the models. The RMSEs can be accessed through the method .rmses and are added to the summary().
Further, the targets for each component can be accessed through the method .nuisance_targets, which returns a dictionary which contains the nuisance targets for each nuisance component (as an array for each repetition and each coefficient).
The new method evaluate_learners for DoubleML objects allows to evaluate the nuisance learners for a callable metric which is based on vector-based (shape of (1, n)) inputs y_pred and y_true (see e.g. scikit-learn).

Notes

Fix a bug to correctly save models and predictions for the PLIV model #184

PR Checklist

Please fill out this PR checklist (see our contributing guidelines for details).

The title of the pull request summarizes the changes made.
The PR contains a detailed description of all changes and additions.
References to related issues or PRs are added.
The code passes all (unit) tests.
Enhancements or new feature are equipped with unit tests.
The changes adhere to the PEP8 standards.

PhilippBach · 2023-01-06T15:21:12Z

doubleml/double_ml.py

@@ -483,8 +507,10 @@ def fit(self, n_jobs_cv=None, store_predictions=False, store_models=False):

                self._set_score_elements(score_elements, self._i_rep, self._i_treat)

+                # calculate rmses and store predictions and targets of the nuisance models
+                self._calc_rmses(preds['predictions'], preds['targets'])


Hi @SvenKlaassen ,

thanks for this PR! I think adding some diagnostics is really nice.

I think it should be possible to make this a bit more general by using (maybe only a subset of) sklearn's metrics either by letting users pass a callable for evaluation of the nuisance predictions or by directly supporting the measures. For example, in case $D$ or $Y$ are binary, classification error or cross-entropy loss might be more relevant than the RMSE. What do you think about this?

I guess that sklearn's measures have built in some methods to handle exceptions that might occur ...

Thanks @PhilippBach, that would be helpful. I think I can add this, but I think keeping RMSE as default would be useful, since one would have to specify different metrics for each learner. RMSE is still useful for classifications.
Another option would be a different method which could evaluate the the nuisance function with a metric, but keeping RMSE as default for the summary.

PhilippBach · 2023-01-23T14:16:47Z

doubleml/double_ml.py

@@ -434,7 +456,7 @@ def __psi_deriv(self):
    def __all_se(self):
        return self._all_se[self._i_treat, self._i_rep]

-    def fit(self, n_jobs_cv=None, store_predictions=False, store_models=False):
+    def fit(self, n_jobs_cv=None, store_predictions=True, store_models=False):


Thanks @SvenKlaassen - I think it's a good idea to set the default for store_predictions to True 👍

doubleml/double_ml.py

Co-authored-by: PhilippBach <[email protected]>

PhilippBach · 2023-01-23T14:30:37Z

doubleml/double_ml.py

+        >>> obj_dml_data = dml.DoubleMLData(data, 'y', 'd')
+        >>> dml_irm_obj = dml.DoubleMLIRM(obj_dml_data, ml_g, ml_m)
+        >>> dml_irm_obj.fit()
+        >>> dml_irm_obj.evaluate_learners(metric=mean_absolute_error)


I'm afraid we get a little problem here if we use callables for classification here, like for example if we run instead:

import numpy as np import doubleml as dml from sklearn.metrics import log_loss from doubleml.datasets import make_irm_data from sklearn.ensemble import RandomForestRegressor, RandomForestClassifier np.random.seed(3141) ml_g = RandomForestRegressor(n_estimators=100, max_features=20, max_depth=5, min_samples_leaf=2) ml_m = RandomForestClassifier(n_estimators=100, max_features=20, max_depth=5, min_samples_leaf=2) data = make_irm_data(theta=0.5, n_obs=500, dim_x=20, return_type='DataFrame') obj_dml_data = dml.DoubleMLData(data, 'y', 'd') dml_irm_obj = dml.DoubleMLIRM(obj_dml_data, ml_g, ml_m) dml_irm_obj.fit() dml_irm_obj.evaluate_learners(metric=log_loss)

I think we'd have to check whether a learner is a regression or classification learner and then pass (optionally) two callables (one for the regressions, one for the classification tasks); Alternatively, one could pass through a keyword referring to the learner_name but that's probably going to lead to a messy interface;

I think the default can still be RMSE (or another regression measure) for all nuisance parts, but the option to use a classification measure is probably reasonable, what do you think?

PhilippBach · 2023-01-23T14:44:50Z

doubleml/_utils.py

        train_preds = list()
+        train_targets = list()


As far as I see, we're not using the train_targets right now, right? I guess that's alright and we can leave a in-sample (fold-wise cross-validated) vs. out-of-sample (cross-fitted) evaluation for later. I'm wondering how we can implement this in a clever way... 🤔

fix exception message

PhilippBach · 2023-01-23T14:52:42Z

doubleml/double_ml.py

+
+        # check metric
+        if not callable(metric):
+            raise TypeError('metric should be either a callable. '


I fixed the message in 10b180e

PhilippBach · 2023-01-23T14:56:08Z

We should also demonstrate the use of the new feature(s) in a short example, see DoubleML/doubleml-docs#114

SvenKlaassen and others added 7 commits January 4, 2023 16:38

Update double_ml.py

bc0be68

add targets

81fe10b

fix store_predictions pliv

b3f46f8

add RMSE to summary

240e358

Update double_ml.py

1e5f958

add return_type tests for pred, targets and rmses

a4c6c44

fix format

db6845e

PhilippBach reviewed Jan 6, 2023

View reviewed changes

SvenKlaassen mentioned this pull request Jan 11, 2023

[Bug]: KeyError in DoubleMLPLIV.fit() with multiple instruments and store_predictions=True #184

Closed

SvenKlaassen linked an issue Jan 11, 2023 that may be closed by this pull request

[Bug]: KeyError in DoubleMLPLIV.fit() with multiple instruments and store_predictions=True #184

Closed

SvenKlaassen added 5 commits January 13, 2023 13:31

add evaluate_learner()

72fe6b8

Update double_ml.py

4461201

fix format

373f1b9

Merge branch 'main' into s-add-rmse

f9b875e

change default for trimming and fix test defaults

2279a6c

SvenKlaassen marked this pull request as ready for review January 20, 2023 11:23

PhilippBach mentioned this pull request Jan 23, 2023

Minor change to Pension notebook DoubleML/doubleml-docs#120

Merged

PhilippBach reviewed Jan 23, 2023

View reviewed changes

doubleml/double_ml.py Outdated Show resolved Hide resolved

Update doubleml/double_ml.py

e1cebce

Co-authored-by: PhilippBach <[email protected]>

PhilippBach reviewed Jan 23, 2023

View reviewed changes

10b180e

fix exception message

PhilippBach reviewed Jan 23, 2023

View reviewed changes

SvenKlaassen merged commit 3870ac7 into main Jan 25, 2023

SvenKlaassen mentioned this pull request Jan 25, 2023

Fix for [Bug]: KeyError in DoubleMLPLIV.fit() with multiple instruments and store_predictions=True #185

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add RMSEs and targets #182

Add RMSEs and targets #182

Uh oh!

SvenKlaassen commented Jan 6, 2023 •

edited

Loading

Uh oh!

PhilippBach Jan 6, 2023

Uh oh!

PhilippBach Jan 6, 2023

Uh oh!

SvenKlaassen Jan 6, 2023 •

edited

Loading

Uh oh!

PhilippBach Jan 23, 2023

Uh oh!

Uh oh!

PhilippBach Jan 23, 2023

Uh oh!

PhilippBach Jan 23, 2023

Uh oh!

PhilippBach Jan 23, 2023

Uh oh!

PhilippBach Jan 23, 2023

Uh oh!

PhilippBach commented Jan 23, 2023

Uh oh!

Uh oh!

Add RMSEs and targets #182

Add RMSEs and targets #182

Uh oh!

Conversation

SvenKlaassen commented Jan 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Notes

PR Checklist

Uh oh!

PhilippBach Jan 6, 2023

Choose a reason for hiding this comment

Uh oh!

PhilippBach Jan 6, 2023

Choose a reason for hiding this comment

Uh oh!

SvenKlaassen Jan 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PhilippBach Jan 23, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

PhilippBach Jan 23, 2023

Choose a reason for hiding this comment

Uh oh!

PhilippBach Jan 23, 2023

Choose a reason for hiding this comment

Uh oh!

PhilippBach Jan 23, 2023

Choose a reason for hiding this comment

Uh oh!

PhilippBach Jan 23, 2023

Choose a reason for hiding this comment

Uh oh!

PhilippBach commented Jan 23, 2023

Uh oh!

Uh oh!

SvenKlaassen commented Jan 6, 2023 •

edited

Loading

SvenKlaassen Jan 6, 2023 •

edited

Loading