Enable external_predictions for short model in benchmarks #238

lucien1011 · 2024-04-06T10:42:31Z

Description

This pull request adds an optional input arguments fit_args to the function sensitivity_benchmark in the class DoubleML. Most importantly, this addition will enable the usage of external_predictions when fitting short models for sensitivity analysis.

The new argument has to be in a nested dictionary like this:

dataset = dml.DoubleMLData(
    df,
    y_col='y',
    d_col='d',
    x_cols=cov_cols,
    force_all_x_finite=False,
)

dml_irm = dml.DoubleMLIRM(
    dataset,
    ml_g=RandomForestRegressor(), #dummy learner only
    ml_l=RandomForestClassifier(),
)

# Some user-specific codes to calculate external_predictions and put in the following columns
# df['d_prop'] df['y_pred_d0'] df['y_pred_d1']

bm = dml_irm.sensitivity_benchmark(
    benchmarking_set=['covariate_to_be_tested'],
    fit_args=dict(
        external_predictions=(
            d=dict(
                ml_m=df['d_prop'].to_numpy().reshape(-1,1),
                ml_g0=df['y_pred_d0'].to_numpy().reshape(-1,1),
                ml_g1=df['y_pred_d1'].to_numpy().reshape(-1,1),
            ),
        ),
    ),
)

Reference to Issues or PRs

No related issues or PRs to my knowledge.

PR Checklist

Please fill out this PR checklist (see our contributing guidelines for details).

The title of the pull request summarizes the changes made.
The PR contains a detailed description of all changes and additions.
References to related issues or PRs are added.
The code passes all (unit) tests.
Enhancements or new feature are equipped with unit tests.
The changes adhere to the PEP8 standards.

SvenKlaassen · 2024-04-07T08:11:05Z

Thanks @lucien1011. I really like this addition.

Maybe you can change the default value to None (to avoid mutatable defaults).
And a small unit test would also be great. Maybe add a small comparison to compare the external predictions to fitted learners (as in https://github.com/DoubleML/doubleml-for-py/blob/main/doubleml/plm/tests/test_plr_external_predictions.py) to https://github.com/DoubleML/doubleml-for-py/blob/main/doubleml/tests/test_sensitivity.py
Further, a unit test for exceptions of input arguments in https://github.com/DoubleML/doubleml-for-py/blob/main/doubleml/tests/test_exceptions_ext_preds.py would be nice.

lucien1011 · 2024-04-08T09:55:17Z

@SvenKlaassen Thanks for the comments. I will implement those accordingly and update this PR.

…ion in test_exceptions_ext_preds.py

lucien1011 · 2024-04-09T14:35:56Z

@SvenKlaassen I have updated the PR with the following three items:

Change the default value of fit_args to be None
Added a unit test to test the type of fit_args (it must be a dictionary)
Added a unit test to test the values of delta_theta with and without external_predictions.

doubleml/tests/test_sensitivity.py

SvenKlaassen · 2024-04-11T18:30:20Z

I will check the coverage on a different branch and then merge into main

Enable external_predictions for short model in benchmarks

9cbde04

Kin Ho Lucien Lo and others added 4 commits April 8, 2024 14:52

Added unit test test_sensitivity_benchmark_external_prediction_except…

8b7d938

…ion in test_exceptions_ext_preds.py

Change default value of fit_args to be None

519bae6

Added test_dml_benchmark_fixture in test_sensitivity.py

d996fda

Remove trailing line in test_sensitivity.py

d68143e

remove trailing whitespaces from test_sensitivity

96f5d4c

github-advanced-security bot found potential problems Apr 11, 2024

View reviewed changes

doubleml/tests/test_sensitivity.py Fixed Show fixed Hide fixed

SvenKlaassen added 3 commits April 11, 2024 16:36

format files

3160b75

remove additional dml_short definition

c0cbf41

extend external predictions benchmarking to multiple repetitions

d9807a8

SvenKlaassen approved these changes Apr 11, 2024

View reviewed changes

SvenKlaassen changed the base branch from main to s-ext-pred-benchmark April 11, 2024 18:29

SvenKlaassen merged commit 3769f81 into DoubleML:s-ext-pred-benchmark Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable external_predictions for short model in benchmarks #238

Enable external_predictions for short model in benchmarks #238

Uh oh!

lucien1011 commented Apr 6, 2024 •

edited

Loading

Uh oh!

SvenKlaassen commented Apr 7, 2024

Uh oh!

lucien1011 commented Apr 8, 2024

Uh oh!

lucien1011 commented Apr 9, 2024

Uh oh!

Uh oh!

SvenKlaassen commented Apr 11, 2024

Uh oh!

Uh oh!

Enable external_predictions for short model in benchmarks #238

Enable external_predictions for short model in benchmarks #238

Uh oh!

Conversation

lucien1011 commented Apr 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Reference to Issues or PRs

PR Checklist

Uh oh!

SvenKlaassen commented Apr 7, 2024

Uh oh!

lucien1011 commented Apr 8, 2024

Uh oh!

lucien1011 commented Apr 9, 2024

Uh oh!

Uh oh!

SvenKlaassen commented Apr 11, 2024

Uh oh!

Uh oh!

lucien1011 commented Apr 6, 2024 •

edited

Loading