n and p parametrization on Zero Inflated Negative Binomial #5212

farhanreynaldo · 2021-11-20T11:35:35Z

This PR added alternative parametrization (n and p) on Zero Inflated Negative Binomial based on this issue #5196.

ricardoV94

Just did a quick skim. Looks great so far

ricardoV94 · 2021-11-20T11:40:33Z

pymc/tests/test_distributions_random.py

@@ -1502,6 +1502,40 @@ def seeded_zero_inflated_negbinomial_rng_fn(self):
    ]


+class TestZeroInflatedNegativeBinomial(BaseTestDistribution):


This test can be much smaller. Check other tests for alternative parametrizations in this module. Like NormalTau

After a quick glance on NormalTau, we only need to test on check_pymc_params_match_rv_op?

codecov · 2021-11-20T11:57:21Z

Codecov Report

Merging #5212 (3038835) into main (a5b13d4) will increase coverage by 0.96%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #5212      +/-   ##
==========================================
+ Coverage   77.98%   78.94%   +0.96%     
==========================================
  Files          88       88              
  Lines       14215    14248      +33     
==========================================
+ Hits        11085    11248     +163     
+ Misses       3130     3000     -130

Impacted Files	Coverage Δ
pymc/distributions/discrete.py	`98.35% <100.00%> (+<0.01%)`	⬆️
pymc/step_methods/hmc/integration.py	`78.84% <0.00%> (-3.85%)`	⬇️
pymc/parallel_sampling.py	`86.33% <0.00%> (-1.00%)`	⬇️
pymc/step_methods/hmc/nuts.py	`95.00% <0.00%> (-0.63%)`	⬇️
pymc/tests/sampler_fixtures.py	`90.76% <0.00%> (-0.48%)`	⬇️
pymc/distributions/multivariate.py	`71.34% <0.00%> (-0.38%)`	⬇️
pymc/gp/cov.py	`98.07% <0.00%> (ø)`
pymc/sampling_jax.py	`0.00% <0.00%> (ø)`
pymc/bart/bart.py	`95.91% <0.00%> (+0.46%)`	⬆️
pymc/math.py	`68.68% <0.00%> (+0.50%)`	⬆️
... and 3 more

ricardoV94 · 2021-11-21T07:18:24Z

Some of the tests seen to be failing.

farhanreynaldo · 2021-11-21T08:26:54Z

Some of the tests seen to be failing.

This line is failing:

self.check_logcdf(
    ZeroInflatedNegativeBinomial,
    Nat,
    {"psi": Unit, "p": Unit, "n": NatSmall},
    lambda value, psi, p, n: np.log(
        (1 - psi) + psi * sp.nbinom.cdf(value, n, p)
    ),
)

Did I misspecify the {"psi": Unit, "p": Unit, "n": NatSmall} this one?

ricardoV94 · 2021-11-21T08:31:10Z

From the trace it seems the distribution is not respecting the invalid negative n parameter: https://github.com/pymc-devs/pymc/runs/4276276491?check_suite_focus=true#step:8:429

ricardoV94 · 2021-11-21T08:32:59Z

Which is strange because it passes with the mu/alpha parametrization

ricardoV94 · 2021-11-21T08:42:39Z

Seems like we have a small bug in the logcdf. We have to add a 0 < n bound condition to the logcdf here:

pymc/pymc/distributions/discrete.py

Line 1710 in dc92865

0 < p,

The case n <= 0 was not being tested independently before apparently. Perhaps because it led to a negative p (when converting from mu) which was enough to activate the bound conditions.

farhanreynaldo · 2021-11-21T08:50:56Z

Seems like we have a small bug in the logcdf. We have to add a 0 < n bound condition to the logcdf here:

pymc/pymc/distributions/discrete.py

Line 1710 in dc92865

0 < p,

The case n<0 was not being tested independently before apparently. Perhaps because it led to a negative p (when converting from mu) which was enough to activate the bound conditions.

I am wondering, so in order to test the n and p parametrization, the logcdf bound (i.e. n<0) should be written explicitly. Why can't we rely on the implicit case where negative n would yield negative p, which activate the 0 < p bounds?

ricardoV94 · 2021-11-21T09:59:04Z

I am wondering, so in order to test the n and p parametrization, the logcdf bound (i.e. n<0) should be written explicitly. Why can't we rely on the implicit case where negative n would yield negative p, which activate the 0 < p bounds?

We are not computing p from n because the new parametrization is already given in terms of p.

ricardoV94 · 2021-11-21T10:43:56Z

Thanks @farhanreynaldo!

farhanreynaldo · 2021-11-21T10:56:03Z

I am wondering, so in order to test the n and p parametrization, the logcdf bound (i.e. n<0) should be written explicitly. Why can't we rely on the implicit case where negative n would yield negative p, which activate the 0 < p bounds?

We are not computing p from n because the new parametrization is already given in terms of p.

Ah right.

farhanreynaldo added 3 commits November 19, 2021 17:25

add n and p parametrization docstring and code

32b7c3b

add test for n and p parametrization

ef7910e

add mu sigma prefix to previous test name

9de255f

ricardoV94 requested changes Nov 20, 2021

View reviewed changes

make test smaller

dda7ea7

ricardoV94 approved these changes Nov 21, 2021

View reviewed changes

add n bound to logcdf

3038835

ricardoV94 merged commit 988f481 into pymc-devs:main Nov 21, 2021

farhanreynaldo deleted the np-parametrization branch November 21, 2021 12:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

n and p parametrization on Zero Inflated Negative Binomial #5212

n and p parametrization on Zero Inflated Negative Binomial #5212

Uh oh!

farhanreynaldo commented Nov 20, 2021

Uh oh!

ricardoV94 left a comment

Uh oh!

ricardoV94 Nov 20, 2021

Uh oh!

farhanreynaldo Nov 20, 2021

Uh oh!

ricardoV94 Nov 20, 2021

Uh oh!

codecov bot commented Nov 20, 2021 •

edited

Loading

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

farhanreynaldo commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021 •

edited

Loading

Uh oh!

ricardoV94 commented Nov 21, 2021 •

edited

Loading

Uh oh!

farhanreynaldo commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

farhanreynaldo commented Nov 21, 2021

Uh oh!

Uh oh!

		@@ -1502,6 +1502,40 @@ def seeded_zero_inflated_negbinomial_rng_fn(self):
		]


		class TestZeroInflatedNegativeBinomial(BaseTestDistribution):

Uh oh!

n and p parametrization on Zero Inflated Negative Binomial #5212

n and p parametrization on Zero Inflated Negative Binomial #5212

Uh oh!

Conversation

farhanreynaldo commented Nov 20, 2021

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Nov 20, 2021

Choose a reason for hiding this comment

Uh oh!

farhanreynaldo Nov 20, 2021

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Nov 20, 2021

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Nov 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

farhanreynaldo commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Nov 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

farhanreynaldo commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

ricardoV94 commented Nov 21, 2021

Uh oh!

farhanreynaldo commented Nov 21, 2021

Uh oh!

Uh oh!

codecov bot commented Nov 20, 2021 •

edited

Loading

ricardoV94 commented Nov 21, 2021 •

edited

Loading

ricardoV94 commented Nov 21, 2021 •

edited

Loading