Sumtree sampling #60

CasBex · 2023-09-08T15:30:17Z

For all the lofty talk about numerical rounding errors in #59, they are unavoidable even with the improved method. This fix simply checks whether the sampled priority happens to be zero, and if so it walks backwards over the leafs until it finds a nonzero priority node. If the backwards walk has not found anything, it performs a forward walk instead.

This has been tested against the JuliaRL_PrioritizedDQN_CartPole experiment in ReinforcementLearningExperiments.jl with 30 different seeds.

jeremiahpslewis · 2023-09-09T13:46:45Z

Looks good! Can you add a test to this?

CasBex · 2023-09-11T08:40:59Z

Tests have been added. Feel free to change the tolerances/number of iterations... in case they take too long though. The first test checks that priority zero is never sampled; the second test checks that the pdf of samples is what we would expect. The latter however requires many samples so I've added some multithreading to speed it up. Both tests are run with 100 different seeds for the rng.

CasBex · 2023-09-11T09:00:24Z

Sorry for the confusion with the tests. Should be good to go now

jeremiahpslewis

Thanks for adding the test! One more question and a couple of minor details

src/common/sum_tree.jl

test/sum_tree.jl

codecov · 2023-09-12T07:24:55Z

Codecov Report

Merging #60 (3f3e99f) into main (85de617) will increase coverage by 0.32%.
The diff coverage is 53.33%.

@@            Coverage Diff             @@
##             main      #60      +/-   ##
==========================================
+ Coverage   73.21%   73.54%   +0.32%     
==========================================
  Files          15       15              
  Lines         743      756      +13     
==========================================
+ Hits          544      556      +12     
- Misses        199      200       +1

Files Changed	Coverage Δ
src/common/sum_tree.jl	`81.60% <53.33%> (+1.87%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

findmyway

LGTM

Changes were included.

HenriDeh · 2023-09-13T13:04:03Z

@CasBex I'll let you merge in case you want to make a last minute change.

CasBex · 2023-09-13T13:08:08Z

I don't have permissions to merge @HenriDeh. Could you merge?

CasBex added 2 commits September 8, 2023 16:55

Fix numerical rounding errors

72e3791

correct zero-priority outcomes

e74c09b

CasBex mentioned this pull request Sep 8, 2023

Prioritized DQN experiment nonfunctional JuliaReinforcementLearning/ReinforcementLearning.jl#971

Closed

CasBex mentioned this pull request Sep 11, 2023

SumTree sampling errors #59

Closed

SumTree tests

3cf057f

CasBex added 3 commits September 11, 2023 10:42

fixup! SumTree tests

c40390a

fixup! fixup! SumTree tests

b6f7e6d

Remove extra call

fbf0c89

jeremiahpslewis previously requested changes Sep 11, 2023

View reviewed changes

src/common/sum_tree.jl Outdated Show resolved Hide resolved

src/common/sum_tree.jl Outdated Show resolved Hide resolved

src/common/sum_tree.jl Outdated Show resolved Hide resolved

src/common/sum_tree.jl Outdated Show resolved Hide resolved

test/sum_tree.jl Outdated Show resolved Hide resolved

CasBex added 3 commits September 12, 2023 09:15

Docstring and function rename

35ac46f

Remove float literals

25ee56c

Enclose tests in testset

3f3e99f

findmyway approved these changes Sep 13, 2023

View reviewed changes

HenriDeh approved these changes Sep 13, 2023

View reviewed changes

jeremiahpslewis merged commit c89ed6f into JuliaReinforcementLearning:main Sep 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sumtree sampling #60

Sumtree sampling #60

Uh oh!

CasBex commented Sep 8, 2023

Uh oh!

jeremiahpslewis commented Sep 9, 2023

Uh oh!

CasBex commented Sep 11, 2023

Uh oh!

CasBex commented Sep 11, 2023

Uh oh!

jeremiahpslewis left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Sep 12, 2023 •

edited

Loading

Uh oh!

findmyway left a comment

Uh oh!

HenriDeh commented Sep 13, 2023

Uh oh!

CasBex commented Sep 13, 2023

Uh oh!

Uh oh!

Sumtree sampling #60

Sumtree sampling #60

Uh oh!

Conversation

CasBex commented Sep 8, 2023

Uh oh!

jeremiahpslewis commented Sep 9, 2023

Uh oh!

CasBex commented Sep 11, 2023

Uh oh!

CasBex commented Sep 11, 2023

Uh oh!

jeremiahpslewis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Sep 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

findmyway left a comment

Choose a reason for hiding this comment

Uh oh!

HenriDeh commented Sep 13, 2023

Uh oh!

CasBex commented Sep 13, 2023

Uh oh!

Uh oh!

codecov bot commented Sep 12, 2023 •

edited

Loading