Skip to content

Commit 3d155d8

Browse files
annaluo676yijiezh
authored andcommitted
Add three ORL examples (#934)
* Initial commit for ORL examples * Address comments on bin packing nb; Add news vendor * Finish three notebooks; Add README * Delete duplicated noteboks; Clean up cells * Fix typo & address comments
1 parent ca82a01 commit 3d155d8

20 files changed

+2755
-0
lines changed
Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,24 @@
1+
# Resource Allocation with Amazon SageMaker RL
2+
3+
Resource allocation problems in Operations Research and Computer Science often manifest as online and stochastic decision making tasks. In this domain, we consider three canonical problems and demonstrate how to solve them using RL.
4+
5+
- Bin packing. Given items of different sizes, one needs to pack them into as few bins as possible.
6+
- News vendor. In inventory management, one needs to decide on an ordering decision (how much of an item to purchase from a supplier) to cover a single period of uncertain demand.
7+
- Vehicle routing problem. Given one or more vehicles and a set of locations, one needs to find the route that reaches all locations with minimal operational cost.
8+
9+
## Contents
10+
11+
* `rl_bin_packing_ray_custom.ipynb`: notebook used for training bin packing policy.
12+
* `rl_news_vendor_ray_custom.ipynb`: notebook used for training news vendor policy.
13+
* `rl_vehicle_routing_problem_ray_custom.ipynb`: notebook used for training vehicle routing problem policy.
14+
* `src/`
15+
* `bin_packing_env.py`: customer environment for bin packing problem.
16+
* `news_vendor_env.py`: customer environment for news vendor problem.
17+
* `vrp_env.py`: customer environment for vehicle routing problem.
18+
* `vrp_view_2D.py`: visualizer for the vehicle routing problem.
19+
* `train-bin_packing.py`: PPO config for training bin packing problem.
20+
* `train-news_vendor.py`: PPO config for training news vendor problem.
21+
* `train-vehicle_routing_problem.py`: APEX-DQN config for training vehicle routing problem.
22+
* `model.py`: custom model used in Ray to mask invalid actions.
23+
* `utils.py`: helper function for vehicle routing problem.
24+
* `images/`: images used in the notebooks.
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
../common
Loading
Loading
Loading
Loading

0 commit comments

Comments
 (0)