Grid Sensor #4399

J-Travnik · 2020-08-20T22:11:51Z

Proposed change(s)

The Grid Sensor combines the generality of data extraction from Raycasts with the image processing power of Convolutional Neural Networks. The Grid Sensor can be used to collect data in the general form of a "Width x Height x Channel" matrix which can be used for training Reinforcement Learning agents or for data analysis.

Motivation

In MLAgents there are 2 main sensors for observing information that is "physically" around the agent.

Raycasts

Raycasts provide the agent the ability to see things along prespecified lines of sight, similar to LIDAR. The kind of data it can extract is open to the developer from things like:

The type of an object (enemy, npc, etc)
The health of a unit
the damage-per-second of a weapon on the ground

This is simple to implement and provides enough information for most simple games. When few are used, they are computationally fast. However, there are multiple limiting factors:

The rays need to be at the same height as the things the agent should observe
Objects can remain hidden by line of sight and if the knowledge of those objects is crucial to the success of the agent, then this limitation must be compensated for by the agents networks capacity (i.e., need a bigger brain with memory)
The order of the raycasts (one raycast being to the left/right of another) is thrown away at the model level and must be learned by the agent which extends training time. Multiple raycasts exacerbates this issue.
Typically the length of the raycasts is limited because the agent need not know about objects that are at the other side of the level. Combined with few raycasts for computational efficiency, this means that an agent may not observe objects that fall between these rays and the issue becomes worse as the objects reduce in size.

Camera

The Camera provides the agent with either a grayscale or an RGB image of the game environment. It goes without saying that there non-linear relationships between nearby pixels in an image. It is this intuition that helps form the basis of Convolutional Neural Networks (CNNs) and established the literature of designing networks that take advantage of these relationships between pixels. Following this established literature of CNNs on image based data, the MLAgent's Camera Sensor provides a means by which the agent can include high dimensional inputs (images) into its observation stream.
However the Camera Sensor has its own drawbacks as well.

It requires render the scene and thus is computationally slower than alternatives that do not use rendering
It has yet been shown that the Camera Sensor can be used on a headless machine which means it is not yet possible (if at all) to train an agent on a headless infrastructure.
If the textures of the important objects in the game are updated, the agent needs to be retrained.
The RGB of the camera only provides a maximum of 3 channels to the agent.

These limitations provided the motivation towards the development of the Grid Sensor and Grid Observations as described below.

Contribution

An image can be thought of as a matrix of a predefined width (W) and a height (H) and each pixel can be thought of as simply an array of length 3 (in the case of RGB), [Red, Green, Blue] holding the different channel information of the color (channel) intensities at that pixel location. Thus an image is just a 3 dimensional matrix of size WxHx3. A Grid Observation can be thought of as a generalization of this setup where in place of a pixel there is a "cell" which is an array of length N representing different channel intensities at that cell position.
From a Convolutional Neural Network point of view, the introduction of multiple channels in an "image" isn't a new concept. In fact the original inspiration for the Grid Sensor came from MinAtar which introduced a small suite of environments analogous to the Atari Learning Environment but where the representations were 10x10xn binary state representations. The distinction of Grid Observations is what the data within the channels represents. Instead of limiting the channels to color intensities, the channels within a cell of a Grid Observation generalize to any data that can be represented by a single number (float or int) such as the type of object within a cell or the value of a certain property.

Additionally, this PR also modifies the rpc_utils.py script to accept multiple pngs as was demonstrated in this unity hack week.

See docs/Grid-Sensor.md for further documentation.

Types of change(s)

New feature
Code refactor
Documentation update

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

The Grid Sensor was developed collaboration between Eidos Montreal and Matsuko.

Developers

Jaden Travnik
Charles Pearson
Martin Certicky
Erik Gajdos
Romain Trachel
Alexandre Peyrot

CLAassistant · 2020-08-20T22:11:55Z

All committers have signed the CLA.

chriselion · 2020-08-21T20:49:20Z

com.unity.ml-agents.extensions/Tests/Editor/Sensors/ChannelHotPerceiveTests.cs

+        GridSensorDummyData dummyData;
+
+        // Use built-in tags
+        const string k_Tag1 = "Player";


Unfortunately, the tags are specific to the project. So previously these tests were passing in our existing one but not a clean one. These tags are "built in" so they'll be present wherever the test is run.

ml-agents-envs/mlagents_envs/rpc_utils.py

DevProject/ProjectSettings/ProjectSettings.asset

Project/ProjectSettings/ProjectVersion.txt

chriselion · 2020-08-26T17:38:55Z

com.unity.ml-agents.extensions/CHANGELOG.md

TODO move to main changelog

Will do after merging PR, to avoid conflicts

docs/Grid-Sensor.md

ml-agents-envs/mlagents_envs/tests/test_rpc_utils.py

chriselion · 2020-08-26T18:53:31Z

Note: cancelled the CircleCI tests since their equivalents are running on github actions.

chriselion · 2020-08-26T18:54:04Z

Will merge as soon as yamato tests pass on #4409

chriselion

Looks great! I'll do a bit of additional cleanup after this is merged.

Thanks so much for contributing this!

J-Travnik added 3 commits August 20, 2020 17:21

First commit

d0ada80

Updating documentation

0906bd3

Updated changelog

f5633a1

chriselion self-assigned this Aug 20, 2020

J-Travnik and others added 2 commits August 20, 2020 20:17

Adding check for greyscale in rpc_utils

ab2c641

fix precommit failures

5ee20de

chriselion mentioned this pull request Aug 21, 2020

[DO NOT MERGE] Grid Sensor #4409

Closed

switch to built-in tags

7f6d90f

chriselion reviewed Aug 21, 2020

View reviewed changes

Chris Elion added 7 commits August 21, 2020 13:57

move test classes to test namespace

9a5ef5c

clean up xmldocs

c6f8cfd

remove redundant ExecuteInEditMode

c4c2c22

fix unit tests

2b004e7

add unit test for >3 channel PNG

e0af710

safer (?) handling of multi-PNGs

9306815

add TODO

dfb375e

chriselion reviewed Aug 25, 2020

View reviewed changes

ml-agents-envs/mlagents_envs/rpc_utils.py Show resolved Hide resolved

harperj reviewed Aug 26, 2020

View reviewed changes

ml-agents-envs/mlagents_envs/rpc_utils.py Show resolved Hide resolved

harperj reviewed Aug 26, 2020

View reviewed changes

ml-agents-envs/mlagents_envs/rpc_utils.py Show resolved Hide resolved

chriselion reviewed Aug 26, 2020

View reviewed changes

DevProject/ProjectSettings/ProjectSettings.asset Outdated Show resolved Hide resolved

chriselion reviewed Aug 26, 2020

View reviewed changes

Project/ProjectSettings/ProjectVersion.txt Outdated Show resolved Hide resolved

chriselion reviewed Aug 26, 2020

View reviewed changes

docs/Grid-Sensor.md Outdated Show resolved Hide resolved

Chris Elion added 3 commits August 26, 2020 10:40

clean up loop

e6afd22

fix behaviorname in prefab

e944364

undo settings changes (copy from master)

21675af

harperj reviewed Aug 26, 2020

View reviewed changes

ml-agents-envs/mlagents_envs/tests/test_rpc_utils.py Outdated Show resolved Hide resolved

Chris Elion added 2 commits August 26, 2020 11:08

move documentation

5c2cd46

add PPO config too (not tested)

397f9e2

Chris Elion added 2 commits August 26, 2020 11:24

fix behaviorname

90c4014

update rpc_utils tests to use np.allclose

c392380

chriselion approved these changes Aug 26, 2020

View reviewed changes

chriselion merged commit 4cb9168 into Unity-Technologies:master Aug 26, 2020

This was referenced Aug 26, 2020

Move extensions changelog entries to main changelog #4427

Merged

[MLA-1306] avoid copying png data #4430

Merged

github-actions bot locked as resolved and limited conversation to collaborators Aug 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Grid Sensor #4399

Grid Sensor #4399

Uh oh!

J-Travnik commented Aug 20, 2020 •

edited

Loading

Uh oh!

CLAassistant commented Aug 20, 2020 •

edited

Loading

Uh oh!

chriselion Aug 21, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chriselion Aug 26, 2020

Uh oh!

chriselion Aug 26, 2020

Uh oh!

Uh oh!

Uh oh!

chriselion commented Aug 26, 2020

Uh oh!

chriselion commented Aug 26, 2020

Uh oh!

chriselion left a comment

Uh oh!

Uh oh!

Grid Sensor #4399

Grid Sensor #4399

Uh oh!

Conversation

J-Travnik commented Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed change(s)

Motivation

Contribution

Types of change(s)

Checklist

Other comments

Uh oh!

CLAassistant commented Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chriselion Aug 21, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chriselion Aug 26, 2020

Choose a reason for hiding this comment

Uh oh!

chriselion Aug 26, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

chriselion commented Aug 26, 2020

Uh oh!

chriselion commented Aug 26, 2020

Uh oh!

chriselion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

J-Travnik commented Aug 20, 2020 •

edited

Loading

CLAassistant commented Aug 20, 2020 •

edited

Loading