You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
Pull Request resolved: #4173
This project adds an internal implementation of https://github.com/microsoft/ArchProbe.
This stack introduces a kernel that can be used to get the number of available registers on a mobile GPU by gradually increasing the number of accessed elements and detecting dramatic drops in performance. See [this paper](https://www.microsoft.com/en-us/research/uploads/prod/2022/02/mobigpu_mobicom22_camera.pdf), page 4, for more information.
This diff finds the number of concurrency groups available for full registry usage, and half registry usage.
On a Samsung Galaxy S22, the latency graphs for full and half registry usage look like this:
Full:
{F1750677545}
Half:
{F1750679467}
Differential Revision: D59497314
Reviewed By: SS-JIA
0 commit comments