Updating MIPROv2 Auto Settings #8165
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR updates the MIPROv2 auto settings based on intuition we've formed on ideal run settings from past experiments.
The new auto settings are calculated by pinning the number of candidates we want to explore for each model in
light
,medium
,heavy
, modes ton=6
,n=12
,n=18
respectively. To calculate the number of trials needed for proper exploration of each candidate, we use the following formula:trials = max(c*M*log(n), c=2, 3/2*n)
, where M is the number of modules in a program. In non zero-shot settings, when both few-shot examples and instructions are being optimized, we set the number of candidate few-shot examples ton
, and the number of candidate instructions ton/2
.The impacts of this change will generally be to ensure that
light
mode explores enough candidates to make an impact on performance, and to generally decrease the # of candidates explored inheavy
mode, which we found was often unnecessarily high before.