Description
Problem
Running PFI takes time. Users can set permutation count and number of examples to use to adjust accuracy vs. speed. The model, number of features and examples affect the time PFI takes to complete from less than a hour to multiple days. A typical case would be that developer has an allowed timeslot for running the experiment, such as 1 hour, 1 day, 1 week... At the moment, estimating the running time seems challenging.
Are there some rules of thumb or some ways to estimate what should be the value for permutation count and number of examples to use to get good results? On other hand, are there some ways to estimate or make quick experiments to see how long it will take to complete? Or, are there ways to see the progress of PFI during the run?
Potential solutions
- rules of thumb on how to set number of permutations and number of examples
- guidelines on how to estimate how increasing number of examples and permutations affect running time
- PMI should report the progress