Problems with SIMBA and using a verbal feedback for the metric #8278

meditans · 2025-05-26T12:59:34Z

Hi,
I was pondering the fact that in order to improve the prompt when a metric gives a low score, it would be useful for the optimizer to access a verbalization of the reason for the low score. I see that as a step towards rendering the dspy program more modular: the explanation of the requirement could be colocated with the metrics, we wouldn't need to have them bundled in the initial prompt, and we can let the optimization process include that in the initial prompt in the way it sees fit.

So looking at the issues I found this issue #7938, in which @okhat and @ratzrattillo arrived at a very similar conclusion, and a new optimizer SIMBA was suggested. Here is the example code that was pasted there:

program = MyFancyDSPyModule()

def my_composite_metric(example, prediction, trace=None):
   score1 = metric1(example, prediction, trace)
   score2 = metric2(example, prediction, trace)
   score = (score1 + score2) / 2.0
   feedback = f"You scored {score1}/1.0 and {score2}/1.0 on metric1 and metric2, respectively"
   return dspy.Prediction(score=score, feedback=feedback)

optimized_program = dspy.SIMBA(metric=my_composite_metric).compile(program, trainset=[dspy.Example(...), ...])

Now I follow with a couple of problems I encountered when trying this idea, and a couple of questions. Keep in mind I tried to use this optimizer in the context of 0-shot learning - I was only interested in prompt optimization since the task is quite token intensive and I didn't want few-shot examples.

Can be SIMBA used for only prompt (0-shot) learning? Can it be used to generate some synthetic examples for few shot learning?
I used the suggested parameter, but in the end the prompt was not optimized (in the sense that the final program contained the same prompt I started with)
I'm using mlflow to track the optimization process, but nothing was logged in that experiment (I think things get logged when I use mipro2. This might be related to Unable to access optimized prompt on SIMBA optimizer, train logs doesn't show any loss after specific steps and no initial prompt on lm.inspect_history #8150.
You suggest to return a Prediction with a feedback field, but I can't figure out where that feedback would be used in the SIMBA optimizer code
I would like to understand better how this paradigm of bundling feedback with metrics (which I think is the right direction) overlaps with the mechanism of dspy assertions. Or is it that assertions are test-time constructs while better metrics are compile-time constructs?

Thanks!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Problems with SIMBA and using a verbal feedback for the metric #8278

Problems with SIMBA and using a verbal feedback for the metric #8278

meditans commented May 26, 2025

Problems with SIMBA and using a verbal feedback for the metric #8278

Problems with SIMBA and using a verbal feedback for the metric #8278

Comments

meditans commented May 26, 2025