Machine translation metrics #1891

mahmoudaymo · 2024-12-05T18:43:13Z

Machine translation metrics such as Bleu or PED are usually computed at corpus level meaning that to evaluate the program we need to feed it all the Examples or Devset instead of one example at a time. How can we use these metrics in DSPY

chenmoneygithub · 2024-12-11T20:53:49Z

I don't fully understand the issue, computing BLEU in DSPy shouldn't be different from other cases. You can define any custom metric that takes in the train/dev/test data, and the output from DSPy program: https://dspy.ai/cheatsheet/#dspy-metrics

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Machine translation metrics #1891

Machine translation metrics #1891

mahmoudaymo commented Dec 5, 2024

chenmoneygithub commented Dec 11, 2024

Machine translation metrics #1891

Machine translation metrics #1891

Comments

mahmoudaymo commented Dec 5, 2024

chenmoneygithub commented Dec 11, 2024