Prompt Performance
Purpose:
Detailed Performance Metrics: Analyze prompt effectiveness using bar charts that display overall cost per LLM interaction, evaluation scores, and the fraction of low-scoring records.
Comparison and Optimization: Compare the performance of different prompts within the same prompt family to determine which ones yield the best results.
Actionable Insights: Use these visual insights to guide A/B testing and prompt optimization efforts, ensuring that you consistently achieve high accuracy.
Example: In an e-commerce scenario, you might use Prompt Performance to evaluate how well your product description summarization prompts are doing. If one prompt variant shows a higher cost per interaction and a lower accuracy score compared to its siblings, you can decide to phase it out in favor of a better-performing alternative, ultimately increasing overall response accuracy.
Last updated