New ask Hacker News story: Ask HN: What are you using for LLM response testing and benchmarking?

Ask HN: What are you using for LLM response testing and benchmarking?
3 by tin7in | 1 comments on Hacker News.
What are you using to test your LLM responses, benchmark them, maybe compare different versions? I've seen a few YC startups focusing on this but I haven't decided yet if we should build this internally or use an external tool.