When Generic Benchmarks Fail: Building a Sales-Domain Evaluation Bench from Scratch

· Dev.to