Your human and LLM judges should follow the same criteria. Then, you can transition from manual to automated evaluation once you have inter-annotator agreement between LLM & human. You now have a faster iteration speed and the annotator can focus on finding edge cases! - ThreadSky

scottcondron.bsky.social • 111 days ago

Your human and LLM judges should follow the same criteria.

Then, you can transition from manual to automated evaluation once you have inter-annotator agreement between LLM & human. You now have a faster iteration speed and the annotator can focus on finding edge cases!

Comments

Posting Rules

Comments

Posting Rules

Reply