Research

Factuality Benchmarks

1 articles in archive

Introducing SimpleQA

A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.

OpenAI Blog506d ago