Research

BrowseComp Benchmark

1 articles in archive

BrowseComp: a benchmark for browsing agents

BrowseComp: a benchmark for browsing agents.

OpenAI Blog347d ago