AR
Benchmark Testing and Analysis Lead
ARC Prize Foundation
Lead frontier model evaluation on ARC-AGI benchmarks. Run models, analyze data, and communicate findings on model capabilities and gaps. Remote, fu...
Fully remote· Only United States vor 2 Wochen