Ver s1: Simple test-time scaling
s1: Simple test-time scaling

s1: Simple test-time scaling

Researchers at Stanford, UW, and AI2 developed `s1-32B`, an open-source model that achieves state-of-the-art reasoning performance and clear test-time scaling on challenging benchmarks