Smaller, Weaker, yet Better: Training LLM Reasoners via Compute-Optimal Sampling September 3, 2024 by Comments