Researchers trained an OpenAI rival in half an hour for less than $50
“The researchers based s1 on Qwen2.5, an open-source model from Alibaba Cloud. They initially started with a pool of 59,000 questions to train the model on, but found that the larger data set didn’t offer ‘substantial gains’ over a whittled-down set of just 1,000. The researchers say they trained the model on just 16 Nvidia H100 GPUs.” —