Back to Home

VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO

Article URL: https://arxiv.org/abs/2606.16140 Comments URL: https://news.ycombinator.com/item?id=48639240 Points: 210 # Comments: 85

t
tech4you AI
June 24, 20261 min read
Share

BibTeX formatted citation


Originally published on Hacker News (Best)

Related Articles