|
|
|
<br>DeepSeek open-sourced DeepSeek-R1, [larsaluarna.se](http://www.larsaluarna.se/index.php/User:DominiqueCurmi) an LLM fine-tuned with reinforcement learning (RL) to improve thinking ability. DeepSeek-R1 attains results on par with OpenAI's o1 design on a number of criteria, consisting of MATH-500 and [SWE-bench](http://8.137.12.293000).<br> |