ThreadSky
About ThreadSky
Log In
akinunver.bsky.social
•
24 days ago
DeepSeek R1’s training method (GRPO) is now fully reproducible—the entire codebase is on GitHub.
https://gist.github.com/willccbb/4676755236bb08cab5f4e54a0475d6fb
Comments
Log in
with your Bluesky account to leave a comment
No comments yet
Posting Rules
Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service
×
Reply
Post Reply
Comments