arjunguha.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

arjunguha.bsky.social

hacker / CS professor https://www.khoury.northeastern.edu/~arjunguha/

61 posts 174 followers 76 following

Posts 8 Comments 42

The real lesson from DeepSeek is the importance of good old-fashioned computer science. Every day this week, they've been doing open source releases. The latest is their in-house distributed file system. github.com/deepseek-ai/...

submitted 2 days ago • 1 comment

Please help amplify ARBOR, a fantastic new research opportunity! If you’d like to start contributing, NDIF is now hosting DeepSeek R1 8B and 70B, open for all researchers to experiment on via our API. Sign up for API access here: login.ndif.us

submitted 9 days ago • 0 comments

O1, R1, etc. are so good that we evaluate them on “PhD-level” benchmarks. But, these benchmarks are so hard that most people can’t even understand what they are testing. We’ve built a benchmark with problems that are hard to solve but easy to verify: for both humans and models.

submitted 26 days ago • 1 comment

Last one: there are a LOT of people to blame for this one. I think @jasvir.bsky.social is to blame for this problem in "Humanity's Last Exam".

submitted 33 days ago • 2 comments

Ugh, who did this? @joepolitz.bsky.social ? Wait, was it @dbp.bsky.social ? Someone else from @shriram.bsky.social's group? Also from "Humanity's Last Exam".

submitted 33 days ago • 2 comments

OK, who is responsible for this? Is it @natefoster.bsky.social? Source: "Humanity's Last Exam" www.nytimes.com/2025/01/23/t...

submitted 33 days ago • 1 comment

It was fun to be a part of this. We analyze a dataset of student-LLM interactions on programming tasks and ask: what do students get wrong when prompting?

submitted 36 days ago • 1 comment

Counterintuitively, this shows the power of GC. IIRC I've heard of similar tricks used for high-performance OCaml at Jane Street.

submitted 50 days ago • 3 comments