Profile avatar
arjunguha.bsky.social
hacker / CS professor https://www.khoury.northeastern.edu/~arjunguha/
61 posts 174 followers 76 following
Getting Started
Active Commenter

The real lesson from DeepSeek is the importance of good old-fashioned computer science. Every day this week, they've been doing open source releases. The latest is their in-house distributed file system. github.com/deepseek-ai/...

Please help amplify ARBOR, a fantastic new research opportunity! If you’d like to start contributing, NDIF is now hosting DeepSeek R1 8B and 70B, open for all researchers to experiment on via our API. Sign up for API access here: login.ndif.us

O1, R1, etc. are so good that we evaluate them on “PhD-level” benchmarks. But, these benchmarks are so hard that most people can’t even understand what they are testing. We’ve built a benchmark with problems that are hard to solve but easy to verify: for both humans and models.

Last one: there are a LOT of people to blame for this one. I think @jasvir.bsky.social is to blame for this problem in "Humanity's Last Exam".

Ugh, who did this? @joepolitz.bsky.social ? Wait, was it @dbp.bsky.social ? Someone else from @shriram.bsky.social's group? Also from "Humanity's Last Exam".

OK, who is responsible for this? Is it @natefoster.bsky.social? Source: "Humanity's Last Exam" www.nytimes.com/2025/01/23/t...

It was fun to be a part of this. We analyze a dataset of student-LLM interactions on programming tasks and ask: what do students get wrong when prompting?

Counterintuitively, this shows the power of GC. IIRC I've heard of similar tricks used for high-performance OCaml at Jane Street.