I'm excited to announce two papers of ours which will be presented this summer at @naaclmeeting.bsky.social eting.bsky.social and @iclr-conf.bsky.social ! 🧵 - ThreadSky

About ThreadSky

juand-r.bsky.social • 1 day ago

I'm excited to announce two papers of ours which will be presented this summer at @naaclmeeting.bsky.social https://eting.bsky.social and @iclr-conf.bsky.social !
🧵

Comments

juand-r.bsky.social•1 day ago

1.) [NAACL 25] @kanishka.bsky.social, @amuuueller.bsky.social and I delve into how language models do property inheritance using behavioral and mechanistic analyses.
Thank you, Kanishka and Aaron. I could not have hoped for better collaborators! https://arxiv.org/abs/2410.22590

[👇 https://bsky.app/profile/juand-r.bsky.social/post/3lahqgko2wc2m]

juand-r.bsky.social•1 day ago

2.) [ICLR 2025]
When does CoT help? It turns out that gains are mainly on math and symbolic reasoning.

Check out our paper for a deep dive into MMLU, hundreds of experiments, and a meta-analysis of CoT across 3 conferences covering over 100 papers! https://arxiv.org/abs/2409.12183

juand-r.bsky.social•1 day ago

Thanks to @zaynesprague.bsky.social, @fcyin.bsky.social, Dongwei Jiang, @manyawadhwa.bsky.social, @prasannsinghal.bsky.social, @lucyxyzhao.bsky.social, Xi Ye, @kmahowald.bsky.social and @gregdnlp.bsky.social for making this happen!

patapom.bsky.social•1 day ago

Tasks that need to store intermediate values on the stack basically...
CoT is just implementing a basic memory strategy by storing some states in the sequence of tokens.
Another proof that LLMs are a terrible algorithm...

Posting Rules

Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service

Comments

Posting Rules

Reply