4/26 at 3pm:
'The Optimization Landscape of SGD Across the Feature Learning Strength'
Alexander Atanasov · Alexandru Meterez · James Simon · @cpehlevan.bsky.social
Submission: https://openreview.net/forum?id=iEfdvDTcZg
'The Optimization Landscape of SGD Across the Feature Learning Strength'
Alexander Atanasov · Alexandru Meterez · James Simon · @cpehlevan.bsky.social
Submission: https://openreview.net/forum?id=iEfdvDTcZg
Comments
'Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups'
Zakhar Shumaylov · Peter Zaika · James Rowbottom · Ferdia Sherry · @mweber.bsky.social · Carola-Bibiane Schönlieb
Submission: https://openreview.net/forum?id=7PLpiVdnUC
'Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon'
USVSN Sai Prashanth · @nsaphra.bsky.social et al
Submission: https://openreview.net/forum?id=3E8YNv1HjU
'Deconstructing What Makes a Good Optimizer for Autoregressive Language Models'
@rosieyzh.bsky.social · Depen Morwani · David Brandfonbrener · Nikhil Vyas · Sham Kakade
Submission: https://openreview.net/forum?id=zfeso8ceqr
'HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics'
Jingxuan Fan · Sarah Martinson · Erik Wang · Kaylie Hausknecht · Jonah Brenner · Danxian Liu · Nianli Peng · Corey Wang · Michael Brenner
Submission: https://openreview.net/forum?id=nDTvP6tBMd