w42.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

1/🧵ICLR 2025 Spotlight Research on LM & Memorization! Language models (LMs) often "memorize" data, leading to privacy risks. This paper explores ways to reduce that! Paper: arxiv.org/pdf/2410.02159 Code: github.com/msakarvadia/... Blog: mansisak.com/memorization/

submitted 74 days ago • 1 comment

Microtubule regulation drives an asymmetry in the regeneration of sensory neurons, with specific proteins controlling growth. buff.ly/4ijksHC

submitted 74 days ago • 0 comments

1/13 New Paper!! We try to understand why some LMs self-improve their reasoning while others hit a wall. The key? Cognitive behaviors! Read our paper on how the right cognitive behaviors can make all the difference in a model's ability to improve with RL! 🧵

submitted 74 days ago • 2 comments

Reading and Writing Google Sheets in DuckDB duckdb.org/2025/02/26/g...

submitted 77 days ago • 0 comments

www.nature.com/articles/s41... awesome new work out today! From the Lee lab in the intramural research program at NIMH!

submitted 80 days ago • 0 comments

Our online book on systems principles of LLM scaling is live at jax-ml.github.io/scaling-book/ We hope that it helps you make the most of your computing resources. Enjoy!

submitted 102 days ago • 3 comments

Excited to share 𝐈𝐧𝐟𝐀𝐥𝐢𝐠𝐧! Alignment optimization objective implicitly assumes 𝘴𝘢𝘮𝘱𝘭𝘪𝘯𝘨 from the resulting aligned model. But we are increasingly using different and sometimes sophisticated inference-time compute algorithms. How to resolve this discrepancy?🧵

submitted 136 days ago • 2 comments

what kind news is this? lol

submitted 104 days ago • 0 comments

On monday in our reading group we discuss "Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective" arxiv.org/abs/2412.03487 With Neta Shaul. Join on zoom on Monday at 9am PT / 12pm ET / 6pm CET: portal.valencelabs.com/logg

submitted 112 days ago • 0 comments

🚨 Researchers uncover 4.5M fake stars on GitHub 🌟, often boosting malware disguised as pirated software & crypto bots. Fake stars surge in 2024, posing major risks to open-source trust & security. #CyberSecurity #GitHub #OpenSource #SupplyChainSecurity arxiv.org/abs/2412.13459

submitted 148 days ago • 0 comments

That exhilarating feeling that everything is possible when you open an editor to code, it hopefully never goes away.

submitted 159 days ago • 4 comments

Binary code is pervasive, and binary analysis is a key task in reverse engineering, malware classification, and vulnerability discovery. So, they created Assemblage - the dataset of source-to-binary projects compiled from GitHub. Assemblage - A dataset of binary executable corpuses

submitted 160 days ago • 1 comment

What came first, life or evolution? Does evolution act on non-living materials? Competitive Exclusion among Self-Replicating Molecules Curtails the Tendency of Chemistry to Diversify 🧪 www.nature.com/articles/s41... Self-replicating molecules demonstrate basic principles of Darwinian evolution

submitted 163 days ago • 3 comments

this morning walk, an ideas stuck me: can you play chess on Rubik's Cube (does not have to be 3x3 one)? not just chess with 6 sides, but normal chess board abstracted away to Rubik's Cube representation and operation

submitted 166 days ago • 0 comments

Half of Twitter right now is people getting mad at some random lady that got a literature PhD. Seems a bit crazy to get so mad about, but I do agree woke academia has become silly and we need to go back to when it was about real solid research, like measuring skull sizes to determine personalities

submitted 166 days ago • 20 comments

The amazing, new Qwen2.5-Coder 32B model can now write SQL for any @hf.co dataset ✨

submitted 166 days ago • 1 comment

Good news everyone! A new version of graph-tool is just out! @graph-tool.skewed.de graph-tool.skewed.de Graph-tool is a comprehensive and efficient Python library to work with networks, including structural, dynamical, and statistical algorithms, as well as visualization. 1/N #networkscience

submitted 166 days ago • 8 comments

An aspect of flow matching which I find a bit interesting is that it is covariant under affine changes of coordinate (c.f. optimal transport, which need not be). This allows for a few nice WLOGs, which I imagine have more applications than I realise.

submitted 166 days ago • 1 comment

If you think the out (site) group isn't enjoying thinking like your ingroup, I've lost respect for you. Sorry.

submitted 167 days ago • 0 comments

More formal verification, this time from the engineers at Cloudflare using a lesser-known verification stack: Cloudflare uses racket & rosette, a solver-aided programming system to, ensure the correctness of their DNS query engine configuration blog.cloudflare.com/topaz-policy...

submitted 177 days ago • 0 comments

Some recent discussions made me write up a short read on how I think about doing computer vision research when there's clear potential for abuse. Alternative title: why I decided to stop working on tracking. Curious about other's thoughts on this. lb.eyer.be/s/cv-ethics....

submitted 169 days ago • 19 comments

I don't know if this was known or not, but if you open your Google search page, type 'Chicxulub' and press enter, something interesting happens. Easter egg? But a funny one!

submitted 168 days ago • 2 comments

Done, Five million. Compressed to 447 mb. Getting more on the main set, it's already at 7+ million. There is a surprisingly good amount of quality data. There will be more curation/variations but won't post more about it unless I manage a billion. huggingface.co/datasets/Ror...

submitted 169 days ago • 0 comments

Very useful looking Python package by BASF on evaluating MLIPs! github.com/basf/mlipx

submitted 168 days ago • 1 comment

today feels like a good day to pick this book up again. be back in a while 👋 also shoutout to @welltypedwit.ch for always teaching me new stuff

submitted 168 days ago • 3 comments

Qi meeting notes https://github.com/drym-org/qi/wiki/Qi-Meeting-Nov-29-2024 -producers, transformers, and consumers -- stream abstractions for deforesting any list-oriented operation -encoding two runtimes - naive and optimized - in "deep" macros -Michael is teaching an exciting new course on DSL…

submitted 168 days ago • 0 comments

A few days ago Lawrence Hollom posted a preprint that solves a problem about posets, informally known as the fishbone conjecture, which I am told was extremely central in the area. 🧵 arxiv.org/abs/2411.16844

submitted 168 days ago • 1 comment

I am running my statistical rethinking course again starting in January. But this time just for Leipzig locals, so I can work with a smaller group this year and track individual progress better. I think that will help me tune the material, espeically homework, better. Materials still open however.

submitted 168 days ago • 7 comments

Every time I'm having a hard time to motivate myself to work on things I just grab my laptop and move to a different room. It kinda resets my brain 'cause the environment has changed. I heard people recommend to rearrange your furniture from time to time for the same reason.

submitted 168 days ago • 23 comments

Dear algorithm, I would like to view: 70% new ML and LLM research and cool results. 10% funny videos with cute animals. 5% sports (but no spoilers if I'm planning to watch the reply later). 5% travel and life hacks. 5% general tech. 5% random. Regards,

submitted 169 days ago • 1 comment

When can a sum of reciprocals of natural numbers sum to a rational number? There are many unsolved problems in this area, by Erdős and others. With Vjeko Kovac, we have been able to resolve some open questions and make progress on others: terrytao.wordpress.com/2024/11/27/o...

submitted 169 days ago • 2 comments

wow, nobody steal this promo link. BSky high trust level looking good

submitted 168 days ago • 1 comment

Hey #EconSky! I’m a postdoc in psychology and economic theory @ Harvard My #EconJMP studies how shared context supports efficient communication and cooperation 🧵⬇️

submitted 171 days ago • 3 comments