Profile avatar
ykl7.bsky.social
PhD candidate at Stony Brook University; Prev: Google Research, AI2 Aristo, Salesforce Research; MS from JHU https://ykl7.github.io
25 posts 1,041 followers 624 following
Regular Contributor
Conversation Starter

I'll do the advisory advertising: @ykl7.bsky.social‬ is a fantastic researcher and is passionate about being in academia. He has this amazing ability to simply get things done! Happy to say more in a letter or over a chat but if you are going to @naaclmeeting.bsky.social (#NAACL2025) ping him.

I'm headed to #NAACL2025 ✈️ in Albuquerque 🏜️Looking for postdoc positions in the US, so if you're hiring (or know someone who is), let's chat at the conference! Also organizing #WNU2025 so make sure to swing by the workshop on May 4

Heads up for anyone who missed this: ARR has moved to 5 cycles per year and the EMNLP deadline will be in May.

First CfP for #EMNLP2025 is live now. Submission deadline to May ARR cycle! Excited to be part of organizing the conference as publicity chair w/ @amuuueller.bsky.social @dallascard.bsky.social so watch out for more updates esp by following the official conf account @emnlpmeeting.bsky.social

🚨 The submission deadline (Feb 17) for the Workshop on Narrative Understanding at #NAACL2025 is approaching us! Excited to see diverse work on studying different aspects of narratives 🤩 Submit here: www.softconf.com/naacl2025/WN... #NLProc #wnu2025

📢 The 7th Workshop on Narrative Understanding (WNU) will happen with #NAACL2025 and is open for submissions. 🌐: tinyurl.com/wnu25 Direct Submission: February 17 Pre-Reviewed (ARR) papers: March 10 Excited to organize this again and hope to see you in Albuquerque 🌵 early this May! #wnu2025 #NLProc

Thanks @mohitbansal.bsky.social for the wonderful Distinguished Lecture on agents and multimodal generation. This got so many of us here at Stony Brook excited for the potential in these areas. Also, thanks for spending time with our students & sharing your wisdom. It was a pleasure hosting you!

Why did @duolingoverde.bsky.social go with the villain vibe for the year in review this year? 🤔

🚨 I am on the faculty job market this year 🚨 I will be presenting at #NeurIPS2024 and am happy to chat in-person or digitally! I work on developing AI agents that can collaborate and communicate robustly with us and each other. More at: esteng.github.io and in thread below 🧵👇

Looking forward to giving this Distinguished Lecture at StonyBrook next week & meeting the several awesome NLP + CV folks there - thanks Niranjan‬ + all for the kind invitation 🙂 PS. Excited to give a new talk on "Planning Agents for Collaborative Reasoning and Multimodal Generation" ➡️➡️ 🧵👇

Lots of posts about LLMs trivially generating websites. How do I prompt them to generate a full (static) website with CSS and JS too? The HTML is fine but I can't fix (a lot of) issues in the generated CSS and JS. I feel like we still need domain expertise to make sure these things work 😅

The question that a reviewer should ask themselves is: Does this paper take a gradient step in a promising direction? Is the community better off with this paper published? If the answer is yes, then the recommendation should be to accept.

Releasing SmolVLM, a small 2 billion parameters Vision+Language Model (VLM) built for on-device/in-browser inference with images/videos. Outperforms all models at similar GPU RAM usage and tokens throughputs Blog post: huggingface.co/blog/smolvlm

A new paper, "Let Me Speak Freely" has been spreading rumors that structured generation hurts LLM evaluation performance. Well, we've taken a look and found serious issue in this paper, and shown, once again, that structured generation *improves* evaluation performance!

Hi everyone! I’m the Director of Machine Learning at the Wikimedia Foundation, the non-profit organization that hosts Wikipedia. My team works on the AI systems that support Wikipedia, impacting both readers and volunteer editors globally.

I noticed a lot of starter packs skewed towards faculty/industry, so I made one of just NLP & ML students: go.bsky.app/vju2ux Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!

📢 NAACL reviews have been released! 🆕 feature alert: ARR now has review issue flagging! Thanks @jkkummerfeld.bsky.social & OR team for help with implementation, and other EiCs for supporting the idea! It'll be live after author response. More details here: aclrollingreview.org/authors#step... /1

I did a starter pack of people in New York (City) working on ML/AI. Please distribute and feel free to self nominate! go.bsky.app/BoEtagz

Great opportunity to see how (your) new coding agent methods stack up real world user tasks

📢 Ultimate test of #NLP bluesky: I need emergency reviewers for NAACL submissions on encoders (one multilingual, one for sentence embeddings). Help a desperate editor abandoned by the ACs! Author response starts tomorrow, so that's a true emergency. If you're my hero, lmk your openreview profile.

Excited to release Tulu 3! We worked hard to try and make the best open post-training recipe we could, and the results are good! I was lucky enough to work on almost every stage of the pipeline in one way or another. Some comments + highlights ⬇️

Meet Tülu 3, a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms. We invented new methods for fine-tuning language models with RL and built upon best practices to scale synthetic instruction and preference data. Demo, GitHub, paper, and models 👇

I've spent the last two years scouring all available resources on RLHF specifically and post training broadly. Today, with the help of a totally cracked team, we bring you the fruits of that labor — Tülu 3, an entirely open frontier model post training recipe. We beat Llama 3.1 Instruct. Thread.

another starter pack, this time for folks (past & current) from Ai2 (@ai2.bsky.social) 😍 go.bsky.app/Qjyc97J

ICLR should not allow public discussion until after decisions. It allows Oscars-style award/accept campaigns by high-profile allies. Even for great papers, that can’t be part of the review process. The only reason it didn’t happen before is because it was against community norms, which are changing.

✨I am on the faculty job market in the 2024-2025 cycle!✨ My research centers on advancing Responsible AI, specifically enhancing factuality, robustness, and transparency in AI systems. If you have relevant positions, let me know! lasharavichander.github.io Please share/RT!

The search function on here needs some work 😅 Can't easily find users that aren't in one of the starter packs going around

TIL about the first Gemini (uploaded May 2022) huggingface.co/describeai/g...