iscienceluvr.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

I restarted my blog a few weeks ago. The 1st post was: Debunking DeepSeek Delusions I discussed 5 main myths that I saw spreading online back during the DeepSeek hype. It may be a little less relevant now, but hopefully still interesting to folks. Check it out → www.tanishq.ai/blog/posts/d...

submitted 7 days ago • 4 comments

Are folks still here? 😅

submitted 7 days ago • 11 comments

Okay so this is so far the most important paper in AI of the year

submitted 37 days ago • 2 comments

Anthropic, please add a higher tier plan for unlimited messages 😭🙏

submitted 46 days ago • 4 comments

Decentralized Diffusion Models UC Berkeley and Luma AI introduce Decentralized Diffusion Models, a way to train diffusion models on decentralized compute with no communication between nodes. abs: arxiv.org/abs/2501.05450 project page: decentralizeddiffusion.github.io

submitted 47 days ago • 0 comments

The GAN is dead; long live the GAN! A Modern Baseline GAN This is a very interesting paper, exploring making GANs simpler and more performant. abs: arxiv.org/abs/2501.05441 code: github.com/brownvc/R3GAN

submitted 47 days ago • 0 comments

Happy birthday to my incredible and awesome Mamma! 🥳🎉🎂 To many more years of health and happiness. Tiara (my sister) and I love you very much ❤️❤️❤️

submitted 47 days ago • 1 comment

Happy 19th birthday to my amazing sister Tiara Abraham! 🥳🎉 🎂 Proud of you graduating with your Master's degree at 18 and starting your doctorate in music degree this past year! Excited to see what this final teen year holds for you!

submitted 60 days ago • 0 comments

Inventors of flow matching have released a comprehensive guide going over the math & code of flow matching! Also covers variants like non-Euclidean & discrete flow matching. A PyTorch library is also released with this guide! This looks like a very good read! 🔥 arxiv: arxiv.org/abs/2412.06264

submitted 78 days ago • 1 comment

Normalizing Flows are Capable Generative Models Apple introduces TarFlow, a new Transformer-based variant of Masked Autoregressive Flows. SOTA on likelihood estimation for images, quality and diversity comparable to diffusion models. arxiv.org/abs/2412.06329

submitted 78 days ago • 1 comment

Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models "We introduce a simple strategy that makes refusal behavior controllable at test-time without retraining: the refusal token." arxiv.org/abs/2412.06748

submitted 78 days ago • 0 comments

Can foundation models actively gather information in interactive environments to test hypotheses? "Our experiments with Gemini 1.5 reveal significant exploratory capabilities" arxiv.org/abs/2412.06438

submitted 78 days ago • 0 comments

Training Large Language Models to Reason in a Continuous Latent Space Introduces a new paradigm for LLM reasoning called Chain of Continuous Thought (COCONUT) Directly feed the last hidden state (a continuous thought) as the input embedding for the next token. arxiv.org/abs/2412.06769

submitted 78 days ago • 3 comments

[MASK] is All You Need New paper from CompVis group, introduces a new method called Discrete Interpolants that builds on top of discrete flow matching. Achieves SOTA performance on MS-COCO, competitive results on ImageNet 256. arxiv.org/abs/2412.06787

submitted 78 days ago • 1 comment

A new tutorial on RL by Kevin Patrick Murphy, a Research Scientist at Google DeepMind who also wrote several comprehensive, well-regarded textbooks on ML/DL. This ought to be a good read 👀 arxiv.org/abs/2412.05265

submitted 79 days ago • 1 comment

Birth and Death of a Rose abs: arxiv.org/abs/2412.05278 Generating temporal object intrinsics - temporally evolving sequences of object geometry, reflectance, and texture, such as blooming of a rose - from pre-trained 2D foundation models.

submitted 79 days ago • 0 comments

Frontier Models are Capable of In-context Scheming abs: arxiv.org/abs/2412.04984 "Our results show that o1, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, and Llama 3.1 405B all demonstrate in-context scheming capabilities"

submitted 79 days ago • 0 comments

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks abs: arxiv.org/abs/2412.04626 project page: bigdocs.github.io BigDocs-7.5M is a high-quality, open-access dataset comprising 7.5 million multimodal documents across 30 tasks.

submitted 79 days ago • 0 comments

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling abs: arxiv.org/abs/2412.05271 model: huggingface.co/OpenGVLab/In... Introduces new InternVL-2.5 model, the first open-source MLLMs to surpass 70% on the MMMU benchmark

submitted 79 days ago • 0 comments

NVILA: Efficient Frontier Visual Language Models abs: arxiv.org/abs/2412.04468 NVIDIA introduces NVILA, a family of open VLMs designed to optimize both efficiency and accuracy.

submitted 82 days ago • 0 comments

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis abs: arxiv.org/abs/2412.04431 New visual autoregression framework that performs bitwise token prediction w/ an infinite-vocabulary tokenizer & classifier, a new record for autoregressive text-to-image models.

submitted 82 days ago • 0 comments

🤔 Why do we extract diffusion features from noisy images? Isn’t that destroying information? Yes, it is - but we found a way to do better. 🚀 Here’s how we unlock better features, no noise, no hassle. 📝 Project Page: compvis.github.io/cleandift 💻 Code: github.com/CompVis/clea... 🧵👇

submitted 84 days ago • 2 comments

Leading computer vision researchers Lucas Beyer (@giffmana.ai), Alexander Kolesnikov (@kolesnikov.ch), Xiaohua Zhai have left Google DeepMind to join OpenAI! They were behind recent SOTA vision approaches and open-source models like ViT, SigLIP, PaliGemma

submitted 84 days ago • 1 comment

The AI winter has started 😔

submitted 85 days ago • 3 comments

the restrictions on post and video length is gonna make it harder to paper-post here ngl

submitted 86 days ago • 3 comments

Reverse Thinking Makes LLMs Stronger Reasoners abs: arxiv.org/abs/2411.19865 Train an LLM to be able to generate forward reasoning from question, backward question, and backward reaoning from backward question Shows an average 13.53% improvement over the student model’s zero-shot performance

submitted 86 days ago • 0 comments

GaussianSpeech: Audio-Driven Gaussian Avatars abs: arxiv.org/abs/2411.18675 project page: shivangi-aneja.github.io/projects/gau...

submitted 86 days ago • 0 comments

some people managed to find some AoC-solving code from qianxyz in a github repo that has now been deleted seems like an automated pipeline using gpt-4o-mini with a pretty basic prompt

submitted 86 days ago • 1 comment

how does someone solve Advent of Code problem in 9 seconds??!!

submitted 87 days ago • 4 comments

how does someone solve Advent of Code problem in 9 seconds??!!

submitted 87 days ago • 4 comments

At #XPANSE in Abu Dhabi last week: - Met @anilseth.bsky.social backstage between our talks, discussed studying the nature of consciousness w/ neuroimaging. Appreciated him gifting me a signed copy of his book! - Met @seanmcarroll.bsky.social who gave a great talk about entropy vs. complexity

submitted 90 days ago • 1 comment

Many SOTA image generation models use an adversarial loss (VAE for latent diffusion for example), which counts I would say...

submitted 91 days ago • 1 comment

My Bluesky follower count (1.6k followers) has now surpassed my Threads follower count (1.1k). I still see a few AI folks on Threads but it seems so much more dead compared to BlueSky.

submitted 93 days ago • 4 comments

Every time conference reviews and rebuttals come in we hear complaints about how bad the process is. Which ML conference has the best review process and what's stopping other conferences from improving their processes?

submitted 93 days ago • 1 comment

Here is a list of ML OSS & Open Source / Science enthusiasts I found on Bluesky 🦋 go.bsky.app/8MFcfXd Let me know if you find such people here! I'm still new here and probably the list misses many must-add people, so let's built it together💪

submitted 97 days ago • 41 comments