fdellaert.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

MASt3R-SLAM code release! github.com/rmurai0610/M... Try it out on videos or with a live camera Work with @ericdexheimer.bsky.social, @ajdavison.bsky.social (Equal Contribution)

submitted 1 day ago • 2 comments

CRA statement about NSF firings cra.org/cuts-to-nsf-...

submitted 8 days ago • 0 comments

Some personal news: as of January I am back full-time at Georgia Tech following a 2-year leave as Verdant Robotics’ CTO. I will continue to be involved with Verdant as part-time Chief AI Officer, thinking strategically about the role of AI in Robotics for Ag.

submitted 8 days ago • 0 comments

Gemini is good but too verbose :-)

submitted 14 days ago • 0 comments

The visual system of a jumping spider is fascinating. Look at those cones behind the fixed main lenses! The retinas are at the end of the cones. youtu.be/gvN_ex95IcE?...

submitted 17 days ago • 1 comment

We've built a simulated driving agent that we trained on 1.6 billion km of driving with no human data. It is SOTA on every planning benchmark we tried. In self-play, it goes 20 years between collisions.

submitted 20 days ago • 22 comments

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos TL;DR: Long videos support; Depth Anything V2 with efficient spatial-temporal head. Temporal consistency loss -> depth gradient (no geometric priors)

submitted 35 days ago • 2 comments

That’s such a cool idea! (Thanks K for flagging this to me)

submitted 36 days ago • 1 comment

First iRIM@GT seminar of 2025: Ani Majumdar! Well attended!

submitted 49 days ago • 0 comments

Sign of the times, in 2025

submitted 50 days ago • 1 comment

Getting myself set up here. I found the Sky Follower Bridge Chrome plugin pretty helpful (thanks @kawamataryo.bsky.social!) chromewebstore.google.com/detail/sky-f...

submitted 52 days ago • 9 comments

3D feed-forward Gaussian Splatting feels like magic, let alone 4D!

submitted 53 days ago • 0 comments

100 years ago today, #OTD in 1925, Edwin Hubble announced that Andromeda and other spiral nebulae were definitely separate galaxies outside the Milky Way, in a paper read to an AAS meeting by H.N. Russell. There was no doubt that the Universe was more than just our little island of stars. 🧪 🔭 ⚛️

submitted 56 days ago • 42 comments

Just WOW: youtu.be/X2UxtKLZnNo?...

submitted 65 days ago • 4 comments

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks.

submitted 68 days ago • 20 comments

ChatGPT canvas is cool. Maybe a bit slow still.

submitted 70 days ago • 1 comment

Awesome stuff!

submitted 71 days ago • 0 comments

Convolutional Differentiable Logic Gate Networks @FHKPetersen

submitted 75 days ago • 3 comments

Today I used ChatGPT canvas to help me simplify the SE_2(3) exponential map calculation, and its Jacobian. 4o looks dumb in comparison to o1 now, though :-(

submitted 75 days ago • 0 comments

Introducing 👀Stereo4D👀 A method for mining 4D from internet stereo videos. It enables large-scale, high-quality, dynamic, metric 3D reconstructions, with camera poses and long-term 3D motion trajectories. We used Stereo4D to make a dataset of over 100k real-world 4D scenes.

submitted 76 days ago • 2 comments

Back in college I took a developmental biology class. This field has come so far with advancing technologies like light sheet microscopy. This video is just plain amazing.

submitted 76 days ago • 0 comments

I’m talking differential geometry with ChatGPT o1 like there is no tomorrow. This thing is amazing. On the other hand, I might need my own nuclear power station.

submitted 77 days ago • 3 comments

This is so cool!

submitted 82 days ago • 1 comment

Zhengqi led a really nice new paper that computes really nice camera poses and depth maps from everyday videos, not just the standard SLAM-mable sorts of videos you often see. It's really fast and robust, and I think it's quite neat.

submitted 82 days ago • 1 comment

Fit-NGP: millimetre accurate 3D object pose estimation from RGB images only via Instant NGP. What can we do in manipulation with this level of accuracy? From Marwan and Ignacio from the Dyson Robotics Lab, Imperial College, ICRA24. marwan99.github.io/Fit-NGP/ youtu.be/KQ7yH_em3Qg?...

submitted 83 days ago • 1 comment

I'd like to introduce what I've been working at @hellorobot.bsky.social: Stretch AI, a set of open-source tools for language-guided autonomy, exploration, navigation, and learning from demonstration. Check it out: github.com/hello-robot/... Thread ->

submitted 85 days ago • 7 comments

A common question nowadays: Which is better, diffusion or flow matching? 🤔 Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.

submitted 86 days ago • 7 comments

MoSh has won the 2024 SIGGRAPH Asia Test-of-Time Award. What’s MoSh? It takes motion capture markers and returns the animation of a realistic 3D human body in #SMPL-X format. I wrote a blog post to explain why MoSh is still relevant after 10 years. perceiving-systems.blog/en/news/moti...

submitted 86 days ago • 4 comments

Turns out aria-glasses are a very useful tool to demonstrate actions to robots: Based on egocentric video we track dynamic changes in a scene graph and use the representation to replay or plan interactions for robots 🔗 behretj.github.io/LostAndFound/ 📄 arxiv.org/abs/2411.19162 📺 youtu.be/xxMsaBSeMXo

submitted 86 days ago • 1 comment

We implemented undo in @rerun.io by storing the viewer state in the same type of in-memory database we use for the recorded data. Have a look (sound on!)

submitted 86 days ago • 2 comments

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving arxiv.org/abs/2411.151... It learns denoising from anchored Gaussian distribution to multi-node driving action distribution, seemingly without many tricks.

submitted 92 days ago • 0 comments

Welcome Andy to BlueSky! Let’s get this flywheel running!

submitted 90 days ago • 1 comment

For my first post on Bluesky, this recent talk I did at the recent BMVA one day meeting on World Models is a good summary of my work on Computer Vision, Robotics and SLAM, and my thoughts on a bigger picture of #SpatialAI. youtu.be/NLnPG95vNhQ?...

submitted 90 days ago • 5 comments

Almost all accounts I am following are the ones that actually have an informative profile, incl. a real first and last name….

submitted 96 days ago • 1 comment

1/ A small glimpse into our work at Intrinsic. Building reliable robotics & perception systems is really hard. AI solutions still struggle with hallucinations & occasional failures, which is a major reason why AI isn’t yet widely used in many industries. But that’s changing.

submitted 97 days ago • 1 comment

I recently wrote a primer on UMAP for Nature Reviews Primers. If you are looking for an overview of the method, a getting started primer, or best practices it is a good place to start. rdcu.be/d0YZT

submitted 97 days ago • 2 comments