Profile avatar
davidvos.bsky.social
Responsible #RecSys and #IR in the Generative AI era. PhD Candidate at IRLab Amsterdam, supervised by Maarten de Rijke and Andrew Yates. davidvos.dev
27 posts 589 followers 301 following
Regular Contributor
Conversation Starter

Let’s collaborate on democratizing insights from tabular data in Amsterdam! ✨ PhD directions: 1) fundamental techniques for tabular foundation models, 2) reliable mechanisms for AI-powered tabular data analysis. Sharing w/ friends appreciated! ⬇️

Today's funny tokenization realization: The GPT4 tokenizer has 1 single token for the entire lower- and uppercase alphabet.

IRLab Amsterdam made it to Bluesky! Go give @irlab-amsterdam.bsky.social a follow for all things RecSys, IR, RAG, Conv AI, etc!

I'll get straight to the point. We trained 2 new models. Like BERT, but modern. ModernBERT. Not some hypey GenAI thing, but a proper workhorse model, for retrieval, classification, etc. Real practical stuff. It's much faster, more accurate, longer context, and more useful. 🧵

Congratulations dr. Zihan Wang! It was an honor to be your paranymph.

🚨 PhD position alert! 🚨 I'm hiring a fully funded PhD student to work on mechanistic interpretability at @uva-amsterdam.bsky.social. If you're interested in reverse engineering modern deep learning architectures, please apply: vacatures.uva.nl/UvA/job/PhD-...

I just completed "Historian Hysteria" - Day 1 - Advent of Code 2024 #AdventOfCode adventofcode.com/2024/day/1

Yesterday @ellisamsterdam.bsky.social hosted the yearly NeurIPS-Fest, a pre-party for NeurIPS with a keynote talk, poster session, drinks and bites! 🍺🍻 The keynote was by @canaesseth.bsky.social , who talked about "Diffusion, Flows and other stories", presenting his 5 papers accepted at NeurIPS! 💥

I'm looking for an intern to introduce Sparse Embedding models to Sentence Transformers! If you're passionate about open source, interested in helping practitioners use your tools, and enjoy embedders/retrievers/rerankers, then I'd love to hear from you! Links with details and to apply in 🧵

And we have a #Dagstuhl Report! Evaluation Perspectives of #RecSys. Edited by @christinebauer.bsky.social, @evazangerle.bsky.social, and myself. Written by a whole host of fantastic #recsys people. Too many to mention (pic here www.dagstuhl.de/en/seminars/...) drops.dagstuhl.de/entities/doc...

I'm looking for a textbook to take me through fundamental ML concepts again. Not too applied, and preferably with a Deep Learning angle. Currently considering the new Deep Learning book by Bishop, but maybe my Bluesky following has different recommendations. Let me know :)

Things you need as a PhD student: - coffee ☕ - a living stipend 💰 - a website 🔗 For the last one, @kiragoldner.bsky.social has good resources: www.kiragoldner.com/blog/website..., www.kiragoldner.com/resources.html

No one can explain stochastic gradient descent better than this panda.

This is such a great start of every week for me. Credits to Sumit for compiling such an easy to digest newsletter on #IR, #RAG and #RecSys. open.substack.com/pub/recsys?r...

Creating a 🦋 starter pack for people working in IR/RAG: go.bsky.app/88ULgwY I can’t seem to find everyone though, help definitely appreciated to fill this out (DM or comment)!

De overstap #BlueSky

As people are slowly moving over, I thought I'd make a list of people on here who work on IR & RecSys in Amsterdam. Who am I missing? Both researchers and engineers :) go.bsky.app/BzPgLK2

As people are slowly moving over, I thought I'd make a list of people on here who work on IR & RecSys in Amsterdam. Who am I missing? Both researchers and engineers :) go.bsky.app/BzPgLK2

decided to create a RecSys (and adjacent) starter pack with @vickiboykis.com @beeonaposy.bsky.social et al who am I missing? go.bsky.app/RGf9BvY

A brief introduction of myself should be a good idea after moving from the other platform to here :) I'm a second-year PhD student at the University of #Amsterdam 🇳🇱, working on Recommender Systems #RecSys and Information Retrieval #IR.

Currently writing up a report on the IR summer school that we organized here in Amsterdam. The lectures are publicly available and provide a great introduction to anyone interested in IR. Thought I'd share: 2024.essir.eu/talks

Pls repost: We, the DEEM Lab at TU Berlin, are hiring a postdoctoral researcher in data engineering for machine learning. Details available at: deem.berlin#jobs-57624 This fully-funded position is part of the Berlin Institute for the Foundations of Learning and Data (BIFOLD). #databs #datasky

‘Based on what you know about me, draw a picture of what you think my current life looks like.’ A @bsky.app introduction by ChatGPT.