Profile avatar
rasgaard.com
Building AI Solutions @ Laerdal Medical 👨‍💻 Open Source committee @ DDSC.io MSc. Human-Centered AI from DTU 🎓🇩🇰 I occasionally write on rasgaard.com
104 posts 187 followers 371 following
Prolific Poster

Here we go! The Copenhagen NLP Symposium is starting up with a welcome from @delliott.bsky.social. People are attending from Aarhus, Aalborg, Copenhagen, and outside Denmark, from both academia and industry.

CPH NLP Symposium 🤗🔥 cphnlp.github.io

I feel like this shouldn't be as groundbreaking as it appears to me but being able to simply write uv run --env-file .env <script.py> to manage which env-file is being used for a particular script execution is so nice docs.astral.sh/uv/concepts/...

20 ĂĽr siden i dag (unpossible!)

Recently read Zen and the Art of Motorcycle Maintenance for the first time. It was a bit of a tough and slow read. Had to consult a guide/explainer quite often but wow was it worth it #booksky

Do we still need large language models in the era of agentic AI? Agentic AI systems are driving a wave of applications where language models carry out a limited set of specialized tasks repeatedly, with minimal variation. "Small Language Models are the Future of Agentic AI"

Here's a great little tutorial on getting started with my LLM tool for running local models! One of the best ways you can contribute to any open source project is to publish unofficial guides like this one

I have soft-launched a newsletter of sorts. It's just a few links to things that I have liked enough to note down throughout the week. I like the idea of having a big archive of things that I like and why I like them :) rasgaard.com/newsletter/

I love everything about this mind-boggling analysis of the latest season of #therehearsal on HBO. It makes me appreciate Nathan Fielder's work even more which I didn't think was possible. www.reddit.com/r/TheRehears...

My first podcast appearance is live :) I was a bit nervous but it was a lot of fun! We talked about running machine learning models locally/client-side in the browser.

I have been thinking about this as well. You shouldn't be afraid of creating bespoke tooling for your AI annotations and evals. The time spend finding a tool that exactly matches your workflow is often better spent implementing a tailor-made solution.

I have written a barebones guide to getting started with transformers.js :) I hope that data scientists and web devs can meet somewhere in the middle to take proper advantage of client-side ML inference in the browser #mlsky #ai #dkai rasgaard.com/posts/gettin...

Haven't ordered new books in a good while because I have been using the library so much but now that I got a few gift cards for my birthday I checked out these :) #booksky

how else can you describe the rehearsal other than just “pure art”

Wow: Bluesky drives almost the same traffic to my blog (The Pragmatic Engineer Blog) as Twitter/X Data from last month. Just incredible that a site with ~33 million users and growing fast (Bluesky) punches as much as one that is supposedly 20x the size (Twitter/X is estimated around 600M users).

All the ways I want the AI debate to be better by Andy Masley https://andymasley.substack.com/p/all-the-ways-i-want-the-ai-debate #AI #tribes

My latest little hobby project: Fine-tuning a tiny 40M Whisper model specifically for Danish. Benchmarks look great! But how is practical performance? Not amazing..:) rasgaard.com/posts/whispe... #dkai #ai #mlsky

Velkommen til endnu flere seje tech-profiler: @peterdalsgaard.bsky.social @jonasfrich.bsky.social @julielykkes.bsky.social @olifent.bsky.social @petersvarre.bsky.social @sunelehmann.com @adler-nissen.bsky.social Sig til, hvis du kender nogen, der skal pĂĽ listen! go.bsky.app/RDnpz5p 1/2

I have been working on bootstrapping AI evals before product launch and came up with something like this. Using QA to inform "problematic on purpose" synthetic data that can validate LLM-as-a-Judge. Would love to chat with people who have experience with this :) #mlsky #buildinpublic

I usually don't really like off-the-shelf AI evaluation tools. Like, what do you even mean when you measure "groundedness"? Using AI Toolkit for VS Code for LLM-as-a-Judge alignment evals has been the first time I have had an okay experience with a tool like this.

EuroSpeech: Massive Multilingual Parliamentary Speech Corpus - 78,100+ hours across 22 European languages - 50,500+ hours of quality-filtered data (CER < 20%) - Robust alignment algorithm for non-verbatim texts - Dramatically expands resources for 19+ languages huggingface.co/datasets/dis...

What a time to be alive

Gemma 3n E2B available through Edge Gallery: github.com/google-ai-ed... Running on my Nothing Phone 1 here. #MLSky

Driving AI hos @ida.dk er i dag. GlÌder mig isÌr til at høre om: - Region Hs erfaring med AI til hündtering af hudkrÌft - Generativ AIs miljøpüvirkning af folk fra ddsc.io - Vision Transformers fra Capacit som havde et maks-nørdet oplÌg sidste ür om DSPy. #dkai #dktech

Vil gerne blive bedre til at dele hvad jeg har gang i. OgsĂĽ selvom det mĂĽske er en smule upoleret. Her er sidste nyt: Whisper-large-v3-turbo som web app, 100% lokalt sĂĽ din data aldrig forlader din computer. Virker bedst i Chrome! (pga. webgpu support) rasgaard.com/p/transcribe/

Har leget med en idÊ om annotering som del af servicen: - Simpel Whisper-app der kører lokalt -> gratis med Transformers.js - FokusÊr pü UI/UX sü det er lÌkkert og nemt at bruge (nok det svÌreste) - Gør det muligt at rette fejl i teksterne - Optional: Indsend rettelser + lyd til database

Low Tech AI!

Happily doing propaganda for small models and the new generation of open source AI startups in a very nice company (Pleias, Moondream, Prime Intellect). globalventuring.com/corporate/eu...

AttentionInfluence: for pretraining data selection Good data matters, but how do you find it? This paper uses the attention heads from existing models to calculate & rank how valuable the data will be during training Mask out critical heads and calculate the loss arxiv.org/abs/2505.07293

[28/30] 86 Likes, 6 Comments, 1 Posts 2504.20752, cs․CL | cs․AI | cs․LG, 07 May 2025 🆕Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Roman Abramov, Felix Steinbauer, Gjergji Kasneci

Danmark har et Microsoft-problem.

By the time the EU will make up its mind on LLM transparency regulation, pretraining in its current shape won’t exist anymore.

Er virkelig taknemmelig for at DHH bliver inviteret til at dele ud af sin erfaring med open source nĂĽr det handler om Danmarks afhĂŚngighed til Amerikansk tech #dktech #dkpol Hans tale her er meget anbefalelsesvĂŚrdig! www.linkedin.com/posts/david-...

It is very easy to make mistakes when creating evals for your AI product. @sh-reya.bsky.social and I run through the most common errors in this talk. 35% discount code to our upcoming course in the video notes youtu.be/GL0XhAj5LPE?...

The “Paper Skygest” is a total validation of the bluesky thesis. Anyone can build a useful, tunable feed. It’s a bit sparse right now but it’ll be amazing once it takes off fully.

Er imponeret over hvor godt Qwen3-0.6B kører med Transformers.js. Det hele er lokalt i browseren(!) Det her er et minimalt hjemmebrygget eksempel for at forstü lidt af hvad der sker under motorhjelmen. @xenova.bsky.social har lavet en bedre demo :) Prøv den her: huggingface.co/spaces/webml...

PyData Copenhagen @ Google Denmark's offices 🐍🔥

To Bluesky feeds som jeg har vÌret ret glad for pü det sidste: ML Ranked Feed: Posts om machine learning & AI ranked pü samme müde som HackerNews bsky.app/profile/did:... Build in Public: Folk der laver apps, services, etc. og poster om det. Hyggeligt at følge med. bsky.app/profile/did:...

Jeg er ikke den store smalltalker. Jeg har det bedre med samtaler om noget, der er personligt, vigtigt, interessant eller underholdende. Men det er for dĂĽrligt, siger de ekstroverte, vi skal smalltalke mere. SĂĽ jeg satte mig for at blive blive bedre til det. www.zetland.dk/historie/sRn...

My kids and I vibe-coded a simple chore tracking app over the weekend 🙌 It was really fun. Now we'll see whether it actually works 😅 We're going to try it for a week and see how it goes.

TrÌner Whisper-tiny pü Coral-datasÌttet. Er spÌndt pü hvor meget man kan fü ud af en sü lille (38M) model. Som bonus er det fuldkommen gratis: Det kan lade sig gøre pü free-tier GPU compute pü Kaggle :)