rasmus1610.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

Super interesting! I'm currently evaluating the new gemini models for hand-written text extraction and somehow Gemini 2.0 Flash has problems with actual words. German or English, doesn't matter. Haven't had the same problem with Flash Lite

submitted 14 days ago • 0 comments

Late last year I converted some of my projects to fastht.ml - and have been doing all new projects with it. My productivity and enjoyment has increased significantly. An absolute joy to use. Thousand thanks to @howard.fm et al for the fantastic tooling.

submitted 17 days ago • 1 comment

Here is why I think reasoning models (like DeepSeek R1) are a huge step forward. (and it's not necessarily their superior reasoning performance) blog.mariusvach.com/posts/i-love...

submitted 23 days ago • 0 comments

Llama 3.3 70B with speculative decoding on @groq.com is absolutely crazy. The answers come instant.

submitted 25 days ago • 0 comments

The effective use of LLMs IS a skill to be learned, just like using Google effectively is a skill too.

submitted 26 days ago • 0 comments

Is there a vim mode for microsoft word? vim motions really tend to infect everything you do on your computer.

submitted 28 days ago • 0 comments

chat.deepseek.com seems to have some performance issues right now :D The hype got real.

submitted 28 days ago • 0 comments

There is still room for an intuitive LLM app library like llama_index or langchain. Man, these two libraries are a mess and so bloated. The source code is unreadable. That's the problem when you try to do everything all at once.

submitted 32 days ago • 0 comments

Back from holidays to build cool shit and talk about it :)

submitted 39 days ago • 0 comments

That’s one one my favorite posts by @morganhousel.bsky.social and there is some striking similarities to the work of Hartmut Rosa on ‚Resonance‘. In the modern world we put too much emphasize on being efficient and correct. Super interesting stuff. collabfund.com/blog/intelli...

submitted 50 days ago • 0 comments

It’s amazing how hard it is to beat BM25 for retrieval, especially in realms with specialized language like medicine.

submitted 69 days ago • 0 comments

Super exiting stuff. I hope this will lead to smaller, more capable models!

submitted 70 days ago • 0 comments

I feel like there should be a ML/AI version of this: "Training!" xkcd.com/303/

submitted 70 days ago • 1 comment

editorialmanager.com is another business ready to be disrupted.

submitted 76 days ago • 1 comment

I can now run a GPT-4 class model on my laptop (The exact same laptop that could just about run a GPT-3 class model 20 months ago) The new Llama 3.3 70B is a striking example of the huge efficiency gains we've seen in the last two years simonwillison.net/2024/Dec/9/l...

submitted 77 days ago • 11 comments

This is just some nasty drumming www.youtube.com/watch?v=CQRX...

submitted 78 days ago • 0 comments

Resonates...

submitted 78 days ago • 4 comments

New Blogpost! How to use python decorators to define layouts in fastht.ml blog.mariusvach.com/posts/decora...

submitted 79 days ago • 0 comments

After a very stressful AoC 2024 day 6, maybe today we'll do something easier: building a random forest library or some web dev. That problem yesterday stressed me out :D

submitted 79 days ago • 1 comment

AoC animated #adventofcode #aoc

submitted 79 days ago • 1 comment

I love how fast you can spin up a jupyter lab instance using `uv`: `uv run --with jupyter jupyter lab` This `uv run` is so helpful.

submitted 81 days ago • 0 comments

I tried out the new SmolVLM model together with ColPali for a visual rag pipeline. It's really nice to run everything on 15gb of VRAM. Unfortunately, the QA abilities of SmolVLM are not as strong as I would have wished. Here is a notebook with the code: colab.research.google.com/drive/10ZnqA...

submitted 88 days ago • 0 comments

That was fast: The makers behind ColPali are already cooking up a version using the new SmolVLM model by @hf.co as the backbone. Exciting stuff! huggingface.co/vidore/colsm...

submitted 88 days ago • 0 comments

This little python class comes in very handy I you want to have javascript style function chaining like: "Pipe(sample_text).pipe(split_into_words).pipe(convert_to_lowercase).pipe(remove_punctuation).pipe(count_words).value" Python supports a much more functional programming style than I thought.

submitted 88 days ago • 1 comment

One big advantage of a new social network is that you can curate your feed new. I realized that I want to read more about AI/ML engineering/research and web development and less about indie hacking. The first one is much more authentic…

submitted 88 days ago • 0 comments

Very exciting release. I love these small LLMs/VLMs recently

submitted 90 days ago • 0 comments

Did I already say that I love alpinejs.dev ? Anyway, I love alpinejs.dev! h/t @calebporzio.bsky.social

submitted 91 days ago • 2 comments

Needed a simple carousel component for a client project. claude.ai needed one prompt to create a react carousel component that does exactly what I need. 5 years ago this would mean fiddling around with obscure packages. Crazy times.

submitted 92 days ago • 0 comments

Having 30 followers here is more satisfying than 500 followers on twitter of which 300 are bots and 20 people actually see your posts

submitted 92 days ago • 0 comments

All of this generative AI stuff is nice and all, but a lot of ML applications are still "discriminative" and it's a little sad that research for this moved into the background. Is there any good research on zero-shot/few-shot classification abilities of Llama, Claude etc. vs Bert?

submitted 92 days ago • 1 comment