Profile avatar
jimdowling.bsky.social
Co-founder and CEO of Hopsworks. Organizer of the feature store summit. I am writing a book on Building ML Systems for O'Reilly.
34 posts 178 followers 404 following
Regular Contributor
Conversation Starter

๐—ก๐—ผ, ๐—ฎ ๐—ฏ๐—ถ๐—ผ๐˜€๐—ถ๐—ด๐—ป๐—ฎ๐˜๐˜‚๐—ฟ๐—ฒ ๐˜„๐—ฎ๐˜€ ๐—ป๐—ผ๐˜ ๐—ท๐˜‚๐˜€๐˜ ๐—ฑ๐—ฒ๐˜๐—ฒ๐—ฐ๐˜๐—ฒ๐—ฑ ๐—ถ๐—ป ๐—ž๐Ÿฎ-๐Ÿญ๐Ÿด๐—ฏ'๐˜€ ๐—ฎ๐˜๐—บ๐—ผ๐˜€๐—ฝ๐—ต๐—ฒ๐—ฟ๐—ฒ. K2-18b is back in the news, now with a bold claim that biosignature molecules (DMS and/or DMDS) have been 'detected at 3ฯƒ'. Most exoplanet astronomers are extremely sceptical about these claims, let's see why (1/n). ๐Ÿ”ญ๐Ÿงช๐Ÿช #exoplanet

This is very cool. Lex and Aleksey show you how you can use an #LLM #assistant to generate code that automatically imports features into the Hopsworks feature store. This makes getting started with the #ai #lakehouse even easier! See youtu.be/wtEPONdKV6Y for another great episode of #showdonttell.

I was pleasantly surprised to see my "From zero to a working ML system with Python, free serverless services + FTI pipelines" talk ranked at 43 in most popular Python talks of 2024. Talk https://www.rfr.bz/be18657 Reference: https://www.rfr.bz/b78475a

I was pleasantly surprised to see my "build Tiktok with Hopsworks" talk ranked at number 6 in most popular Python talks of last year. TikTok with Hopsworks talk: https://www.rfr.bz/beb40a4 Source: https://www.rfr.bz/b58c8ae

This is an interesting dataset and paper by Netflix that in essence says: "By predicting the next interaction, our foundation model must build an internal model of consumer behavior." https://www.rfr.bz/b30d8e7

The top 10 fallacies of MLOps www.hopsworks.ai/post/the-10-...

What tables in the Data Warehouse should be exposed to data scientists to create features from? The gold layer is aggregated features. If you want to create your own aggregations, you need access to the raw events (bronze or silver layers). Reference: https://www.rfr.bz/b793ee3

Hopsworks near the top of hackernews today with an article on how we migrated our serverless platform from AWS to OVH (european cloud company).

The top 10 fallacies of MLOps www.hopsworks.ai/post/the-10-...

The top 10 fallacies of MLOps www.hopsworks.ai/post/the-10-...

There is a concerted effort to drive a wedge between Europe and the US by the new US Administration. Apparently we should not be worried about Russia/Ukraine, but free speech and Greta are our big problems. I don't see Europe taking this lying down. www.bbc.com/news/article...

I was on irish radio, Newstalk , this morning, talking with Joe Lynam about Hopsworks and AI in the EU. open.spotify.com/episode/0yBg...

Musk's nazi salute was deliberate. He's outflanking Steve Bannon to appeal to the Maga masses. He's afraid he'll lose access to his cash cow, Trump.

Tesla's proprietary charging infrastructure is now their weak point in Sweden. Massive queues. Unions have stymied rolling out Tesla charging stations. Commodity charging is fine. As Peter Thiel said "Competition Is for Losers". This is the playbook for how to fight back. x.com/i/status/187...

I was curious who Andy referred to as screwing over OtterTune. It's ASCII art in the HTML of their announcement that they were winding up.

Good reproducible benchmark study by @coiled.io. I get this question all the time by MLOps folks. What size of workloads can frameworks handle : * Pandas ~1 GB (this one i added for reference, not in the study) * Polars ~10 GB * DuckDB ~100GB to 1TB * Dask and PySpark up to 10 TB lnkd.in/dAxUAWC4

Looking forward to 2025, I have been working together with the #Hopsworks team to try to understand what the upcoming regulations are going to mean for the Financial Services Industry ( #fsi ). www.hopsworks.ai/post/breakin...

We have a strong background in formal methods and system reliability. So weโ€™ve been surgically applying these techniques to scale our correctness efforts at Feldera. Weโ€™ve found them quite effective at catching bugs early during development, long before they can manifest in production.

Just saw that Sagemaker is rebranded now as "Sagemaker Lakehouse". However, there can be only one "AI Lakehouse"....Hopsworks www.hopsworks.ai/post/the-ai-...

AWS flexing its control of s3. A new s3 bucket type - iceberg tables! aws.amazon.com/blogs/aws/ne... They are like s3 buckets, but you write iceberg tables. They will do maintenance for you, make it available in AWS Glue for querying via Athena, etc. $26.5/TB/month for s3 tables. That's cheap.

I want to put together an infra engineering bsky list. Are you posting infrastructures software engineering content? Please reply so I can add you. Also, please repost ๐Ÿ” so I can get good reach.

Hadoop can still be repackaged and sold - so long as it is powering AI ! Just saw Snowflake have acquired a fork of Apache Nifi (by the founders) that added vector embedding support for data pipelines - datavolo.io/comparing-da...

The single-most valuable tool when doing latency measurements is an ECDF (empirical cumulative distribution function). Collect every single latency sample, plot the ECDF to look at the *entire distribution*, and resist the temptation to compute summaries until you can explain the ECDF.