Profile avatar
stepheng29.bsky.social
Industry Consultant at SAS Institute. Interested in the tech industry, AI/ML, data science, fintech. 49ers fan.
40 posts 90 followers 753 following
Prolific Poster
Conversation Starter

This is a great long list of resources to explore agent models...

Google released a list of 321 real-world use cases for Generative AI. Definitely worth a read through.

Exploring the gap between what LLMs really know vs what people think they know www.nature.com/articles/s42...

Singapore telco offering Perplexity Pro as a bundle, just like free Spotify or Netflix. www.techinasia.com/news/singtel...

Michael Barr's resignation as fed VC of banking supervision is worrying to many why watch the industry. This may be the end of independent bank regulation and may mean a turnover of Fed VC with each administration.

This is very cool, a guide to building an OS in 1000 lines.

OpenAI o3 models achieve 87.5% on the ARC-AGI Eval, absolutely blowing away any of the previous models.

This presentation from @fromedome.bsky.social is always a good one every year

I'm very curious how real these claims actually are and tend to think this is PR for a future IPO against recent valuation adjustments. If a company could actually carve out 20% of their workforce with AI then they'd be crazy not to spin it off into a what could easily be a unicorn.

YC's request for startups Winter 2025 If you were to start an economy today from scratch....

That input image was the ultimate robustness test for computer vision and ML in 1989. Imagine trying some structural pattern recognition on this, which was popular at that time. But, @yann-lecun.bsky.social's convnets solved it 😍 He posted the video on LinkedIn: www.youtube.com/watch?v=H0oE...

Anthropic released the Model Context Protocol (MCP), which intended to bridge the gap between data sources/systems and LLMs. The full protocol is open-source and already comes with integrations for things like Google Drive, Slack, Postgres, Git and more

How awesome is Bluesky? The tools around it are going to become so amazing: Check out this list~ repost this if you know someone who might find it useful. 🔵🌍 github.com/fishttp/awes...

The whole "debanking" debate is based on a fundamental misunderstanding of the role of the regulators and risk management at a bank. Risk management is difficult and expensive; most FIs error on the side of caution for unfamiliar risk This is a great overview and explanation below from Jason Mikula

Apparently LLMs perform much better in complex tasks when they are told to imitate rather than purely reason. 🎭 🔑 Acting-based prompting: Treat LLMs as performers, prompts as scripts.

As I've complained before in this here platform, AI gen'd images are taking over websites that us artists use for references. Here's an example, screenshot courtesy of a reddit post (from user MetaKnowing). Same thing is happening on DeviantArt, Pinterest etc. with no clear way to filter them out.

AI Agents Stack, Nov 2024

Perplexity keeps shipping very interesting features

A quick tip on coding with AI. If you do not get the code you are looking for by the third turn (i.e., request/response, clarification/response, clarification/response), start a new chat session. It does not get any better.

The consensus with Apple Intelligence seems to be that it's average at best. Especially the weirdness of notification summaries. This is still early days and no doubt will improve--I'm excited to see improvements to Siri or rumored cross-app programmable agentic behavior

While Bluesky looks like other social apps on the surface, it’s actually quite different. It’s an open network. I blogged about it here:

DynaSaur: The AI agent that evolves by coding → Writes new Python functions when needed → Reuses & optimizes actions via embeddings → Builds a growing library of tools Results: 🏆 Tops GAIA leaderboard 📈 +81.59% w/ human + AI tools 🚀 Excels in complex tasks (level 2 and 3)

Drag & Drop prompt engineering IDE to eval LLMs for agentic workflow. All OS and No-code.

An analysis by Stanford economists estimating that nearly a quarter of all U.S. innovation since 1976 can be credited to highly skilled, foreign-born individuals.

A Novel Being Written in Real-Time by 10 Autonomous AI Agents. At nlr.ai

DeepSeek-R1-Lite-Preview Test Number 2

From “Barbary Coast,” written in 1933 about San Francisco in 1849.

🔥 Stripe launches payment processing for AI agents - enables autonomous systems to handle transactions via LangChain, CrewAI & Vercel AI SDK. Includes virtual cards, usage-based billing & secure API access. Big step toward AI-driven commerce.

Maybe there is no money in payments after all... @Barclays exploring giving away 80% of its merchant acquiring business to @brookfieldcorp.bsky.social

AWS just released a new Multi-Agent AI framework It lets you manage multiple AI agents, dynamically route LLM queries, maintain context across AI Agents and can be deployed locally on your computer. 100% opensource.