I heard people wanted a million Bluesky posts? So I made an open-source script that allows anyone to scrape the Bluesky firehose and collect everything people post.

This will be a useful resource to anyone who wants to archive this data or train generative AI. Have fun!

https://github.com/deepfates/bsky-scraper

Comments