5 budget VPSes load sharing for about 2 days to scrape all 11k posts from an artist who died. I wanted to archive their work on my own non-public file server since I adored it so much.
Comments
Log in with your Bluesky account to leave a comment
Same level of complexity, just in a different direction. I spent more time on figuring out infrastructure than I did writing the scripts to do the scraping. With jetstream it's the opposite tbh. I spent more time figuring out how to read the blobs than worrying over infrastructure.
Comments