Profile avatar
malcolmbarrett.malco.io
Ph.D., epidemiology. research software engineer @ Stanford Health Policy. living in Ann Arbor. open-source data science. causal inference. f.o.p. (friend of Piglet's). doing poems on aircrafts. approximately Bayesian. formerly Posit, Apple. 心を燃やせ。
438 posts 4,914 followers 620 following
Prolific Poster

"Selection bias: where are we now?" Incredibly excited to announce the next Causal Inference Interest Group seminar with @haidonglu.bsky.social will take place on 10th March 2025 at 3pm GMT (11am EDT). Don't miss this one! Register at: turing-uk.zoom.us/meeting/regi... #EpiSky #CausalSky #CIIG

Government Advances in Statistical Programming conference, 25-26 June 2025 statmodeling.stat.columbia.edu/2025/02/22/g...

🚨 The "penguins" data is coming to the base #rstats{datasets} in 4.5.0! @ellakaye.co.uk & @HeathrTurnr.fosstodon.org.ap.brid.gy r prepared an adapted and a raw version of the data set based on the {palmerpenguins} 📦 by @allisonhorst.bsky.social @apreshill.com and Kristen Gorman. 📈 Scatter plot:

If anyone who receives this would like help writing a suitable prompt injection for when they use an LLM to evaluate responses, I am at your service. DM or @hmason.64 on Signal.

ICPSR at University of Michigan has been one of the longest-running secure data archives in the world and is stepping up to the challenges being presented in the current U.S. environment to keep government data as a resource for researchers and policymakers. isr.umich.edu/giving/suppo...

One more bonus update with two changes: 1. A diverging color palette centered at 4% lets us see which counties are doing great or doing poorly 2. That size legend was spaced funnily, but we can make it more compact with {legendry} #rstats www.andrewheiss.com/blog/2025/02...

Obsidian is now free for work. Starting today, the Obsidian Commercial license is optional. Anyone can use Obsidian for work, for free. Explore the organizations that support Obsidian on our site. obsidian.md/blog/free-fo...

Friendly reminder to get your free Covid tests before they throw them away. Covidtests.gov wapo.st/42VLGQh

Which of these features would you like us to implement first in the nanoparquet R package? See options below. #RStats #parquet nanoparquet.r-lib.org

the single most un-american and anti-constitutional statement ever uttered by an american president

AAAAAAAAAAAAAAAAAAAAHHHHHHHHH

Giving infectious disease research a break, as promised by the MAHA agenda. Let's break down what exactly this Executive Order is *really* saying. Fortunately, I speak fluent anti-vax grifterese & can translate. www.whitehouse.gov/presidential...

On the other hand Roses are red Violets have blue tints You shouldn’t use euphemisms If your goal is causal inference

Not sure where to find datasets that were previously hosted on government websites? The @busph.bsky.social Center for Health Data Science has developed a streamlined interface to let you search for essential public data: www.FindLostData.org Please share!

So, this struck a nerve with people. What would it take to do this nationwide? Invite local press to local speak-outs on research. Tell national press we're doing this across the country. Ideas of who could help with this?

Steve Levitsky, co-author of How Democracies Die "the U.S. is sliding toward a more 21st-century model of autocracy: competitive authoritarianism" www.theatlantic.com/ideas/archiv... "A failure to resist... could pave the way for authoritarian entrenchment" www.foreignaffairs.com/united-state...

JUST IN: Federal judges orders HHS, CDC, and FDA to restore “by no later than 11:59 pm” today their websites and datasets to pre-January 30th status. storage.courtlistener.com/recap/gov.us...

This is how easy 5Calls makes it for you:

AND IT’S ILLEGAL FOLKS: “None of the funds appropriated in this title may be used to modify or implement any change to the indirect cost rates applied to grants & contracts funded by the NIH w/o the prior approval of the Committees on Appropriations of the House of Representatives & the Senate.” 1/

is it woke to want to end cancer?

GT autotheming, anyone?

Here is what the censoring of data.cdc.gov looks like over time

Hey Leipzig people! The one and only Saloni Dattani (@scientificdiscovery.dev) is giving a talk on one of the big questions—what do we really know about the world? Check out the invitation at openscience-leipzig.org, or let me know if you'd like me to send it to you via email!

US folks: Call your Senators to urge them to oppose RFK Jr.'s nomination to Secretary of HHS 5calls.org/issue/robert...

Parallelization just landed in the dev version of purrr: purrr.tidyverse.org/dev/referenc... Really pleased that the mirai framework makes this possible. Huge credit to the tidyverse maintainers @hadley.nz @lionelhenry.bsky.social and @davisvaughan.bsky.social ! #rstats #tidyverse

We're excited to announce Catherine Nelson as keynote speaker at posit::conf(2025)! Author of "Software Engineering for Data Scientists," she'll share how data scientists can benefit from software engineering best practices! Sept 16-18 in Atlanta. pos.it/conf #PositConf2025 #Python #rstats #pydata

There is a lot of bad public health information out there and it's only getting worse. So I'm starting a newsletter to help you distinguish the bad from the good. E is for Epi: coming soon! Link: open.substack.com/pub/epiellie...

Course registration is OPEN! 📢 Join CAUSALab this summer to learn from the #causalinference experts on campus @harvardchanschool.bsky.social or online. Spots are limited! Learn more & register today: causalab.sph.harvard.edu/courses/ #publichealth #healthdata #epidemiology

Yeah so the Constitution isn’t really in effect right now

Is there a mirror of US Census TIGER/Line files somewhere? None of the downloads work from the official website.

gahhh I use these shapefiles all the time IPUMS has a bunch of them still, fortunately usa.ipums.org/usa/volii/bo...

The data at USAID's ForeignAssistance dot gov was removed over the weekend, but it seems to be back (for now). As a backup, I've uploaded the data both as static CSV files and a queryable API - Details: andrewheiss.github.io/foreignassis... - API: foreignassistance-data.andrewheiss.com

Two dates regarding posit::conf(2025): - The call for talks is open until Feb 14 (that's the extended deadline). - The deadline for applications for the Opportunity Scholarships is Feb 21. #dataBS #RStats

Conservative budget wonks (Manhattan Instiutte, Tax Foundation) confirming that this is a consittutional crisis. Not liberal hysteria. Happening right now.

Hearing another Census data purge is coming Prioritize downloads of anything IPUMS does not have They are targeting anything with race and sex

Never dreamed that being friends and colleagues w tons of data nerds would be an asset in the war for democracy

Congressional Reps storm USAID was not on my bingo card for 2025 💪🏼💪🏽💪

I saw this over on LinkedIn and am sharing here in case this information is helpful to anyone - data that was available on the CDC before January 28th has been archived and is available on archive.org. We should probably be making a few extra copies... archive.org/details/2025...

Feed billionaires to Titans www.npr.org/2025/02/03/n...

All of this from @pastpunditry.bsky.social. The crisis is not *legible*. Even politically attentive people have no idea - the media are not conveying the scope and magnitude of what's happening. It has to be made legible through political and institutional resistance, not passivity and resignations.

Hey remember when George Lucas was like “hurr durr Trade Wars” and we were like that’s so silly and he was like no wait that’s how Emperor Palpatine comes to power and we were like you are obviously dumb Jorge.

The Internet Archive has to date downloaded 500 terabytes of US government websites, which it crawls at the end of every presidential term. The whole archive is fully searchable. This effort's housed by a donation-funded nonprofit, not a branch of the US government. blog.archive.org/2024/05/08/e...

Census back up www.census.gov

“A group of researchers and students at the Harvard T.H. Chan School of Public Health is gathered today for a data preservation marathon, scraping and downloading data related to health equity from U.S. government agency websites before they disappear.”

they are stealing data from the american people and depriving us of the transparency that we are entitled to by law. none of this is legal and not a single federal employee has any reason to comply.

The API is up though!