gregoryfaletto.com - Profile | ThreadSky | a Reddit-style client for Bluesky

comment in response to post

When did the topic change to how one can win an election?

submitted 9 hours ago

comment in response to post

Many mostly middle-aged and old white people who watch Fox really didn't like Biden. Show us the day that 2% of the population turned out to protest him?

submitted 10 hours ago

comment in response to post

Now if the researchers subsequently collected completely independent data to "validate" the green jelly bean hypothesis, that would of course be perfectly fine

submitted 1 day ago

comment in response to post

Yes, the fundamental problem is failing to correct for multiple hypothesis testing. The sense in which one could frame the issue as "timing" is if the researcher pretends only after seeing the p-values that actually they were really only interested in green jelly beans all along

submitted 1 day ago

comment in response to post

Well here's one example of what people are talking about when they say HARKing is bad

submitted 1 day ago

comment in response to post

The status quo where anybody who has any business publishing can submit to the most prestigious journals in their field, have a good chance of getting reasonable feedback from experts who otherwise might not have read the work, and maybe get published if it's warranted gives new names an opportunity

submitted 1 day ago

comment in response to post

Nice paper! You touch on related points a bit in S3, but I'm concerned that eliminating peer review takes away one of the few pathways new researchers have to make good work prominent without either being attached to an already-prominent researcher or being good at "marketing."

submitted 1 day ago

comment in response to post

I found this video super helpful, this was actually the first thing that made TMLE click for me. The meat of it is in the second half www.youtube.com/watch?v=8Q9d...

submitted 2 days ago

comment in response to post

would agree if it were june 2023, but why don't they have anything more baked by now? it blew my mind how long it took autocomplete in messages to noticeably improve. no application of LLMs could be more straightforward than literally predicting the next token

submitted 4 days ago

comment in response to post

Yes and unit economics will start to become more of the ballgame than they already are, especially if "reliably replacing a human end-to-end" is looking unachievable for the most part

submitted 5 days ago

comment in response to post

Yeah reasoning is kind of the biggest lift in the last couple of years I guess. I guess we'll see as really huge models get trained, maybe scaling laws will keep up... maybe they won't

submitted 5 days ago

comment in response to post

Yeah we'll see what happens, maybe a year from now we'll look back on current capabilities and feel like they're way worse. But when I look back at say last fall, the improvements feel noticeably slower than the improvements in the 9 months before that.

submitted 5 days ago

comment in response to post

(People keep saying "you can code things without even having to know how to code!" and that keeps... not really being true. Can code in languages that you don't really know if you're already a coder, sure.)

submitted 5 days ago

comment in response to post

One thing I've been thinking about: let's say the transformer architecture kind of plateaus at a point where they're v useful if you have pro-level knowledge in your field (coding etc.), but can't reliably replace a human unsupervised on end-to-end tasks. Could be a pretty nice sweet spot for labor

submitted 5 days ago

comment in response to post

Which model?

submitted 7 days ago

comment in response to post

Let's at least all be clear about who we're arguing about deserving more sympathy.

submitted 10 days ago

comment in response to post

I wish we'd acknowledge more that the biggest "losers" in the 2022 - 2024 economy was about the 70th - 90th percentile of income, who experienced rapid wage growth in the bottom quartile of earners as price increases. (chart is unfairly negative IMO since includes 2020, but couldn't find better one)

submitted 10 days ago

comment in response to post

Citizens have a civic duty to vote for whichever of the major-party candidates they think would be a better leader. Not doing so is a moral failing, and it's good for the country to discourage it. If the candidate you dislike less doesn't win, then the candidate you dislike more will.

submitted 10 days ago

comment in response to post

Vanilla Least Squares (VLS)

submitted 15 days ago

comment in response to post

Pretty unlikely it happened today

submitted 15 days ago

comment in response to post

It is. Try it yourself right now

submitted 15 days ago

comment in response to post

And I didn't bring up the topic, but I also think it's kind of not okay for news outlets to prioritize this topic (which it really feels like we have already gotten to the bottom of) over harm being done in the present that elected officials have the power to stop, but don't.

submitted 15 days ago

comment in response to post

I just disagree that the criticism is "proportional" in any reasonable sense. I guess my starting point for thinking about proportions is units like lives lost that can be directly tied to a specific policy choice, number of people wrongfully detained, number of laws broken, etc.

submitted 15 days ago

comment in response to post

Biden's age issues were the main news topic for almost a full month after the June 27th debate. There were certainly many other days where it was the leading story. In your view, how much additional criticism is needed to be proportional?

submitted 16 days ago

comment in response to post

The "why" is potentially more complicated depending on how deep you want to get into "why." (Looking at the system prompt would probably help.) But "what" is going on is perfectly clear, and IMO the less we lose sight of that (and the more the general public understands that) the better

submitted 17 days ago

comment in response to post

You push specific dough through this special machine, you get noodles. You push specific strings through those matrices, you get some text. Both machines can be quite useful for certain purposes

submitted 17 days ago

comment in response to post

Big numbers are getting pushed through a bunch of very particular matrix multiplications

submitted 17 days ago

comment in response to post

realizing I'm way behind for the day, making myself 9 full bowls of plain spaghetti, then interlacing my fingers and cracking my knuckles as I prepare to get started

submitted 18 days ago

comment in response to post

A personal assistant who is 80% as smart as you and can get stuff done while you get a glass of water is a massively useful tool

submitted 18 days ago

comment in response to post

Not that specific one, but some of these posts seem photoshopped, or at the very least not reproducible. When I see these posts sometimes I'll try the exact query on the exact model and get back a sensible response

submitted 18 days ago

comment in response to post

Let he who has never typed a function from memory and then realized they misremembered the exact name cast the first stone

submitted 18 days ago

comment in response to post

something I think about all the time is how many people there are whose only voluntary interaction with LLMs was trying whatever free OpenAI model was available months/years ago and getting frustrated

submitted 19 days ago

comment in response to post

That's very helpful, thank you!

submitted 20 days ago

comment in response to post

It's very important that we either forbid developers from making bad investments that would lose them money or forbid them from investments that would make them too much money, depending on which one of those would result in housing getting built

submitted 23 days ago

comment in response to post

☹️

submitted 28 days ago

comment in response to post

I can invert a 3 x 3 matrix by hand, doesn't mean it makes sense for me to actually do it!

submitted 33 days ago

comment in response to post

middle manager is one of the most underrated jobs. it is hard, people who are really good at it are rare, if you could snap your fingers and make every middle manager twice as good the world would be noticeably better.

submitted 35 days ago

comment in response to post

Yes, and precise positioning of text, images etc. using the "arrange" tab on the right side of the screen

submitted 36 days ago

comment in response to post

Coriolis hive activated

submitted 38 days ago

comment in response to post

To me it's like, sure calculators shouldn't be allowed on all tests, but it would also be insane to insist not only that kids should never learn how to use calculators, it's in fact shameful if they ever did

submitted 38 days ago

comment in response to post

It's kind of fascinating to me that some people think this is obviously cheating 100% of the time and there's never a place for it. These kids are going to graduate and the ones who didn't learn how to "cheat" like this are going to have employers who are pissed off at them for it.

submitted 38 days ago

comment in response to post

if you have a pressure cooker, this is really good. i substituted tempeh for mushrooms cooking.nytimes.com/recipes/1021...

submitted 39 days ago

comment in response to post

I think the difference to the argument is that instead of this term disappearing asymptotically because it's mean 0, it just wouldn't appear at all

submitted 41 days ago

comment in response to post

Indeed, and I believe the same argument in the post works for ANCOVA I too

submitted 41 days ago

comment in response to post

And how much will the resulting estimator look like vanilla ANCOVA II? It turns out they're asymptotically equivalent--makes sense since they have the same asymptotic properties. In this blog post, I walk through the math & supporting simulations. Check it out! gregoryfaletto.com/2025/05/03/e...

submitted 42 days ago

comment in response to post

So here's a question: what if we use ANCOVA II as the outcome model in a cross-fit AIPW estimator? Since treatment assignment is randomized, the propensity scores are known--errors are already o(1/√n). So even if ANCOVA II is misspecified, we should get consistency and asymptotic normality.

submitted 42 days ago

comment in response to post

If we use the AIPW estimator with cross-fitting, Theorem 5.1 of Chernozhukov et al (2018) guarantees consistent and asymptotically normal estimation of the ATE under unconfoundedness and overlap if the product of the rates of convergence of the propensity and outcome models are o(1/√n).

submitted 42 days ago

comment in response to post

Another setting where we can get consistent and asymptotically normal estimation of the ATE with a misspecified model is the doubly-robust AIPW estimator.

submitted 42 days ago

comment in response to post

In randomized experiments, the OLS coefficient on the treatment dummy is a consistent and asymptotically normal estimator for the average treatment effect (ATE). This holds regardless of whether the linearity assumption is true.

submitted 42 days ago