ericjpedersen.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

comment in response to post

Yep, the coverage statements for CIs for GAMs in mgcv are averaged "across the function" (which is, confusingly, not the same thing as a simultaneous interval for the whole function; @gsimpson.bsky.social's got a nice post on the difference between these fromthebottomoftheheap.net/2016/12/15/s...)

submitted 8 days ago

comment in response to post

The coverage statements for CIs / p-values for penalized terms are averaged across the domain of the penalized regression function though; so e.g. individual CIs for specific random effect levels might be overly conservative or optimistic

submitted 8 days ago

comment in response to post

It is possible to calculate valid p-values with penalization by adjusting degrees of freedom for penalization; see Wood 2006 (onlinelibrary.wiley.com/doi/10.1111/...), and mgcv. It relies on some standard asymptotic arguments for calculating the values, but generally good small sample properties

submitted 8 days ago

comment in response to post

I found the cross-country comparisons of rise and fall in murder rates vs lead blood levels by Kevin Drum in Mother Jones to be some of the most convincing evidence for me www.motherjones.com/kevin-drum/2...

submitted 12 days ago

comment in response to post

Strong argument that it's leaded gasoline driving a lot of that increase and decline

submitted 12 days ago

comment in response to post

The link function is actually the key part of GLM (and related frameworks), even though everyone focuses on the marginal distribution. It also allows for better modelling of dependency (via e.g. basis functions), on the link scale

submitted 12 days ago

comment in response to post

The fact that most of the frauds found are either in high profile journals or just really incompetent frauds makes me think this is a "only 10% of the iceberg is visible" situation

submitted 25 days ago

comment in response to post

I think GenAI is likely accelerating the issue, but even before chatGPT came out I received a lot of pushback from 2nd year undergrads about any writing or quantitative q's. In my first year of teaching I was amazed to hear that students thought a 3-5 sentence question was a "long answer question"

submitted 25 days ago

comment in response to post

I do have a bit of a hard time visualizing this as a dag though. Maybe: Dice distribution -> Roll1 Roll1+->Decision to measure Decision to measure+-> measurement 1 Roll1 +-> measurement 1 Dice distribution -> Roll2 Roll2 +-> measurement 2

submitted 26 days ago

comment in response to post

I haven't seen it framed like this, but regression to the mean seems like a good application of DAGs. Realized roll depends on average, and decision to measure depends on realized roll. Second measurement only depends on realized value.

submitted 26 days ago

comment in response to post

"Was Fëanor the Andrew Tate of the First Age? In this essay, I will..."

submitted 38 days ago

comment in response to post

And the HGAM paper only exists because of social media! It started when @noamross.net tweeted about tensoring random effects and smoothing splines and got us all curious what we could do with them

submitted 42 days ago

comment in response to post

Teaching in French was banned in Manitoba for more than 50 years, and the Quebec government at the time supported Manitoba banning it, because (from what I understand of it) it strengthened the case for Quebec's own language policies

submitted 43 days ago

comment in response to post

The Quebec act was declared one of the Intolerable Acts in part because it gave *too many* rights and freedoms (e.g. freedom to practice Catholicism) to the residents of Quebec after the Seven Years' War

submitted 43 days ago

comment in response to post

Good for me! I appreciate the discussion. Also, it's always nice to get to dig into matrix algebra as a break from grading :)

submitted 43 days ago

comment in response to post

But Julia's original point was that *just because X2 and X3 are strongly correlated, including them both does not necessarily impair the precision of our effect of interest*. I was trying to demonstrate that point.

submitted 43 days ago

comment in response to post

A couple quick corrections (I wrote this too fast): the matrix should be: M = n*[A B] [B' C] and SE(b1)^2 = (SE(res)^2*M^{-1})[1,1] (inverse of M times residual standard error) so SE(b1)^2 = SE(res)^2*1/n(1-BC^{-1}B')^-1) But the same argument still holds

submitted 43 days ago

comment in response to post

But I think that rough intuition is wrong here; just because (X'X)^-1 is hard to invert does not mean that the standard error for any specific parameter is large

submitted 43 days ago

comment in response to post

But if B / B' are "small", then BC^-1B' should be close to zero regardless unless C^-1 is very large (very poorly conditioned)

submitted 43 days ago

comment in response to post

(as a note: A,B, and C should all be multiplied by n above) Inverse of a block matrix can be inverted blockwise (en.wikipedia.org/wiki/Block_m...) and the top left element of that inverse will have a value of: (A - BC^-1B')^-1 3/n

submitted 43 days ago

comment in response to post

Since all variables have mean 0, SD of 1, then X'X is just the correlation matrix Cor(X). We can write it as a block matrix: X'X = [A B] [B'C] (A=1, B = [cor(X1,X2), X1,X3], C = Cor(X2:X3)) SE(b) = (X'X)^-1[1,1], or the first diagonal element. 2/n

submitted 43 days ago

comment in response to post

Even if X'X is nearly non-invertible, that does not imply that the SE for any given parameter is large. To see this: Fit Y ~ b1*X1 +b2*X2 + b3*X2, and we care about the SE(b1). [Y, X1:X3 are all centered and scaled] We assume that Cor(X2, X3) is "large", but Cor(X1, X2) and Cor(X1,X2) is "small"

submitted 43 days ago

comment in response to post

You could just compute moments for each dist and summarize the change in the first few moments

submitted 55 days ago

comment in response to post

A single distance measure might not be the best option here... We'd expect the posterior location (median, mode etc. ) to shift, but the posterior variance should also decrease. Diff distance measures will be sensitive to different components

submitted 55 days ago

comment in response to post

I knew there was a reason I'm holding on to those vga and mini usb cables...

submitted 70 days ago

comment in response to post

God I hate April Fools Day. The "now the internet is unreadable" day

submitted 72 days ago

comment in response to post

Yep. The way I think about it now is "the sample mean and se tell you about how certain you should be if your model is right. The rest of the data is there to explain to you why that assumption is wrong."

submitted 73 days ago

comment in response to post

I still find the idea of sufficiency to be slightly counter-intuitive. "What do you mean all that matters is the sample average? I collected 10,000 data points, should I just ignore them!?"

submitted 73 days ago

comment in response to post

This is not where I expected them to take the Ratatouille sequel

submitted 83 days ago

comment in response to post

He has a lot of practical experience at designing exploding rockets; that has to count for something

submitted 83 days ago

comment in response to post

In 1912, she moved to Berlin to begin working with Leonor Michaelis in enzyme kinetics. With him, she developed the Michaelis-Menten Equation. The equation shows that the rate of reaction rate in enzymes increases to saturation as the substrate concentration increases. 🧵4/10

submitted 83 days ago

comment in response to post

I'd love to give it a try!

submitted 155 days ago

comment in response to post

The article does mention that these inconsistencies were at one point considered evidence of possible variation in the speed of light (I.e. a failure to replicate) rather than ignored systematic errors, which I think reinforces @devezer.bsky.social's original point re: lack of replication

submitted 158 days ago

comment in response to post

Thanks for tracking this down! I realized yesterday there was no reference in that doc for the CIs and I was going nuts trying to track down the source for the intervals

submitted 158 days ago

comment in response to post

By strict standards of replication, the estimated speed of light wasn't replicable based on highly inconsistent CIs, even though the initial estimates were within 1% of the modern value (figure from www.olympus-lifescience.com/en/microscop...)

submitted 159 days ago

comment in response to post

Just to name a few ways: the Culture is deeply non hierarchical, fundamentally pro-trans, anarchist and anti-capitalist. Every member of the Culture would find Elon's day-to-day behavior completely inexplicable

submitted 159 days ago

comment in response to post

By strict standards of replication, the estimated speed of light wasn't replicable based on highly inconsistent CIs, even though the initial estimates were within 1% of the modern value (figure from www.olympus-lifescience.com/en/microscop...)

submitted 159 days ago

comment in response to post

Also, the fact that Musk says he loves the Culture novels shows that he has the reading comprehension of a rock. The books couldn't be more antithetical to everything he's done than if they were written with him in mind

submitted 159 days ago

comment in response to post

This is a great idea! I'd love to be added too

submitted 166 days ago

comment in response to post

submitted 166 days ago