Profile avatar
erikapullum.bsky.social
Engineer, climber, currently working as Head of Data at Hex. www.erikapullum.com
41 posts 285 followers 216 following
Regular Contributor
Active Commenter

I wish there was a way for the dbt Cloud Explore DAG to filter to content that exists *between* two nodes. Anyone know of a fun work-around for this with other tools? #dataBS

What's on my mind this week (as Head of Data at a growth stage startup) * Hiring 🔜 * Upcoming vendor contract negotiations * Priorities pulse check with our stakeholders * Couple lil IC tasks to standardize a metric a bit better #dataBS

Naming is hard. #dataBS

Perpetually flip flopping on self serve analytics Some days it feels like it could be the holy grail - enabling downstream users to do their own exploration and analysis on governed metrics. Fewer ad hoc requests! Reporting automation! Other days: So what? Does that actually move the needle?

Don’t think of it as whether the data has error or not. Most of the time it will. Think of it as whether the error in the data will cause you to make a different decision than if the data was perfectly clean. Most of the time it won’t.

One of the biggest tips I have for anyone doing data analysis, especially data from people, is to spend some time drilling down to the most granular data and just looking at individual records. You will find the craziest shit you never imagined and your analysis will be better for it #databs

My #dataBS new year's resolution is to stop feeling guilty when I tell someone to go self serve their QQ. What's yours?

Dear Santa, All I want for Christmas is for any of the abandoned GitHub repos that half-implemented the Hemingway writing app as a VS Code extension to finish the job. Please put your best TypeScript dev elf on this 🙏

Watching Hex's internal hack week demos is like Christmas early 😍

Has anyone cracked the code on the most efficient way to align on metric definitions with your customers - key word being efficient?

a poem by poetry: poetry run dbt compile command not found: dbt poetry show | grep dbt dbt 1.0.0.38.22 pip freeze | grep dbt dbt==1.0.0.38.22 which dbt dbt not found (look, no one said it was a happy poem) #dataBS

Fascinating! Confirmed this works in DuckDB, but would need some concepts ported to Snowflake to work in our warehouse. Nice work on this! #dataBS

Super #SQL brainteaser for your Tuesday that kept a few folks on our team thinking pretty hard until someone made an elegant solution. The desired behavior per unit: * First event qualifies * Subsequent events qualify only if it's been more than 90 days since the last Sample dates below #dataBS

Continuing my theme of banger PRs -- this one improved documentation for a field confusing to a lot of folks #dataBS

Just had a great chat about being the first data person at a startup. Pros * Building at high velocity is fun * Work with smart people Challenges * Always more than you can do * Hard to know how good to build when Folks that have done it, what's on your pro/challenge list? #datasky #dataBS

dbt job: "Can't parse a JSON blob, I'm out." me: *looks for a needle in the 100M-row haystack* me: "Aha! This looks fine? Guess I'll paste it in a text editor to double check." text editor: "Uh, boss" me: "Is that...is that a tiny red 'BS'?" BS broke my data. I can't make this shit up. #dataBS

This is a great post about a thoughtful data warehouse redesign! My favorite part is where @afioritto.bsky.social talks about how Hex’s schemas were organized before and their names, and how they were organized afterward and their names! #dataBS

Amanda did absolutely fabulous work on this project! We got a clean and shiny data warehouse, you get an interesting read. Everyone wins. #dataBS

Your Monday reminder: Lots of people who seem smart, talented, accomplished, and confident on social media also feel insecure, silly, and stupid sometimes. What matters is that you show up and keep learning, day after day. #dataBS

Them: These numbers don't match Me: Yes, because they are different Them: Oh Tale as old as time, song as old as rhyme, cohorts confuse everyone #dataBS

I am convinced that the YML-based semantic layer was as much invented by a data professional as the stiletto heel was invented by a woman (the stiletto heel was invented by a man) #dataBS

What's on my mind this week (as the Head of Data at a growth stage startup) * Renegotiating a vendor contract * Insights re:channel ROI for marketing * Team member priorities / projects * A blissfully low-meeting week #datasky #dataBS

10x analytics engineer kinda Friday

Data people kinda hate QQs, esp without context or the word "just". The right simple dataset to the right person at the right time unblocks decision making and leads to higher quality decisions. The challenge is sorting out which QQs are levers for business partners and which are a waste of time.

Do I know anyone in #dataBS who has built out a fairly full-scale metrics layer in MetricFlow (dbt) or Cube? Looking for someone who'd connect for ~30 with someone on my team starting the process for us at Hex!

Things on my mind this week (as a data leader at a growth stage startup) * Wrapping up the migration of our instrumentation platform * Next steps for channel viz / ROI for marketing * Checking in with my team on our operating rhythm change * Continuing convo w/ leadership re: our priorities