Profile avatar
zachrenwick.work
Data analytics engineer #SQL #DataModeling #Analytics #BigQuery #WebAnalytics zachrenwick.work
51 posts 125 followers 147 following
Regular Contributor
Active Commenter
comment in response to post
I agree but they've also been positioning Looker/BigQuery as a more open data stack recently. Specifically trying to market Looker modeler (the semantic layer part of Looker) as its own stand alone tool
comment in response to post
Yeah I've been following cube for a while now, lots of great new developments from their product, and love how they are breaking the metrics/semantic layer further out of the BI tools
comment in response to post
I never understood this, it's literally just extra lines of SQL code... because *reasons*
comment in response to post
There's so much value to add anywhere along the data chain. Honestly, blending business context (analyst focus) with data modeling is an extremely impactful role. Yes there's some shit AE models out there, but I've seen some terrible SQL written by data engineers too
comment in response to post
I tried that for Google Analytics 4 issues and it didnt work 😀
comment in response to post
2) Set up reverse ETL data feeds that power machine learning experiences on a website (fit predictor)
comment in response to post
1) Designed and built an ecommerce data mart from scratch, integrating GA4 and GA UA data to provide continuous web analytics reporting before and after the Universal Analytics sunset
comment in response to post
Integrate their product? Or buy the [future] competition?
comment in response to post
how is your neck not absolutely wrecked?
comment in response to post
Very often I will see heavy sampling.. like 2% of available data with GA4 explorations We use the GA4 UI less and less.. and BigQuery / data mart is our "source of truth" for everything.
comment in response to post
damn I've found my people
comment in response to post
andddd that fixed our paid session count bug:
comment in response to post
I fixed it with this little bit of code: CASE WHEN REGEXP_CONTAINS(page_location, r'gclid=.+') AND traffic_source_source = 'google' THEN 'cpc' /* Known GA4 misattribution bug */ ELSE traffic_source_medium END AS traffic_source_medium
comment in response to post
There were records where traffic source medium was "organic", even though that same hit contained a GCLID and the traffic_source name was 'google' This should be impossible, as the GCLID would only populate on an ad (and therefore should be cpc/paid traffic source medium)
comment in response to post
Any data folks from bench accounting, feel free to reach out to me, we’ve got some data roles open in Vancouver
comment in response to post
Your longest ticket took you 48 days
comment in response to post
Oh damn there’s also this: github.com/hafenkran/du...
comment in response to post
One warning: if you are using events_ and events_intraday, prior day data can exist in both datasets for a certain amount of time. Had to build in logic to prefer full day export if exists (was not needed in GA UA)
comment in response to post
The native one (GA4 to BigQuery)? Yes, for events_ and events_intraday (not accurate) Have fun modelling sessions. The schema has also changed over time with new fields added
comment in response to post
comment in response to post
The default chart look and feel is much better than I can do with Python. And frankly this workflow is just so much faster while still be reproducible (enough)
comment in response to post
I did something like: with events_dates as ( select distinct date from ga4_events ) select * from ga4_events union all select * from events_fresh left outer join events_dates on events_fresh.event_date= events_dates .date where events_dates .date is null
comment in response to post
Not disagreeing but one Grafana feature that looked v useful was the ability to define dashboards as code. Tableau versioning and CI/CD is awful, are there other BI tools that are better on this front?
comment in response to post
Our federated data dept is 60+ ppl, so the specialization makes sense. But I definitely see how you would want more of a generalist for smaller teams
comment in response to post
I think AE role being too narrrow depends on org and data team size. I’m an analytics engineer and there’s no shortage of work or value we can help push at the company. I don’t handle ingestion but I model/transform/join everything from raw data to final BI/self serve datasets.
comment in response to post
BigQuery & dataform (like dbt but integrated into GCP)
comment in response to post
Strava guy!