Here's my idea for Vital Score: • shows % of traffic passing CWVs • no weights, shows the progress, easy to compare, evolves CWV assessment • formula: (LCP+CLS+INP+ min(LCP, CLS, INP)) / 4 What do you think? #webperf Thx @timvereecke.bsky.social for min() trick and @csswizardry.com for CrRRUX - ThreadSky

alekseykulikov.bsky.social • 86 days ago

Here's my idea for Vital Score:

• shows % of traffic passing CWVs
• no weights, shows the progress, easy to compare, evolves CWV assessment
• formula: (LCP+CLS+INP+ min(LCP, CLS, INP)) / 4

What do you think? #webperf

Thx @timvereecke.bsky.social for min() trick and @csswizardry.com for CrRRUX

Comments

kojordan.com•85 days ago

I like the idea, it's nice and simple.

I use a calculation with the established 75th percentiles:
For each metric
- 100% if 75th is Good
- 0% if Poor
- linearly declining if in between
and then (lcp+cls+inp)/3.

Yours rewards pages where more than 75% experience good vitals, which is great.

alekseykulikov.bsky.social•85 days ago

Yes, I went through similar logic.
75 is the magic number for passing CWV. After you just make a faster website, which is rewarded with a bigger score.

cdaveross.bsky.social•86 days ago

Tested it myself, & it shows promise for quick comparisons of a cohort with CrUX data.
It *does* potentially mask a site that does extremely well on one metric & poorly on the other two (Ex: CLS 91% LCP 69% INP 66%; VS == 73%)

But correcting for that would add complexity. 🤔

alekseykulikov.bsky.social•85 days ago

Maybe adjust weight to the min value? Like Johannes suggested: https://bsky.app/profile/powdercloud.bsky.social/post/3lckaxunno22q

screenspan.net•86 days ago

I've been using:

• "% Good CWV" – i.e. the % of all page views that support the CWV which have "good" LCP, CLS and INP. That can be used for different levels of aggregation (URL-level, page types, origin).

• Displaying "Passed" or "Failed" – i.e. if the p75 for the CWV are "good".

benschwarz.bsky.social•86 days ago

The scoring calcs seem reasonably sound. What happens when there's a NULL CLS or INP?

I am (slightly) surprised to see people reaching for yet another 'one true metric' given all the historical issues of having such a score.

alekseykulikov.bsky.social•86 days ago

null values should be ignored, for null INP:(LCP+CLS+min(LCP,CLS))/3

There're reasons for the score:
• compare sites/pages
• LCP/CLS/INP names and meanings are complex to understand for non-tech people
• It's an evolution of CWV assessment (not a new metric)

What do you think are the downsides?

benschwarz.bsky.social•85 days ago

Hard in 300 chars, but:

* CrUX & RUM won't have the same score
* CWV assessment is binary, not a range/score
* Many incorrectly thought LH score was a ranking factor and will likely do the same with a score (instead of assessment pass/fail)

alekseykulikov.bsky.social•85 days ago

I remember ranking is not binary. It goes from poor=0 to good=1 (need improvement is not 0). In this case, the score correlates with ranking, especially if you account for popularity for pages/groups/origin and that it's subject to change.
Also, it quantifies UX improvements, which is the main goal.

benschwarz.bsky.social•85 days ago

Yep, it looks good 👍

powdercloud.bsky.social•85 days ago

Cool. I like "avoid hiding poor values" - others may like "see the progress". :-)
I think you're taking a weighted average between two methods: an average, and a min, such that the average has a weight of 3/4 and the min has a weight of 1/4. So depending on preference, it could be tweaked, right?

alekseykulikov.bsky.social•85 days ago

Yes, it's indeed a weighted average between min & avg approaches.
Do you think 3/5 and 2/5 or 1/2 and 1/2 would make more sense?

powdercloud.bsky.social•84 days ago

I probably shouldn't have an opinion on specific weights but rather appreciate that there's an option to tweak, and perhaps even learn what people prefer. :-) Thank you!

rviscomi.dev•86 days ago

How about just min(LCP, CLS, INP) 😈

timvereecke.bsky.social•86 days ago

You want a score that improves/degrades.

If you just use (min) you could go from (95, 97, 72) to (76, 78, 73) and be happy you “improved”

andydavies.me•86 days ago

Something I’ve learnt 14yrs of trying to produce a score from multiple metrics is it’s hard but people love the idea of’ one’ number particularly for reporting up

andydavies.me•86 days ago

A weakness with @csswizardry.com’s approach of comparative ranking is that a sites score can change even when the metrics don’t change - due to changes in the performance of the other sites in cohort changes

Lighthouse has this issue which is why the scoring curves haven’t been updated for a while

andydavies.me•86 days ago

I’m tempted to try the Lighthouse scoring mechanism but remove SI and TTI and then adjust the proportions of the other metrics but not got around to it

Challenge with all this is you’ve got to look at a lot of data to validate it

csswizardry.com•86 days ago

No, that’s deliberate. I don’t want a standalone ranking; I want a score in context. Including being overtaken.

tomnomnom.com•85 days ago

I think this makes sense if there's an overall upwards trajectory, where you get left behind if you don't keep up with industry improvements... If there's a downwards trajectory however, you can get better by doing nothing 😅

csswizardry.com•85 days ago

A bear chasing you and your pal, you just need to be faster than them.

tomnomnom.com•85 days ago

Exactly! But if your buddy runs towards the bear... 🙃

Comments

Posting Rules

Reply