Solid OpenAI post-mortem on @kubernetes.io API overload, caused by "per-node telemetry ingestion" https://status.openai.com/incidents/ctrsv3lwd797 🤔

Oddly familiar to a common DaemonSet @prometheus.io / @opentelemetry.io gotcha for metrics scrapes, we talked about in the past:
https://youtu.be/yk2aaAyxgKw?t=768 🙈

Comments