OpenAI says ChatGPT's three-hour outage on December 11 was caused by a new telemetry service configuration deployed to collect Kubernetes control plane metrics
Can Blockchain Offer More Security? OpenAI : API, ChatGPT & Sora Facing Issues PYMNTS.com : OpenAI Says Deployment of Telemetry Service Caused 3-Hour Outage Simon Willison's Weblog : OpenAI's postmortem for API, ChatGPT & Sora Facing Issues (via) OpenAI had an outage across … Bluesky: Corey Quinn / @quinnypig.com : It's that “The Aristocrats!” joke except that the punchline is “Kubernetes!” [embedded post] Mastodon: Garry Knight / @garry@mstdn.social : That's a lot of jargon, but basically, the new telemetry service affected OpenAI's Kubernetes operations, including a resource that many of the company's services rely on for DNS resolution. DNS resolution converts IP addresses to domain names; it's the reason you're able to type “Google.com” instead of “142.250.191.78.” … LinkedIn: Rick Bullotta : But...wait...aren't hosted LLMs supposed to be awesome? Ha ha ha ha. — #ai #ml #generativeai #highavailability #ownyourownai #dontbeafool …