2025-02-22
This example from their paper ( https://pub.sakana.ai/...), which is claimed to have 150x speedup, is actually 3x slower if you bench it... [image]
TechCrunch
Sakana AI walks back claims that its new AI CUDA Engineer can speed up AI training by up to 100x, after complaints about worse-than-average training performance
Kyle Wiggers / TechCrunch :
2025-02-20
This example from their paper ( https://pub.sakana.ai/...), which is claimed to have 150x speedup, is actually 3x slower if you bench it... [image]
Nikkei Asia
Tokyo-based Sakana AI details its AI CUDA Engineer, which it says can speed up AI training and inference by 10x to 100x by “breeding” efficient instructions
TOKYO — Tokyo-based startup Sakana AI says it has developed a system capable of accelerating artificial intelligence development …
2024-06-29
@OpenAI wow, that's pessimistic [image]
Wired
OpenAI details CriticGPT, a GPT-4 model fine-tuned to catch errors in ChatGPT's code output, assisting human trainers tasked with assessing and spotting errors
meet OpenAI's new bug hunter Markus Kasanmascheff / WinBuzzer : OpenAI Introduces CriticGPT for Better AI Training OpenAI : Finding GPT-4's mistakes with GPT-4 Donna Eva / Analytic...
2024-06-28
@OpenAI wow, that's pessimistic [image]
Wired
OpenAI details CriticGPT, a GPT-4 model fine-tuned to catch errors in ChatGPT's code output, assisting human trainers tasked with assessing and spotting errors
Having humans rate a language model's outputs produced clever chatbots. OpenAI says adding AI to the loop could help make them even smarter and more reliable.
2024-03-19
@_akhaliq is it just me or do none of the examples look like they're lipsynced lol
VentureBeat
Google Research details VLOGGER, an AI model that can generate lifelike videos of people speaking, gesturing, and moving, from a single photo and an audio clip
2023-10-20
sooooo is nobody talking about the research paper that got published with this post? [image]
The Verge
OpenAI rolls out DALL-E 3 access to ChatGPT Plus and Enterprise subscribers, after preparing a safety mitigation stack to ready the model for expanded release
all in real-time. https://openai.com/... [image] Greg Brockman / @gdb : DALL·E 3 is now available to all ChatGPT Plus & Enterprise users: @swyx : In a Surprising-for-these-times mo...