2025-11-19
Keep hearing from the GDM imo team @lmthang and @jj_at_brown @quocleix etc that the IMO gold methods are completely general purpose and not IMO specific that is attributed to an improvement in gemini, not some scaffolding. Then I try gemini 3 (first time using gemini since 2.5
matt shumer
Gemini 3 hands-on: a fundamental improvement on daily use, extremely fast, Antigravity IDE is a powerful launch product, and its personality is terse and direct
Gemini 3 is a fundamental improvement on daily use, not just on benchmarks. It feels more consistent and less “spiky” than previous models.
Benchmaxxed. No good at vibeproving
matt shumer
Gemini 3 hands-on: a fundamental improvement on daily use, extremely fast, Antigravity IDE is a powerful launch product, and its personality is terse and direct
Gemini 3 is a fundamental improvement on daily use, not just on benchmarks. It feels more consistent and less “spiky” than previous models.
Keep hearing from the GDM imo team @lmthang and @jj_at_brown @quocleix etc that the IMO gold methods are completely general purpose and not IMO specific that is attributed to an improvement in gemini, not some scaffolding. Then I try gemini 3 (first time using gemini since 2.5
The Information
Gemini co-lead Oriol Vinyals says Gemini 3's gains come from better pre-training and post-training, contradicting the idea that pre-training gains are falling
which we discussed in our NeurIPS '25 talk with @ilyasut and @quocleix—the team delivered a drastic jump. The delta between 2.5 and 3.0 is [image] Andrej Karpathy / @karpathy : I p...
Benchmaxxed. No good at vibeproving
The Information
Gemini co-lead Oriol Vinyals says Gemini 3's gains come from better pre-training and post-training, contradicting the idea that pre-training gains are falling
which we discussed in our NeurIPS '25 talk with @ilyasut and @quocleix—the team delivered a drastic jump. The delta between 2.5 and 3.0 is [image] Andrej Karpathy / @karpathy : I p...