yampeleg · TEXXR

This is Qwen3-Coder-480B-A35B. (not the 30B) Roughly ~same performance as Claude Sonnet. 6x the speed. Speechless.

2025-08-02 View on X

Cerebras

Cerebras announces the $50/month Code Pro and the $200/month Code Max plans, offering users access to Qwen3-Coder at speeds of up to 2,000 tokens per second

Two interesting examples of inference speed as a flagship feature of LLM services today. Bluesky: Tim Kellogg / @timkellogg.me : Cerebras Code — use models hosted on Cerebras with ...

View original

This is a huge upgrade! Imo codex is one of the best coding tools out there today and my main issue with it was its lack of internet access. I wrote about my experience with codex here, it should be even more incredible now: https://x.com/...

2025-06-04 View on X

TechCrunch

Mistral releases Mistral Code, a “vibe coding” client forked from open-source project Continue, in private beta on JetBrains development platforms and VS Code

French AI startup Mistral is releasing its own “vibe coding” client, Mistral Code, to compete with incumbents like Windsurf …

View original

This is a huge upgrade! Imo codex is one of the best coding tools out there today and my main issue with it was its lack of internet access. I wrote about my experience with codex here, it should be even more incredible now: https://x.com/...

2025-06-04 View on X

Simon Willison's Weblog

OpenAI updates its coding agent Codex with internet access, which is turned off by default, and expands its availability from ChatGPT Pro to ChatGPT Plus users

New features, fixes, and improvements to Codex in ChatGPT — Agent internet access David Gewirtz / ZDNET : You can use OpenAI's super powerful AI coding agent Codex for just $20 n...

View original

ChatGPT pro coming? - Unlimited access to o1, o1-mini, GPT-4o - Unlimited advanced voice - Access to “o1-pro mode” - $200 per month source: https://web.archive.org/... [image]

2024-12-06 View on X

The Verge

OpenAI launches ChatGPT Pro, a $200/month plan with unlimited access to o1, GPT-4o, and more, plus an o1 version that uses more compute for better responses

12 Days of OpenAI: Day 1 Alan Velasco / HotHardware : OpenAI Unveils A Turbocharged $200 ChatGPT Pro Tier For AI Power Users Reece Rogers / Wired : Here's What OpenAI's $200 Monthl...

View original

Qwen team somehow destroyed every single commercial model using 32B parameters

2024-11-29 View on X

TechCrunch

Alibaba releases 32.5B-parameter QwQ-32B-Preview under Apache 2.0 and claims the “reasoning” AI model beats OpenAI's o1-preview on the AIME and MATH tests

Introduction QwQ-32B-Preview is an experimental research model developed … Ananya Gairola / Benzinga : Alibaba's New AI Model Outperforms OpenAI's o1 In Specific Benchmarks, Now Av...

View original

so I think it is safe to assume that all major players have reached the limits of training longer and collecting more data already.. It is all about data quality now.. which takes time..

2024-11-10 View on X

The Information

Sources: the jump in quality from GPT-4 to Orion is far smaller than the jump from GPT-3 to GPT-4; Orion may not outperform predecessors in tasks like coding

The number of people using ChatGPT and other artificial intelligence products is soaring. The rate of improvement …

View original

Heard a leak from one of the frontier labs (not oai tbh), they reached an unexpected HUGE wall of diminishing returns trying to brute-force better results by training longer & using more and more data.. (more severe than what is published publicly)

2024-11-10 View on X

The Information

Sources: the jump in quality from GPT-4 to Orion is far smaller than the jump from GPT-3 to GPT-4; Orion may not outperform predecessors in tasks like coding

The number of people using ChatGPT and other artificial intelligence products is soaring. The rate of improvement …

View original

what a symbolic moment..

2024-11-03 View on X

Bloomberg

Nvidia will replace Intel in the Dow Jones Industrial Average on November 8, after Amazon replaced Walgreens in February 2024; Intel joined the index in 1999

Intel's 25-year reign has come to an end Varsha Agarwal / DNA India : Nvidia to replace rival Intel in Dow Jones indices by November 8 Kara Greenberg / Investopedia : Nvidia To Tak...

View original

what a symbolic moment..

2024-11-02 View on X

Bloomberg

Nvidia will replace Intel in the Dow Jones Industrial Average on November 8, after Amazon replaced Walgreens in February 2024; Intel joined the index in 1999

- Intel, Dow Inc. are set to leave the 30-member index — Nvidia up 3.2% in post-market trading, while Intel is down 2%

View original

AI search battle: SearchGPT vs GeminiGrounded vs Perplexity vs Grok Live on TursdAI, come join! [image]

2024-11-01 View on X

Bloomberg

OpenAI unveils ChatGPT Search to let paid users search for timely information using GPT-4o, after testing the feature in July 2024, a direct challenge to Google

and I'm shocked by the results Anthony Cuthbertson / The Independent : OpenAI turns ChatGPT into a search engine Hayden Field / CNBC : OpenAI launches ChatGPT search, competing wit...

View original

Note: these are different from the normal quants you know and use every day. First, they used quantized aware training on the original dataset. Alright, very useful, thank you! BUT they also did it with LoRA. First time I see this. VERY interesting idea, lots of potential

2024-10-25 View on X

SiliconANGLE

Meta debuts “quantized” versions of Llama 3.2 1B and 3B models, designed to run on low-powered devices and developed in collaboration with Qualcomm and MediaTek

so today we're releasing new quantized versions of Llama 3.2 1B & 3B that deliver up to 2-4x increases in inference speed and, on average, 56% reduction in model size, and 41% redu...

View original

How the hell Phi-3.5 is even possible? Phi-3.5-3.8B (Mini) somehow beats LLaMA-3.1-8B.. (trained only on 3.4T tokens) Phi-3.5-16x3.8B (MoE) somehow beats Gemini-Flash (trained only on 4.9T tokens) Phi-3.5-V-4.2B (Vision) somehow beats GPT-4o (trained on 500B tokens) how? lol [image]

2024-08-21 View on X

VentureBeat

Microsoft releases three Phi-3.5 models designed for basic/fast reasoning and more, available for developers to download, use, and fine-tune on Hugging Face

Microsoft isn't resting its AI success on the laurels of its partnership with OpenAI. — No, far from it.

View original

@AIatMeta this is insane

2024-06-28 View on X

VentureBeat

Meta releases LLM Compiler, a family of models built on Code Llama specifically designed for code optimization tasks, available in 7B- and 13B-parameter sizes

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama … Chris Cummins / Meta : Meta Large Language Model Compiler: Foundation Models of Compiler Optimization Rafl...

View original

Mistral just released a new SOTA code model! TL;DR: - Outperforms DeepSeek 33B while being much smaller. - Released under the new Mistral AI Non-Production License. [1] - The first 22B coming from Mistral, probably the same architecture as the experts in Mixtral-8x22B. - The

2024-05-30 View on X

TechCrunch

Mistral AI releases 22B-parameter Codestral, its first generative AI model for coding, trained on 80+ programming languages and prohibited for commercial use

You need to agree to share your contact information to access this model Deepti Pathak / Fossbytes : Mistral AI Launches Codestral: AI Code Generation Across 80 Programming Languag...

View original

Tried this already, it didn't improve the training for me. Probably is more efficient above certain amount of tokens that I didn't reach. Cool idea and very easy to try!

2024-05-07 View on X

VentureBeat

A study by Meta researchers suggests that training LLMs to predict multiple tokens at once, instead of just the next token, results in better and faster models

LLM approach to predict multiple tokens KAN: Kolmogorov-Arnold Networks —"promising alternatives to Multi-Layer Perceptrons" [image] Ethan / @ethan_smith_20 : it was only briefly t...

View original

It is so fun that Meta doesn't try to hide the uploads of LLaMA-3 before the official announcement because it is open source and there are no secrets anyway..

2024-04-19 View on X

The Verge

Meta details Llama 3: 8B- and 70B-parameter models, a focus on reducing false refusals, and an upcoming model trained on 15T+ tokens that has 400B+ parameters

What To Know About ‘Llama 3’ Model Marcus Gopolang Moloko / Memeburn : Meta AI with built in Llama 3 is on WhatsApp in South Africa Hamsat Abdurasheed / News.ng : Meta releases Lla...

View original

Meta just dropped a banger: LLaMA 2 Long... The model weights are not out yet. Hopefully Soon! 🙏

2023-09-30 View on X

VentureBeat

Meta quietly unveils Llama 2 Long, which has been trained with longer sequences, outperforming GPT-3.5 Turbo and Claude 2 when responding to long user prompts

Meta Platforms showed off a bevy of new AI features for its consumer-facing services Facebook, Instagram and WhatsApp …

View original

Long LLaMA 2 The strongest versions of LLaMA 2 to-date!...Summary: Amazing work from meta, as always! Takeaways: - Do not train with long context from scratch (switch at the 80% mark) - You do not need long instruct datasets. You can generalize to long context via long pretraining....

2023-09-30 View on X

VentureBeat

Meta quietly unveils Llama 2 Long, which has been trained with longer sequences, outperforming GPT-3.5 Turbo and Claude 2 when responding to long user prompts

Meta Platforms showed off a bevy of new AI features for its consumer-facing services Facebook, Instagram and WhatsApp …

View original