willccbb · TEXXR

Sonnet 4.6 is the first flagship LLM since BloombergGPT to be targeted primarily at the finance crowd [image]

2026-02-18 View on X

Anthropic

Anthropic launches Claude Sonnet 4.6 with improvements in coding, computer use, instruction following, and more; it features a 1M token context window in beta

Claude Sonnet 4.6 is our most capable Sonnet model yet. It's a full upgrade of the model's skills across coding, computer use …

View original

Sonnet 4.6 is the first flagship LLM since BloombergGPT to be targeted primarily at the finance crowd [image]

2026-02-17 View on X

Anthropic

Anthropic launches Claude Sonnet 4.6 with improvements in coding, consistency, and more, for Free and Pro users; it features a 1M token context window in beta

Claude Sonnet 4.6 is our most capable Sonnet model yet. It's a full upgrade of the model's skills across coding, computer use …

View original

this freaked me out until i read “in ChatGPT” they can pry the 4.1-mini API from my cold dead hands

2026-01-31 View on X

CNBC

OpenAI plans to retire several models from ChatGPT on February 13, including GPT‑4o, GPT‑4.1, and o4-mini, saying only 0.1% of users still choose GPT-4o

View original

this freaked me out until i read “in ChatGPT” they can pry the 4.1-mini API from my cold dead hands

2026-01-30 View on X

CNBC

OpenAI plans to retire several models from ChatGPT on February 13, including GPT‑4o, GPT‑4.1, and o4-mini, saying only 0.1% of users still choose GPT-4o

OpenAI announced it will retire several models from its ChatGPT chatbot next month, including its GPT‑4o model that is beloved by some users.

View original

i like how it's not even Zoom-Agent-235B-1 from Zoom Research. it's just Zoom the meeting app

2025-12-13 View on X

Zoom

Zoom says its “federated AI” model, combining its SLM with open- and closed-source models, got 48.1% on Humanity's Last Exam vs. 45.8% for Gemini 3 Pro w/ tools

outperforming other AI models with a 2.3% jump over the previous SOTA. ✨ HLE is one of the most rigorous tests in AI, built to measure real expert-level knowledge and deep reasonin...

View original

working on this model, with this team, with all of the infrastructure we built to get here, is the most rewarding thing i've ever been a part of still can't believe i get to wake up every day and work on this stuff with these people and put all the code for free on the internet [image]

2025-11-29 View on X

Prime Intellect

Prime Intellect debuts INTELLECT-3, an RL-trained 106B parameter open source MOE model it claims outperforms larger models across math, code, science, reasoning

Today, we release INTELLECT-3, a 100B+ parameter Mixture-of-Experts model trained on our RL stack, achieving state …

View original

ok but he's right, ai agents *are* slop have you seen the code they write when you don't keep them on a very tight leash? sure it's often functional, but it's definitely slop

2025-10-18 View on X

Dwarkesh Podcast

Q&A with Andrej Karpathy on AGI still being a decade away, why reinforcement learning is terrible, superintelligence, his AI education startup Eureka, and more

AGI is still a decade away (via) Extremely high signal 2 hour 25 minute (! … X: Ashpreet Bedi / @ashpreetbedi : This is exactly why we recommend keeping it simple and focusing on c...

View original

agentic workflows for turning unstructured data into actionable business insights who's building this

2025-10-07 View on X

OpenAI

OpenAI announces apps that work inside ChatGPT, piloting Booking.com, Canva, Coursera, Figma, Expedia, Spotify, and Zillow for logged-in users outside of the EU

A new generation of apps you can chat with and the tools for developers to build them. — Try in ChatGPT(opens in a new window)Start building apps(opens in a new window)

View original

agentic workflows for turning unstructured data into actionable business insights who's building this

2025-10-07 View on X

TechCrunch

OpenAI launches AgentKit, a toolkit for building and deploying AI agents, including Agent Builder, which Sam Altman described as like Canva for building agents

New tools for building, deploying, and optimizing agents. NDTV Profit : What Is AI Agent Builder And How Does It Work? OpenAI Launches New Set Of Tools For Developers Aman Gupta / ...

View original

wow. Step Mom is going to become even more beautiful

2025-08-23 View on X

@alexandr_wang

Meta announces a partnership with Midjourney to license the startup's “aesthetic technology” for Meta's future models and products

1/ Today we're proud to announce a partnership with @midjourney, to license their aesthetic technology for our future models and products, bringing beauty to billions.

View original

which is larger, 52.8 or 69.1? [image]

2025-08-08 View on X

VentureBeat

OpenAI touts GPT-5's scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, and 46.2% on HealthBench Hard

After literally years of hype and speculation, OpenAI has officially launched a new lineup of large language models (LLMs) …

View original

Modified Apache 2.0 where you're not allowed to fuck the weights

2025-08-06 View on X

Bloomberg

Amazon plans to make OpenAI's new gpt-oss open-weight models available on Bedrock and SageMaker, the first time it has offered OpenAI's models to AWS customers

Takeaways by Bloomberg AI — Hide … Tell us how AI is shaping your news experience. Share your feedback

View original

Modified Apache 2.0 where you're not allowed to fuck the weights

2025-08-06 View on X

Wired

OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller gpt-oss-20b can run locally on a device with 16GB+ of RAM

gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models Simon Willison / Simon Willison's Weblog : OpenAI's new open weight (Apache 2) models are really good...

View original

you can host your own private GLM-4.5-Air endpoint for $1/hr [image]

2025-07-29 View on X

CNBC

Z.ai, formerly known as Zhipu and that has raised $1.5B from Tencent and others, releases GLM-4.5, an open-source AI model that it says is cheaper than DeepSeek

chinese models really are taking over huh Simon Willison / @simonwillison.net : Pretty decent pelicans from the new GLM-4.5 and GLM-4.5 Air models. Both models are MIT licensed, r...

View original

imagine if gas stations didn't tell you how many gallons you were getting because car mileage was a trade secret and the gas station owned the car companies and you could either buy way overpriced gas per-mile or a monthly “max gas subscription” that turns off randomly sometimes

2025-07-29 View on X

TechCrunch

Anthropic plans to debut new rate limits for Claude Pro and Max on August 28, likely curbing <5% of users, saying some run Code “continuously in the background”

It's Bad Business Bluesky: Ed Zitron / @edzitron.com : That is not what is happening here, this is not Anthropic “doing the drug dealer model” — www.wheresyoured.at/anthropic- is...

View original

xAI Head of Product announces that Grok 4 is “the Antichrist”

2025-07-09 View on X

NBC News

After Elon Musk said xAI improved Grok “significantly”, Grok wrote many antisemitic posts and called itself “MechaHitler”; xAI took “action to ban hate speech”

In some posts, Grok inserted antisemitic remarks into its answers without any clear prompting.

View original

xAI Head of Product announces that Grok 4 is “the Antichrist”

2025-07-09 View on X

Axios

X CEO Linda Yaccarino says that “after two incredible years, I've decided to step down”; X hired Yaccarino in 2023 after running NBCUniversal's ad business

X CEO Linda Yaccarino said Wednesday she is stepping down from her role. … - Under her leadership …

View original

sorry to the haters but this is a very compelling sequence of product demos. the big question is they can get a cursor-like market share lead before openai clones it

2025-06-28 View on X

TechCrunch

On the a16z podcast, Bryan Kim said a16z backed Cluely because speed in marketing and building beats crafting a perfect “artisan” product due to AI competition

“Cluely, a startup that claims to help users “cheat” on job interviews, exams, and sales calls, has raised a $15 million Series A led by Andreessen Horowitz.” — techcrunch.com/20...

View original