_akhaliq · TEXXR

qwen3-coder-plus is now available on Anycoder Enhanced terminal task capabilities and better performance on Terminal Bench (w/ Qwen Code / Claude Code) SWE-Bench performance up to 69.6 Safer code generation available as Qwen3-Coder-Plus-2025-09-23 [video]

2025-09-24 View on X

Simon Willison's Weblog

Alibaba releases the Qwen3-VL vision models, the Qwen3Guard “safety moderation” models, and three closed-weight models, including Qwen3-Max with 1T+ parameters

Qwen 50.6k — Safetensors qwen3_vl_moe Julian Nabil / Forbes Middle East : Alibaba Introduces Qwen3-Max AI Model With Over 1T Parameters Markus Kasanmascheff / WinBuzzer : Alibaba...

View original

qwen3-coder-plus is now available on Anycoder Enhanced terminal task capabilities and better performance on Terminal Bench (w/ Qwen Code / Claude Code) SWE-Bench performance up to 69.6 Safer code generation available as Qwen3-Coder-Plus-2025-09-23 [video]

2025-09-24 View on X

Bloomberg

Alibaba's Hong Kong-listed shares hit a nearly four-year high after CEO Eddie Wu announced plans to increase AI spending beyond the $53B target over three years

Alibaba Group Holding Ltd.'s shares surged to their highest in nearly four years after revealing plans to ramp up AI spending past …

View original

Qwen3-Max-Preview (Instruct) biggest model yet from Qwen, with over 1 trillion parameters is now available in anycoder Benchmarks show it beats previous best, Qwen3-235B-A22B-2507 Voxel Pagoda garden, 1 shot [image]

2025-09-06 View on X

VentureBeat

Alibaba debuts Qwen3-Max-Preview, its largest AI model with over 1T parameters, showcasing strong benchmark performance; the model is not open source

Chinese e-commerce giant Alibaba's “Qwen Team” of AI researchers has done it again. After a busy summer in which the AI lab released …

View original

DeepSeek-V3.1 ball bouncing inside a spinning hexagon with @FireworksAI_HQ in anycoder, one shot [video]

2025-08-22 View on X

Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging F...

View original

DeepSeek-V3.1 ball bouncing inside a spinning hexagon with @FireworksAI_HQ in anycoder, one shot [video]

2025-08-21 View on X

Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup …

View original

Qwen-Image @Alibaba_Qwen, a 20B MMDiT model for text-to-image generation is now available in anycoder using @replicate for generating images for your apps You can now generate images directly inside anycoder for your apps when vibe coding [image]

2025-08-06 View on X

VentureBeat

Alibaba releases Qwen-Image, an open-source AI image generation model focused on accurately rendering text, with support for alphabetic and logographic scripts

After seizing the summer with a blitz of powerful, freely available new open source language and coding focused AI models …

View original

Alibaba just dropped ZeroSearch on Hugging Face Incentivize the Search Capability of LLMs without Searching [image]

2025-05-09 View on X

VentureBeat

Alibaba researchers detail ZeroSearch, a technique that allows LLMs to develop advanced search capabilities via simulation, claiming it cuts costs by up to 88%

Researchers at Alibaba Group have developed a novel approach that could dramatically reduce the cost and complexity …

View original

Xiaomi just dropped MiMo on Hugging Face Unlocking the Reasoning Potential of Language Model From Pretraining to Posttraining The final RL-tuned model, MiMo-7B-RL, achieves superior performance on mathematics, code and general reasoning tasks, surpassing the performance of [image]

2025-04-30 View on X

Bloomberg

Xiaomi unveils open-source AI reasoning model MiMo, joining other Chinese tech leaders hoping to make a splash in the burgeoning AI field endorsed by Beijing

Xiaomi debuted MiMo a day after Alibaba unveiled the latest version of its own flagship model, amplifying a race between China's tech players …

View original

Alibaba just released Qwen2.5-VL-32B-Instruct on Hugging Face further optimize this VLM with reinforcement learning and have found significant improvements in human preference and also mathematical reasoning [image]

2025-03-25 View on X

Simon Willison's Weblog

Alibaba releases Qwen2.5-VL-32B, a 32B open model under Apache 2.0, claiming better math reasoning and alignment with human preferences than earlier 2.5 models

Qwen2.5-VL-32B: Smarter and Lighter. The second big open weight LLM release from China today - the first being DeepSeek v3-0324.

View original

[image]

2025-02-18 View on X

Bloomberg

Source: Ilya Sutskever's startup Safe Superintelligence is raising $1B+ at a $30B+ valuation led by Greenoaks Capital, which plans to invest $500M

that outsmarts humans in a safe way—is heading for a $30 billion-plus valuation The Information : Ilya Sutskever's Startup in Talks to Raise Financing at $30 Billion Valuation Meyt...

View original

OpenAI o3-mini System Card [image]

2025-02-01 View on X

TechCrunch

OpenAI launches o3-mini, its latest reasoning model that the company says is largely on par with o1 and o1-mini in capabilities, but runs faster and costs less

OpenAI on Friday launched a new AI “reasoning” model, o3-mini, the newest in the company's o family of reasoning models.

View original

Qwen2.5-Max just one shotted this prompt: write a script for three bouncing yellow balls within a sphere, make sure to handle collision detection properly. make the sphere slowly rotate. make sure balls stays within the sphere. implement it in p5.js developers can start using [video]

2025-01-29 View on X

Reuters

Alibaba releases Qwen 2.5-Max, an AI model that the company's cloud unit claims “outperforms” GPT-4o, DeepSeek-V3, and Llama-3.1-405B “almost across the board”

Chinese tech company Alibaba (9988.HK) on Wednesday released a new version of its Qwen 2.5 artificial intelligence model …

View original

Qwen QVQ is now available in anychat This may be the first open-weight model for visual reasoning. It is called QVQ, where V stands for vision. It just reads an image and an instruction, starts thinking, reflects while it should, keeps reasoning, and finally it generates its [image]

2024-12-26 View on X

Qwen

Alibaba releases QvQ-72B-Preview, an experimental research model focused on “enhancing visual reasoning capabilities”, built on Qwen2-VL-72B

QVQ-72B-Preview is an experimental research model developed by the Qwen team … QwenLM on GitHub : Qwen2-VL — Introduction After a year's relentless efforts, today we are thrilled...

View original

Google drops Gemini 2.0 Flash Thinking a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more now available in anychat, try it out:

2024-12-20 View on X

TechCrunch

Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning

Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Fl...

View original

Apple just released Depth Pro Sharp Monocular Metric Depth in Less Than a Second [image]

2024-10-06 View on X

VentureBeat

Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel 3D depth map in 0.3 seconds on a standard GPU, using a single image

Michael Nuñez / VentureBeat :

View original

Depth Pro app is up: https://huggingface.co/... [image]

2024-10-06 View on X

VentureBeat

Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel 3D depth map in 0.3 seconds on a standard GPU, using a single image

Michael Nuñez / VentureBeat :

View original

Apple just released Depth Pro Sharp Monocular Metric Depth in Less Than a Second [image]

2024-10-05 View on X

VentureBeat

Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel depth map in 0.3 seconds on a standard GPU, using a single image

Apple's AI research team has developed a new model that could significantly advance how machines perceive depth …

View original

Depth Pro app is up: https://huggingface.co/... [image]

2024-10-05 View on X

VentureBeat

Apple's AI research team releases Depth Pro, an AI model that can generate a 2.25-megapixel depth map in 0.3 seconds on a standard GPU, using a single image

Apple's AI research team has developed a new model that could significantly advance how machines perceive depth …

View original

Meta announces Movie Gen A Cast of Media Foundation Models We present Movie Gen, a cast of foundation models that generates high-quality, 1080p HD videos with different aspect ratios and synchronized audio. We also show additional capabilities such as precise instruction-based [video]

2024-10-05 View on X

Wired

Meta announces Movie Gen, a suite of AI models for generating realistic video and audio clips; Movie Gen Video has 30B parameters and Movie Gen Audio has 13B

The next frontier in generative AI is video—and with Movie Gen, Meta has now staked its claim.

View original

Meta announces Movie Gen A Cast of Media Foundation Models We present Movie Gen, a cast of foundation models that generates high-quality, 1080p HD videos with different aspect ratios and synchronized audio. We also show additional capabilities such as precise instruction-based [video]

2024-10-04 View on X

Wired

Meta announces Movie Gen, a suite of AI models for generating realistic video and audio clips; Movie Gen Video has 30B parameters and Movie Gen Audio has 13B

The next frontier in generative AI is video—and with Movie Gen, Meta has now staked its claim.

View original