/
Navigation
C
Chronicles
Browse all articles
C
E
Explore
Semantic exploration
E
R
Research
Entity momentum
R
N
Nexus
Correlations & relationships
N
~
Story Arc
Topic evolution
S
Drift Map
Semantic trajectory animation
D
P
Posts
Analysis & commentary
P
Browse
@
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
?
Concept Search
Semantic similarity search
!
High Impact Stories
Top coverage by position
+
Sentiment Analysis
Positive/negative coverage
*
Anomaly Detection
Unusual coverage patterns
Analysis
vs
Rivalry Report
Compare two entities head-to-head
/\
Semantic Pivots
Narrative discontinuities
!!
Crisis Response
Event recovery patterns
Connected
Nav: C E R N
Search: /
Command: ⌘K
Embeddings: large
VOICE ARCHIVE

@deepseek_ai

@deepseek_ai
20 posts
2025-12-01
🏆 World-Leading Reasoning 🔹 V3.2: Balanced inference vs. length. Your daily driver at GPT-5 level performance. 🔹 V3.2-Speciale: Maxed-out reasoning capabilities. Rivals Gemini-3.0-Pro. 🥇 Gold-Medal Performance: V3.2-Speciale attains gold-level results in IMO, CMO, ICPC World [image]
2025-12-01 View on X
Bloomberg

DeepSeek releases DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which it calls “reasoning-first models built for agents”, after releasing V3.2-Exp in September

China's DeepSeek unveiled two new versions of an experimental artificial-intelligence model it released weeks ago …

🤖 Thinking in Tool-Use 🔹 Introduces a new massive agent training data synthesis method covering 1,800+ environments & 85k+ complex instructions. 🔹 DeepSeek-V3.2 is our first model to integrate thinking directly into tool-use, and also supports tool-use in both thinking and [image]
2025-12-01 View on X
Bloomberg

DeepSeek releases DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which it calls “reasoning-first models built for agents”, after releasing V3.2-Exp in September

China's DeepSeek unveiled two new versions of an experimental artificial-intelligence model it released weeks ago …

2025-09-29
🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model! ✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. 👉 Now live on App, Web, and API. 💰 API prices cut by 50%+! 1/n
2025-09-29 View on X
Bloomberg

DeepSeek releases DeepSeek-V3.2-Exp, saying it built the model using a new technique called DeepSeek Sparse Attention, and halves the pricing of its tools

DeepSeek updated an experimental AI model Monday in what it called a step toward next-generation artificial intelligence.

⚡️ Efficiency Gains 🤖 DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost. 📊 Benchmarks show V3.2-Exp performs on par with V3.1-Terminus. 2/n [image]
2025-09-29 View on X
Bloomberg

DeepSeek releases DeepSeek-V3.2-Exp, saying it built the model using a new technique called DeepSeek Sparse Attention, and halves the pricing of its tools

DeepSeek updated an experimental AI model Monday in what it called a step toward next-generation artificial intelligence.

2025-08-22
Model Update 🤖 🔹 V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3 🔹 Tokenizer & chat template updated — new tokenizer config: https://huggingface.co/... 🔗 V3.1 Base Open-source weights: https://huggingface.co/... 🔗 V3.1 Open-source weights:
2025-08-22 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

Introducing DeepSeek-V3.1: our first step toward the agent era!  🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging F...

API Update ⚙️ 🔹 deepseek-chat → non-thinking mode 🔹 deepseek-reasoner → thinking mode 🧵 128K context for both 🔌 Anthropic API format supported: https://api-docs.deepseek.com/guides/ anthropic_api ✅ Strict Function Calling supported in Beta API: https://api-docs.deepseek.com/guides/ function_calling 🚀 More API resources, smoother API experience
2025-08-22 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

Introducing DeepSeek-V3.1: our first step toward the agent era!  🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging F...

Tools & Agents Upgrades 🧰 📈 Better results on SWE / Terminal-Bench 🔍 Stronger multi-step reasoning for complex search tasks ⚡️ Big gains in thinking efficiency 3/5 [image]
2025-08-22 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

Introducing DeepSeek-V3.1: our first step toward the agent era!  🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging F...

Pricing Changes 💳 🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time) 🔹 Until then, APIs follow current pricing 📝 Pricing page: https://api-docs.deepseek.com/ ... 5/5 [image]
2025-08-22 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

Introducing DeepSeek-V3.1: our first step toward the agent era!  🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging F...

Introducing DeepSeek-V3.1: our first step toward the agent era!  🚀 🧠 Hybrid inference: Think & Non-Think — one model, two modes ⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 🛠️ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks
2025-08-22 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

Introducing DeepSeek-V3.1: our first step toward the agent era!  🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging F...

2025-08-21
API Update ⚙️ 🔹 deepseek-chat → non-thinking mode 🔹 deepseek-reasoner → thinking mode 🧵 128K context for both 🔌 Anthropic API format supported: https://api-docs.deepseek.com/ ... ✅ Strict Function Calling supported in Beta API: https://api-docs.deepseek.com/ ... 🚀 More API resources, smoother
2025-08-21 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup …

Pricing Changes 💳 🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time) 🔹 Until then, APIs follow current pricing 📝 Pricing page: https://api-docs.deepseek.com/ ... 5/5 [image]
2025-08-21 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup …

Model Update 🤖 🔹 V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3 🔹 Tokenizer & chat template updated — new tokenizer config: https://huggingface.co/... 🔗 V3.1 Base Open-source weights: https://huggingface.co/... 🔗 V3.1 Open-source weights:
2025-08-21 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup …

Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 🧠 Hybrid inference: Think & Non-Think — one model, two modes ⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 🛠️ Stronger agent skills: Post-training boosts tool use and
2025-08-21 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup …

Tools & Agents Upgrades 🧰 📈 Better results on SWE / Terminal-Bench 🔍 Stronger multi-step reasoning for complex search tasks ⚡️ Big gains in thinking efficiency 3/5 [image]
2025-08-21 View on X
Bloomberg

DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19

DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup …

2025-05-29
🚀 DeepSeek-R1-0528 is here! 🔹 Improved benchmark performance 🔹 Enhanced front-end capabilities 🔹 Reduced hallucinations 🔹 Supports JSON output & function calling ✅ Try it now: https://chat.deepseek.com/ 🔌 No change to API usage — docs here: https://api-docs.deepseek.com/ ... 🔗 [image]
2025-05-29 View on X
Bloomberg

DeepSeek says its R1 update can perform mathematics, programming, and general logic better than the previous version, and comes close to o3 and Gemini 2.5 Pro

The Chinese startup DeepSeek said Thursday that its upgraded artificial-intelligence model can perform mathematics, programming …

2025-03-02
🚀 Day 6 of #OpenSourceWeek: One More Thing - DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: 🔧 Cross-node EP-powered batch scaling 🔄 Computation-communication overlap ⚖️ Load balancing Statistics of DeepSeek's Online Service: ⚡ 73.7k/14.8k
2025-03-02 View on X
Bloomberg

DeepSeek says its V3 and R1 models' cost of inferencing relative to sales during a 24-hour-period on February 28 put “theoretical” profit margins at 545%

Chinese artificial intelligence phenomenon DeepSeek revealed some financial numbers on Saturday, saying its “theoretical” …

2025-03-01
🚀 Day 6 of #OpenSourceWeek: One More Thing - DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: 🔧 Cross-node EP-powered batch scaling 🔄 Computation-communication overlap ⚖️ Load balancing Statistics of DeepSeek's Online Service: ⚡ 73.7k/14.8k
2025-03-01 View on X
Bloomberg

DeepSeek says its V3 and R1 models' cost of inferencing relative to sales during a 24-hour-period on February 28 put “theoretical” profit margins at 545%

Chinese artificial intelligence phenomenon DeepSeek revealed some financial numbers on Saturday, saying its “theoretical” …

2025-02-21
🚀 Day 0: Warming up for #OpenSourceWeek! We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency. These humble building blocks in our online service have been documented,
2025-02-21 View on X
Bloomberg

DeepSeek plans to open source five of its code repositories next week, letting anyone download, build on, or improve the code behind its well-regarded AI models

- The startup will begin making technology available next week  — DeepSeek is pushing harder on its open-source approach

2025-01-20
🚀 DeepSeek-R1 is here! ⚡ Performance on par with OpenAI-o1 📖 Fully open-source model & technical report 🏆 MIT licensed: Distill & commercialize freely! 🌐 Website & API are live now! Try DeepThink at https://chat.deepseek.com/ today! 🐋 1/n [image]
2025-01-20 View on X
Politico

CoinGecko: Trump's memecoin, with a $9B+ market cap, is still in the top 25 most valuable cryptocurrencies; some in crypto call Official Trump a “horrible look”

Trump's memecoin has taken off since debuting late Friday night.  Not everyone in crypto is thrilled.

2024-12-27
🚀 Introducing DeepSeek-V3! Biggest leap forward yet: ⚡ 60 tokens/second (3x faster than V2!) 💪 Enhanced capabilities 🛠 API compatibility intact 🌍 Fully open-source models & papers 🐋 1/n [image]
2024-12-27 View on X
VentureBeat

DeepSeek releases DeepSeek-V3, an open-source MoE model of 671B total parameters, with 37B activated per token, claiming it outperforms top models like GPT-4o

Chinese AI startup DeepSeek, known for challenging leading AI vendors with its innovative open-source technologies, today released a new ultra-large model: DeepSeek-V3.