/
Navigation
C
Chronicles
Browse all articles
C
E
Explore
Semantic exploration
E
R
Research
Entity momentum
R
N
Nexus
Correlations & relationships
N
~
Story Arc
Topic evolution
S
Drift Map
Semantic trajectory animation
D
P
Posts
Analysis & commentary
P
Browse
@
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
?
Concept Search
Semantic similarity search
!
High Impact Stories
Top coverage by position
+
Sentiment Analysis
Positive/negative coverage
*
Anomaly Detection
Unusual coverage patterns
Analysis
vs
Rivalry Report
Compare two entities head-to-head
/\
Semantic Pivots
Narrative discontinuities
!!
Crisis Response
Event recovery patterns
Connected
Nav: C E R N
Search: /
Command: ⌘K
Embeddings: large
VOICE ARCHIVE

@wordgrammer

@wordgrammer
7 posts
2025-01-27
Meta interview in 2020: Leetcode medium Meta interview in 2024: Leetcode hard Meta interview in 2025: implement a custom loss free load balancing scheduler across 16 GPU nodes, each containing a standard 8xH100 setup. Assume mixture of experts with 4 + 1 experts per node
2025-01-27 View on X
Bloomberg

DeepSeek's iOS app tops the App Store's Top Free Apps chart in the US, beating ChatGPT, stirring doubts in Silicon Valley about the strength of the US' AI lead

- App's lower-cost model upends premise for AI spending boom  — Stocks of chip gear makers ASML and Advantest plunge

Okay. Thanks for the nerd snipe guys. I spent the day learning exactly how DeepSeek trained at 1/30 the price, instead of working on my pitch deck. The tl;dr to everything, according to their papers:
2025-01-27 View on X
VentureBeat

How DeepSeek outpaced OpenAI at a fraction of the cost: open source, pure reinforcement learning, no supervised fine-tuning, and building on DeepSeek-R1-Zero

DeepSeek R1's Monday release has sent shockwaves through the AI community, disrupting assumptions about what's required to achieve cutting-edge AI performance.

Okay. Thanks for the nerd snipe guys. I spent the day learning exactly how DeepSeek trained at 1/30 the price, instead of working on my pitch deck. The tl;dr to everything, according to their papers:
2025-01-27 View on X
Bloomberg

DeepSeek's iOS app tops the App Store's Top Free Apps chart in the US, beating ChatGPT, stirring doubts in Silicon Valley about the strength of the US' AI lead

- App's lower-cost model upends premise for AI spending boom  — Stocks of chip gear makers ASML and Advantest plunge

Meta interview in 2020: Leetcode medium Meta interview in 2024: Leetcode hard Meta interview in 2025: implement a custom loss free load balancing scheduler across 16 GPU nodes, each containing a standard 8xH100 setup. Assume mixture of experts with 4 + 1 experts per node
2025-01-27 View on X
The Information

Sources: Meta set up four war rooms to analyze High-Flyer's DeepSeek, including two for how High-Flyer cut training costs and one on what data it may have used

“Wait, how much are we spending on research of little/no utility to Meta proper?”  —  “Wait, what?  How much?  How many Stanford PhDs did LeCun hire to endlessly fellate his ego?” ...

2025-01-22
The year is 2026. If you want to use AGI, your options are: - $200 per query to Stargate - $.02 per query, but the CCP must get access to your social security number
2025-01-22 View on X
TechCrunch

OpenAI, SoftBank, and Oracle unveil The Stargate Project, a JV to invest in US AI infrastructure, committing $100B now and up to $500B over the next four years

OpenAI says that it will team up with both the Japanese conglomerate SoftBank and with Oracle, along with others, to build multiple data centers for AI in the U.S.

2024-12-27
DeepSeek's new model seems to have proven this For the majority of startups. You cannot build a better datacenter than Microsoft. You cannot fundraise more than Sam or Elon. You cannot get more data than Google. But you actually can write 10x better code than any of them
2024-12-27 View on X
VentureBeat

DeepSeek releases DeepSeek-V3, an open-source MoE model of 671B total parameters, with 37B activated per token, claiming it outperforms top models like GPT-4o

Chinese AI startup DeepSeek, known for challenging leading AI vendors with its innovative open-source technologies, today released a new ultra-large model: DeepSeek-V3.

2024-12-18
You could buy 10 of these for 80 gb of vram (same as the H100), and it would only cost $2.5k (cheaper than a 4090)
2024-12-18 View on X
Tom's Hardware

Nvidia unveils Jetson Orin Nano Super Developer Kit, a $249 compact AI development board that promises 67 TOPS, compared to 40 TOPS of the $499 last-gen kit

The Jetson Orin Nano Super Developer Kit will be available later this month and comes with 8GB of RAM.