2025-10-21
The Decoder
12 related
DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute
the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model su...
2024-12-07
OpenAI
9 related
OpenAI expands its Reinforcement Fine-Tuning Research Program to let developers create expert models in specific domains with very little training data
the repo we used to train Tulu 3. Expanding reinforcement learning with verifiable rewards (RLVR) to more domains and with better answer extraction (what OpenAI calls a grader, a [image] Kevin Weil / ...
2024-05-30
TechCrunch
14 related
Mistral AI releases 22B-parameter Codestral, its first generative AI model for coding, trained on 80+ programming languages and prohibited for commercial use
You need to agree to share your contact information to access this model Deepti Pathak / Fossbytes : Mistral AI Launches Codestral: AI Code Generation Across 80 Programming Languages Luke Jones / WinB...
Loading articles...