BeClaude
Industry2026-05-07

ZAYA1-8B: An 8B Moe Model with 760M Active Params Matching DeepSeek-R1 on Math

Source: Hacker News

hacker-newsdeepseek