not much happened today

📝 摘要

**nvidia** released **nemotron 3 ultra**, a fully open **550b moe** model with **55b active parameters** and **1m context**, optimized for long-running agent tasks with up to **5x speedup** and **30% cost reduction**. it features hybrid mamba/attention, latentmoe, native mtp, and was pretrained on **20t tokens** using nvfp4 low-precision format. benchmarks show strong performance with **47.7 intelligence index** and **400+ output tokens/sec**. the model is supported across major serving platforms. additionally, **nemotron 3.5 asr** is an open streaming asr model with **0.6b parameters**, supporting **40 language-locale combinations** and sub-100ms latency, designed for voice agents. **anthropic** highlighted early signs of recursive self-improvement (rsi) in ai, with **claude** models authoring **80%+ of merged code** and engineers shipping **8x more code**. claude opus 4 achieved **3x speedup** on training scripts, while mythos preview reached **~52x speedup** and provided better research suggestions than humans **64% of the time**.

✍️ 编辑摘要

这条资讯的核心议题是“not much happened today”。

从当前聚合摘要看，最值得先关注的是：**nvidia** released **nemotron 3 ultra**, a fully open **550b moe** model with **55b active parameters** and **1m context**, optimized for long-running agent tasks with up to **5x speedup** and **30% cost reduction**. it features hybrid mamba/attention, latentmoe, native mtp, and was pretrained on **20t tokens** using nvfp4 low-precision format. benchmarks show strong performance with **47.7 intelligence index** and **400+ output tokens/sec**. the model is supported across major serving platforms. additionally, **nemotron 3.5 asr** is an open streaming asr model with **0.6b parameters**, supporting **40 language-locale combinations** and sub-100ms latency, designed for voice agents. **anthropic** highlighted early signs of recursive self-improvement (rsi) in ai, with **claude** models authoring **80%+ of merged code** and engineers shipping **8x more code**. claude opus 4 achieved **3x speedup** on training scripts, while mythos preview reached **~52x speedup** and provided better research suggestions than humans **64% of the time**.。

如果你只看一遍，这条新闻与后续判断最相关的点是：涉及模型：nemotron-3-ultra、nemotron-3.5-asr、claude-opus-4，适合跟踪模型能力、价格或产品策略变化。

📌 关键信息

**nvidia** released **nemotron 3 ultra**, a fully open **550b moe** model with **55b active parameters** and **1m context**, optimized for long-running agent tasks with up to **5x speedup** and **30% cost reduction**. it features hybrid mamba/attention, latentmoe, native mtp, and was pretrained on **20t tokens** using nvfp4 low-precision format. benchmarks show strong performance with **47.7 intelligence index** and **400+ output tokens/sec**. the model is supported across major serving platforms. additionally, **nemotron 3.5 asr** is an open streaming asr model with **0.6b parameters**, supporting **40 language-locale combinations** and sub-100ms latency, designed for voice agents. **anthropic** highlighted early signs of recursive self-improvement (rsi) in ai, with **claude** models authoring **80%+ of merged code** and engineers shipping **8x more code**. claude opus 4 achieved **3x speedup** on training scripts, while mythos preview reached **~52x speedup** and provided better research suggestions than humans **64% of the time**.

🧭 为什么值得关注

涉及模型：nemotron-3-ultra、nemotron-3.5-asr、claude-opus-4，适合跟踪模型能力、价格或产品策略变化。
涉及公司：nvidia、anthropic、togethercompute，这通常意味着行业竞争、合作或商业化动作值得继续观察。
关联标签：mixture-of-experts、long-context、model-quantization、agentic-ai，可用于继续追踪同主题后续报道。

查看首个原始来源 →

🗂 主题卡片

涉及模型

nemotron-3-ultra nemotron-3.5-asr claude-opus-4 mythos-preview

涉及公司

nvidia anthropic togethercompute baseten modal vllm_project fireworksai_hq ollama wandb cline primeintellect nousresearch

关联标签

mixture-of-experts long-context model-quantization agentic-ai streaming-speech asr low-precision-training benchmarking recursive-self-improvement code-generation model-speedup

← 查看全部资讯 →

📝 摘要

✍️ 编辑摘要

📌 关键信息

🧭 为什么值得关注

🗂 主题卡片

📌 更多资讯