Jun 25 not much happened today

🕐 2d ago 📰 1 个来源 👁 1 阅读

📝 摘要

Z.ai's GLM-5.2 leads in coding and agent benchmarks with top scores like 1595 on Code Arena: Frontend and 34.29% reasoning accuracy with zero failures. Databricks improved GLM-5.2 speed to 392 tok/s using hardware and optimizations. Ornith-1.0, a new MIT-licensed coding model family, spans 9B to 397B parameters with strong benchmark results and a self-improving RL training method. Liquid AI released a small model for low-latency robotics/e-commerce use. Google integrated computer use into Gemini 3.5 Flash with safety controls and developer tools for device control. Startups like Sail and Hyperagent focus on long-running agents with persistent execution and cost efficiency. OpenAI reports growing internal Codex use for complex, cross-functional tasks, highlighting agent skill concurrency.

✍️ 编辑摘要

这条资讯的核心议题是“Jun 25 not much happened today”。

从当前聚合摘要看，最值得先关注的是：Z.ai's GLM-5.2 leads in coding and agent benchmarks with top scores like 1595 on Code Arena: Frontend and 34.29% reasoning accuracy with zero failures. Databricks improved GLM-5.2 speed to 392 tok/s using hardware and optimizations. Ornith-1.0, a new MIT-licensed coding model family, spans 9B to 397B parameters with strong benchmark results and a self-improving RL training method. Liquid AI released a small model for low-latency robotics/e-commerce use. Google integrated computer use into Gemini 3.5 Flash with safety controls and developer tools for device control. Startups like Sail and Hyperagent focus on long-running agents with persistent execution and cost efficiency. OpenAI reports growing internal Codex use for complex, cross-functional tasks, highlighting agent skill concurrency.。

如果你只看一遍，这条新闻与后续判断最相关的点是：这条资讯围绕“Jun 25 not much happened today”展开，建议结合来源列表和相关话题继续跟踪后续进展。

📌 关键信息

Z.ai's GLM-5.2 leads in coding and agent benchmarks with top scores like 1595 on Code Arena: Frontend and 34.29% reasoning accuracy with zero failures. Databricks improved GLM-5.2 speed to 392 tok/s using hardware and optimizations. Ornith-1.0, a new MIT-licensed coding model family, spans 9B to 397B parameters with strong benchmark results and a self-improving RL training method. Liquid AI released a small model for low-latency robotics/e-commerce use. Google integrated computer use into Gemini 3.5 Flash with safety controls and developer tools for device control. Startups like Sail and Hyperagent focus on long-running agents with persistent execution and cost efficiency. OpenAI reports growing internal Codex use for complex, cross-functional tasks, highlighting agent skill concurrency.

🧭 为什么值得关注

这条资讯围绕“Jun 25 not much happened today”展开，建议结合来源列表和相关话题继续跟踪后续进展。

查看首个原始来源 →

← 查看全部资讯 →

📝 摘要

✍️ 编辑摘要

📌 关键信息

🧭 为什么值得关注

📌 更多资讯