AI 速递 2026-06-10

生成时间：2026/6/10 10:08:06（UTC: 2026-06-10T02:08:06.131Z）

论文精选

Agents’ Last Exam

👍 83 · arXiv

Recent AI systems have achieved strong results on a wide range of benchmarks, yet these gains have not translated into economically meaningful deployment across many professional domains. We argue tha…

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

👍 51 · arXiv

Agent systems increasingly use textual skills to encode reusable task procedures, but injecting these skills into the prompt at every step incurs substantial context overhead and exposes skill content…

OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics

👍 16 · arXiv

Vision-language model (VLM) agents are increasingly deployed in interactive game environments. Yet game benchmarks for VLM agents typically report a single first-attempt score per (agent, game) pair, …

A Geometric Account of Activation Steering through Angle-Norm Decomposition

👍 15 · arXiv

Linear activation steering has gained popularity as a simple and empirically effective way to control language model behavior. More recently, spherical steering paradigms have been proposed to address…

SwiftVR: Real-Time One-Step Generative Video Restoration

👍 12 · arXiv

Real-time video restoration (VR) for live streams requires high-resolution outputs under strict per-frame latency constraints. Existing one-step diffusion-based VR models remain difficult to deploy on…

版本更新

OpenClaw v2026.6.5

Highlights

QQBot now strips model reasoning/thinking scaffolding before native delivery, preventing raw <thinking> content from leaking into channel replies. (#89913, #90132) Thanks @openper…

链接：https://github.com/openclaw/openclaw/releases/tag/v2026.6.5

LangChain langchain==1.3.6

Changes since langchain==1.3.5

release(langchain): 1.3.6 (#38001) fix(langchain): preserve summarization trigger compatibility (#38000)…

链接：https://github.com/langchain-ai/langchain/releases/tag/langchain%3D%3D1.3.6

CrewAI 1.14.7a4

What’s Changed

Features

Migrate @listen/@router runtime to read from FlowDefinition
Add pluggable default backends for memory, knowledge, rag, and flow

Documentation

Update changelo…

链接：https://github.com/crewAIInc/crewAI/releases/tag/1.14.7a4

开发者工具

OpenAI Codex CLI rust-v0.139.0

New Features

Code mode can now call standalone web search directly, including from nested JavaScript tool calls, and receive plaintext search results. (#26719)
Tool and connector input schemas …

链接：https://github.com/openai/codex/releases/tag/rust-v0.139.0