AI 速递 2026-06-15
生成时间:2026/6/15 10:34:41(UTC: 2026-06-15T02:34:41.644Z)
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments
Section titled “EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments”👍 129 · arXiv
Large language model (LLM) agents have achieved strong performance on a wide range of benchmarks, yet most evaluations assume static environments. In contrast, real-world deployment is inherently dyna…
MiniMax Sparse Attention
Section titled “MiniMax Sparse Attention”👍 120 · arXiv
Ultra-long-context capability is becoming indispensable for frontier LLMs: agentic workflows, repository-scale code reasoning, and persistent memory all require the model to jointly attend over hundre…
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces
Section titled “WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces”👍 97 · arXiv
Computer-use agents (CUAs) increasingly operate in runtimes that combine visual desktop control, command-line execution, code editing, browsers, and external tools. Existing benchmarks, however, often…
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning
Section titled “SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning”👍 87 · arXiv
Spatial reasoning, the ability to determine where objects are, how they relate, and how they move in 3D, remains a fundamental challenge for vision-language models (VLMs). Tool-augmented agents attemp…
InterleaveThinker: Reinforcing Agentic Interleaved Generation
Section titled “InterleaveThinker: Reinforcing Agentic Interleaved Generation”👍 77 · arXiv
Recent image generators have demonstrated impressive photorealism and instruction-following capabilities in single-image generation and editing. However, constrained by their architectures, they canno…
OpenClaw v2026.6.8-beta.1
Section titled “OpenClaw v2026.6.8-beta.1”2026.6.8
Section titled “2026.6.8”Highlights
Section titled “Highlights”- Telegram and WhatsApp channel delivery are richer and less brittle: Telegram can send structured rich text with tables, lists, expandable blockquotes, prompt-preserving …
链接:https://github.com/openclaw/openclaw/releases/tag/v2026.6.8-beta.1
LangChain langchain-openai==1.3.2
Section titled “LangChain langchain-openai==1.3.2”Changes since langchain-openai==1.3.1
release(openai): 1.3.2 (#38130)…
链接:https://github.com/langchain-ai/langchain/releases/tag/langchain-openai%3D%3D1.3.2
OpenAI Codex CLI rust-v0.140.0-alpha.19
Section titled “OpenAI Codex CLI rust-v0.140.0-alpha.19”Release 0.140.0-alpha.19
…
链接:https://github.com/openai/codex/releases/tag/rust-v0.140.0-alpha.19
As AI companies race to go public, who else is along for the ride?
Section titled “As AI companies race to go public, who else is along for the ride?”Startups are trying to “ride that SpaceX IPO wave.”
As Anthropic suspends access to new models, India debates its AI future
Section titled “As Anthropic suspends access to new models, India debates its AI future”Tech leaders debate whether the Anthropic episode is a wake-up call for India’s AI ambitions.
Meta reportedly moves to unwind $2B Manus deal after Beijing’s demand
Section titled “Meta reportedly moves to unwind $2B Manus deal after Beijing’s demand”Meta starts dismantling its $2 billion Manus acquisition after Beijing ordered the deal reversed.
KPMG pulls report on AI usage due to apparent hallucinations
Section titled “KPMG pulls report on AI usage due to apparent hallucinations”Once again, AI proves to be an unreliable source of information about AI.
Amazon CEO reportedly raised Anthropic model concerns before government crackdown
Section titled “Amazon CEO reportedly raised Anthropic model concerns before government crackdown”Amazon CEO Andy Jassy may have been the source of security concerns that led Anthropic to cut off worldwide access to two models on Friday.
OpenAI faces investigation from state attorneys general
Section titled “OpenAI faces investigation from state attorneys general”It’s not clear which states are involved, but they’re asking about everything from OpenAI’s ad policies to its handling of health data.
Andrew Yang thinks the next big startup opportunity is lowering the cost of living
Section titled “Andrew Yang thinks the next big startup opportunity is lowering the cost of living”Andrew Yang made a list of everything Americans overpay for — housing, food, wireless — and thinks the next startup gold rush is giving that money back.
Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI
Section titled “Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI”Anthropic isn’t hiding its frustration. “We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people,” the company wrote in a blog post.