AI 速递 2026-06-03
生成时间:2026/6/3 10:37:21(UTC: 2026-06-03T02:37:21.585Z)
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks
Section titled “A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks”👍 55 · arXiv
As agent capabilities advance, existing benchmarks, such as τ^2-Bench, are becoming increasingly saturated. Yet constructing new benchmark tasks remains complex, costly, and labor-intensive. Moreover,…
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding
Section titled “Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding”👍 32 · arXiv
Speculative decoding accelerates LLM inference by drafting multiple tokens and verifying them in parallel with the target model. However, its practical speedup is constrained by the trade-off between …
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses
Section titled “Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses”👍 31 · arXiv
Search agents are often trained as policies over growing transcripts: the model must decide how to search while also remembering what it has seen, which evidence is useful, which constraints remain op…
Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs
Section titled “Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs”👍 25 · arXiv
Watermarking embeds statistical signatures in AI-generated text for detection and attribution. We reveal a fundamental vulnerability: when users access multiple models (today’s reality), watermarks tr…
LVSA: Training-Free Sparse Attention for Long Video Diffusion
Section titled “LVSA: Training-Free Sparse Attention for Long Video Diffusion”👍 12 · arXiv
Dense self-attention is the compute and quality bottleneck of long-video diffusion inference: cost grows quadratically with the sequence length, and beyond the training horizon the model converges to …
OpenClaw v2026.6.1-beta.2
Section titled “OpenClaw v2026.6.1-beta.2”Highlights
Section titled “Highlights”- Agents and CLI-backed runtimes recover more cleanly from interrupted tool calls, stale session bindings, compaction handoffs, and media delivery retries. (#88129, #88136, #88141, #8…
链接:https://github.com/openclaw/openclaw/releases/tag/v2026.6.1-beta.2
LangChain langchain==1.3.4
Section titled “LangChain langchain==1.3.4”Changes since langchain==1.3.3
release(langchain): 1.3.4 (#37861) fix(langchain): improve HITL rejection guidance (#37859)…
链接:https://github.com/langchain-ai/langchain/releases/tag/langchain%3D%3D1.3.4
Ollama v0.30.2
Section titled “Ollama v0.30.2”What’s Changed
Section titled “What’s Changed”- feat(launch): show and auto-install Cline CLI by @hoyyeva in https://github.com/ollama/ollama/pull/16402
- log template details to aid troubleshooting by @dhiltgen in https://git…
链接:https://github.com/ollama/ollama/releases/tag/v0.30.2
OpenAI Codex CLI rust-v0.137.0-alpha.4
Section titled “OpenAI Codex CLI rust-v0.137.0-alpha.4”Release 0.137.0-alpha.4
…
链接:https://github.com/openai/codex/releases/tag/rust-v0.137.0-alpha.4
More than 6 out of 10 people turn to AI for psychological support
Section titled “More than 6 out of 10 people turn to AI for psychological support”Article URL: https://www.axa.com/en/press/press-releases/2026-mind-health-report Comments URL: https://news.ycombinator.com/item?id=48377854 Points: 58
Comments: 47
Section titled “Comments: 47”AI outperforms law professors in Stanford Law study
Section titled “AI outperforms law professors in Stanford Law study”https://law.stanford.edu/wp-content/uploads/2026/06/salinas_…
Comments URL: https://news.ycombinator.com/item?id=48377761 Points: 104
Comments: 102
Section titled “Comments: 102”Now AI agents need what RSS does
Section titled “Now AI agents need what RSS does”Article URL: https://julienreszka.com/blog/rss-is-back-ai-agents-are-reading-it/ Comments URL: https://news.ycombinator.com/item?id=48375673 Points: 60
Comments: 52
Section titled “Comments: 52”Uber caps employee AI spending after blowing through budget in four months
Section titled “Uber caps employee AI spending after blowing through budget in four months”Article URL: https://techcrunch.com/2026/06/02/uber-caps-employee-ai-spending-after-blowing-through-budget-in-four-months/ Comments URL: https://news.ycombinator.com/item?id=48375544 Points: 61
Comments: 46
Section titled “Comments: 46”Microsoft announces Scout, an autonomous AI agent built on OpenClaw
Section titled “Microsoft announces Scout, an autonomous AI agent built on OpenClaw”https://www.microsoft.com/en-us/microsoft-365/blog/2026/06/0…https://www.404media.co/microsoft-wants-to-make-people-addic…https://www.wired.com/story/meet-microsoft-scout-your-ai-cow… (https://web.archive.org/web/20260602180553/https://www.wired…)
Comments URL: https://news.ycombinator.com/
Trump signs downsized AI order after weeks of reversals
Section titled “Trump signs downsized AI order after weeks of reversals”Comments URL: https://news.ycombinator.com/item?id=48372628 Points: 178
Comments: 127
Section titled “Comments: 127”Americans don’t know how to fight AI so they’re fighting data centers
Section titled “Americans don’t know how to fight AI so they’re fighting data centers”Article URL: https://www.vox.com/future-perfect/490350/data-center-moratoria-ai-backlash Comments URL: https://news.ycombinator.com/item?id=48371592 Points: 114
Comments: 206
Section titled “Comments: 206”AI Doesn’t Have ROI
Section titled “AI Doesn’t Have ROI”Article URL: https://www.wheresyoured.at/ai-doesnt-have-roi/ Comments URL: https://news.ycombinator.com/item?id=48370437 Points: 58