Skip to content

AI 速递 2026-06-03

生成时间:2026/6/3 10:37:21(UTC: 2026-06-03T02:37:21.585Z)

A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks

Section titled “A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks”

👍 55 · arXiv

As agent capabilities advance, existing benchmarks, such as τ^2-Bench, are becoming increasingly saturated. Yet constructing new benchmark tasks remains complex, costly, and labor-intensive. Moreover,…

Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding

Section titled “Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding”

👍 32 · arXiv

Speculative decoding accelerates LLM inference by drafting multiple tokens and verifying them in parallel with the target model. However, its practical speedup is constrained by the trade-off between …

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Section titled “Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses”

👍 31 · arXiv

Search agents are often trained as policies over growing transcripts: the model must decide how to search while also remembering what it has seen, which evidence is useful, which constraints remain op…

Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs

Section titled “Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs”

👍 25 · arXiv

Watermarking embeds statistical signatures in AI-generated text for detection and attribution. We reveal a fundamental vulnerability: when users access multiple models (today’s reality), watermarks tr…

LVSA: Training-Free Sparse Attention for Long Video Diffusion

Section titled “LVSA: Training-Free Sparse Attention for Long Video Diffusion”

👍 12 · arXiv

Dense self-attention is the compute and quality bottleneck of long-video diffusion inference: cost grows quadratically with the sequence length, and beyond the training horizon the model converges to …

  • Agents and CLI-backed runtimes recover more cleanly from interrupted tool calls, stale session bindings, compaction handoffs, and media delivery retries. (#88129, #88136, #88141, #8…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.6.1-beta.2

Changes since langchain==1.3.3

release(langchain): 1.3.4 (#37861) fix(langchain): improve HITL rejection guidance (#37859)…

链接https://github.com/langchain-ai/langchain/releases/tag/langchain%3D%3D1.3.4

链接https://github.com/ollama/ollama/releases/tag/v0.30.2

Release 0.137.0-alpha.4

链接https://github.com/openai/codex/releases/tag/rust-v0.137.0-alpha.4

More than 6 out of 10 people turn to AI for psychological support

Section titled “More than 6 out of 10 people turn to AI for psychological support”

Article URL: https://www.axa.com/en/press/press-releases/2026-mind-health-report Comments URL: https://news.ycombinator.com/item?id=48377854 Points: 58

来源Hacker News AI

AI outperforms law professors in Stanford Law study

Section titled “AI outperforms law professors in Stanford Law study”

https://law.stanford.edu/wp-content/uploads/2026/06/salinas_…

Comments URL: https://news.ycombinator.com/item?id=48377761 Points: 104

来源Hacker News AI

Article URL: https://julienreszka.com/blog/rss-is-back-ai-agents-are-reading-it/ Comments URL: https://news.ycombinator.com/item?id=48375673 Points: 60

来源Hacker News AI

Uber caps employee AI spending after blowing through budget in four months

Section titled “Uber caps employee AI spending after blowing through budget in four months”

Article URL: https://techcrunch.com/2026/06/02/uber-caps-employee-ai-spending-after-blowing-through-budget-in-four-months/ Comments URL: https://news.ycombinator.com/item?id=48375544 Points: 61

来源Hacker News AI

Microsoft announces Scout, an autonomous AI agent built on OpenClaw

Section titled “Microsoft announces Scout, an autonomous AI agent built on OpenClaw”

https://www.microsoft.com/en-us/microsoft-365/blog/2026/06/0…https://www.404media.co/microsoft-wants-to-make-people-addic…https://www.wired.com/story/meet-microsoft-scout-your-ai-cow… (https://web.archive.org/web/20260602180553/https://www.wired…)

Comments URL: https://news.ycombinator.com/

来源Hacker News AI

Trump signs downsized AI order after weeks of reversals

Section titled “Trump signs downsized AI order after weeks of reversals”

https://www.whitehouse.gov/presidential-actions/2026/06/prom…https://www.nytimes.com/2026/06/02/technology/trump-executiv

Comments URL: https://news.ycombinator.com/item?id=48372628 Points: 178

来源Hacker News AI

Americans don’t know how to fight AI so they’re fighting data centers

Section titled “Americans don’t know how to fight AI so they’re fighting data centers”

Article URL: https://www.vox.com/future-perfect/490350/data-center-moratoria-ai-backlash Comments URL: https://news.ycombinator.com/item?id=48371592 Points: 114

来源Hacker News AI

Article URL: https://www.wheresyoured.at/ai-doesnt-have-roi/ Comments URL: https://news.ycombinator.com/item?id=48370437 Points: 58

来源Hacker News AI