AI 速递 2026-05-29
生成时间:2026/5/29 10:00:45(UTC: 2026-05-29T02:00:45.745Z)
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation
Section titled “ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation”👍 76 · arXiv
Proactive Recommender Systems (PRSs) aim to guide user preference shift toward target items by generating paths of intermediate recommendations. Reinforcement learning (RL) provides a principled frame…
DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes
Section titled “DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes”👍 39 · arXiv
Reinforcement learning has become a central paradigm for advancing reasoning in large language models, yet most existing methods still depend on stronger teacher models or heavily curated difficult da…
GEM: Generative Supervision Helps Embodied Intelligence
Section titled “GEM: Generative Supervision Helps Embodied Intelligence”👍 35 · arXiv
Embodied Vision-Language Models (VLMs) have demonstrated impressive performance and generalization in robotics, particularly within Vision-Language-Action frameworks. However, a significant gap remain…
MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems
Section titled “MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems”👍 33 · arXiv
Memory is essential for enabling large language models to support long-horizon reasoning, yet existing memory systems remain unreliable and difficult to debug. Tracing memory’s dynamic evolution is cr…
ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence
Section titled “ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence”👍 29 · arXiv
Autonomous research agents produce competitive solutions and professional-looking manuscripts, yet their outputs contain verifiability failures undetectable by surface-level evaluation: fabricated cit…
OpenClaw v2026.5.27
Section titled “OpenClaw v2026.5.27”Highlights
Section titled “Highlights”- Stronger security and content boundaries: group prompt text is kept out of the system prompt, repeated-dot hostnames are normalized, side-effecting command wrappers and unsafe Node …
链接:https://github.com/openclaw/openclaw/releases/tag/v2026.5.27
LangChain langchain-anthropic==1.4.4
Section titled “LangChain langchain-anthropic==1.4.4”Changes since langchain-anthropic==1.4.3
release(anthropic): 1.4.4 (#37757) fix(anthropic): normalize cross-provider tool-call IDs (#37756) test(anthropic): retry integration tests on transient failu…
链接:https://github.com/langchain-ai/langchain/releases/tag/langchain-anthropic%3D%3D1.4.4
CrewAI 1.14.6
Section titled “CrewAI 1.14.6”What’s Changed
Section titled “What’s Changed”Features
Section titled “Features”- Enhance StdioTransport to prevent environment variable leakage
- Enhance planning configuration and observation handling
- Declare env_vars on DatabricksQueryTool
- A…
链接:https://github.com/crewAIInc/crewAI/releases/tag/1.14.6
Goose v1.36.0
Section titled “Goose v1.36.0”✨ Features
Section titled “✨ Features”链接:https://github.com/aaif-goose/goose/releases/tag/v1.36.0
OpenAI Codex CLI rust-v0.135.0
Section titled “OpenAI Codex CLI rust-v0.135.0”New Features
Section titled “New Features”codex doctornow reports richer environment, Git, terminal, app-server, and thread inventory diagnostics for support cases. (#24261, #24311, #24305)/statusshows remote connec…
链接:https://github.com/openai/codex/releases/tag/rust-v0.135.0
Glean’s top line crosses $300M as AI budget-cutting becomes its major selling point
Section titled “Glean’s top line crosses $300M as AI budget-cutting becomes its major selling point”The enterprise AI search startup tripled its annual revenue even as tech giants entered the category.
The internet is being rebuilt for machines
Section titled “The internet is being rebuilt for machines”As AI agents move from experiments to production, AWS, Cloudflare, and others are redesigning cloud infrastructure for a future dominated by machine-generated internet traffic instead of human users.
Asana acquires no-code agent-builder StackAI
Section titled “Asana acquires no-code agent-builder StackAI”Asana will incorporate StackAI into its growing suite of AI workflow tools.
Anthropic raises $65 billion, nears $1T valuation ahead of IPO
Section titled “Anthropic raises $65 billion, nears $1T valuation ahead of IPO”Anthropic has closed a $65 billion Series H round at a $965 billion post-money valuation, marking what could be the AI startup’s final private fundraise before a highly anticipated IPO.
Just like gold and oil, we’ll soon be able to trade AI token futures
Section titled “Just like gold and oil, we’ll soon be able to trade AI token futures”Large exchanges are designing derivative products around AI tokens, which are increasingly being considered less a computational output and more a raw material input, like electricity or bandwidth.
In just 3 weeks, StrictlyVC is coming to Los Angeles
Section titled “In just 3 weeks, StrictlyVC is coming to Los Angeles”StrictlyVC Los Angeles is on June 18. Join for meaningful networking and fireside chats with leaders from Mach Industries, Shinkei Systems, and more. Register today.
Anthropic releases Opus 4.8 with new ‘dynamic workflow’ tool
Section titled “Anthropic releases Opus 4.8 with new ‘dynamic workflow’ tool”The new Opus model comes with a tool called Dynamic Workflows, for coordinating swarms of subagents.
How long is Anthropic’s lease with SpaceX? Opinions vary
Section titled “How long is Anthropic’s lease with SpaceX? Opinions vary”Elon Musk is publicly reframing xAI’s massive Anthropic compute deal as short-term and cancellable, despite SpaceX’s own S-1 filing describing payments through May 2029.