AI 速递 2026-07-03
生成时间:2026/7/3 09:47:58(UTC: 2026-07-03T01:47:58.485Z)
Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models
Section titled “Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models”👍 54 · arXiv
Embodied Vision-Language-Action (VLA) models are typically obtained by fine-tuning powerful pretrained VLMs on robotics data, yet it is unclear how much commonsense and factual knowledge they retain a…
PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception
Section titled “PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception”👍 34 · arXiv
We introduce PerceptionRubrics, a rubric-based evaluation framework that addresses the gap between saturated benchmark scores and real-world brittleness. Shifting evaluation from holistic semantic mat…
SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History
Section titled “SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History”👍 24 · arXiv
Agent skills extend language-model agents with task-specific procedures, scripts, and references, but the tasks and environments they target continually change. Existing methods improve skills in boun…
ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving
Section titled “ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving”👍 19 · arXiv
In prefill-decode (PD) disaggregated LLM serving, each request is assigned to a decode worker after prefill. Existing decode routers balance only load; for mixture-of-experts (MoE) models this is inco…
TurboServe: Serving Streaming Video Generation Efficiently and Economically
Section titled “TurboServe: Serving Streaming Video Generation Efficiently and Economically”👍 19 · arXiv
Streaming video generation is emerging as a new serving workload in which users interact with long-lived sessions that generate video progressively, chunk by chunk. Unlike offline video generation or …
OpenClaw v2026.7.1-beta.1
Section titled “OpenClaw v2026.7.1-beta.1”Highlights
Section titled “Highlights”- OpenAI GPT-5.6 support: OpenClaw now recognizes the GPT-5.6 model family across catalog, capability, and runtime selection paths. (#98333) Thanks @steipete-oai.
- **External har…
链接:https://github.com/openclaw/openclaw/releases/tag/v2026.7.1-beta.1
CrewAI 1.15.2a2
Section titled “CrewAI 1.15.2a2”What’s Changed
Section titled “What’s Changed”Features
Section titled “Features”- Add aiobotocore to the bedrock extra
- Document flow agent options
- Add text helper to flow skill example
- Add text helper for flow CEL prompts
- Add streaming docs…
链接:https://github.com/crewAIInc/crewAI/releases/tag/1.15.2a2
Goose v1.41.0
Section titled “Goose v1.41.0”✨ Features
Section titled “✨ Features”- iFlytek Spark and Astron MaaS providers (#9837)
--editsession flag to edit conversation before forking (#9799)- Fireworks AI declarative provider (#9990)
- OpenRouter request pa…
链接:https://github.com/aaif-goose/goose/releases/tag/v1.41.0
OpenAI Codex CLI rust-v0.143.0-alpha.34
Section titled “OpenAI Codex CLI rust-v0.143.0-alpha.34”Release 0.143.0-alpha.34
…
链接:https://github.com/openai/codex/releases/tag/rust-v0.143.0-alpha.34
The short leash AI coding method for beating Fable
Section titled “The short leash AI coding method for beating Fable”Article URL: https://blog.okturtles.org/2026/07/short-leash-ai-method/ Comments URL: https://news.ycombinator.com/item?id=48766026 Points: 63
Comments: 63
Section titled “Comments: 63”Claude-real-video - any LLM can watch a video
Section titled “Claude-real-video - any LLM can watch a video”Article URL: https://github.com/HUANGCHIHHUNGLeo/claude-real-video Comments URL: https://news.ycombinator.com/item?id=48766005 Points: 81
Comments: 28
Section titled “Comments: 28”Claude’s AskUserQuestion: “No response after 60s – continued without an answer”
Section titled “Claude’s AskUserQuestion: “No response after 60s – continued without an answer””Article URL: https://github.com/anthropics/claude-code/issues/73125 Comments URL: https://news.ycombinator.com/item?id=48765630 Points: 54
Comments: 59
Section titled “Comments: 59”Weird Al Yankovic Pulled Out of AI Ad Deal: ‘I Can’t Be the Poster Boy for AI’
Section titled “Weird Al Yankovic Pulled Out of AI Ad Deal: ‘I Can’t Be the Poster Boy for AI’”Article URL: https://variety.com/2026/biz/news/weird-al-yankovic-rejected-ai-commercial-money-offer-1236800794/ Comments URL: https://news.ycombinator.com/item?id=48764326 Points: 59
Comments: 33
Section titled “Comments: 33”No LLM Code in Dependencies
Section titled “No LLM Code in Dependencies”Article URL: https://joeyh.name/blog/entry/no_LLM_code_in_dependencies/ Comments URL: https://news.ycombinator.com/item?id=48762008 Points: 114
Comments: 97
Section titled “Comments: 97”AI can’t be listed as inventor on patent applications, Japan’s top court rules
Section titled “AI can’t be listed as inventor on patent applications, Japan’s top court rules”Article URL: https://japannews.yomiuri.co.jp/science-nature/technology/20260306-314930/ Comments URL: https://news.ycombinator.com/item?id=48761536 Points: 360
Comments: 190
Section titled “Comments: 190”AI fake news complaining about how AI fake news is the death of real news
Section titled “AI fake news complaining about how AI fake news is the death of real news”Article URL: https://www.niemanlab.org/2026/07/now-were-getting-ai-fake-news-complaining-about-how-ai-fake-news-is-the-death-of-real-news/ Comments URL: https://news.ycombinator.com/item?id=48760598 Points: 153
Comments: 52
Section titled “Comments: 52”The gauge broke: devs felt 20% faster with AI, measured 19% slower (2025)
Section titled “The gauge broke: devs felt 20% faster with AI, measured 19% slower (2025)”Article URL: https://intrepidkarthi.com/writing/the-gauge-broke/ Comments URL: https://news.ycombinator.com/item?id=48757440 Points: 76