Skip to content

AI 速递 2026-07-03

生成时间:2026/7/3 09:47:58(UTC: 2026-07-03T01:47:58.485Z)

Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models

Section titled “Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models”

👍 54 · arXiv

Embodied Vision-Language-Action (VLA) models are typically obtained by fine-tuning powerful pretrained VLMs on robotics data, yet it is unclear how much commonsense and factual knowledge they retain a…

PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception

Section titled “PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception”

👍 34 · arXiv

We introduce PerceptionRubrics, a rubric-based evaluation framework that addresses the gap between saturated benchmark scores and real-world brittleness. Shifting evaluation from holistic semantic mat…

SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History

Section titled “SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History”

👍 24 · arXiv

Agent skills extend language-model agents with task-specific procedures, scripts, and references, but the tasks and environments they target continually change. Existing methods improve skills in boun…

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving

Section titled “ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving”

👍 19 · arXiv

In prefill-decode (PD) disaggregated LLM serving, each request is assigned to a decode worker after prefill. Existing decode routers balance only load; for mixture-of-experts (MoE) models this is inco…

TurboServe: Serving Streaming Video Generation Efficiently and Economically

Section titled “TurboServe: Serving Streaming Video Generation Efficiently and Economically”

👍 19 · arXiv

Streaming video generation is emerging as a new serving workload in which users interact with long-lived sessions that generate video progressively, chunk by chunk. Unlike offline video generation or …

  • OpenAI GPT-5.6 support: OpenClaw now recognizes the GPT-5.6 model family across catalog, capability, and runtime selection paths. (#98333) Thanks @steipete-oai.
  • **External har…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.7.1-beta.1

  • Add aiobotocore to the bedrock extra
  • Document flow agent options
  • Add text helper to flow skill example
  • Add text helper for flow CEL prompts
  • Add streaming docs…

链接https://github.com/crewAIInc/crewAI/releases/tag/1.15.2a2

  • iFlytek Spark and Astron MaaS providers (#9837)
  • --edit session flag to edit conversation before forking (#9799)
  • Fireworks AI declarative provider (#9990)
  • OpenRouter request pa…

链接https://github.com/aaif-goose/goose/releases/tag/v1.41.0

Release 0.143.0-alpha.34

链接https://github.com/openai/codex/releases/tag/rust-v0.143.0-alpha.34

The short leash AI coding method for beating Fable

Section titled “The short leash AI coding method for beating Fable”

Article URL: https://blog.okturtles.org/2026/07/short-leash-ai-method/ Comments URL: https://news.ycombinator.com/item?id=48766026 Points: 63

来源Hacker News AI

Claude-real-video - any LLM can watch a video

Section titled “Claude-real-video - any LLM can watch a video”

Article URL: https://github.com/HUANGCHIHHUNGLeo/claude-real-video Comments URL: https://news.ycombinator.com/item?id=48766005 Points: 81

来源Hacker News AI

Claude’s AskUserQuestion: “No response after 60s – continued without an answer”

Section titled “Claude’s AskUserQuestion: “No response after 60s – continued without an answer””

Article URL: https://github.com/anthropics/claude-code/issues/73125 Comments URL: https://news.ycombinator.com/item?id=48765630 Points: 54

来源Hacker News AI

Weird Al Yankovic Pulled Out of AI Ad Deal: ‘I Can’t Be the Poster Boy for AI’

Section titled “Weird Al Yankovic Pulled Out of AI Ad Deal: ‘I Can’t Be the Poster Boy for AI’”

Article URL: https://variety.com/2026/biz/news/weird-al-yankovic-rejected-ai-commercial-money-offer-1236800794/ Comments URL: https://news.ycombinator.com/item?id=48764326 Points: 59

来源Hacker News AI

Article URL: https://joeyh.name/blog/entry/no_LLM_code_in_dependencies/ Comments URL: https://news.ycombinator.com/item?id=48762008 Points: 114

来源Hacker News AI

AI can’t be listed as inventor on patent applications, Japan’s top court rules

Section titled “AI can’t be listed as inventor on patent applications, Japan’s top court rules”

Article URL: https://japannews.yomiuri.co.jp/science-nature/technology/20260306-314930/ Comments URL: https://news.ycombinator.com/item?id=48761536 Points: 360

来源Hacker News AI

AI fake news complaining about how AI fake news is the death of real news

Section titled “AI fake news complaining about how AI fake news is the death of real news”

Article URL: https://www.niemanlab.org/2026/07/now-were-getting-ai-fake-news-complaining-about-how-ai-fake-news-is-the-death-of-real-news/ Comments URL: https://news.ycombinator.com/item?id=48760598 Points: 153

来源Hacker News AI

The gauge broke: devs felt 20% faster with AI, measured 19% slower (2025)

Section titled “The gauge broke: devs felt 20% faster with AI, measured 19% slower (2025)”

Article URL: https://intrepidkarthi.com/writing/the-gauge-broke/ Comments URL: https://news.ycombinator.com/item?id=48757440 Points: 76

来源Hacker News AI