Skip to content

周报 2026-06-29 ~ 2026-07-05

生成时间:2026/7/5 12:58:57(UTC: 2026-07-05T04:58:57.585Z)

本周自动总结未启用或调用失败,以下为原始内容合并。

生成时间:2026/6/29 10:11:45(UTC: 2026-06-29T02:11:45.664Z)

DanceOPD: On-Policy Generative Field Distillation

Section titled “DanceOPD: On-Policy Generative Field Distillation”

👍 71 · arXiv

Modern image generation demands a single model that unifies diverse capabilities, including text-to-image (T2I), local editing, and global editing. However, these capabilities are rarely naturally ali…

In-Context World Modeling for Robotic Control

Section titled “In-Context World Modeling for Robotic Control”

👍 51 · arXiv

Modern Vision-Language-Action (VLA) models often fail to generalize to novel setups, such as altered camera viewpoints or robot morphologies, because they are typically conditioned only on current obs…

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

Section titled “OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning”

👍 46 · arXiv

Outcome-based reinforcement learning provides a stable optimization backbone for language agents, but its sparse trajectory-level rewards provide little guidance on which intermediate decisions should…

ShutterMuse: Capture-Time Photography Guidance with MLLMs

Section titled “ShutterMuse: Capture-Time Photography Guidance with MLLMs”

👍 45 · arXiv

Real-world photography requires capture-time guidance for both camera framing and subject pose. Yet existing aesthetic cropping benchmarks mainly evaluate post-hoc crop prediction and overlook subject…

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Section titled “Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation”

👍 42 · arXiv

While text-to-image (T2I) models have achieved remarkable progress, they struggle with real-world requests that are often underspecified, implicit, or dependent on up-to-date knowledge. We identify th…

  • More capable channel control: Slack relay mode, native Mattermost /oc_queue, and per-DM model overrides make channel operations easier to automate and tune. (#94707, #95546, #…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.6.11-beta.2

  • Initialize Git repositories for generated projects (#6364)
  • Require explicit CrewAI project definitions (#6358)
  • Open deployment page after CLI deploy (#6343)

链接https://github.com/crewAIInc/crewAI/releases/tag/1.15.1

Release 0.143.0-alpha.29

链接https://github.com/openai/codex/releases/tag/rust-v0.143.0-alpha.29

Ford rehires ‘gray beard’ engineers after AI falls short

Section titled “Ford rehires ‘gray beard’ engineers after AI falls short”

“Mistakenly we thought that by just introducing artificial intelligence … that would produce a high-quality product.”

来源TechCrunch AI

Why Wall Street thinks US memory maker Micron is the next Nvidia

Section titled “Why Wall Street thinks US memory maker Micron is the next Nvidia”

Eager to find more public AI-related companies that may do as well as Nvidia, Wall Street investors think they’ve found a winner with Micron.

来源TechCrunch AI

SoftBank’s CEO isn’t the only one with questions about Elon Musk’s orbital data center hype

Section titled “SoftBank’s CEO isn’t the only one with questions about Elon Musk’s orbital data center hype”

Not everyone is buying Elon Musk’s vision for orbital data centers.

来源TechCrunch AI

Apple Vision Pro exec is reportedly leaving for OpenAI

Section titled “Apple Vision Pro exec is reportedly leaving for OpenAI”

Paul Meade, the Apple vice president in charge of the Vision Pro headset, is reportedly leaving the company to join OpenAI’s hardware team.

来源TechCrunch AI

The fittest founder in the room got cancer. Here’s how he used AI to fight back.

Section titled “The fittest founder in the room got cancer. Here’s how he used AI to fight back.”

When confronted with cancer, Connor Christou fed everything tied tied to his regime — blood results, scan data, wearable output, journal entries — into Claude.

来源TechCrunch AI

Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on

Section titled “Asian AI startups launch Mythos-like models as Anthropic’s export ban drags on”

New models are launching in Asia that promise Mythos-like capabilities without fear of an export ban. U.S. AI labs may never recover this enormous market.

来源TechCrunch AI

Trump Admin releases Anthropic Mythos to be used by more than 100 US companies, agencies

Section titled “Trump Admin releases Anthropic Mythos to be used by more than 100 US companies, agencies”

Over 100 companies and government agencies are reportedly authorized to use Mythos 5, including their non-American employees.

来源TechCrunch AI

OpenAI limits GPT-5.6 rollout after government request, says restrictions shouldn’t be the norm

Section titled “OpenAI limits GPT-5.6 rollout after government request, says restrictions shouldn’t be the norm”

“We don’t believe this kind of government access process should become the long-term default,” says OpenAI. “It keeps the best tools from users, developers, enterprises, cyber defenders, and global partners who need them.”

来源TechCrunch AI


生成时间:2026/6/30 10:06:48(UTC: 2026-06-30T02:06:48.813Z)

In-Context World Modeling for Robotic Control

Section titled “In-Context World Modeling for Robotic Control”

👍 57 · arXiv

Modern Vision-Language-Action (VLA) models often fail to generalize to novel setups, such as altered camera viewpoints or robot morphologies, because they are typically conditioned only on current obs…

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

Section titled “OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning”

👍 49 · arXiv

Outcome-based reinforcement learning provides a stable optimization backbone for language agents, but its sparse trajectory-level rewards provide little guidance on which intermediate decisions should…

Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs

Section titled “Formalizing Latent Thoughts: Four Axioms of Thought Representation in LLMs”

👍 43 · arXiv

We introduce an axiomatic evaluation framework for latent thought representations in LLMs, comprising metrics that are independent of downstream benchmark scores and reveal representational failures t…

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Section titled “The Verification Horizon: No Silver Bullet for Coding Agent Rewards”

👍 43 · arXiv

A classical intuition holds that verifying a solution is easier than producing one. For today’s coding agents, this intuition is being inverted: as foundation models develop stronger reasoning capabil…

PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation

Section titled “PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation”

👍 41 · arXiv

Video generation models have emerged as a promising paradigm for embodied world simulation. However, both general-domain video generators and robot-specific data fine-tuned models can still produce ph…

  • More capable channel control: Slack relay mode, native Mattermost /oc_queue, and per-DM model overrides make channel operations easier to automate and tune. (#947…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.6.11-beta.2

Changes since langchain-openrouter==0.2.4

release(openrouter): 0.2.5 (#38553) fix(openrouter): deduplicate repeated finish metadata (#38552) fix(openrouter): strip Responses reasoning IDs (#38383)…

链接https://github.com/langchain-ai/langchain/releases/tag/langchain-openrouter%3D%3D0.2.5

This release features 571 commits from 256 contributors (77 new)!

  • MiniMax-M3: Added support for the new MiniMax-M3 model (#45381), with a …

链接https://github.com/vllm-project/vllm/releases/tag/v0.24.0

链接https://github.com/ollama/ollama/releases/tag/v0.30.12-rc0

  • No user-facing changes were identified for this release.

Full Changelog: https://github.com/openai/codex/compare/rust-v0.142.3…rust-v0.142.4

链接https://github.com/openai/codex/releases/tag/rust-v0.142.4

Micro-Agent: Beat Frontier Models with Collaboration Inside Model API

Section titled “Micro-Agent: Beat Frontier Models with Collaboration Inside Model API”

Article URL: https://vllm.ai/blog/2026-06-29-micro-agent-frontier-models Comments URL: https://news.ycombinator.com/item?id=48722802 Points: 53

来源Hacker News AI

Article URL: https://htmx.org/essays/working-with-ai/ Comments URL: https://news.ycombinator.com/item?id=48720064 Points: 90

来源Hacker News AI

Article URL: https://tidal.com/ai-policy Comments URL: https://news.ycombinator.com/item?id=48718840 Points: 292

来源Hacker News AI

Anthropic CEO: Open-Source AI is getting dangerous (2023)

Section titled “Anthropic CEO: Open-Source AI is getting dangerous (2023)”

Article URL: https://xcancel.com/coinbureau/status/2071330294452666695 Comments URL: https://news.ycombinator.com/item?id=48716750 Points: 51

来源Hacker News AI

Herdr: Agent multiplexer that lives in your terminal

Section titled “Herdr: Agent multiplexer that lives in your terminal”

Article URL: https://github.com/ogulcancelik/herdr Comments URL: https://news.ycombinator.com/item?id=48714802 Points: 145

来源Hacker News AI

AI boom risks global financial crash, warn central bankers

Section titled “AI boom risks global financial crash, warn central bankers”

Article URL: https://www.telegraph.co.uk/business/2026/06/28/ai-boom-risks-global-financial-crash-central-bankers-warn/ Comments URL: https://news.ycombinator.com/item?id=48713697 Points: 155

来源Hacker News AI

Article URL: https://betterimagesofai.org/ Comments URL: https://news.ycombinator.com/item?id=48713051 Points: 55

来源Hacker News AI

We need tech news sources which exclude AI

Section titled “We need tech news sources which exclude AI”

Its now clear that we need to preserve tech press for non AI related things. Techmeme for example is now completely overrun with AI stories.HN is getting closer to that every day.If AI kickback deals, phony new model ratings, high RAM prices and your surprise at how you think you coded something wit

来源Hacker News AI


生成时间:2026/7/1 10:11:31(UTC: 2026-07-01T02:11:31.610Z)

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Section titled “Agentic Abstention: Do Agents Know When to Stop Instead of Act?”

👍 121 · arXiv

LLM agents are expected to act over multiple turns, using search, browsing interfaces, and terminal tools to complete user goals. Yet not every goal is well specified or achievable in the available en…

LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing

Section titled “LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing”

👍 72 · arXiv

Streaming video editing has made rapid progress, yet practical deployment is still limited by two core issues: maintaining stable backgrounds and non-edited regions over time, and achieving the low la…

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Section titled “Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent”

👍 67 · arXiv

We introduce Agents-A1, a 35B Mixture-of-Experts Agentic Model that reaches trillion-parameter-level performance by scaling the agent horizon. We investigate agent-horizon scaling from two perspective…

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

Section titled “TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents”

👍 44 · arXiv

As large language models and harness frameworks continue to advance, agents operating in terminals are increasingly capable of performing a broader range of general computer-use tasks beyond coding. H…

ReFreeKV: Towards Threshold-Free KV Cache Compression

Section titled “ReFreeKV: Towards Threshold-Free KV Cache Compression”

👍 43 · arXiv

To reduce memory consumption during LLM inference, a handful of methods have been proposed for KV cache pruning. While these techniques can accomplish lossless memory reduction on many datasets, they …

We heard the feedback. v2026.6.11 focuses on the rough edges that make OpenClaw feel less dependable, with fixes for misplaced replies, stuck sends, reconnects, model setup failures, and safer admin d…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.6.11

Changes since langchain-openrouter==0.2.4

release(openrouter): 0.2.5 (#38553) fix(openrouter): deduplicate repeated finish metadata (#38552) fix(openrouter): strip Responses reasoning IDs (#38383)…

链接https://github.com/langchain-ai/langchain/releases/tag/langchain-openrouter%3D%3D0.2.5

This release features 571 commits from 256 contributors (77 new)!

  • MiniMax-M3: Added support for the new MiniMax-M3 model (#45381), with a …

链接https://github.com/vllm-project/vllm/releases/tag/v0.24.0

Screenshot 2026-06-30 at 5 25 29 PM ...

链接https://github.com/ollama/ollama/releases/tag/v0.31.1

  • Repoint template commands to crewAIInc-fde org
  • Support inline skill definitions
  • Define stream frame protocol for flows
  • Add type tool and app in CrewDefinition -…

链接https://github.com/crewAIInc/crewAI/releases/tag/1.15.2a1

  • Prevented full Responses WebSocket request payloads from being written to trace logs. (#30771)

Full Changelog: https://github.com/openai/codex/compare/rust-v0.142.4…ru

链接https://github.com/openai/codex/releases/tag/rust-v0.142.5

Wayve launches $85M employee tender offer at $8.5B valuation

Section titled “Wayve launches $85M employee tender offer at $8.5B valuation”

Wayve’s offering is part of a growing trend of AI startups using employee tenders as a strategic tool to attract and retain talent.

来源TechCrunch AI

OpenClaw is finally available on Android and iOS

Section titled “OpenClaw is finally available on Android and iOS”

The free open source agentic program is finally invading your phone.

来源TechCrunch AI

The DeepMind trio who built a poker AI are now making money for quant hedge funds

Section titled “The DeepMind trio who built a poker AI are now making money for quant hedge funds”

EquiLibre Technologies, a Prague-based AI lab founded by three ex-DeepMind researchers, is now valued at more than $500 million.

来源TechCrunch AI

Google introduces a faster, cheaper image generator with Nano Banana 2 Lite

Section titled “Google introduces a faster, cheaper image generator with Nano Banana 2 Lite”

Google is updating its image generator to make it faster and cheaper, making it a more useful tool for creators looking to make AI content.

来源TechCrunch AI

Nvidia competitor Etched hits $5B valuation, $1B in sales for AI chip

Section titled “Nvidia competitor Etched hits $5B valuation, $1B in sales for AI chip”

Nvidia AI chip competitor Etched says it has already booked $1 billion under contract for the inference systems powered by its chip.

来源TechCrunch AI

Anthropic launches Claude Sonnet 5 as a cheaper way to run agents

Section titled “Anthropic launches Claude Sonnet 5 as a cheaper way to run agents”

Anthropic’s Claude Sonnet 5 brings stronger agentic capabilities, lower pricing, and improved safety, positioning the model as a cheaper alternative to Opus, GPT-5.5, and Gemini Pro.

来源TechCrunch AI

Acti puts AI agents directly into your smartphone keyboard

Section titled “Acti puts AI agents directly into your smartphone keyboard”

Acti is betting the smartphone keyboard is the next home for AI assistants. The startup’s new keyboard for iOS and Android works across apps and lets users create custom AI-powered shortcuts using natural language.

来源TechCrunch AI

Anthropic’s Claude Science bets on workflow, not a new model, to win over scientists

Section titled “Anthropic’s Claude Science bets on workflow, not a new model, to win over scientists”

Anthropic’s Claude Science is a workbench that gives scientists one environment to do computational research, saving them from the need to bounce between databases, pipelines, and tools.

来源TechCrunch AI


生成时间:2026/7/2 10:03:51(UTC: 2026-07-02T02:03:51.040Z)

👍 177 · arXiv

We introduce Orca, an initial instantiation of a general world foundation model. Orca learns a unified world latent space from multimodal world signals and exposes it through multimodal readout interf…

Dockerless: Environment-Free Program Verifier for Coding Agents

Section titled “Dockerless: Environment-Free Program Verifier for Coding Agents”

👍 86 · arXiv

Program verifiers play a central role in training coding agents, including selecting trajectories for supervised fine-tuning (SFT) and providing rewards for reinforcement learning (RL). Standard execu…

👍 76 · arXiv

On-policy distillation (OPD) offers superior capacity transfer by supervising student-sampled trajectories with dense token-level signals. To furnish high-quality supervision sources and thereby eleva…

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Section titled “BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding”

👍 67 · arXiv

Speculative decoding accelerates inference by using a lightweight draft model to generate candidate tokens in parallel, and are then verified by the target model, enabling lossless acceleration. Recen…

Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models

Section titled “Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models”

👍 41 · arXiv

Embodied Vision-Language-Action (VLA) models are typically obtained by fine-tuning powerful pretrained VLMs on robotics data, yet it is unclear how much commonsense and factual knowledge they retain a…

We heard the feedback. v2026.6.11 focuses on the rough edges that make OpenClaw feel less dependable, with fixes for misplaced replies, stuck sends, reconnects, model setup failures, and safer admin d…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.6.11

Screenshot 2026-06-30 at 5 25 29 PM ...

链接https://github.com/ollama/ollama/releases/tag/v0.31.1

  • Add aiobotocore to the bedrock extra
  • Document flow agent options
  • Add text helper to flow skill example
  • Add text helper for flow CEL prompts
  • Add streaming docs…

链接https://github.com/crewAIInc/crewAI/releases/tag/1.15.2a2

Release 0.143.0-alpha.33

链接https://github.com/openai/codex/releases/tag/rust-v0.143.0-alpha.33

SpaceX has an AI device prototype, and it sure sounds phone-ish

Section titled “SpaceX has an AI device prototype, and it sure sounds phone-ish”

SpaceX reportedly showed investors a “handset-like” AI device before going public. It could be another signal SpaceX wants to expand into wireless.

来源TechCrunch AI

Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller

Section titled “Ashton Kutcher leaving Sound Ventures to launch new VC firm with Morgan Beller”

Sound built its reputation on concentrated, high-conviction bets in category-leading AI labs, while Kutcher’s new fund appears to be chasing the layer underneath those companies — the infrastructure and energy that power them.

来源TechCrunch AI

Cloudflare’s new policy pushes AI companies to pay for publishers’ content

Section titled “Cloudflare’s new policy pushes AI companies to pay for publishers’ content”

Cloudflare is giving AI companies until September 15 to separate web crawlers used for search from those used for AI training and agents, or risk being blocked by default on many publisher sites.

来源TechCrunch AI

Venice AI becomes a unicorn with $65M Series A as its privacy-first AI platform takes off

Section titled “Venice AI becomes a unicorn with $65M Series A as its privacy-first AI platform takes off”

Venice AI is already profitable, with annualized run-rate revenues of over $70 million, CEO Erik Voorhees said.

来源TechCrunch AI

Gemini Spark, Google’s agentic assistant, is now available on Mac

Section titled “Gemini Spark, Google’s agentic assistant, is now available on Mac”

Google’s 24/7 agentic assistant, Gemini Spark, comes to Mac alongside other improvements, like real-time tracking and support for more apps.

来源TechCrunch AI

Builders Stage agenda revealed: Practical strategies for scaling startups at TechCrunch Disrupt 2026

Section titled “Builders Stage agenda revealed: Practical strategies for scaling startups at TechCrunch Disrupt 2026”

The Builders Stage is returning to TechCrunch Disrupt 2026, bringing together 10,000+ founders, startup operators, and investors for practical conversations. and Q&A on what it takes to build and scale successful companies. Register now to save up to $330.

来源TechCrunch AI

Meta, like SpaceX, looks to turn excess AI compute into cash

Section titled “Meta, like SpaceX, looks to turn excess AI compute into cash”

Meta is developing plans for a cloud infrastructure business, selling access to AI compute power and models. The move would pit it against the big cloud providers like Amazon Web Services, Google Cloud, and Microsoft Azure.

来源TechCrunch AI

The ‘Father of the Internet’ is finally retiring

Section titled “The ‘Father of the Internet’ is finally retiring”

Vinton Cerf, one of the creators of the protocols underlying the internet, will step down as Google’s chief internet evangelist next week.

来源TechCrunch AI


生成时间:2026/7/3 09:47:58(UTC: 2026-07-03T01:47:58.485Z)

Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models

Section titled “Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models”

👍 54 · arXiv

Embodied Vision-Language-Action (VLA) models are typically obtained by fine-tuning powerful pretrained VLMs on robotics data, yet it is unclear how much commonsense and factual knowledge they retain a…

PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception

Section titled “PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception”

👍 34 · arXiv

We introduce PerceptionRubrics, a rubric-based evaluation framework that addresses the gap between saturated benchmark scores and real-world brittleness. Shifting evaluation from holistic semantic mat…

SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History

Section titled “SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History”

👍 24 · arXiv

Agent skills extend language-model agents with task-specific procedures, scripts, and references, but the tasks and environments they target continually change. Existing methods improve skills in boun…

ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving

Section titled “ELDR: Expert-Locality-Aware Decode Routing for PD-Disaggregated MoE Serving”

👍 19 · arXiv

In prefill-decode (PD) disaggregated LLM serving, each request is assigned to a decode worker after prefill. Existing decode routers balance only load; for mixture-of-experts (MoE) models this is inco…

TurboServe: Serving Streaming Video Generation Efficiently and Economically

Section titled “TurboServe: Serving Streaming Video Generation Efficiently and Economically”

👍 19 · arXiv

Streaming video generation is emerging as a new serving workload in which users interact with long-lived sessions that generate video progressively, chunk by chunk. Unlike offline video generation or …

  • OpenAI GPT-5.6 support: OpenClaw now recognizes the GPT-5.6 model family across catalog, capability, and runtime selection paths. (#98333) Thanks @steipete-oai.
  • **External har…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.7.1-beta.1

  • Add aiobotocore to the bedrock extra
  • Document flow agent options
  • Add text helper to flow skill example
  • Add text helper for flow CEL prompts
  • Add streaming docs…

链接https://github.com/crewAIInc/crewAI/releases/tag/1.15.2a2

  • iFlytek Spark and Astron MaaS providers (#9837)
  • --edit session flag to edit conversation before forking (#9799)
  • Fireworks AI declarative provider (#9990)
  • OpenRouter request pa…

链接https://github.com/aaif-goose/goose/releases/tag/v1.41.0

Release 0.143.0-alpha.34

链接https://github.com/openai/codex/releases/tag/rust-v0.143.0-alpha.34

The short leash AI coding method for beating Fable

Section titled “The short leash AI coding method for beating Fable”

Article URL: https://blog.okturtles.org/2026/07/short-leash-ai-method/ Comments URL: https://news.ycombinator.com/item?id=48766026 Points: 63

来源Hacker News AI

Claude-real-video - any LLM can watch a video

Section titled “Claude-real-video - any LLM can watch a video”

Article URL: https://github.com/HUANGCHIHHUNGLeo/claude-real-video Comments URL: https://news.ycombinator.com/item?id=48766005 Points: 81

来源Hacker News AI

Claude’s AskUserQuestion: “No response after 60s – continued without an answer”

Section titled “Claude’s AskUserQuestion: “No response after 60s – continued without an answer””

Article URL: https://github.com/anthropics/claude-code/issues/73125 Comments URL: https://news.ycombinator.com/item?id=48765630 Points: 54

来源Hacker News AI

Weird Al Yankovic Pulled Out of AI Ad Deal: ‘I Can’t Be the Poster Boy for AI’

Section titled “Weird Al Yankovic Pulled Out of AI Ad Deal: ‘I Can’t Be the Poster Boy for AI’”

Article URL: https://variety.com/2026/biz/news/weird-al-yankovic-rejected-ai-commercial-money-offer-1236800794/ Comments URL: https://news.ycombinator.com/item?id=48764326 Points: 59

来源Hacker News AI

Article URL: https://joeyh.name/blog/entry/no_LLM_code_in_dependencies/ Comments URL: https://news.ycombinator.com/item?id=48762008 Points: 114

来源Hacker News AI

AI can’t be listed as inventor on patent applications, Japan’s top court rules

Section titled “AI can’t be listed as inventor on patent applications, Japan’s top court rules”

Article URL: https://japannews.yomiuri.co.jp/science-nature/technology/20260306-314930/ Comments URL: https://news.ycombinator.com/item?id=48761536 Points: 360

来源Hacker News AI

AI fake news complaining about how AI fake news is the death of real news

Section titled “AI fake news complaining about how AI fake news is the death of real news”

Article URL: https://www.niemanlab.org/2026/07/now-were-getting-ai-fake-news-complaining-about-how-ai-fake-news-is-the-death-of-real-news/ Comments URL: https://news.ycombinator.com/item?id=48760598 Points: 153

来源Hacker News AI

The gauge broke: devs felt 20% faster with AI, measured 19% slower (2025)

Section titled “The gauge broke: devs felt 20% faster with AI, measured 19% slower (2025)”

Article URL: https://intrepidkarthi.com/writing/the-gauge-broke/ Comments URL: https://news.ycombinator.com/item?id=48757440 Points: 76

来源Hacker News AI


生成时间:2026/7/4 09:47:36(UTC: 2026-07-04T01:47:36.452Z)

Program-as-Weights: A Programming Paradigm for Fuzzy Functions

Section titled “Program-as-Weights: A Programming Paradigm for Fuzzy Functions”

👍 58 · arXiv

Many everyday programming tasks resist clean rule-based implementation, such as alerting on important log lines, repairing malformed JSON, or ranking search results by intent, and are increasingly out…

AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents

Section titled “AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents”

👍 41 · arXiv

Memory for a long-horizon LLM agent is a contract about what each future decision is allowed to see. The simplest contract appends past observations, tool calls, and reflections to every prompt, which…

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

Section titled “EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments”

👍 40 · arXiv

Autonomous agents are increasingly expected to improve executable policies through feedback, yet existing evaluations often collapse this process into a final score or confound it with open-ended soft…

PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception

Section titled “PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception”

👍 36 · arXiv

We introduce PerceptionRubrics, a rubric-based evaluation framework that addresses the gap between saturated benchmark scores and real-world brittleness. Shifting evaluation from holistic semantic mat…

👍 31 · arXiv

Hybrid attention models improve long-context efficiency by retaining only a subset of full-attention layers and replacing the remaining layers with linear attention. However, the effectiveness of Tran…

  • OpenAI GPT-5.6 support: OpenClaw now recognizes the GPT-5.6 model family across catalog, capability, and runtime selection paths. (#98333) Thanks @steipete-oai.
  • **External har…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.7.1-beta.1

  • iFlytek Spark and Astron MaaS providers (#9837)
  • --edit session flag to edit conversation before forking (#9799)
  • Fireworks AI declarative provider (#9990)
  • OpenRouter request pa…

链接https://github.com/aaif-goose/goose/releases/tag/v1.41.0

Release 0.143.0-alpha.35

链接https://github.com/openai/codex/releases/tag/rust-v0.143.0-alpha.35

Kagi Changelog (July 2): Heads, tails, and an AI toggle

Section titled “Kagi Changelog (July 2): Heads, tails, and an AI toggle”

Article URL: https://kagi.com/changelog#10959 Comments URL: https://news.ycombinator.com/item?id=48779352 Points: 51

来源Hacker News AI

AI saves about 3% of your hours, and almost none of it reaches the money

Section titled “AI saves about 3% of your hours, and almost none of it reaches the money”

Article URL: https://okaneland.com/study/ai-productivity-roi-at-work/ Comments URL: https://news.ycombinator.com/item?id=48777257 Points: 70

来源Hacker News AI

Instead of banning AI, I made a classroom contract with my students

Section titled “Instead of banning AI, I made a classroom contract with my students”

Article URL: https://www.science.org/content/article/instead-banning-ai-i-made-classroom-contract-my-students Comments URL: https://news.ycombinator.com/item?id=48775499 Points: 69

来源Hacker News AI

Article URL: https://www.elenaverna.com/p/please-stop-the-ai-confidence-theater Comments URL: https://news.ycombinator.com/item?id=48774414 Points: 222

来源Hacker News AI

AI Data Centers Use More Water Than Most Tech Giants Report

Section titled “AI Data Centers Use More Water Than Most Tech Giants Report”

Article URL: https://www.wsj.com/tech/ai/ai-data-centers-water-use-901e2902 Comments URL: https://news.ycombinator.com/item?id=48774246 Points: 54

来源Hacker News AI

Alibaba to ban Claude Code in workplace over alleged backdoor risks, source says

Section titled “Alibaba to ban Claude Code in workplace over alleged backdoor risks, source says”

Article URL: https://www.reuters.com/world/china/alibaba-ban-claude-code-workplace-over-alleged-backdoor-risks-source-says-2026-07-03/ Comments URL: https://news.ycombinator.com/item?id=48772443 Points: 314

来源Hacker News AI

AI coding is a nightmare. Am I the only one experiencing this?

Section titled “AI coding is a nightmare. Am I the only one experiencing this?”

Here are my biggest gripes with AI coding assistants right now: Obsessed with reinventing the wheel. You’ll often find it writing three duplicate functions for the exact same feature in a single file. Why? Because it’s terrified of blowing up the context window, so it only reads a fraction of a larg

来源Hacker News AI

Article URL: https://righttointelligence.org/ Comments URL: https://news.ycombinator.com/item?id=48768951 Points: 496

来源Hacker News AI