Skip to content

AI 速递 2026-07-04

生成时间:2026/7/4 09:47:36(UTC: 2026-07-04T01:47:36.452Z)

Program-as-Weights: A Programming Paradigm for Fuzzy Functions

Section titled “Program-as-Weights: A Programming Paradigm for Fuzzy Functions”

👍 58 · arXiv

Many everyday programming tasks resist clean rule-based implementation, such as alerting on important log lines, repairing malformed JSON, or ranking search results by intent, and are increasingly out…

AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents

Section titled “AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents”

👍 41 · arXiv

Memory for a long-horizon LLM agent is a contract about what each future decision is allowed to see. The simplest contract appends past observations, tool calls, and reflections to every prompt, which…

EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments

Section titled “EvoPolicyGym: Evaluating Autonomous Policy Evolution in Interactive Environments”

👍 40 · arXiv

Autonomous agents are increasingly expected to improve executable policies through feedback, yet existing evaluations often collapse this process into a final score or confound it with open-ended soft…

PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception

Section titled “PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception”

👍 36 · arXiv

We introduce PerceptionRubrics, a rubric-based evaluation framework that addresses the gap between saturated benchmark scores and real-world brittleness. Shifting evaluation from holistic semantic mat…

👍 31 · arXiv

Hybrid attention models improve long-context efficiency by retaining only a subset of full-attention layers and replacing the remaining layers with linear attention. However, the effectiveness of Tran…

  • OpenAI GPT-5.6 support: OpenClaw now recognizes the GPT-5.6 model family across catalog, capability, and runtime selection paths. (#98333) Thanks @steipete-oai.
  • **External har…

链接https://github.com/openclaw/openclaw/releases/tag/v2026.7.1-beta.1

  • iFlytek Spark and Astron MaaS providers (#9837)
  • --edit session flag to edit conversation before forking (#9799)
  • Fireworks AI declarative provider (#9990)
  • OpenRouter request pa…

链接https://github.com/aaif-goose/goose/releases/tag/v1.41.0

Release 0.143.0-alpha.35

链接https://github.com/openai/codex/releases/tag/rust-v0.143.0-alpha.35

Kagi Changelog (July 2): Heads, tails, and an AI toggle

Section titled “Kagi Changelog (July 2): Heads, tails, and an AI toggle”

Article URL: https://kagi.com/changelog#10959 Comments URL: https://news.ycombinator.com/item?id=48779352 Points: 51

来源Hacker News AI

AI saves about 3% of your hours, and almost none of it reaches the money

Section titled “AI saves about 3% of your hours, and almost none of it reaches the money”

Article URL: https://okaneland.com/study/ai-productivity-roi-at-work/ Comments URL: https://news.ycombinator.com/item?id=48777257 Points: 70

来源Hacker News AI

Instead of banning AI, I made a classroom contract with my students

Section titled “Instead of banning AI, I made a classroom contract with my students”

Article URL: https://www.science.org/content/article/instead-banning-ai-i-made-classroom-contract-my-students Comments URL: https://news.ycombinator.com/item?id=48775499 Points: 69

来源Hacker News AI

Article URL: https://www.elenaverna.com/p/please-stop-the-ai-confidence-theater Comments URL: https://news.ycombinator.com/item?id=48774414 Points: 222

来源Hacker News AI

AI Data Centers Use More Water Than Most Tech Giants Report

Section titled “AI Data Centers Use More Water Than Most Tech Giants Report”

Article URL: https://www.wsj.com/tech/ai/ai-data-centers-water-use-901e2902 Comments URL: https://news.ycombinator.com/item?id=48774246 Points: 54

来源Hacker News AI

Alibaba to ban Claude Code in workplace over alleged backdoor risks, source says

Section titled “Alibaba to ban Claude Code in workplace over alleged backdoor risks, source says”

Article URL: https://www.reuters.com/world/china/alibaba-ban-claude-code-workplace-over-alleged-backdoor-risks-source-says-2026-07-03/ Comments URL: https://news.ycombinator.com/item?id=48772443 Points: 314

来源Hacker News AI

AI coding is a nightmare. Am I the only one experiencing this?

Section titled “AI coding is a nightmare. Am I the only one experiencing this?”

Here are my biggest gripes with AI coding assistants right now: Obsessed with reinventing the wheel. You’ll often find it writing three duplicate functions for the exact same feature in a single file. Why? Because it’s terrified of blowing up the context window, so it only reads a fraction of a larg

来源Hacker News AI

Article URL: https://righttointelligence.org/ Comments URL: https://news.ycombinator.com/item?id=48768951 Points: 496

来源Hacker News AI