周报 2026-06-08 ~ 2026-06-14

生成时间：2026/6/14 13:52:07（UTC: 2026-06-14T05:52:07.694Z）

本周自动总结未启用或调用失败，以下为原始内容合并。

2026-06-08

生成时间：2026/6/8 10:31:33（UTC: 2026-06-08T02:31:33.927Z）

论文精选

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

👍 72 · arXiv

Code language models need repository-level context to resolve imports, APIs, and project conventions. Existing methods inject this knowledge as long inputs (retrieved through RAG or dependency analysi…

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

👍 45 · arXiv

Role-playing language agents (RPLAs) should play characters whose values and behavior evolve as the story progresses, not maintain a fixed persona. Existing benchmarks measure factual recall at a give…

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

👍 38 · arXiv

Agents are widely deployed as assistants over documents, tools, and code. However, they typically act only on explicit user requests, which surface only the problems the user has noticed, while many o…

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

👍 37 · arXiv

Planning for real-world problems by language models often involves both world and user constraints, which may not be fully specified upfront and are progressively disclosed through interaction. Howeve…

RobotValues: Evaluating Household Robots When Human Values Conflict

👍 24 · arXiv

While household robots are often evaluated based on task completion, everyday domestic environments involve value-conflicting situations in which robots are expected to choose actions that prioritize …

版本更新

OpenClaw v2026.6.5-beta.2

Highlights

QQBot now strips model reasoning/thinking scaffolding before native delivery, preventing raw <thinking> content from leaking into channel replies. (#89913, #90132) Thanks @openper…

链接：https://github.com/openclaw/openclaw/releases/tag/v2026.6.5-beta.2

Ollama v0.30.7-rc1

What’s Changed

docs: update Zod example to use native toJSONSchema by @fuleinist in https://github.com/ollama/ollama/pull/14746
launch: use native Windows Hermes config path by @BruceMacD in …

链接：https://github.com/ollama/ollama/releases/tag/v0.30.7-rc1

开发者工具

OpenAI Codex CLI rust-v0.138.0-alpha.6

Release 0.138.0-alpha.6

…

链接：https://github.com/openai/codex/releases/tag/rust-v0.138.0-alpha.6

行业动态

Is this the dawn of the Tokenpocalypse?

We’re likely to see more price increases as the big AI companies plan to go public.

来源：TechCrunch AI

Notion restores access to Anthropic after service disruption

Notion’s head of product said he was “astonished” at “the amount of people RT-ing this.”

来源：TechCrunch AI

OpenAI is still working on that ‘super app’

“Chat is dead” — at least, according to a senior OpenAI employee.

来源：TechCrunch AI

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

Even with Lockdown Mode, ChatGPT could be still vulnerable to prompt injections, but the goal is to reduce the likelihood that sensitive data gets shared in the process.

来源：TechCrunch AI

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

Apple’s WWDC nears: Here’s what you can look forward to.

来源：TechCrunch AI

Sriram Krishnan is leaving his role as White House AI advisor

Krishnan is reportedly starting a new institution to continue shaping Trump’s AI policy.

来源：TechCrunch AI

The Trump administration might take an equity stake in OpenAI

President Donald Trump said he’s discussing deals “where the American people can benefit from the success of AI.”

来源：TechCrunch AI

Startup Battlefield 200 applications officially close in 3 days

Applications for Startup Battlefield 200 officially close on June 8, 11:59 p.m. PT. Don’t wait any longer. Secure your shot at competing on the Disrupt Stage at TechCrunch Disrupt 2026 this October at San Francisco’s Moscone West.

来源：TechCrunch AI

2026-06-09

生成时间：2026/6/9 09:56:03（UTC: 2026-06-09T01:56:03.617Z）

论文精选

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

👍 71 · arXiv

Large language models exhibit impressive zero-shot capabilities across a wide range of downstream tasks. However, they struggle to function as off-the-shelf embedding models, leading to suboptimal per…

SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations

👍 43 · arXiv

Evaluating LLM mediators remains challenging, as mediation unfolds as a real-time trajectory shaped by disputants’ shifting emotions, intentions, and context. Existing testbeds rely on a few expert-au…

GENEB: Why Genomic Models Are Hard to Compare

👍 42 · arXiv

Progress in genomic foundation models is difficult to assess due to fragmented benchmarks, incompatible evaluation protocols, and task-specific reporting. As a result, claims of superiority or general…

MMAE: A Massive Multitask Audio Editing Benchmark

👍 39 · arXiv

We introduce MMAE, a Massive Multitask Audio Editing benchmark, serving as the first comprehensive evaluation testbed designed for general-purpose instruction-based audio editing. Spurred by the shift…

AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization

👍 24 · arXiv

Despite being a pivotal frontier, interactive world modeling remains underexplored in terms of the versatile controllability required by practical scenarios. To bridge this gap, we present AnchorWorld…

版本更新

OpenClaw v2026.6.5-beta.5

Highlights

QQBot now strips model reasoning/thinking scaffolding before native delivery, preventing raw <thinking> content from leaking into channel replies. (#89913, #90132) Thanks @openper…

链接：https://github.com/openclaw/openclaw/releases/tag/v2026.6.5-beta.5

LangChain langchain-core==1.4.2

Changes since langchain-core==1.4.1

release(core): 1.4.2 (#37968) feat(core): deprecate problematic dict() method (#31685)…

链接：https://github.com/langchain-ai/langchain/releases/tag/langchain-core%3D%3D1.4.2

Ollama v0.30.7

Ollama Launch now supports Hermes Desktop, a native desktop interface for the Hermes agent. Run it alongside your Hermes agent to get a visual interface for managing conversations, integrations, and m…

链接：https://github.com/ollama/ollama/releases/tag/v0.30.7

开发者工具

OpenAI Codex CLI rust-v0.138.0

New Features

The /app command can now hand off the current CLI thread into Codex Desktop on macOS and native Windows, and Windows workspace launches can open directly into Desktop instead of s…

链接：https://github.com/openai/codex/releases/tag/rust-v0.138.0

行业动态

Apple reveals new AI architecture built around Google Gemini models

Article URL: https://www.macrumors.com/2026/06/08/apple-reveals-new-ai-architecture/ Comments URL: https://news.ycombinator.com/item?id=48450142 Points: 332

周报 2026-06-08 ~ 2026-06-14

2026-06-08

论文精选

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

RobotValues: Evaluating Household Robots When Human Values Conflict

版本更新

OpenClaw v2026.6.5-beta.2

Highlights

Ollama v0.30.7-rc1

What’s Changed

开发者工具

OpenAI Codex CLI rust-v0.138.0-alpha.6

行业动态

Is this the dawn of the Tokenpocalypse?

Notion restores access to Anthropic after service disruption

OpenAI is still working on that ‘super app’

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

Sriram Krishnan is leaving his role as White House AI advisor

The Trump administration might take an equity stake in OpenAI

Startup Battlefield 200 applications officially close in 3 days

2026-06-09

论文精选

Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations

GENEB: Why Genomic Models Are Hard to Compare

MMAE: A Massive Multitask Audio Editing Benchmark

AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization

版本更新

OpenClaw v2026.6.5-beta.5

Highlights

LangChain langchain-core==1.4.2

Ollama v0.30.7

开发者工具

OpenAI Codex CLI rust-v0.138.0

New Features

行业动态

Apple reveals new AI architecture built around Google Gemini models

Comments: 315

Apple Core AI Framework

Comments: 39

Ask HN: What are tools you have made for yourself since the advent of AI?

Comments: 274

Siri AI

Comments: 360

AI is slowing down

Comments: 415

SDSU Wired Its Dorms with 1,300 AI Cameras Without Telling Students

Comments: 17

DeepSeek V4 Pro beats GPT-5.5 Pro on precision

Comments: 215

The OnlyFans Economy of American AI

Comments: 203

2026-06-10

论文精选

Agents’ Last Exam

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics

A Geometric Account of Activation Steering through Angle-Norm Decomposition

SwiftVR: Real-Time One-Step Generative Video Restoration

版本更新

OpenClaw v2026.6.5

Highlights

LangChain langchain==1.3.6

CrewAI 1.14.7a4

What’s Changed

Features

Documentation

开发者工具

OpenAI Codex CLI rust-v0.139.0

New Features

行业动态

AI misidentification results in wrongful arrest; man seeks justice

Comments: 30

If Claude Fable stops helping you, you’ll never know

Comments: 242

Apple’s AI Can Now Change Your Passwords. What Could Possibly Go Wrong?