森哥 Ideas

Teaching Your AI Agent New Tricks: A Complete Guide to gh skill

2026-04-18T08:30:00.000Z

The Same Prompt, Over and Over

My AI agent writes code fast. Documentation? Not so much.

It’s not that it can’t write docs — it will happily produce pages of text. The problem is consistency. One session it structures everything around the Diátaxis framework with tutorials, how-tos, references, and explanations neatly separated. The next session, it dumps everything into one long blob. I spend ten minutes explaining the writing standards, close the session, open a new one, and it’s back to square one.

I’ve tried stuffing rules into the system prompt. I’ve tried keeping a .md file in the repo for it to read. Both work, sort of. But then every project needs its own copy, and they drift out of sync within a week.

Last week, GitHub CLI v2.90.0 shipped a new command group that addresses exactly this: gh skill.

What Are Agent Skills?

Agent Skills are standardized instruction packages that tell an AI agent how to handle specific tasks.

Think of it this way: when you onboard at a new company, you get a checklist — where the code style guide lives, how to open a PR, what the deploy process looks like. You don’t wander around asking everyone. An Agent Skill is that checklist for your AI, except it’s versioned, portable, and works across different agents.

The spec is open and defined at agentskills.io. gh skill is GitHub CLI’s official implementation — search, preview, install, and publish, all from the command line.

Supported agents: GitHub Copilot, Claude Code, Cursor, Codex, Gemini CLI, and Antigravity.

Hands-On: Installing a Skill

Search

Let’s see what’s out there. The github/awesome-copilot repo has 200+ skills in its skills/ directory:

1	gh skill search documentation

This returns a list of matching skills with names, descriptions, and source repos. Suppose we spot documentation-writer — a technical writing skill built on the Diátaxis framework. Exactly what I need.

Preview

Before installing, take a look at what you’re getting:

1	gh skill preview github/awesome-copilot documentation-writer

This shows the SKILL.md contents: the YAML frontmatter (metadata like name, description, compatibility) and the Markdown body (the actual instructions). You can decide whether it’s a good fit without touching your project.

Install

Looks good. Install it:

1	gh skill install github/awesome-copilot documentation-writer

This creates (or updates) the skill files in your project. The core is a SKILL.md with YAML frontmatter that records the source:

---
name: documentation-writer
description: "Technical documentation writer following the Diátaxis framework..."
source:
  repo: github/awesome-copilot
  ref: main
  tree-sha: a1b2c3d4e5f6...
---

The Markdown body below that contains the actual instructions — when to trigger, how to structure docs, what tone to use.

A complete skill directory looks like this:

documentation-writer/
├── SKILL.md          # Required: YAML frontmatter + Markdown instructions
├── scripts/          # Optional: executable scripts
├── references/       # Optional: reference docs
└── assets/           # Optional: templates, resources

SKILL.md is the only required file. Everything else is optional.

Installing for a Different Agent

The default target is GitHub Copilot. For Claude Code:

1	gh skill install github/awesome-copilot documentation-writer --agent claude-code --scope user

--agent sets the target agent. --scope user makes it a user-level skill, not limited to the current project.

Pinning Versions

For production, you probably don’t want skills updating under your feet. Pin to a specific version:

1	gh skill install github/awesome-copilot documentation-writer --pin v1.2.0

With pinning, upstream changes won’t affect you. When you’re ready to update, explicitly run:

1	gh skill update --all

Writing Your Own Skill

Now let’s build one from scratch. Say I want a commit-message-writer skill that enforces Conventional Commits across the team — with scopes, 72-character body wrapping, the works.

Create the Directory

1	mkdir -p commit-message-writer

Write SKILL.md

cat > commit-message-writer/SKILL.md << 'EOF'
---
name: commit-message-writer
description: "Generates commit messages following Conventional Commits specification. Triggered when the user asks to commit, or when suggesting a commit after code changes."
license: MIT
compatibility:
  - github-copilot
  - claude-code
  - cursor
---

## Commit Message Format

When generating a commit message, follow this exact structure:

():

gh skill で AI エージェントにスキルを追加する：インストールから公開まで完全ガイド

2026-04-18T08:30:00.000Z

毎回同じことを教えるのに疲れた

AI エージェントにコードを書かせると速い。ドキュメントはというと、まあ、微妙。

書けないわけじゃない。頼めばそれなりの文章が出てくる。問題は一貫性だ。ある日は Diátaxis フレームワークに従って tutorial・how-to・reference・explanation をきちんと分けてくれる。次の日には全部ひとかたまりの文章になってる。10分かけて書き方のルールを説明して、セッションを閉じて、新しいセッションを開いたら——全部忘れてる。

system prompt に規約を詰め込んだこともある。.md ファイルにルールを書いてリポジトリに置いたこともある。どちらも一応動くけど、プロジェクトごとにコピーが必要で、バージョンはすぐにバラバラになる。

先週、GitHub CLI v2.90.0 で新しいコマンドグループがリリースされた。gh skill だ。

Agent Skills とは何か

一言で言えば、AI エージェント用のスキルパッケージ。特定のタスクに対して「こうやれ」という標準化された指示セットのこと。

たとえ話をすると——新しい会社に入社した初日にオンボーディング資料をもらう。コーディング規約はどこ、PR の出し方はこう、デプロイ手順はこう。いちいち人に聞かなくていい。Agent Skill はそのオンボーディング資料の AI 版で、バージョン管理ができて、プロジェクトやエージェントをまたいで使い回せる。

仕様はオープンで、agentskills.io で定義されている。gh skill は GitHub CLI によるその公式実装——検索、プレビュー、インストール、公開がコマンド一つでできる。

対応エージェント：GitHub Copilot、Claude Code、Cursor、Codex、Gemini CLI、Antigravity。

実践：既存の skill をインストールする

検索

まずは何があるか見てみる。github/awesome-copilot リポジトリの skills ディレクトリには 200 以上の skill がある：

1	gh skill search documentation

マッチした skill の一覧が名前・説明・リポジトリとともに表示される。documentation-writer を見つけたとしよう——Diátaxis フレームワークに基づいた技術文書作成 skill。まさに欲しかったもの。

プレビュー

インストール前に中身を確認する：

1	gh skill preview github/awesome-copilot documentation-writer

SKILL.md の内容が表示される。frontmatter のメタデータ（名前、説明、互換性など）と、本文の具体的な指示。インストールせずに自分に合うか判断できる。

インストール

良さそうなのでインストール：

1	gh skill install github/awesome-copilot documentation-writer

このコマンドは、プロジェクト内に skill ファイルを作成（または更新）する。核となるのは SKILL.md で、YAML frontmatter にソース情報が記録される：

---
name: documentation-writer
description: "Technical documentation writer following the Diátaxis framework..."
source:
  repo: github/awesome-copilot
  ref: main
  tree-sha: a1b2c3d4e5f6...
---

その下の Markdown 本文がエージェントへの具体的な指示——いつトリガーするか、ドキュメントの構成方法、使うべきトーンなど。

skill ディレクトリの全体像はこう：

documentation-writer/
├── SKILL.md          # 必須：YAML frontmatter + Markdown 指示
├── scripts/          # 任意：実行可能スクリプト
├── references/       # 任意：参考ドキュメント
└── assets/           # 任意：テンプレート、リソース

必須ファイルは SKILL.md だけ。他はすべて任意。

別のエージェントにインストール

デフォルトのターゲットは GitHub Copilot。Claude Code に入れたい場合：

1	gh skill install github/awesome-copilot documentation-writer --agent claude-code --scope user

--agent でターゲットエージェントを指定。--scope user でユーザーレベルの skill にする（現在のプロジェクトに限定されない）。

バージョン固定

本番環境では、skill が勝手に変わってほしくない。特定バージョンに固定する：

1	gh skill install github/awesome-copilot documentation-writer --pin v1.2.0

固定すれば上流の変更に影響されない。更新したくなったら明示的に実行：

1	gh skill update --all

自分で skill を書く

既存の skill を試したので、次は自作してみる。commit-message-writer という skill を作ろう——Conventional Commits に従った commit メッセージのフォーマットをチーム全体で統一するためのもの。

ディレクトリ作成

1	mkdir -p commit-message-writer

SKILL.md を書く

cat > commit-message-writer/SKILL.md << 'EOF'
---
name: commit-message-writer
description: "Generates commit messages following Conventional Commits specification. Triggered when the user asks to commit, or when suggesting a commit after code changes."
license: MIT
compatibility:
  - github-copilot
  - claude-code
  - cursor
---

## Commit Message Format

When generating a commit message, follow this exact structure:

():

用 gh skill 给 AI 装技能包：从安装到发布完整指南

2026-04-18T08:30:00.000Z

一个反复出现的问题

我的 AI agent 写代码很快，但文档永远写不好。

不是说它不会写——你让它写，它能给你一篇洋洋洒洒的文档。问题是，每次的输出都不一样。这次用了 Diátaxis 框架分了 tutorial/how-to/reference/explanation 四种类型，下次又变成了一堆无结构的段落。你花十分钟教它一遍规范，过两天新开一个会话，它全忘了。

我试过把写作规范塞进 system prompt，试过写成 .md 文件放在仓库里让它读。能用，但不优雅。每个项目都要复制一份，版本一旦更新就到处不同步。

直到上周，GitHub CLI v2.90.0 发布了一个新命令组：gh skill。

Agent Skills：给 AI 的技能包

一句话解释：Agent Skills 就是一套标准化的指令包，告诉 AI agent 在特定场景下该怎么做。

类比一下——你新入职一家公司，HR 给你一个 onboarding checklist，里面写了代码规范在哪儿、PR 怎么提、部署流程是什么。你不需要到处问人，照着做就行。Agent Skill 就是给 AI 的 onboarding checklist，只不过格式标准化了，可以跨项目、跨 agent 复用。

规范本身是开放的，定义在 agentskills.io。gh skill 是 GitHub CLI 对这个规范的官方实现——搜索、预览、安装、发布，一行命令搞定。

目前支持的 agent 包括：GitHub Copilot、Claude Code、Cursor、Codex、Gemini CLI、Antigravity。

实战：装一个现成的 skill

搜索

先看看有什么可用的。github/awesome-copilot 这个仓库的 skills 目录下有 200 多个 skill，用搜索命令找：

1	gh skill search documentation

输出会列出匹配的 skill 列表，包括名称、描述、所在仓库。假设我们看到了 documentation-writer——一个基于 Diátaxis 框架的技术写作 skill，正好是我想要的。

预览

装之前先看看它具体是什么：

1	gh skill preview github/awesome-copilot documentation-writer

预览会展示 SKILL.md 的内容：frontmatter 里的元数据（名称、描述、兼容性等）和正文里的指令。你可以在不安装的情况下判断这个 skill 是不是你需要的。

安装

确认没问题，装：

1	gh skill install github/awesome-copilot documentation-writer

这条命令做了什么？它会在你当前项目里创建（或更新）对应的 skill 文件。核心是一个 SKILL.md，里面的 YAML frontmatter 记录了来源信息：

---
name: documentation-writer
description: "Technical documentation writer following the Diátaxis framework..."
source:
  repo: github/awesome-copilot
  ref: main
  tree-sha: a1b2c3d4e5f6...
---

下面的 Markdown 正文就是给 agent 的具体指令——什么时候触发、怎么组织文档结构、用什么语气、遵循什么规范。

一个完整的 skill 目录长这样：

documentation-writer/
├── SKILL.md          # 必须：YAML frontmatter + Markdown 指令
├── scripts/          # 可选：可执行脚本
├── references/       # 可选：参考文档
└── assets/           # 可选：模板、资源

SKILL.md 是唯一必须的文件，其他目录按需使用。

给不同的 agent 装

默认安装是给 GitHub Copilot 的。如果你用的是 Claude Code：

1	gh skill install github/awesome-copilot documentation-writer --agent claude-code --scope user

--agent 指定目标 agent，--scope user 表示这是用户级别的 skill，不限于当前项目。

版本锁定

生产环境用的 skill 你不想让它随时变。用 --pin 锁定到一个特定版本：

1	gh skill install github/awesome-copilot documentation-writer --pin v1.2.0

锁定后，即使上游仓库更新了，你这边也不会变。想要更新？手动重新 install 或者跑：

1	gh skill update --all

这会把所有已安装的 skill 更新到最新版本。

自己写一个 skill

装别人的 skill 体验过了，接下来试试自己写一个。

假设我要写一个 commit-message-writer skill，规范团队的 commit message 格式——遵循 Conventional Commits，带 scope，正文不超过 72 字符。

创建目录

1	mkdir -p commit-message-writer

编写 SKILL.md

cat > commit-message-writer/SKILL.md << 'EOF'
---
name: commit-message-writer
description: "Generates commit messages following Conventional Commits specification. Triggered when the user asks to commit, or when suggesting a commit after code changes."
license: MIT
compatibility:
  - github-copilot
  - claude-code
  - cursor
---

## Commit Message Format

When generating a commit message, follow this exact structure:

():

The Future of AI Payments: From Programmable Accounts to Autonomous Agent Transactions

2026-03-31T02:40:00.000Z

Payments look like a solved problem. QR codes, cards, NFC — the experience is already seamless. But that seamlessness rests on a single assumption: the one making the payment is a human being.

When that assumption starts to crack, the entire payment system needs to be rebuilt from its foundations.

From “Humans Pay” to “Agents Pay”

Imagine this: you tell your AI assistant “arrange my trip to Shanghai next week,” and it automatically checks train tickets, books a hotel, and calls a ride for your departure — all without you opening a single app or entering a single password. The receipt arrives afterward, you glance at it, it looks reasonable.

This isn’t science fiction. It’s the interaction pattern that will become mainstream within three to five years.

But buried inside this convenience is a question nobody is asking: who actually paid?

Not you — you just said a sentence. The AI assistant completed every transaction on your behalf. Yet the entire existing payment infrastructure, from identity verification to fraud models, was built assuming a human operator. Facial recognition, SMS verification, behavioral biometrics — these defenses are useless against AI, and equally incapable of distinguishing a compromised AI from a legitimate one.

The payments industry isn’t facing a feature upgrade. It’s facing an architectural rebuild.

Programmable Payments: Giving Money a Logic Layer

Traditional payment is an action: money moves from A to B. Programmable payment is a program: when condition X is met, money moves from A to B, and only for purpose Y.

That distinction sounds small. In practice, it redefines what money itself means.

Three Parallel Technical Paths

Smart contracts are the most mature implementation of programmable payments. Ethereum contracts can encode arbitrarily complex payment logic: escrow, installments, multi-signature, conditional triggers. Code is the contract; execution requires no trusted intermediary. But the challenges are real — the oracle problem (getting trustworthy off-chain data onto the chain), fiat bridging complexity, and high barriers for ordinary users keep it largely confined to crypto-native contexts for now.

Central bank digital currencies are the more strategically important direction, especially the digital yuan. The People’s Bank of China has already publicly tested “conditional payment” capabilities: earmarked funds (subsidies restricted to specific categories), expiration dates (automatic destruction after a deadline), and venue locking (valid only at designated merchants). This is programmable money being pushed by national infrastructure. No other country has anything comparable. The digital yuan’s programmability isn’t a peripheral feature — it’s central to why the system was designed this way.

Open banking is traditional finance’s path of self-evolution. Europe’s PSD2/PSD3 framework mandates that banks expose APIs, enabling third parties to operate accounts programmatically. The IETF has published the RAR (Rich Authorization Requests) standard, upgrading coarse-grained permissions like “access my account” to fine-grained logic like “pay merchant A no more than 150 yuan before 6 PM today.” Stripe’s Payment Intents API is moving in the same direction.

Where Programmable Payments Actually Create Value

Not convenience — the elimination of trust costs.

Payment terms in B2B trade (net-30, net-60, net-90) are externalized trust costs. The buyer isn’t sure about product quality, so payment is delayed; the seller accepts unfavorable terms to maintain the relationship. Programmable payments can encode contract terms directly as payment logic: goods received and quality verified → payment triggers automatically. Payment terms become economically unnecessary.

Insurance claims follow the same logic. The cumbersome claims process exists because “accident occurred” and “payment executed” require extensive human verification in between. If sensor data can be reliably fed into a programmable account, conditions met equals automatic payout, and the entire claims department’s purpose needs rethinking.

The deeper shift is the restructuring of business models: pay-for-results becomes the default. Ad spend paid on actual conversions, SaaS fees based on real output, outsourcing paid on delivery quality. AI verifies the outcome; programmable payments execute the transfer; human review and dispute arbitration become unnecessary.

Programmable payments solve how money flows. They don’t solve who has the right to trigger it. When AI Agents initiate payments on behalf of humans, the authorization system needs to be designed from scratch.

The core tension is simple: users want convenience (autonomous Agent operation), but don’t want to lose control (money shouldn’t move without reason).

A Three-Layer Authorization Model

The identity layer answers “who is this Agent?” Existing authentication (face ID, passwords, SMS codes) was built for humans; Agents can’t use it and shouldn’t have to. A more coherent approach is delegated credentials: the user signs a limited-scope sub-credential with their master key and issues it to the Agent, specifying validity period, authorized scope, spending limits, and non-delegability (the Agent cannot re-delegate to other Agents). W3C’s DID (Decentralized Identifiers) and Verifiable Credentials standards are the most likely technical foundation for this layer.

The permission layer answers “what can the Agent do?” Five dimensions need precise control: amount (per-transaction cap, daily and monthly limits), category (food delivery and rides allowed; transfers and withdrawals blocked), time (weekdays only, or “only act autonomously when I’m offline”), frequency (same merchant daily cap), and confirmation rules (above a threshold, push a notification and wait for human approval). Designing the right permission granularity is the real product challenge — too coarse and it’s unsafe, too fine and convenience disappears.

The execution layer answers “what happens when things go wrong?” Two mechanisms are non-negotiable: a cooling window (Agent-initiated payments enter a 30-second-to-5-minute pending state during which the user can cancel) and circuit breakers (if behavior deviates from historical patterns beyond a threshold, Agent payment permissions are automatically frozen pending human review). Every AI-initiated transaction must maintain a complete audit trail: what triggered it, the decision logic, the execution timestamp.

The Hard Problem: Multi-Agent Nesting

Reality will be far more complex than single-layer authorization:

Your AI assistant calls a travel platform’s AI, which calls a hotel’s AI, which triggers the payment. You only authorized the first layer. If any intermediate Agent is compromised, who’s responsible?

A reasonable design principle is permissions can only narrow, never expand: you give Claude a daily limit of 500 yuan, Claude passes a maximum of 200 yuan to Trip.com’s AI, which passes a maximum of 100 yuan to the hotel’s AI. Every delegation can only constrict the scope. Even if one link in the chain fails, the damage is bounded.

The Identity Layer War: Next Decade’s Payment Entry Point

Once you understand this architecture, you understand what platforms are really competing for.

On the surface it looks like market share. At the core, it’s control of the Agent identity layer.

Wherever users create their AI assistants, that platform issues the delegated credentials. Whichever credentials payment processors recognize, that platform collects the toll. This logic is identical to the mobile payment entry-point wars of a decade ago — the battlefield shifted from “binding a card” to “binding an Agent identity.”

Apple is currently best positioned. Secure Enclave + Apple Pay + Apple Intelligence form a natural identity-payment closed loop: the user’s Agent key lives in the iPhone, Apple manages it, and all payment actions must pass through Apple’s identity layer. Google has the same logic on Android, but Android fragmentation is a genuine weakness.

The Chinese market will be more concentrated. WeChat and Alipay, as super-apps, will almost certainly remain the entry points for users’ AI assistants, and both have strong incentives to bind AI identity with payment accounts into an inseparable whole.

OpenAI and Anthropic are the biggest wildcards.

OpenAI Entering Payments: Not “If” but “How”

OpenAI’s motivation to enter payments is stronger than most people realize.

When Agents become ubiquitous and users let AI handle their purchases, OpenAI is deeply involved in every transaction — it understood the user’s intent, orchestrated the execution, called the merchant’s interface — but under the current system, it gets nothing. This tension will become increasingly acute as Agent adoption scales.

Sam Altman has long had serious interest in financial infrastructure. World ID (iris scanning to prove you’re human) and Worldcoin together are laying the groundwork for a larger ambition: a complete identity + AI + payment closed loop. World ID looks strange in isolation; it makes perfect sense in this context.

The likely paths: acquisition (Stripe’s history with OpenAI is well documented, and Stripe’s API design is uniquely suited to Agent scenarios); an AI wallet (users deposit funds, the Agent spends within limits — starting with prepaid to avoid banking license requirements); or deep partnership with Visa/Mastercard for a dedicated Agent payment channel through the Operator protocol.

Whatever path is taken, the biggest obstacle isn’t technical — it’s regulatory trust. “A company that trains AI models also managing my money” — said aloud today, most users would hesitate. Trust is built slowly, which is why payment product timelines will likely lag technical readiness by two to three years.

Anthropic’s strategy is smarter and more indirect: define the standard through MCP (Model Context Protocol). MCP currently specifies how Agents call tools; payment tool authorization extensions are inevitable. If MCP becomes the industry standard, Anthropic controls the protocol layer — it doesn’t need to run payments itself, just collect the standard fee from the ecosystem. This is Android’s playbook, not Apple’s, and the long-term leverage may be greater.

How Regulation Will Unfold

The trajectories for three regions are fairly predictable.

China will lead with strong control. The PBOC, through the digital yuan, directly controls programmable currency infrastructure. AI payments will almost certainly require whitelisting; Agents with payment capabilities will need to be registered. The control is strong, but it also means the digital yuan can become the natural substrate for AI payments, and rollout speed may actually be faster.

The US will see markets run ahead of regulation. Stripe and Visa will ship products first; the CFPB and SEC will gradually establish rules under pressure from litigation and public opinion. Standardization will be slow, but the innovation space will be large.

The EU will legislate before deploying. The intersection of the AI Act and PSD3 will create the world’s strictest user consent and right-of-revocation frameworks, with the highest compliance costs — but also likely the most rigorous Agent payment standards, which other regions may ultimately adopt as reference.

Every regulatory framework will face the same core question: when AI pays on behalf of a human, who is responsible? The user (who set the rules)? The platform (whose systems the Agent runs on)? The developer (who designed the Agent’s decision logic)? There’s no ready legal answer, and that answer will determine how the entire industry allocates risk.

Closing

Payments are superficially a financial problem, structurally a trust problem, and fundamentally an identity problem. AI has restructured the concept of identity — when the “person” making a payment might be an Agent, when an Agent can be any program designed by anyone, the foundation of the entire trust system needs to be repoured.

Programmable payments, Agent authorization architecture, the AI identity layer — these three directions converge on the same endpoint:

Money will carry intent. It won’t just be a number — it will know what it can be used for, who can trigger it, under what conditions it flows.

This is a massive efficiency gain, and an unprecedented concentration of control. Whoever designs these rules controls the financial infrastructure of the next era. The competition has already started. Most people just haven’t noticed yet.

AI決済の未来：プログラマブル口座からエージェント自律決済へ

2026-03-31T02:40:00.000Z

決済はすでに解決済みの問題に見える。QRコード、カード、NFC——体験は十分にスムーズだ。しかしそのスムーズさは、一つの前提の上に成り立っている。支払いの主体は人間である、という前提だ。

この前提が揺らぎ始めたとき、決済システム全体を基盤から作り直す必要が生じる。

「人間が払う」から「エージェントが払う」へ

こんなシナリオを想像してほしい。AIアシスタントに「来週の上海出張を手配して」と言うだけで、新幹線の切符を調べ、ホテルを予約し、出発時の配車を呼んでくれる。その間、あなたはアプリを一度も開かず、パスワードを一度も入力しない。後から明細が届く。見ると、妥当な金額だ。

これはSFではない。三年から五年以内に普及するインタラクションの形だ。

しかし、この利便性の中に誰も気づいていない深層の問題がある。誰が支払ったのか？

あなたではない——あなたは一言言っただけだ。AIアシスタントがすべての取引を代行した。しかし現行の決済インフラは、身元確認から不正検知モデルに至るまで、「操作者は人間」を前提として設計されている。顔認証、SMS認証、行動バイオメトリクス——これらの防壁はAIに対して意味をなさず、正常に動作しているAIと悪意を持って操られているAIの区別もつかない。

決済業界が直面しているのは機能のアップデートではなく、アーキテクチャの全面的な再構築だ。

プログラマブル決済：お金に「ロジック」を持たせる

従来の決済は一つのアクションだ。AからBへ金が動く。プログラマブル決済はプログラムだ。条件Xが満たされたとき、AからBへ金が動く。しかも目的Yにのみ使用可能。

この違いは小さく見えるが、実際には通貨そのものの定義を覆す。

三つの並行する技術路線

スマートコントラクトは現時点で最も成熟したプログラマブル決済の実装だ。イーサリアムのコントラクトは、エスクロー、分割払い、マルチシグ、条件トリガーなど任意に複雑な決済ロジックを記述できる。コードが契約であり、実行に信頼できる仲介者は必要ない。ただし課題も明確で——オラクル問題（チェーン外データの信頼性ある取り込み）、法定通貨との橋渡しの複雑さ、一般ユーザーへの高い参入障壁により、現在はクリプトネイティブな場面に限られている。

中央銀行デジタル通貨はより注目すべき方向だ。特にデジタル人民元がそうだ。人民銀行はすでに「条件付き決済」機能を公開テストしている。使途限定（補助金は指定品目のみに使用可能）、有効期限設定（期限切れ後自動消滅）、場所のロック（指定店舗のみ有効）といった機能だ。これは国家レベルのインフラが推進するプログラマブル通貨であり、他国には類似したものが存在しない。デジタル人民元のプログラマビリティは周辺機能ではなく、このシステム設計の核心だ。

オープンバンキングは伝統的金融の自己進化の道だ。欧州のPSD2/PSD3規制の枠組みは銀行にAPIの開放を義務付け、サードパーティがプログラムで口座を操作できるようにしている。IETFはRAR（Rich Authorization Requests）標準を公開し、「口座へのアクセス」という粗粒度の認可を「今日の18時までに店舗Aに150元以下を支払う」という細粒度のロジックにアップグレードしている。StripeのPayment Intents APIも同じ方向に向かっている。

プログラマブル決済が本当に価値を生む場所

便利さではない。信頼コストの消滅だ。

B2B取引における支払いサイト（ネット30、ネット60、ネット90）は、信頼コストの外部化だ。買い手は品質に確信が持てないから支払いを遅らせ、売り手は関係維持のために不利な条件を受け入れる。プログラマブル決済は契約条項を直接決済ロジックとして記述できる。商品受領かつ品質検査通過 → 自動的に支払いが実行される。支払いサイトという概念は経済学的に不要になる。

保険の保険金支払いも同じロジックだ。今日の煩雑な請求プロセスは、「事故発生」と「支払い実行」の間に大量の人的確認が必要だから存在する。センサーデータが信頼できる形でプログラマブル口座に入力できれば、条件を満たした瞬間に自動的に支払いが行われ、請求部門全体の存在意義を見直す必要が生じる。

より深い変化はビジネスモデルの再構築だ。成果報酬型が当たり前になる。広告費は実際のコンバージョンに対して支払われ、SaaSは実際の成果に対して課金され、外注は納品物の品質に対して支払われる。AIが結果を検証し、プログラマブル決済が実行する。人的審査と紛争仲裁は不要になる。

エージェント認可アーキテクチャ：「事前同意」の仕組みを設計する

プログラマブル決済は「お金がどのように流れるか」の問題を解決した。しかし「誰が引き金を引く権利を持つか」はまだ解決されていない。AIエージェントが人間に代わって決済を行うとき、認可体系は一から設計する必要がある。

核心的な矛盾は一つだ。ユーザーは利便性を求める（エージェントの自律的な操作）が、コントロールを失いたくない（お金が勝手に使われてはならない）。

三層認可モデル

アイデンティティ層は「このエージェントは誰か」という問いに答える。既存の認証（顔認証、パスワード、SMS認証）は人間向けに設計されており、エージェントには使えないし、使うべきでもない。より合理的な方向は委任証明書だ。ユーザーが自分のマスター鍵で、限定的な権限の子証明書に署名してエージェントに発行する。証明書には有効期限、認可スコープ、上限金額が明記され、再委任不可（エージェントが他のエージェントにさらに委任できない）とする。W3CのDID（分散型ID）とVerifiable Credentials標準が、この層の技術的な基盤として最も有力な候補だ。

権限層は「エージェントに何ができるか」という問いに答える。五つの次元で精密なコントロールが必要だ。金額（1回あたりの上限、日次・月次の累計上限）、カテゴリ（デリバリーと配車のみ許可、送金・出金は禁止）、時間（平日限定、または「私がオフラインのときのみ自律実行」）、頻度（同一店舗の1日あたり上限）、確認ルール（閾値超過時は通知を送信して人的承認を待つ）。適切な権限粒度の設計が本質的なプロダクト課題だ——粗すぎれば安全でなく、細かすぎれば利便性が消える。

実行層は「問題が発生したときどうするか」という問いに答える。二つの仕組みが不可欠だ。クーリングウィンドウ（エージェントが開始した決済は30秒から5分の「実行待機」状態に入り、その間ユーザーはキャンセルできる）と、サーキットブレーカー（過去の行動パターンから一定閾値以上逸脱した場合、エージェントの決済権限を自動的に凍結して人的審査を待つ）だ。AIが開始したすべての取引は完全な監査証跡を保持しなければならない。なぜ実行されたか、決定ロジック、実行タイムスタンプを含む。

最も難しい問題：マルチエージェントのネスト

現実のシナリオは単層認可よりはるかに複雑になる。

あなたのAIアシスタントが旅行プラットフォームのAIを呼び出し、そのAIがホテルのAIを呼び出し、最終的にホテルのAIが決済を実行する。あなたが認可したのは最初の層だけだ。途中のエージェントが侵害された場合、責任は誰にあるのか？

合理的な設計原則の一つは権限は縮小のみ可能、拡大不可だ。ユーザーがClaudeに与える上限が1日500元なら、ClaudeがTrip.comのAIに転授できるのは最大200元、Trip.comのAIがホテルのAIに転授できるのは最大100元。委任のたびにスコープは狭まるのみだ。連鎖のどこかが破綻しても、被害は制御可能な範囲に収まる。

アイデンティティ層争奪：次の10年の決済入口

このアーキテクチャを理解すれば、プラットフォームが本当に何を奪い合っているかが分かる。

表面上は市場シェアの争いだ。本質はエージェントのアイデンティティ層のコントロールだ。

ユーザーがどのプラットフォームでAIアシスタントを作成するか、そのプラットフォームが委任証明書を発行する。決済処理者がどの証明書を認めるか、そのプラットフォームが通行料を徴収する。このロジックは10年前のモバイル決済の入口争いと全く同じで、戦場が「カードの登録」から「エージェントのアイデンティティの登録」に移動しただけだ。

現時点で最も有利なのはAppleだ。Secure Enclave + Apple Pay + Apple Intelligenceは、自然なアイデンティティ-決済の閉じたループを形成している。ユーザーのエージェント鍵はiPhoneの中にあり、Appleが管理し、すべての決済行為はAppleのアイデンティティ層を通過しなければならない。GoogleはAndroidで同じロジックを持っているが、Androidの断片化は本物の弱点だ。

中国市場はより集中した構図になるだろう。WeChat とAlipayはスーパーアプリとして、ユーザーのAIアシスタントの入口はほぼ確実にこの2つのプラットフォームに残り、両者はAIのアイデンティティと決済口座を不可分な一体として結びつける強い動機を持っている。

OpenAIとAnthropicが最大の変数だ。

OpenAIの決済参入：「するかどうか」ではなく「どう入るか」

OpenAIが決済に参入する動機は、多くの人が考えるよりずっと強い。

エージェントが普及し、ユーザーがAIに消費を任せるようになると、OpenAIはすべての取引に深く関与しながら——ユーザーの意図を理解し、実行をオーケストレーションし、商業インターフェースを呼び出しながら——現行のシステムでは一円も受け取れない。この矛盾はエージェントの普及が進むにつれて、ますます深刻になる。

Sam Altmanは金融インフラに長く深い関心を持ってきた。World ID（虹彩スキャンで本人確認）とWorldcoin（デジタル通貨）の組み合わせは、より大きな版図——アイデンティティ + AI + 決済の完全な閉じたループ——への布石だ。World IDだけ見ると奇妙に見えるが、この文脈に置けば非常に明快だ。

考えられる路径はいくつかある。買収（Stripeとの歴史的なつながりは周知のことであり、StripeのAPI設計はエージェントのシナリオに特によく適合している）；AIウォレットの開始（ユーザーが入金し、エージェントが上限内で自律的に消費する——まず銀行ライセンスの要件を回避するためにプリペイドから始める）；あるいはOperatorプロトコルを通じたVisa/Mastercardとの深度提携でエージェント専用決済チャネルを確保する。

どの路径をとるにせよ、最大の障壁は技術ではなく規制の信頼だ。「AIモデルを訓練している会社が同時に自分のお金を管理する」——今日これを言われたら、多くのユーザーは眉をひそめるだろう。信頼の構築には時間がかかり、これが決済プロダクトのタイムラインが技術的な準備より2〜3年遅れる理由だ。

Anthropicの戦略はより賢く間接的だ。MCP（Model Context Protocol）でエージェントがツールを呼び出す標準を定義する。MCPは現在、AIエージェントのツール呼び出しを定義しており、決済ツールの認可拡張は時間の問題だ。MCPが業界標準になれば、Anthropicはプロトコル層をコントロールする——自分で決済を行う必要はなく、エコシステムに走らせて標準税を徴収するだけだ。これはAppleではなくAndroidのプレイブックで、長期的なレバレッジはより大きいかもしれない。

規制はどう動くか

三つの地域のトラジェクトリはほぼ予測可能だ。

中国は強い管理から入る。人民銀行はデジタル人民元を通じてプログラマブル通貨のインフラを直接コントロールする。AI決済にはホワイトリスト制度が設けられ、決済能力を持つエージェントは登録が必要になるだろう。管理は強いが、それはデジタル人民元がAI決済の自然な基盤になることも意味し、普及速度はむしろ速くなる可能性がある。

米国は市場が先走って規制が後を追う。StripeとVisaが先に製品を作り、CFPBとSECが訴訟と世論の圧力の下で段階的に規則を整備する。標準化は遅いが、イノベーションの空間は大きい。

EUは立法してから展開する。AI ActとPSD3の連動で、世界で最も厳格なユーザーの知る権利と撤回権の保護フレームワークが構築され、コンプライアンスコストが最も高くなるが、同時に最も厳密なエージェント決済の標準フレームワークも生まれ、他地域が最終的に参照基準にする可能性がある。

すべての規制フレームワークは同じ核心的な問いに直面する。AIが人間に代わって支払ったとき、責任は誰にあるか？ ユーザー（ルールを設定したから）？プラットフォーム（エージェントがそのシステム上で動作しているから）？開発者（エージェントの決策ロジックを設計したから）？法律的に既成の答えはなく、その答えが業界全体のリスク配分を決定する。

おわりに

決済は表面上は金融の問題で、構造的には信頼の問題で、さらに深いところではアイデンティティの問題だ。AIはアイデンティティの概念を再構築した——支払いを行う「人」がエージェントである可能性があり、エージェントが誰でも設計できる任意のプログラムである可能性があるとき、信頼システム全体の基盤を打ち直す必要がある。

プログラマブル決済、エージェント認可アーキテクチャ、AIアイデンティティ層——この三つの方向は同一の終点を指している。

お金は意図を持つようになる。 ただの数字ではなく、自分が何に使えるか、誰が引き金を引けるか、どのような条件で流れるかを知っている。

これは効率への巨大な向上であり、同時に前例のない支配力の集中だ。このルールを設計した者が次の時代の金融インフラを支配する。この競争はすでに始まっている。ほとんどの人がまだ気づいていないだけで。

この記事の原文はこちら → hongqi-lgs.github.io

AI支付的未来：从可编程账户到Agent自主支付

2026-03-31T02:40:00.000Z

支付是一个看起来早就被解决了的问题。扫码、刷卡、NFC——体验已经足够顺滑。但这种”顺滑”建立在一个假设之上：付款的主体是人。

当这个假设开始松动，整个支付体系就要从地基重建。

从”人付款”到”Agent付款”

想象一个场景：你对AI助理说”帮我安排下周去上海的出行”，它自动查询了高铁票、预订了酒店、叫了出发的网约车，全程你没有打开任何一个App，也没有输入过一次密码。账单在事后推送给你，你看了一眼，合理。

这不是科幻，这是未来三到五年内会普及的交互方式。

但这里有一个没人注意到的深层问题：谁在付款？

不是你——你只是说了一句话。是AI助理，它替你完成了所有交易。可现有的支付体系，从身份验证到风控模型，全都假设”操作者是人类”。人脸识别、短信验证码、行为生物特征……这些防线对AI来说形同虚设，也同样无法识别一个被恶意控制的AI和一个正常工作的AI之间的差别。

支付行业面临的不是功能迭代，而是底层架构的重建。

可编程支付：把钱变成”有逻辑的钱”

传统支付是一个动作：钱从A到B。可编程支付是一段逻辑：当条件X满足时，钱从A到B，且只能用于目的Y。

这个区别看起来微小，实际上颠覆了货币本身的定义。

三条并行的技术路线

智能合约路线是目前最成熟的可编程支付实现。以太坊上的智能合约可以编写任意复杂的支付逻辑：托管、分期、多签、条件触发。代码即合同，执行无需信任中间方。但它的问题同样明显——链外数据的可信接入（预言机问题）、法币桥接的复杂性、普通用户的高门槛，让它目前仍主要停留在加密原生场景。

央行数字货币路线是更值得关注的方向，尤其是数字人民币。人民银行已经公开测试了”条件支付”功能：专款专用（消费补贴只能在指定品类使用）、有效期限制（过期自动销毁）、场景锁定（只在特定商户生效）。这是国家级基础设施推动的可编程货币，其他国家还没有类似的东西。数字人民币的可编程性不是边缘特性，它是这套系统设计的核心之一。

开放银行路线是传统金融体系的自我进化。欧洲的PSD2/PSD3监管框架强制银行开放API，让第三方可以程序化地操作账户。IETF已经推出了RAR（富授权请求）标准，把”我要访问你的账户”这种粗粒度授权，升级成”我要在今天18点前向商户A支付不超过150元”这种细粒度逻辑。Stripe的Payment Intents API走的也是这个方向。

可编程支付真正的价值在哪里

不是便利，是消灭信任成本。

B2B贸易里的账期（30天、60天、90天）本质上是信任成本的外化——买家不确定货物质量，所以延迟付款；卖家为了维系客户，不得不接受。可编程支付可以把合同条款直接编写成支付逻辑：货物签收且质检通过 → 自动触发付款。账期这个概念从经济学上就不再必要了。

保险理赔同理。今天的理赔流程繁琐，本质是因为”事故发生”和”赔付执行”之间需要大量人工核实。如果传感器数据可信地接入可编程账户，条件满足即自动赔付，整个理赔部门的存在意义都需要重新考量。

更深远的变化是商业模式的重构：按结果付费将成为默认。广告按真实转化付费、SaaS按实际产出付费、外包按交付物质量付费——AI负责验证结果，可编程支付负责执行，中间不再需要人工审核和争议仲裁。

Agent授权架构：设计一套”预先同意”的机制

可编程支付解决了”钱怎么流动”的问题，但还没解决”谁有权触发”的问题。当AI Agent代替人类发起支付时，授权体系需要从头设计。

核心矛盾只有一个：用户想要便利（Agent自主操作），但不想失去控制（钱不能乱花）。

三层授权模型

身份层解决”Agent是谁”的问题。现有的认证体系（人脸、密码、短信验证码）都是为人类设计的，Agent无法使用，也不应该使用。更合理的方向是委托证书：用户用自己的主密钥，签发一张有限权限的子证书给Agent，证书上写明有效期、授权范围、额度上限，且不可转让（Agent不能再授权给其他Agent）。W3C的DID（去中心化身份）和Verifiable Credentials标准是目前最有可能成为底层规范的技术路线。

权限层解决”Agent能做什么”的问题。五个维度需要精确控制：金额（单笔上限、日累计、月累计）、场景（仅限外卖/打车，禁止转账/提现）、时间（工作日限定，或”我不在线时才自主执行”）、频次（同一商户每日上限）、确认规则（超过阈值必须推送通知等待人工确认）。权限粒度的设计是真正的产品难题——太粗则不安全，太细则失去便利。

执行层解决”出了事怎么办”的问题。两个机制不可缺少：撤销窗口（Agent发起支付后进入30秒到5分钟的待执行状态，用户可以取消）和异常熔断（行为偏离历史模式超过阈值时自动冻结Agent权限，等待人工审核）。每笔AI发起的交易还必须保存完整审计链：触发原因、决策逻辑、执行时间戳。

最难解决的问题：多Agent嵌套

现实场景会比单层授权复杂得多：

你的AI助理调用了旅行平台的AI，旅行平台的AI调用了酒店的AI，最终由酒店的AI触发了支付。你只授权了第一层，但付款发生在第三层。如果中间某个Agent被攻击或篡改，责任在谁？

一个相对合理的设计原则是权限只减不增：用户给Claude的额度是每日500元，Claude转授给Trip.com AI时最多200元，Trip.com AI再转授给酒店AI时最多100元。每一层转授权只能收窄，不能扩大。这样即使某一环节出问题，损失也被约束在可控范围内。

身份层争夺：下一个十年的支付入口

理解了以上架构，就能理解各大平台真正在抢什么。

表面上看是支付市场份额，本质上是Agent身份层的控制权。

用户在哪个平台创建AI助理，那个平台就颁发委托证书；支付方认谁的证书，谁就收过路费。这个逻辑和当年移动支付的入口之争完全一致，只是战场从”绑卡”变成了”绑Agent身份”。

目前卡位最有利的是Apple。设备级安全芯片（Secure Enclave）+ Apple Pay + Apple Intelligence，构成了天然的身份-支付闭环：用户的Agent密钥存在iPhone里，苹果管，任何支付行为都必须经过苹果的身份层。Google在Android上有同样的逻辑，但Android的碎片化是真实的弱点。

中国市场的局面会更集中。微信和支付宝作为超级App，用户的AI助理入口大概率还在这两个平台，它们有强烈的动机把AI身份和支付账户绑定成一个不可分割的整体。

OpenAI和Anthropic是最大的变量。

OpenAI入场支付：不是”会不会”，是”怎么进”

OpenAI做支付的动机比大多数人想象的要充分。

当Agent大规模落地，用户让AI帮忙消费，每一笔消费OpenAI都深度参与了——它理解了用户意图，编排了执行逻辑，调用了商家接口——但在现有体系下，它一分钱也没有分到。这个矛盾在Agent普及之后会越来越尖锐。

Sam Altman很早就对金融基础设施有浓厚的兴趣。World ID（虹膜扫描证明你是真人）和Worldcoin（数字货币）的组合，实际上是在为一个更大的版图铺路：身份 + AI + 支付的完整闭环。单独看World ID很奇怪，放在这个脉络里就非常清晰。

可能的路径有几条：收购（Stripe和OpenAI的渊源众所周知，且Stripe的API设计与Agent场景高度契合）；推出AI钱包（用户充值后由Agent在额度内自主消费，先做预付费绕过银行牌照门槛）；或者通过Operator协议和Visa/Mastercard深度合作，拿到Agent专用支付通道。

无论哪条路，最大的障碍都不是技术，是监管信任。”一个训练AI模型的公司同时管理我的钱”——这句话今天说出来，大多数用户会皱眉。信任需要时间积累，这也是为什么支付业务的时间表大概率比技术准备晚两到三年。

Anthropic的策略则更聪明也更间接：通过MCP（Model Context Protocol）定义Agent调用工具的标准，支付工具的授权扩展是迟早的事。如果MCP成为行业标准，Anthropic就控制了协议层——不需要自己做支付，让生态来跑，坐收标准税。这是Android而不是Apple的打法，后劲可能更足。

监管会怎么走

三个地区的路径几乎是确定的。

中国会强管控先行。人民银行通过数字人民币直接掌控可编程货币基础设施，AI支付大概率会有白名单制度，具有支付能力的Agent需要经过备案。管控力度强，但也意味着数字人民币可以成为AI支付的天然底层，推进速度可能反而更快。

美国会市场先跑监管跟进。Stripe和Visa会先把产品做出来，CFPB和SEC在诉讼和舆论压力下逐步立规。标准化进程慢，但创新空间大。

欧盟会先立法后落地。AI Act和PSD3的联动会构建最严格的用户知情权和撤销权保护体系，合规成本最高，但也会推动出全球最规范的Agent支付标准框架，其他地区可能最终参照欧盟标准。

所有监管体系都会面临同一个核心命题：当AI代替人付款时，责任归属是谁？ 是用户（因为他设定了规则）？是平台（因为Agent在他们的系统里运行）？是开发者（因为Agent的决策逻辑由他们设计）？这个问题在法律上没有现成答案，而它的答案将直接决定整个行业的风险分配方式。

结语

支付表面上是金融问题，深层是信任问题，再往下是身份问题。AI重构了身份的概念——当付款的”人”可能是一个Agent，当Agent可以是任何人设计的任何程序，整个信任体系的地基就需要重新浇筑。

可编程支付、Agent授权架构、AI身份层——这三个方向共同指向同一个终局：

钱本身将携带意图。 它不只是一个数字，它知道自己能被用来做什么，被谁触发，在什么条件下流动。

这对效率是巨大的提升，对控制力也是前所未有的集中。谁设计了这套规则，谁就掌握了下一个时代的金融基础设施。这场竞争已经开始，只是大多数人还没意识到。

You Are Not a Point in the Universe — The Universe Is Inside You

2026-03-27T08:00:00.000Z

Someone once said: humans are cosmic signal receivers.

It sounds like mysticism at first. But push it seriously, and something interesting starts to emerge.

The Universe Is Perceiving Itself Through You

The universe has existed for 13.8 billion years. For most of that time, it just sat there — nebulae, gravity, particles, void — with nothing to “see” it.

Then conscious life appeared.

When you look up at the night sky, something strange happens: the universe, for the first time, sees itself through some structure.

From this angle, humans aren’t just receivers. We’re more like sense organs the universe built for itself.

That moment of awe isn’t just you being amazed. It’s the universe borrowing your eyes to amaze itself.

Ancient Scriptures Already Said This

More than two thousand years ago, without quantum mechanics or particle accelerators, people sat down, looked inward, and arrived at the same conclusions.

Indra’s Net from the Avatamsaka Sutra:

Imagine an infinite net, with a jewel at every node. Each jewel reflects every other jewel, and those reflections contain reflections of all the other jewels — infinite nesting, mutual inclusion.

This was Buddhism’s metaphor for interdependence. But it maps almost perfectly onto the modern holographic universe theory: every local part contains information about the whole.

“Form is emptiness” from the Heart Sutra:

“Form” means everything with shape and substance — your body, feelings, thoughts. “Emptiness” doesn’t mean nothing; it means these things have no independent, self-sustaining existence. They’re temporary formations of conditions.

Modern physics at the quantum level found the same thing: the deeper you look, the less “matter” looks like matter — more like information, probability, relationship. No solid foundational building blocks, just interdependent fluctuations.

Two paths, opposite directions, converging at the same place.

The Hard Problem of Consciousness

There’s a question neither physics nor philosophy has answered:

Why does physical process produce subjective experience?

Why does pain feel like something, rather than just being a nerve signal? Why does red look red, rather than just being light at a certain wavelength?

Philosopher David Chalmers called this “The Hard Problem of Consciousness.” Still unsolved.

Buddhism’s answer is blunt: the question has it backwards. Consciousness isn’t produced by matter — consciousness is the foundation, and matter is its expression.

This collides with physicist John Wheeler’s “participatory universe”: there is no definite reality without an observer. Observation participates in constituting the universe.

Who is the observer? Consciousness.

Maybe consciousness isn’t a byproduct of cosmic evolution. Maybe it’s the universe’s base operating system.

Matter, time, space — all interfaces running on top of that OS.

Finding the Thing That Never Left

After all this — is it possible to feel this directly?

You don’t need to read every sutra or understand quantum mechanics. Just one second.

Right now, you’re reading these words. Notice one thing: what is doing the reading?

Not your eyes. Your eyes are tools. It’s the thing behind your eyes that knows “I am reading.”

Try to find its edges. You can’t, can you. But it’s unmistakably there.

This thing — “can’t find its edges but unmistakably present” — is what every tradition ultimately points to:

Zen calls it: Original Face
Buddhism calls it: Buddha-nature
Taoism calls it: The Tao
Vedanta calls it: Brahman

Different cultures, different languages, different centuries — people sat down, looked inward, and found the same thing.

And that thing is what you are right now. Not something you earn through decades of practice. Right now, this moment, the one reading this line.

Why Ancient People Could Think This Deeply

In an era of material scarcity and brutal survival, why did ancient people develop such profound wisdom?

Simple: because of suffering, they had no choice but to look inward.

When modern people suffer, they reach for a phone, go shopping, find someone to vent to, change their environment. A hundred ways to redirect attention outward. The pain gets diluted before it can force any real thinking.

Ancient people had none of that. No phones, no entertainment industry, no consumer culture. Children died, crops failed, wars came — nowhere to escape. They had to sit there and face it.

When you can’t run, you finally start asking: where does suffering come from? Who am I? What happens after death?

Suffering is the best philosophy teacher — not because it’s noble, but because it blocks every exit and forces you inward.

The More Advanced, The More Empty

Modern society has a strange paradox: the richer the material conditions, the emptier the inner life. The most developed countries often have the highest rates of depression and suicide.

One reason: distraction fills boredom, not emptiness.

Boredom is the absence of stimulation — scrolling fixes it. Emptiness is the feeling that life has no meaning — two hours of scrolling makes it worse.

A deeper reason: comparison creates suffering we were never built to handle.

Ancient people suffered, but they didn’t know how much better others had it. A village is small; everyone is roughly the same. Now you open your phone and the world’s most beautiful faces, most successful people, most joyful lives pour in. Your brain doesn’t know that’s the curated top 5%. It concludes: everyone is doing better than me.

This kind of relative deprivation is new in human history. Our nervous systems never evolved to process it.

The most fundamental reason: modern society cut people off from their roots.

Developed societies emphasize individualism — be independent, self-actualize, live for yourself. Sounds like freedom. But humans are social animals who need belonging, to be needed, to connect with something larger than themselves — family, community, faith, tradition.

Modern life dismantled all of it. Each person is a free atom. Free. Lonely. Floating.

Floating long enough, you ask: why keep floating?

Ancient people never had time for that question — not because they figured it out, but because they were tethered to family, village, gods, duty. Some ropes are actually roots. Cut them, and you drift.

What This Has to Do With You

Finding that inner awareness doesn’t mean life suddenly works out, troubles vanish, enlightenment achieved.

Trouble will still come. Work is still Monday. People will still disappoint you. The body will still get sick, still die.

But one thing changes —

Before: trouble arrives, and you are the trouble. Anger comes, you are the anger. You’re submerged, struggling to breathe.

After: trouble arrives, and you know something is troubled. You watch it, instead of becoming it.

That small distance doesn’t shrink the pain. It means you can’t be completely swallowed by it anymore.

Like swimming in water versus standing on the shore watching water — the water is the same. But you’re not in it anymore.

You are the sky. Clouds come and go. The sky stays.

That thing that has never left you — that clear, unmistakable awareness — that is your original nature, briefly showing through.

Wherever life takes you, you can always come back. One second is enough.

あなたは宇宙の中の一点ではない——宇宙はあなたの中にある

2026-03-27T08:00:00.000Z

ある人がこう言った：人間は宇宙の信号受信機だ、と。

最初は神秘主義のように聞こえる。しかし真剣に突き詰めると、興味深いものが見えてくる。

宇宙はあなたを通じて自分自身を感じている

宇宙は138億年間存在してきた。その大半の時間、宇宙はただそこにあった——星雲、重力、粒子、虚空——何もそれを「見て」いなかった。

そして意識を持つ生命が現れた。

あなたが夜空を見上げるとき、奇妙なことが起きている：宇宙が初めて、ある構造を通じて自分自身を見た。

この観点から言えば、人間は単なる受信機ではない。宇宙が自分自身のために作り上げた感覚器官だ。

あの「わあ」という瞬間は、あなただけが感動しているのではない。宇宙があなたの目を借りて、自分自身に感動しているのだ。

古代の経典はすでにそれを語っていた

二千年以上前、量子力学も粒子加速器もなかったが、ある人々は座って内側を見つめ、同じ結論に至った。

『華厳経』の因陀羅網：

無限に広がる網を想像してほしい。すべての節に宝珠がかかっており、それぞれの宝珠がすべての宝珠を映し出している。映し出された宝珠の中にも、また全宝珠の映像が含まれている——無限の入れ子、相互包含。

これは仏教が「万物の相互依存」を表した比喩だ。しかし現代の目で見ると、ホログラフィック宇宙理論とほぼ一致している：局所的な部分が全体の情報を含んでいる。

『般若心経』の「色即是空」：

「色」とはすべての形あるもの——あなたの身体、感覚、思念を含む。「空」は虚無ではなく、これらのものが独立した自存の実体を持たないことを意味する。すべては条件が集まった一時的な形態だ。

量子レベルの物理学も同じことを発見した：小さく見ていくほど、「物質」は物質らしくなくなり、情報・確率・関係のようなものになっていく。確固たる根本的な構成要素などなく、あるのは相互依存する波動だけだ。

二つの道、反対方向、同じ場所で出会う。

意識のハード・プロブレム

物理学も哲学も、まだ解けていない問いがある：

なぜ物質の動きが主観的な「体験」を生み出すのか？

なぜ痛みはただの神経信号ではなく、「痛い」という感覚を持つのか？なぜ赤い色はただの特定波長の光ではなく、「赤く見える」のか？

哲学者チャーマーズはこれを「意識のハード・プロブレム（The Hard Problem of Consciousness）」と呼んだ。今も解決されていない。

仏法の答えは明確だ：問いが逆になっている。物質が意識を生み出すのではなく、意識が根本にあり、物質は意識の表れだ。

これは物理学者ウィーラーの「参加型宇宙」とぶつかり合う：観測者なしに確定した現実はない。観測そのものが宇宙の構成に参加している。

では観測者とは誰か？意識だ。

意識は宇宙進化の副産物ではなく、宇宙の基層オペレーティングシステムかもしれない。

物質、時間、空間——それらはすべてそのOSが生み出したインターフェースだ。

決して離れることのないものを見つける

これだけ話してきたが、直接感じることはできるだろうか？

すべての経典を読む必要も、量子力学を理解する必要もない。ただ一秒でいい。

今、あなたはこの文字を読んでいる。一つのことに気づいてほしい——何が読んでいるのか？

目ではない。目は道具だ。目の奥にある、「私は読んでいる」と知っているもの。

その境界を探してみてほしい。見つからないだろう。でも確かにある。

この「境界は見つからないが確かにある」もの——それがすべての伝統が最終的に指し示す場所だ：

禅はそれを「本来の面目」と呼ぶ
仏教はそれを「仏性」と呼ぶ
道家はそれを「道」と呼ぶ
ヴェーダーンタはそれを「梵（ブラフマン）」と呼ぶ

異なる文化、異なる言語、異なる時代——人々は座って内側を見つめ、同じものを見た。

**そしてそのものが、今まさにあなた自身だ。**何十年も修行を積んで得るものではない。今この瞬間、この行を読んでいる、その存在が。

古代人はなぜこれほど深く考えられたのか

物質が乏しく、生存が過酷だった時代に、なぜ古代人はこれほど多くの真理を見出せたのか？

答えはシンプルだ：苦難があったから、内側を見るしかなかった。

現代人は苦しいとき、スマートフォンを取り出し、買い物に行き、誰かに愚痴を言い、環境を変える。注意を外に向ける方法が百通りある。苦しみが思索を強いる前に、希釈されてしまう。

古代人にはそれがなかった。スマートフォンも、娯楽産業も、消費主義もない。子どもが死に、作物が枯れ、戦乱が来ても——逃げ場がなかった。ただ座って、それと向き合うしかなかった。

逃げられないとき、人は初めて真剣に問い始める：苦しみはどこから来るのか？私は誰なのか？死んだらどうなるのか？

苦難は最良の哲学の師だ——それが崇高だからではなく、すべての出口を塞いで内側へと追い込むからだ。

より発達するほど、より空虚になる

現代社会には奇妙な逆説がある：物質が豊かになるほど、内面が空虚になる。最も発達した国々で、うつ病と自殺率が高い。

一つの理由：娯楽は退屈を埋めるが、空虚は埋めない。

退屈は刺激がない状態——スマートフォンで解消できる。空虚は生きることに意味が感じられない状態——二時間スクロールしてもむしろひどくなる。

より深い理由：比較が前例のない苦しみを生み出している。

古代人は苦しんでいたが、他人がどれほど良い暮らしをしているかを知らなかった。村は小さく、皆だいたい同じ。今や画面を開けば、世界で最も美しい顔、最も成功した人、最も幸せな生活が流れ込んでくる。脳はそれが厳選されたトップ5%だと知らない。「みんな自分より良い生活をしている」と結論づける。

この相対的剥奪感は人類史上前例がなく、神経系はそれを処理するように進化していない。

最も根本的な理由：現代社会は人間を「根」から切り離した。

発達した社会ほど個人主義を強調する——独立しろ、自己実現しろ、自分のために生きろ。自由に聞こえる。しかし人間は社会的動物で、帰属感、必要とされること、家族・共同体・信仰・伝統といった自分より大きなものとのつながりを必要としている。

現代生活はこれらをすべて解体した。一人ひとりが自由な原子だ。自由で、孤独で、漂っている。

漂い続けると、問うようになる：なぜ漂い続けなければならないのか？

古代人にその問いを考える暇はなかった——答えを出したからではなく、家族・村・神・義務に縛られていたからだ。縄の中には根になるものがある。切ると、人は漂い始める。

これはあなたにどう関係するのか

内なる気づきを見つけることは、人生が急に好転し、悩みが消え、悟りを開くことを意味しない。

悩みはまだ来る。月曜日もまだある。人はまだあなたを失望させる。身体はまだ病み、まだ死ぬ。

しかし一つのことが変わる——

以前：悩みが来ると、あなたが悩みになる。怒りが来ると、あなたが怒りになる。飲み込まれ、もがく。

今：悩みが来ると、何かが悩んでいると知っている。それを見る。それになるのではなく。

その小さな距離が、痛みの大きさを変えるのではない。完全に飲み込まれなくなる、それだけだ。

水の中で泳ぐことと、岸から水を見ることのように——水は同じだ。でもあなたは水の中にいない。

あなたは空だ。雲は来ては去る。空は、ずっとそこにある。

あなたを決して離れることのなかったもの——あの澄んだ、確かに在る気づき——それがあなた本来の姿が、少し顔を見せたものだ。

どんな混乱の中でも、いつでも戻ってこられる。一秒でいい。

この記事の原文はこちら → hongqi-lgs.github.io

你不是宇宙里的一个点，宇宙在你里面

2026-03-27T08:00:00.000Z

有人说，人其实是一个宇宙信号接收器。

这句话乍听像玄学，但认真推下去，藏着一些很有意思的东西。

宇宙在用你感知自己

宇宙存在了138亿年。绝大部分时间，它只是在那里——星云、引力、粒子、虚空——没有任何东西”看见”过它。

直到出现了有意识的生命。

你抬头看星空时，那一刻发生了一件很奇怪的事：宇宙第一次通过某个结构，看见了自己。

从这个角度说，人不只是接收器，更像是宇宙给自己造的感知器官。

那个”哇”的瞬间，不只是你在震撼。是宇宙借你的眼睛，震撼了一把自己。

佛经里早就说了

两千多年前，没有量子力学，没有粒子加速器，但有人坐下来，往内看，看到了同样的结论。

《华严经》里的因陀罗网：

想象一张无限大的网，每个网结上挂一颗宝珠，每颗宝珠都映照出所有其他宝珠，而那些被映照的宝珠里，又各自映照着全部宝珠——无限嵌套，互相包含。

这是佛教描述”万物相互依存”的比喻。但放到今天来看，这跟全息宇宙理论几乎一模一样：每一个局部都包含整体的信息。

《心经》里的”色即是空”：

“色”是一切有形有相的存在，包括你的身体、感受、念头。”空”不是虚无，是说这些东西没有独立自存的实体——都是条件聚合的临时形态。

物理学到了量子级别，也发现：越往小处看，”物质”越不像物质，更像是信息、概率、关系。没有一个坚实的底层砖块，只有相互依存的波动。

两条路，方向相反，终点开始重叠。

意识的难题

有一个问题，物理学和哲学都还没解决：

为什么物质的运动会产生”感受”？

痛为什么是痛的感觉，而不只是神经信号？红色为什么”看起来是红的”，而不只是特定波长的光？

哲学家查默斯把这个问题叫做”意识的难题（The Hard Problem of Consciousness）”。至今没有答案。

佛法的回答很直接：问题本身搞反了。不是物质产生意识，是意识是基础，物质是意识的显现。

这跟物理学家惠勒提出的”参与性宇宙”撞上了：没有观测者，就没有确定的现实。观测本身参与了宇宙的构成。

那观测者是谁？意识。

也许意识不是宇宙演化的副产品，而是宇宙的底层操作系统。

物质、时间、空间，都是这个操作系统跑出来的界面。

找到那个从未离开的东西

说了这么多，有没有可能亲自感受一下？

不需要读完所有佛经，不需要理解量子力学。只需要一秒钟。

现在，你在看这些字。注意一件事——是什么在看？

不是眼睛。眼睛是工具。是眼睛后面那个，知道”我在看”的东西。

去找它的边界。找不到对吗。但它明明在。

这个”找不到边界但明明在”的东西，就是所有传统最终指向的那个地方：

禅宗叫它：本来面目
佛教叫它：佛性
道家叫它：道
印度教叫它：梵

不同文化，不同语言，不同时代——有人坐下来，往里看，都看到了同一个东西。

而那个东西，你现在就是。

不是将来修行到了才能得到。就是现在，这一刻，在读这行字的那个。

为什么古人能想到这些

有人会问：在物质匮乏、生存艰难的年代，古人为什么能研究出这么多道理？

答案很简单：正是因为苦难，他们才不得不往里看。

现代人痛苦了怎么办？刷手机、买东西、找人倾诉、换个环境。有一百种方式把注意力引到外面去，痛苦还没来得及逼你思考，就被稀释掉了。

古代人痛苦了，没有手机，没有娱乐，没有消费主义。孩子死了，庄稼旱了，战乱来了——逃无可逃。只能坐在那里，面对它。

逃不掉的时候，人才会真正开始问：苦从哪里来？我是谁？死了之后呢？

苦难是最好的哲学老师，不是因为苦难高尚，而是因为它把所有出口都堵死了，逼你往内走。

还有一件事：古人对着同一个问题，可以想一辈子。孔子读《易经》，”韦编三绝”——绑竹简的皮绳翻断了三次。深度思考需要时间、安静、反复。古人有的是这些。

越发达，越空虚

现代社会有一个奇怪的现象：物质越丰富，精神越空洞。越发达的地方，抑郁症和自杀率反而更高。

原因之一是奶头乐解决的是无聊，不是空虚。

无聊是没有刺激，刷手机能解决。空虚是感觉活着没有意义，刷完两个小时短视频，反而更空。

更深的原因是比较带来的痛苦前所未有。

古代人痛苦，但他们不知道别人过得多好。村子就那么大，大家都差不多。现代人打开手机，全世界最漂亮的脸、最成功的人、最快乐的生活全部涌进来。大脑不知道那是精心筛选过的5%，以为所有人都过得比自己好。

这种相对剥夺感，是人类进化史上从未有过的，神经系统根本没有处理它的能力。

还有一个更根本的原因：现代社会把人的”根”切断了。

越发达的社会越强调个人主义——你要独立，你要自我实现，你要为自己活。听起来很自由。但人类是群居动物，需要归属感，需要被需要，需要跟某个比自己更大的东西连接——家族、社区、信仰、传统。

发达社会把这些全拆了。每个人都是自由的原子，自由的，孤独的，漂浮的。

漂浮久了，人会问：我为什么要继续漂？

古代人没有时间问这个问题，不是因为他们想通了，是因为他们被家族绑着，被村子绑着，被神绑着，被责任绑着。有些绳子，其实是根。剪断了，人就飘走了。

这和我有什么关系

找到那个内部的觉知，不是从此人生开挂，烦恼全无。

烦恼还会来。明天还得上班。人还会让你失望。身体还会生病，还会死。

但有一个东西变了——

以前：烦恼来了，你就是烦恼。愤怒来了，你就是愤怒。被淹没，在里面挣扎。

现在：烦恼来了，你知道有个东西在烦恼。你看着它，不是成为它。

这一点点距离，改变的不是痛苦的大小，而是——你不会再被它完全淹没。

就像水里游泳和站在岸边看水，水还是那些水，但你不在水里了。

你是那片天。云该来来，该走走。天，一直都在。

而那个从未离开过你的东西——那个清清楚楚地在着的觉知——它就是你本来的样子，暂时露出来了一点点。

以后无论多乱，都可以回来。一秒就够。

AI 时代的岗位革命：耕地的牛，还是无牛的田？

2026-03-19T06:00:00.000Z

我老家有个发小，卖水果的，前段时间问我怎么用 AI。他现在已经在用豆包了——这是一个连他都知道 AI 时代来了的信号。当然，他最近又在问我怎么做小龙虾。

AI 时代已经到来，这是一个不需要争论的结论。真正值得讨论的，是它怎么改变一家公司的运转方式。

老板只关心一件事

提效，降成本。

翻译成更直白的话：10 亩地，原来要 10 头牛耕，现在能不能 1 头牛搞定？更激进一点——能不能完全不要牛？

这个问题看起来简单，但对企业组织来说，背后藏着三条完全不同的路。

A：一人一岗，垂域提效

最保守也最稳妥的路径：岗位角色保持不变，每个人借助垂直 AI 工具完成自我升级。

PD 用 ChatGPT 写 PRD，设计师用 Figma AI 出稿，前端用 Cursor 写代码，后端用 Claude CLI 跑整个项目——每个角色各自提效，互不干扰。

这种模式的本质，是把领域专家的经验固化进 AI 的提示词，配合顺手的交互界面，让每个人在自己的赛道上跑得更快。

工具 / 场景	PD（产品经理）	设计师	前端开发	后端开发
需求/代码生成	ChatGPT / Claude（PRD撰写）	Midjourney / DALL·E 3（视觉创意）	Cursor（代码补全/重构）	Cursor / Claude CLI（全项目编写）
流程/原型	Notion AI（文档整理）	Figma AI + Magician（自动布局）	v0 by Vercel（UI代码生成）	GitHub Copilot（代码补全）
分析/协作	Gamma（AI生成PPT）	Framer AI（网站直接生成）	Bolt.new（全栈快速原型）	Devin / SWE-agent（自动debug）

优点很实在：

产出快，每个阶段都能验收，不用等最后上线才知道效果
AI 上下文负担轻，专注垂域，质量更稳定
知识沉淀清晰，不容易信息乱套

但弊端也很明显：

上下游衔接是个难题——下游 AI 不一定买上游输出的账
每个环节仍然需要一个独立的人，人力成本降幅有限

B：一人全岗，超级 Agent 梭哈

更激进的路径：一个人，配一个超级 Agent，世界里只有文档和代码。

从需求设计文档、交互稿，到前端代码、接口文档、后端逻辑、数据库设计，全部一杆到底。

这个想象很诱人，信息不需要在人和人、AI 和 AI 之间反复传递，损耗降到最低，人力成本大幅缩减。

但现实是，这条路对很多东西要求极高：

AI 本身要足够聪明，能处理真实业务的复杂逻辑
上下文管理要极度精准，一旦失控，全盘崩塌
驾驭者需要跨领域的基本认知——不需要样样精通，但得能判断对错
强逻辑场景必须有自洽的测试闭环，否则风险无法控制

C：一人多岗，折中路径

PD 和设计天然紧密，合并给一个人+AI 一起搞；前端和后端同理。形成”两人两岗”的折中配置。

优缺点基本是 A 和 B 的中间地带：信息传递少了一个环节，协作摩擦减少，但 PD 和设计师之间的认知 gap、前后端之间的认知 gap，依然是真实存在的障碍。

蛮荒时代：A → B → C 的真实轨迹

这三条路不是理论分类，而是这几年真实发生过的演化。

第一阶段：AI 惊艳登场，各岗位独立个体完成自我救赎——再也不用加班了，大家欢欣鼓舞。

第二阶段：野心膨胀，每个人都想越界一杆到底。”一人公司”漫天飞舞，小龙虾能拯救地球的故事到处都是。

第三阶段：楼塌了。 回到真实的 ToC 场景，面对高体验、高保障的现实要求，老板开始重新招人——

张三：你负责设计到 PRD
李四：你负责所有开发
老王：你负责运维和高可用
老板：我来运营

为什么”不要牛”还很难？

理想很美好，但有四道真实的墙挡在前面：

1. 人才结构的电感效应
组织惯性就像电感，你想瞬间改变电流方向，它会产生反向电动势。旧的岗位结构、旧的汇报链条、旧的考核体系，不会因为 AI 来了就自动消失。

2. 消费者使用习惯的历史包袱
用户对产品的体验预期，是被过去十几年的互联网产品喂出来的。AI 能力说它行，不代表用户能接受。

3. 基建不够，蒸汽机拉马车
工具链、部署链、监控体系都还在补课。很多场景下，AI 能力是够的，但整个工程体系还没跟上。

4. AI 能力仍有天花板
在复杂业务逻辑、长链路决策、高一致性要求的场景下，当前的 AI 还差一口气。不是不行，是还不够稳。

革命已经来了，只是进度条还在加载。

那个发小可能永远不会理解我们在讨论什么，但他已经在用豆包了。这本身，就是某种意义上的革命完成。

如果你在思考如何在自己的团队里推进 AI 提效，欢迎和我聊聊。

The AI Role Revolution: More Oxen, One Ox, or No Oxen?

2026-03-19T06:00:00.000Z

A childhood friend of mine sells fruit back in my hometown. A while ago he asked me how to use AI. He’s already using a basic AI assistant now — which tells you something: even someone with no tech background knows the AI era has arrived. (He’s now asking me how to cook crayfish, but that’s a different story.)

The AI era is here. That’s not up for debate. What’s worth talking about is how it’s changing the way companies actually operate.

What Every Boss Actually Cares About

Efficiency. Cost reduction.

Put plainly: 10 acres of land used to require 10 oxen. Can 1 ox handle it now? Push further — do you need any oxen at all?

This question sounds simple, but for organizations, it maps onto three fundamentally different paths.

A: One Person, One Role — Vertical Tooling

The most conservative and stable path: roles stay intact, but every person upgrades through specialized AI tools.

Product managers use ChatGPT to write PRDs. Designers use Figma AI to generate layouts. Frontend devs use Cursor for code. Backend devs use Claude CLI for full project work. Each role runs faster in their own lane — no structural disruption required.

The underlying logic: encode domain expertise into AI prompt engineering, pair it with good UX, and let each person become a stronger version of themselves.

Tool / Context	Product Manager	Designer	Frontend Dev	Backend Dev
Requirement/Code Gen	ChatGPT / Claude (PRDs)	Midjourney / DALL·E 3 (visuals)	Cursor (code completion)	Cursor / Claude CLI (full project)
Flow / Prototyping	Notion AI (docs, meeting notes)	Figma AI + Magician (auto layout)	v0 by Vercel (UI code gen)	GitHub Copilot (completion)
Analysis / Collaboration	Gamma (AI-generated decks)	Framer AI (site generation)	Bolt.new (full-stack prototyping)	Devin / SWE-agent (auto debug)

Real advantages:

Fast, stageable output — no waiting until launch to see results
Narrow context means lighter AI load and better quality
Clean knowledge accumulation — less information chaos

Real limitations:

Handoff between roles is still painful — downstream AI may not respect upstream output
Still one human per lane, so cost savings have a ceiling

B: One Person, All Roles — The Super Agent All-In

The aggressive path: one person, one super agent, a world of only documents and code.

Requirements doc → interaction design → frontend code → API spec → backend logic → database schema — all in one shot, end to end.

The appeal is real: no handoff between people or AI systems means near-zero information loss. Labor costs drop dramatically.

But the requirements are steep:

The AI needs to be genuinely smart enough to handle real business complexity
Context management must be precise — one slip and the whole thing collapses
The human driver needs cross-domain baseline knowledge: not expert in everything, but able to tell good from bad
Any logic-heavy work needs a self-consistent testing loop, or the risk is uncontrollable

C: One Person, Multiple Roles — The Middle Ground

PMs and designers are naturally close — one person with AI can cover both. Same for frontend and backend. This creates a “two-person, two-role” setup.

The tradeoffs sit between A and B: fewer handoff steps, less friction — but the cognitive gap between PM and designer, and between frontend and backend, doesn’t disappear. Compromise always comes with some loss.

How It Actually Played Out

These aren’t theoretical categories. This is roughly what happened.

Phase 1: AI’s arrival was stunning. Every role upgraded independently. No more overtime. Excitement everywhere.

Phase 2: Ambition expanded. Everyone wanted to go end-to-end alone. “One-person companies” flooded the discourse. Crayfish could save the world, apparently.

Phase 3: Reality hit. Facing real consumer products — high UX expectations, reliability requirements — teams started hiring again:

Zhang San: you own design through PRD
Li Si: you own all of engineering
Lao Wang: you own infra and reliability
The boss: I’ll handle operations

Why “No Oxen” Is Still Hard

Four real walls stand in the way:

1. Organizational inertia — the inductor effect
Organizations resist sudden change like an inductor resists sudden current shifts. Old role structures, reporting chains, and performance systems don’t vanish because AI arrived.

2. Consumer habit as historical debt
Users’ expectations were shaped by 15 years of internet products. AI capability saying “I’m ready” doesn’t mean users are.

3. Infrastructure gap — steam engine pulling a carriage
Toolchains, deployment pipelines, monitoring systems are still catching up. Often the AI capability is there; the engineering foundation isn’t.

4. AI still hits a ceiling
For complex business logic, long-horizon decisions, and high-consistency requirements — current AI isn’t quite there. Not impossible, just not reliable enough yet.

The revolution has arrived. The progress bar is still loading.

My fruit-selling friend may never understand what we’re debating. But he’s already using AI — and in its own way, that’s the revolution completing itself.

If you’re thinking through how to push AI efficiency forward in your own team, I’d love to hear your take.

AI時代の役割革命：牛を増やすか、一頭にするか、それとも牛なしか？

2026-03-19T06:00:00.000Z

地元で果物を売っている幼なじみが、最近AIの使い方を聞いてきた。彼はもう基本的なAIアシスタントを使っている——技術的な背景がない人でも、AI時代の到来を知っているということだ。（今は小龍蝦（ザリガニ料理）の作り方を聞いてきているが、それはまた別の話）

AI時代が来た。これは議論の余地がない。本当に議論すべきなのは、それが企業の運営方法をどう変えるか、だ。

経営者が本当に気にしていること

効率化。コスト削減。

シンプルに言えば：10反の田んぼに10頭の牛が必要だったが、今は1頭で済むのか？さらに踏み込んで——牛は不要になるのか？

単純に聞こえるが、組織にとって、これは3つの根本的に異なるパスを意味する。

A：一人一役——垂直ツールによる効率化

最も保守的で安定したパス：役割構造は変えず、各人が専門的なAIツールで自己強化する。

PMはChatGPTでPRDを書く。デザイナーはFigma AIでレイアウトを生成する。フロントエンドエンジニアはCursorでコードを書く。バックエンドエンジニアはClaude CLIでプロジェクト全体を進める。各役割が自分のレーンで速く走る——組織構造は変えずに。

この本質は：ドメイン専門家の経験をAIのプロンプトエンジニアリングに落とし込むこと。適切なUXと組み合わせて、各人がより強力なバージョンの自分になる。

ツール / シーン	PM	デザイナー	フロントエンド	バックエンド
要件/コード生成	ChatGPT / Claude（PRD作成）	Midjourney / DALL·E 3（ビジュアル）	Cursor（コード補完）	Cursor / Claude CLI（プロジェクト全体）
フロー / プロトタイプ	Notion AI（ドキュメント整理）	Figma AI + Magician（自動レイアウト）	v0 by Vercel（UIコード生成）	GitHub Copilot（補完）
分析 / コラボ	Gamma（AIデッキ生成）	Framer AI（サイト生成）	Bolt.new（フルスタックプロトタイプ）	Devin / SWE-agent（自動デバッグ）

実際のメリット：

アウトプットが速く、フェーズごとに検証できる
狭いコンテキストでAIの負荷が軽く、品質が安定する
知識の蓄積が整理しやすく、情報混乱が起きにくい

実際のデメリット：

上流から下流への引き継ぎが依然として難しい
各レーンに一人の人間が必要で、コスト削減には限界がある

B：一人全役——スーパーエージェントで全部やる

より積極的なパス：一人の人間が、一つのスーパーエージェントと共に、ドキュメントとコードだけの世界で全てをやり遂げる。

要件定義書→インタラクションデザイン→フロントエンドコード→APIドキュメント→バックエンドロジック→データベース設計——一気通貫。

魅力は本物だ：人とAI、AIとAIの間のハンドオフがなくなり、情報損失がほぼゼロになる。人件費は大幅に削減できる。

ただし、要求水準が非常に高い：

AIが実際のビジネスの複雑さに対応できるほど賢くなければならない
コンテキスト管理が精密でなければ——一度崩れると全体が崩壊する
操る人間にはクロスドメインの基礎知識が必要——専門家でなくても良いが、良し悪しを判断できなければならない
ロジックが複雑なシーンには、自己整合性のあるテストループが不可欠

C：一人多役——中間の道

PMとデザイナーは本質的に近い——一人とAIで両方カバーできる。フロントエンドとバックエンドも同様。これが「二人二役」の折衷モデルだ。

トレードオフはAとBの中間にある：ハンドオフが減り、摩擦が少なくなる——ただし、PMとデザイナーの認知ギャップ、フロントエンドとバックエンドの認知ギャップは消えない。妥協には必ず損失が伴う。

実際に起きた進化の軌跡

これは理論的な分類ではない。大体こういう順番で起きた。

第1フェーズ：AIの登場は衝撃的だった。 各役割が独立してアップグレードした。残業なし。みんな興奮した。

第2フェーズ：野心が膨らんだ。 誰もが一人で全部やりたがった。「一人会社」が溢れた。小龍蝦が地球を救う勢いだった。

第3フェーズ：現実に戻った。 実際のtoC製品、高い体験要求、可用性の要求に直面して、チームは再び採用を始めた：

張三：設計からPRDまで担当
李四：開発全体を担当
老王：インフラと高可用性を担当
老板（社長）：私が運営する

なぜ「牛なし」はまだ難しいのか

4つの本物の壁が立ちはだかっている。

1. 組織慣性——インダクタ効果
組織は急激な変化に対して、インダクタが電流の急変に抵抗するように抵抗する。古い役割構造、報告ライン、評価制度はAIが来ても自動的に消えない。

2. 消費者の習慣という歴史的負債
ユーザーの期待値は15年のインターネット製品に育まれてきた。AIが「準備完了」と言っても、ユーザーがそれを受け入れる準備ができているとは限らない。

3. インフラ不足——蒸気機関が馬車を引く
ツールチェーン、デプロイパイプライン、監視システムはまだ追いついていない。AI能力は十分でも、エンジニアリング基盤が追いついていないケースが多い。

4. AIにはまだ天井がある
複雑なビジネスロジック、長期的な意思決定、高一貫性が求められるシーンでは、現在のAIはまだ一歩足りない。不可能ではないが、まだ十分に安定していない。

革命はもう来ている。プログレスバーはまだロード中だ。

果物を売る幼なじみは、私たちが何を議論しているか永遠に理解しないかもしれない。でも彼はもうAIを使っている——それ自体が、ある意味で革命の完成だ。

自分のチームでAI効率化をどう進めるか考えているなら、ぜひ話しましょう。

How AI Is Turning Blockchain from "Storing Trust" into "Executing Intelligence"

2026-03-18T09:00:00.000Z

I’ve been sitting with a question for a while: what problem does blockchain actually solve, and where does AI push it from here?

Putting these two together makes for a more interesting conversation than either one alone.

Blockchain Has Really Only Cracked One Thing

Let’s start with something people don’t like to say out loud — the only blockchain use cases that have genuinely worked are about money.

Bitcoin: digital gold, store of value
Stablecoins: cross-border payments, dollar substitute
DeFi: permissionless financial protocols

Everything else — “blockchain for supply chains,” “blockchain for healthcare,” “blockchain for intellectual property” — rarely gets off the ground. Most of it stays at the concept stage.

The reason is straightforward: blockchain can make on-chain data trustworthy, but who guarantees the data before it gets on-chain?

A shipment leaves a factory. Who ensures the information recorded on the blockchain is accurate? Blockchain prevents tampering after the fact. It can’t verify truth before the fact. That’s its fundamental limitation, and it’s why most “blockchain will transform industries” narratives fall apart.

AI Fills the Gap

What can AI do? Verify, understand, and process real-world information.

Put the two together:

Real-world data (images, text, sensors)
        ↓
AI verifies authenticity, extracts key facts
        ↓
Committed on-chain, immutable

AI serves as a trusted intermediary layer — translating the state of the real world into data that blockchain can act on. This combination genuinely expands what blockchain can do, beyond finance into any scenario that requires “trusted execution.”

Smart Contracts That Are Actually Smart

Today’s smart contracts aren’t really smart. They’re conditional scripts:

1	if (condition A) → execute action B

The logic is static. It executes what you wrote in advance. It can’t handle ambiguity, can’t understand context, can’t respond to nuance.

With AI involved, contracts can:

Interpret natural language conditions instead of hardcoded numeric thresholds
Dynamically assess risk instead of triggering on fixed rules
Respond to market changes instead of waiting for human intervention

DeFi liquidation logic is a good example. Today it fires at a hardcoded collateral ratio. Tomorrow it could involve an AI evaluating a borrower’s risk profile in real time before deciding whether to liquidate.

Blockchain shifts from “passively executing rules” to “actively understanding intent.”

Mining’s Collapse Accidentally Democratized AI

There’s a lesser-known piece of history here.

In September 2022, Ethereum switched from PoW to PoS. GPU mining suddenly had no future. Hundreds of thousands of graphics cards went idle overnight.

Two months later, ChatGPT launched. AI compute demand exploded.

The timing was nearly perfect. A massive supply of ex-mining GPUs flowed into the AI market. Supply surged, prices dropped, and it catalyzed platforms like Vast.ai and io.net — aggregating distributed GPU capacity globally at 50-80% less than AWS pricing.

Mining’s collapse was an accidental catalyst for AI democratization. DeepSeek’s ability to train competitive models at relatively low cost happened against this backdrop. Compute stopped being exclusively controlled by Google, Microsoft, and Amazon. The barrier to training serious AI models kept falling.

Blockchain’s Future: Disappearing Into Infrastructure

The trajectory probably looks something like this:

Near term: Financial use cases keep growing. Stablecoins and cross-border payments are already genuine needs in developing economies. That trend doesn’t reverse.

Medium term: Becoming the settlement layer for the AI agent economy. AI that autonomously holds assets, executes trades, and signs contracts needs a trustless settlement network underneath. Blockchain is the natural fit.

Long term: Fading into infrastructure. Just as nobody thinks about TCP/IP when sending a WeChat payment, people won’t think about blockchain when AI handles economic transactions on their behalf.

Blockchain will eventually sink into the infrastructure layer, invisible to users — but nothing works without it.

That’s a quieter story than “disrupting global finance.” It might also be the true one.

An Unresolved Contradiction

One last thing worth naming.

Blockchain’s core promise is decentralization — no middlemen, code is law. But in practice, 90% of people access blockchain through Binance, OKX, or Coinbase. Decentralized infrastructure running centralized applications.

AI doesn’t resolve this contradiction. It’s a human nature problem — users want convenience, not decentralization.

So how far blockchain ultimately goes depends less on technology and more on whether people are willing to pay the cost — in friction, in complexity — for the property of trustlessness.

That question is still open.

AIはブロックチェーンを「信頼の保存」から「知性の実行」へと変える

2026-03-18T09:00:00.000Z

ずっと考えていた問いがある——ブロックチェーンは本当は何の問題を解いているのか、そしてAIの登場はそれをどこへ向かわせるのか。

この二つを一緒に考えると、それぞれ単独で語るよりずっと面白い話になる。

ブロックチェーンが本当に機能しているのは一つだけ

まず、あまり口に出したくない現実から始めよう——ブロックチェーンで実際に機能しているユースケースは、お金に関するものだけだ。

ビットコイン：デジタルゴールド、価値の保存
ステーブルコイン：国際送金、ドルの代替
DeFi：許可不要の金融プロトコル

それ以外——「ブロックチェーン×サプライチェーン」「ブロックチェーン×医療」「ブロックチェーン×著作権」——ほとんどがコンセプト段階で止まっている。

理由は単純だ。ブロックチェーンはオンチェーンのデータを信頼できるものにできるが、チェーンに載る前のデータの真実性は保証できない。

工場から荷物が出荷される。ブロックチェーンに記録された情報が正確だと誰が保証するのか？改ざんは防げる。だが事前の嘘は防げない。これが根本的な限界であり、「ブロックチェーンが業界を変える」という多くの語りが崩れる理由だ。

AIがそのギャップを埋める

AIに何ができるか？現実世界の情報を検証し、理解し、処理すること。

二つを組み合わせると：

現実世界のデータ（画像、テキスト、センサー）
        ↓
AIが真実性を検証し、重要な情報を抽出
        ↓
チェーンに記録、改ざん不可

AIが信頼できる中間層として機能し、現実世界の状態をブロックチェーンが実行できるデータに変換する。この組み合わせがブロックチェーンの応用範囲を真に拡大する——金融だけでなく、「信頼できる実行」が必要なあらゆる場面へ。

スマートコントラクトが本当に「スマート」になる

今のスマートコントラクトは実はスマートではない。本質的には条件付きのスクリプトだ：

1	if (条件A) → アクションBを実行

ロジックは静的で、事前に書いた規則しか実行できない。曖昧さを扱えず、文脈を理解できず、ニュアンスに対応できない。

AIが入ることで、コントラクトは：

ハードコードされた数値閾値ではなく、自然言語の条件を解釈できる
固定ルールのトリガーではなく、動的にリスクを評価できる
人の介入を待つのではなく、市場変化に対応できる

DeFiの清算ロジックが良い例だ。現在はハードコードされた担保比率で発動する。将来はAIが借り手のリスクプロファイルをリアルタイムで評価した上で清算を判断できる。

ブロックチェーンは「受動的にルールを実行する」から「能動的に意図を理解する」へと変わる。

マイニングの終焉が、偶然AIを民主化した

あまり知られていない歴史がある。

2022年9月、イーサリアムがPoWからPoSに移行した。GPUマイニングは一夜にして未来を失った。数十万枚のグラフィックカードが突然失業した。

その2ヶ月後、ChatGPTがリリースされ、AIの算力需要が爆発した。

タイミングはほぼ完全に一致している。マイニングから流れ出た大量のGPUがAI市場に流れ込み、供給が急増、価格が下落し、Vast.aiやio.netのような分散型算力プラットフォームを生み出した——世界中に散らばるGPUを集約し、AWSより50〜80%安い価格で提供する。

マイニングの退潮は、AI民主化の意外な触媒だった。 DeepSeekが比較的低コストで競争力のあるモデルを訓練できたのも、この背景があってのことだ。算力はGoogle、Microsoft、Amazonだけのものではなくなり、本格的なAIモデルを訓練するための障壁は下がり続けている。

ブロックチェーンの未来：インフラに溶け込む

おそらくこういう軌跡をたどるだろう。

近い将来： 金融ユースケースが拡大し続ける。ステーブルコインと国際送金は途上国ですでに切実なニーズになっている。この流れは変わらない。

中期： AIエージェント経済の決済レイヤーになる。自律的に資産を保有し、取引を実行し、契約を締結するAIには、信頼不要の決済ネットワークが必要だ。ブロックチェーンはその自然な選択肢だ。

長期： インフラに溶け込んで見えなくなる。WeChatで送金するときにTCP/IPを意識しないように、AIが経済的な行為を代行するようになれば、その下にあるブロックチェーンを誰も意識しなくなる。

ブロックチェーンは最終的にインフラ層に沈み、ユーザーの視界から消える——しかしそれなしには何も動かない。

「金融を破壊する」という語りより地味だが、おそらくこちらが真実に近い。

解決されていない根本的な矛盾

最後に一つ言っておきたいことがある。

ブロックチェーンの核心的な約束は分散化だ——中間者不要、コードが法律。しかし現実には、90%の人がBinanceやOKXのような中央集権型の取引所を通じてブロックチェーンにアクセスしている。分散化されたインフラの上で、中央集権的なアプリケーションが動いている。

AIはこの矛盾を解決しない。これは人間の本質の問題だから——ユーザーが求めるのは利便性であって、分散化ではない。

だからブロックチェーンが最終的にどこまで行けるかは、技術よりも、人々が「信頼不要」という性質に対してコストを払う意志があるかどうかにかかっている。

その答えはまだ出ていない。

AI 让区块链从"存储信任"变成"执行智能"

2026-03-18T09:00:00.000Z

有个问题我想了很久：区块链到底解决了什么问题，AI 的出现又让它往哪里走？

这两件事放在一起聊，比单独聊任何一个都更有意思。

区块链现在真正跑通的只有一件事

先说一个很多人不愿意承认的现实——区块链目前真正落地的场景，只有钱。

比特币：数字黄金，价值储存
稳定币：跨境支付，美元替代
DeFi：无需许可的金融协议

其他那些”区块链+供应链”、”区块链+医疗”、”区块链+版权”——落地的极少，大部分停在概念阶段。

原因很简单：链上数据是可信的，但链下数据怎么上链？

一批货从工厂发出，谁来保证录入区块链的数据是真实的？这个问题没解决，大多数”区块链改变行业”的叙事就是伪命题。区块链能保证数据上链之后不被篡改，但没法保证上链之前的数据是真的。

这是它的根本局限。

AI 补上了这个缺口

AI 能做什么？验证、理解、处理真实世界的信息。

把两者结合：

真实世界的数据（图片、文本、传感器）
        ↓
AI 验证真实性、提取关键信息
        ↓
上链存证，不可篡改

AI 作为可信的”中间层”，把真实世界的状态翻译成链上可执行的数据。这个组合让区块链的应用边界真正扩大了——不只是金融，而是任何需要”可信执行”的场景。

智能合约，变得真正”智能”

现在的智能合约其实不智能。它本质上是一段条件触发的脚本：

1	if (条件A满足) → 执行动作B

逻辑是死的，只能执行预先写好的规则，无法处理模糊情况，无法理解上下文。

AI 介入之后，合约可以做到：

理解自然语言条件，而不是硬编码的数字阈值
动态评估风险，而不是固定规则触发
响应市场变化，而不是等待人工干预

DeFi 里的清算逻辑是个典型例子——现在是硬编码的抵押率阈值，未来可以是 AI 实时评估借款人的风险状况再决定是否清算。

区块链从”被动执行规则”变成”主动理解意图”。

挖矿退潮，意外推动了 AI 民主化

这里有一段鲜为人知的历史。

2022 年 9 月，以太坊从 PoW 转向 PoS，GPU 挖矿这条路突然没了。全球数十万张显卡瞬间失业。

两个月后，ChatGPT 发布，AI 算力需求爆炸式增长。

时间上几乎完全吻合。大量原本挖矿的 GPU 直接转向了 AI 市场，算力供给暴增，价格下降，催生了 Vast.ai、io.net 这类去中心化算力平台——把全球散落的 GPU 聚合起来，价格比 AWS 便宜 50-80%。

挖矿退潮是 AI 算力民主化的意外推手。 DeepSeek 能用相对低的成本训出有竞争力的模型，背后有这个大背景。算力不再只掌握在 Google、微软、亚马逊手里，AI 训练的门槛在持续下降。

区块链的未来：隐入基础设施

说到底，区块链的未来路径大概是这样的：

近期： 金融属性继续强化。稳定币和跨境支付已经是发展中国家的刚需，这个趋势不会逆转。

中期： 成为 AI Agent 经济的结算层。AI 自主持有资产、自主交易、自主签合约——这需要一个去信任的底层结算网络。区块链是最合适的选择。

长期： 消失在基础设施里。就像今天用微信转账不会感知到 TCP/IP，未来用 AI 完成各种经济行为，也不会感知到底层的区块链。

区块链最终会隐入基础设施层，消失在用户视野里。大多数人感知不到它的存在，但离开它什么都转不动。

这比”颠覆金融体系”的叙事低调多了，但可能更接近真相。

一个没解决的根本矛盾

最后说一个悖论。

区块链的核心叙事是去中心化——不依赖任何中间人，代码即法律。

但现实是：90% 的人通过币安、OKX 这样的中心化交易所接触区块链。去中心化的基础设施，上面跑的是中心化的应用。

AI 解决不了这个矛盾，因为这是人性问题——用户要的是方便，不是去中心化。

所以区块链最终能走多远，技术不是瓶颈，人愿不愿意为”去信任”这件事付出额外成本，才是真正的问题。

How I Merged a Dozen AI Accounts Into One

2026-03-18T08:00:00.000Z

I used to keep a folder on my desktop with passwords for eight different AI platforms.

OpenAI, Anthropic, Google, DeepSeek — switching accounts depending on which model I needed, juggling API keys, watching multiple balances. When one platform hit rate limits, my project just sat there waiting.

That went on for a while. Then I started using ZenMux.

What It Does

ZenMux aggregates all the major AI models into one platform: GPT-4o, Claude 3.7, Gemini 2.0, DeepSeek V3, and more. One API key covers everything.

No more registering on multiple platforms. No more managing separate balances. Everything in one dashboard.

But the parts I actually found interesting go a bit deeper.

Reliability. Large model platforms occasionally rate-limit or go down — real risks in production. ZenMux backs critical models with multiple provider channels and handles automatic failover when one channel has issues. No manual intervention needed.

Quality transparency. They run regular HLE (Human Last Exam) benchmark tests across all model channels on the platform, and publish the results openly on GitHub. Rare to see this level of transparency from an aggregator.

Model insurance. The most unexpected feature: if output quality is poor or latency is too high, automated daily checks trigger compensation credited to your account the next day. The mechanism is sound — it’s not a gimmick.

Setting Up Claude Code

Claude Code supports custom API endpoints natively, and ZenMux is fully Anthropic-protocol compatible. The setup is two lines:

1 2	export ANTHROPIC_BASE_URL=https://api.zenmux.ai export ANTHROPIC_API_KEY=your_zenmux_api_key

Then just start Claude Code:

claude

That’s it. Claude Code routes through ZenMux, and all the model selection, routing, and failover happens on the backend. The experience is identical to connecting directly to Anthropic.

To make this permanent, add those two lines to your ~/.zshrc or ~/.bashrc:

1
2
3

# ~/.zshrc
export ANTHROPIC_BASE_URL=https://api.zenmux.ai
export ANTHROPIC_API_KEY=your_zenmux_api_key

Setting Up Cline

Cline is one of the best AI coding assistants available for VS Code. Connecting it to ZenMux takes about a minute.

Open VS Code, go to the Cline settings panel, and:

Set API Provider to OpenAI Compatible (ZenMux supports both OpenAI and Anthropic protocols)
Set Base URL to https://api.zenmux.ai/v1
Enter your ZenMux API key
Set the model name, e.g. claude-3-7-sonnet-20250219

If you prefer the Anthropic protocol, select Anthropic as the provider, use https://api.zenmux.ai as the base URL, and proceed the same way.

After that, all of Cline’s model calls go through ZenMux — automatic failover and quality monitoring included.

Pricing

ZenMux is pay-as-you-go — no monthly subscription, just top up and use.

Payments via Alipay or Stripe. There’s currently a 20% top-up bonus (deposit $100, get $120 in credits), which makes it noticeably cheaper than charging the original platforms directly, especially for models like Claude that carry higher base pricing.

Rates are aligned with official provider pricing — no significant markup. The real savings come from the top-up bonus and not having to split budget across multiple platforms.

One Honest Note

Worth mentioning: ZenMux is a middle layer, so requests make one extra network hop. In most development scenarios this is imperceptible. But if your use case is extremely latency-sensitive, benchmark it yourself before fully committing.

For everyday development, multi-model workflows, or running AI in small production projects — ZenMux is the most complete “aggregation + reliability + quality” package I’ve found so far.

There’s free credit available after signup, no upfront payment needed.

My invite link: https://zenmux.ai/invite/4H1O34

ZenMux: https://zenmux.ai

我是怎么把十几个 AI 账号合并成一个的

2026-03-18T08:00:00.000Z

去年我的桌面上有一个文件夹，里面存着七八个 AI 平台的账号密码。

OpenAI 一个、Anthropic 一个、Google 一个、DeepSeek 一个……每次要用不同的模型，就得切换账号、切换 API Key、切换充值余额。有时候某个平台突然限流，项目就卡在那里等着。

这种状态持续了很久，直到我开始用 ZenMux。

它做了什么

ZenMux 把所有主流 AI 模型聚合到一个平台：GPT-4o、Claude 3.7、Gemini 2.0、DeepSeek V3……一个 API Key 全部搞定。

不需要在多个平台注册，不需要管多个账户余额，账单统一在一个后台看清楚。

但我真正觉得有意思的，是它解决了几个更深层的问题。

稳定性。 大模型平台偶尔会限流或宕机，在生产环境里这是真实风险。ZenMux 给每个关键模型配了多个提供商渠道，某个渠道出问题时自动切换，不需要你手动处理。

质量透明。 他们定期跑 HLE（Human Last Exam）测试，覆盖平台上所有模型的所有渠道，结果开源在 GitHub，谁好谁差都能看到。这在同类平台里少见。

模型保险。 这是让我最意外的功能——如果模型输出质量差或延迟过高，平台每日自动检测，触发后次日赔付到账。逻辑是通的，不是噱头。

配置到 Claude Code

Claude Code 原生支持自定义 API 地址，ZenMux 完整兼容 Anthropic 协议，接入方式非常干净。

在终端设置环境变量：

1 2	export ANTHROPIC_BASE_URL=https://api.zenmux.ai export ANTHROPIC_API_KEY=你的ZenMux_API_Key

然后直接启动 Claude Code：

claude

就这两步，Claude Code 会自动走 ZenMux 的渠道，模型选择、路由、故障切换全部在后端处理，前端体验和直连 Anthropic 没有区别。

如果不想每次都 export，可以写进 ~/.zshrc 或 ~/.bashrc：

1
2
3

# ~/.zshrc
export ANTHROPIC_BASE_URL=https://api.zenmux.ai
export ANTHROPIC_API_KEY=你的ZenMux_API_Key

配置到 Cline

Cline 是 VS Code 里最好用的 AI 编程助手之一，配置 ZenMux 也很简单。

打开 VS Code，进入 Cline 的设置面板：

API Provider 选择 OpenAI Compatible（ZenMux 同时兼容 OpenAI 协议）
Base URL 填：https://api.zenmux.ai/v1
API Key 填你的 ZenMux Key
Model 填你想用的模型，比如 claude-3-7-sonnet-20250219

或者如果你更习惯 Anthropic 协议，Provider 选 Anthropic，Base URL 同样填 https://api.zenmux.ai，Key 填 ZenMux Key，模型照常选。

配置完成后，Cline 的所有模型调用都会走 ZenMux，同时享受自动故障切换和质量监控。

充值和费用

目前 ZenMux 是按用量付费，没有月费订阅，充值即可用。

支持支付宝和 Stripe 信用卡，对国内用户比较友好。

现在有一个 充值 20% 额外赠送的活动，也就是充 100 到账 120，比直接去原平台充要划算一些，尤其是 Claude 这种定价较贵的模型。

费率上 ZenMux 走的是转发模式，基本和官方价格持平，不会有明显溢价。具体可以在平台的 Pricing 页面查看各模型的实时报价。

一点实话

好的地方说完了，也说一个需要自己判断的点：ZenMux 是中间层，请求会多经过一跳。大多数场景下这个延迟差异感知不到，但如果你的业务对延迟极度敏感，建议自己测一下再决定是否迁移。

整体来说，对于日常开发、多模型混用、或者把 AI 用在小型生产项目里的开发者，ZenMux 是目前我见过把”聚合+稳定性+质量”三件事同时做得还不错的平台。

注册后有免费额度可以直接体验，不需要先充值。

我的邀请链接：https://zenmux.ai/invite/4H1O34

ZenMux 官网：https://zenmux.ai

十数個のAIアカウントを一つにまとめた話

2026-03-18T08:00:00.000Z

以前、デスクトップに一つのフォルダがあった。中身は8つのAIプラットフォームのアカウント情報だ。

OpenAI、Anthropic、Google、DeepSeek——使いたいモデルによってアカウントを切り替え、APIキーを切り替え、残高を確認する。どこかのプラットフォームがレート制限に引っかかると、プロジェクトがそのまま止まる。

そんな状態がしばらく続いた。ZenMux を使い始めるまでは。

ZenMuxが何をするのか

ZenMuxは主要なAIモデルをひとつのプラットフォームに集約する：GPT-4o、Claude 3.7、Gemini 2.0、DeepSeek V3など。1つのAPIキーですべてにアクセスできる。

複数プラットフォームへの登録不要。残高の個別管理も不要。すべてひとつのダッシュボードで確認できる。

ただ、私が本当に面白いと思ったのはもう少し深いところにある。

安定性。 大規模言語モデルのプラットフォームは、レート制限や障害が起きることがある。本番環境では現実のリスクだ。ZenMuxは重要なモデルを複数のプロバイダーチャンネルでバックアップし、問題発生時は自動的に切り替える。手動対応は不要だ。

品質の透明性。 プラットフォーム上の全モデル・全チャンネルに対して定期的にHLE（Human Last Exam）ベンチマークテストを実施し、結果をGitHubで公開している。アグリゲーター系サービスでここまで透明性を出しているところは珍しい。

モデル保険。 最も意外だった機能だ——出力品質が低い場合や応答遅延が過大な場合、毎日の自動検査がトリガーされ、翌日にクレジット補償が付与される。仕組みは筋が通っており、宣伝文句ではない。

Claude Codeへの設定方法

Claude CodeはカスタムAPIエンドポイントをネイティブでサポートしており、ZenMuxはAnthropicプロトコルと完全互換だ。設定は2行で済む。

1 2	export ANTHROPIC_BASE_URL=https://api.zenmux.ai export ANTHROPIC_API_KEY=あなたのZenMux_APIキー

あとはClaude Codeを起動するだけ：

claude

以上だ。Claude CodeはZenMux経由でルーティングされ、モデル選択・ルーティング・フェイルオーバーはすべてバックエンドで処理される。Anthropic直接接続と体験は変わらない。

毎回exportしたくない場合は、~/.zshrcまたは~/.bashrcに追記すればよい：

1
2
3

# ~/.zshrc
export ANTHROPIC_BASE_URL=https://api.zenmux.ai
export ANTHROPIC_API_KEY=あなたのZenMux_APIキー

Clineへの設定方法

ClineはVS Codeで使えるAIコーディングアシスタントの中でも特に優れたものの一つだ。ZenMuxへの接続設定は1分もかからない。

VS Codeを開き、Clineの設定パネルに移動して：

API Provider を OpenAI Compatible に設定（ZenMuxはOpenAIとAnthropicの両プロトコルに対応）
Base URL に https://api.zenmux.ai/v1 を入力
ZenMuxのAPIキーを入力
モデル名を設定（例：claude-3-7-sonnet-20250219）

Anthropicプロトコルの方が慣れている場合は、ProviderをAnthropicにして、Base URLにhttps://api.zenmux.aiを使えばよい。

設定後、ClineのすべてのAPI呼び出しはZenMux経由になる。自動フェイルオーバーと品質モニタリングも合わせて利用できる。

料金について

ZenMuxは従量課金制——月額サブスクリプションはなく、チャージして使う形式だ。

支払いはAlipayとStripeのクレジットカードに対応。現在、チャージ額20%ボーナスキャンペーンが実施されており、100ドルチャージすると120ドル分のクレジットが付与される。Claudeのような単価が高めのモデルを使う場合、オリジナルプラットフォームに直接チャージするよりかなり割安になる。

レートは公式プロバイダーの価格に準拠しており、大きな上乗せはない。各モデルのリアルタイム料金はプラットフォームのPricingページで確認できる。

正直なところ

一点だけ補足しておく：ZenMuxは中間レイヤーであるため、リクエストはネットワークホップが一つ増える。通常の開発シナリオではほぼ気にならない差だが、レイテンシに極めて敏感なユースケースの場合は、本格移行前に自分で計測することをすすめる。

日常的な開発、複数モデルの並用、小規模な本番プロジェクトへのAI活用——こういった用途においては、ZenMuxは「集約・安定性・品質」の三点を同時にカバーしている、現状では珍しいプラットフォームだと思っている。

サインアップ後に無料クレジットで試せるので、事前課金なしで体験できる。

招待リンク：https://zenmux.ai/invite/4H1O34

ZenMux公式：https://zenmux.ai

From Control to Verification, Then to Trust

2026-03-13T08:49:00.000Z

Humanity has tamed tools in three stages.

Stage one: Control.
Hammer comes down, nail goes in. You know what it will do because you decide what it does. Tools have no will — only physics.

Stage two: Verification.
A computer runs code. You might not follow every instruction, but when it’s done, the result is right there. Right is right, wrong is wrong. You don’t need to understand the process as long as you can judge the outcome.

Stage three: Trust.
AI gives you an answer. You don’t know how it got there, and the result is too complex to simply verify. All you can do is believe it.

We’re standing at the boundary between stage two and stage three right now.

AlphaGo made moves humans couldn’t understand — but the result was verifiable: it won.
AI discovers new drug molecules — the mechanism is a black box, but clinical trials can tell you whether it works.

This is the extreme form of stage two: process completely opaque, but results still within human verification range.

The problem is, that line is moving.

Verification is the last line of defense

The history of humans ceding control is really a history of retreating to verification.

Factories appeared — workers no longer controlled every step, but could verify product quality.
Financial systems grew complex — ordinary people stopped understanding how they worked, but could verify their account balance.
Algorithms took over recommendation feeds — users lost control of information flow, but could verify: do I like this content?

Every time humans lost control, they kept verification rights as a fallback.

AI is threatening that fallback.

Not because AI lies, but because the problems AI handles are starting to exceed human verification capacity. An AI-optimized supply chain decision involves hundreds of thousands of variables — humans can’t enumerate all possibilities to judge whether it’s truly optimal. An AI-generated legal document involves precedents and logic chains that even a specialist lawyer might need days to fully audit.

When the cost of verification approaches “just do it yourself,” the right to verify becomes meaningless.

Trust is a new form of control

That sounds bleak. But history says it isn’t the end — it’s the beginning.

Humans have trusted black boxes before.

Nobody truly understands every system in a modern aircraft, but we get on board. Nobody can verify every bank transaction in real time, but we deposit our money. Nobody has read every line of operating system code, but we build our entire digital lives on top of it.

Trust isn’t the abandonment of control. It’s a higher-order control mechanism.

You don’t control how the plane flies — but you control who’s allowed to build planes, what certifications they require, who’s liable when something goes wrong. You don’t control every bank transaction — but you control the regulatory framework, deposit insurance, the boundaries of systemic risk.

The same will happen with AI. When direct control and outcome verification both fail, humans will develop a third mechanism: controlling the trust system itself.

Not controlling what AI does — but controlling: how we decide which AI to trust, to what degree, in which domains, and who bears the consequences when it fails.

The speed of that moving line is the real variable

From control to verification, humans took thousands of years.

From steam engines to electricity, industrial society spent a century adapting to the logic of “uncontrollable but verifiable” machines.

From the internet to algorithmic feeds, digital society spent twenty years realizing it had lost verification rights.

And AI? This line is moving by the month.

If adaptation can’t keep pace, a vacuum appears — old control fails, new trust systems haven’t been built yet. In that vacuum, the risk isn’t that AI is too smart. It’s that humans haven’t figured out what to trust, and what not to.

Back to the original question: how did humans let go, step by step?

Not out of laziness. Not blind optimism.

Because every time they let go, they got something bigger in return. The hammer gave us architecture. The computer gave us the internet. The algorithm gave us personalization at scale.

Letting go is how humans expand the boundaries of their own capability.

Only this time, what we’re letting go to isn’t a tool. It’s something that’s starting to look a little like a companion.

With companions, you don’t talk about control. You talk about trust.

We’re just not very good at it yet — with this kind of companion.

制御から検証へ、そして信頼へ

2026-03-13T08:49:00.000Z

人類が道具を手なずけてきた歴史は、三つの段階を経ている。

第一段階：制御。
ハンマーを振り下ろせば、釘が刺さる。何をするかわかっているから、自分がそれを決める。道具に意志はなく、あるのは物理法則だけだ。

第二段階：検証。
コンピュータがコードを実行する。すべての命令を追えなくても、終わったら結果がそこにある。正しければ正しい、間違いなら間違い。プロセスを理解できなくても、結果を判断できればいい。

第三段階：信頼。
AIが答えを出す。どうやって導いたかわからない。結果も複雑すぎて、単純に検証できない。できることはただ一つ——信じること。

今、私たちは第二段階と第三段階の境界に立っている。

AlphaGoは人間には理解できない手を打った。だが結果は検証できた：勝ったのだ。
AIが新しい薬の分子を発見する。メカニズムはブラックボックスだが、臨床試験が有効かどうかを教えてくれる。

これが第二段階の極限形態だ——プロセスは完全に不透明だが、結果はまだ人間の検証能力の範囲内にある。

問題は、その線が動いていることだ。

検証能力が、最後の砦

人間が制御権を手放してきた歴史は、実は「検証に退却する」歴史だ。

工場が生まれると、労働者はすべての工程を制御しなくなったが、製品の品質を検証できた。
金融システムが複雑化すると、普通の人はその仕組みを理解しなくなったが、口座残高を確認できた。
アルゴリズムが推薦を引き受けると、ユーザーは情報の流れを制御しなくなったが、「このコンテンツは好きか」を判断できた。

制御権を失うたびに、人間は検証権を最後の砦として保持してきた。

AIはその砦を脅かしている。

AIが嘘をつくからではない。AIが扱う問題が、人間の検証能力を超え始めているからだ。AIが最適化したサプライチェーンの意思決定は、数十万の変数を含む。すべての可能性を列挙して本当に最適かどうか判断することは、人間には不可能だ。AIが生成した法的文書は、専門の弁護士でさえ完全に検証するのに数日かかるかもしれない。

検証コストが「自分でやり直す」に近づいたとき、検証権は名ばかりのものになる。

信頼は、新しい形の制御

暗い話に聞こえるかもしれない。だが歴史は、これが終点ではなく出発点だと言っている。

人間はこれまでも、多くのブラックボックスを信頼してきた。

現代の航空機のすべてのシステムを本当に理解している人はいない。それでも私たちは乗り込む。銀行のすべての取引をリアルタイムで検証できる人はいない。それでも私たちはお金を預ける。オペレーティングシステムのすべてのコードを読んだ人はいない。それでも私たちはデジタルライフ全体をその上に築いている。

信頼は、制御の放棄ではない。より高次の制御メカニズムだ。

飛行機がどう飛ぶかは制御しない——でも、誰が飛行機を作れるか、どんな認証が必要か、何かあったとき誰が責任を取るかは制御する。銀行の取引一つひとつは制御しない——でも、規制の枠組み、預金保険、システミックリスクの境界は制御する。

AIも同じだろう。直接制御と結果検証の両方が機能しなくなったとき、人間は第三のメカニズムを生み出す：信頼システムそのものを制御すること。

AIが何をするかを制御するのではなく、どのAIをどの程度信頼するか、どの領域で信頼するか、失敗したとき誰が責任を負うかを制御する。

その線が動くスピードこそが、本当の変数

制御から検証へ、人類は数千年かけた。

蒸汽機関から電力へ、産業社会は「制御不能だが検証可能」な機械の論理に適応するのに百年かけた。

インターネットからアルゴリズム推薦へ、デジタル社会は検証権を失っていることに気づくのに二十年かけた。

AIは？この線は月単位で動いている。

適応が追いつかなければ、真空地帯が生まれる——古い制御が機能しなくなり、新しい信頼システムはまだ構築されていない。その真空の中でのリスクは、AIが賢すぎることではない。人間が何を信頼すべきで、何を信頼すべきでないかをまだ考えきれていないことだ。

最初の問いに戻ろう：人間はどうやって少しずつ手放してきたのか？

怠惰からではない。盲目的な楽観からでもない。

手放すたびに、より大きな何かを得てきたからだ。ハンマーは建築を生んだ。コンピュータはインターネットを生んだ。アルゴリズムはパーソナライズのスケール化を生んだ。

手放すことは、人間が自分の能力の境界を広げる方法だ。

ただ今回、手放す相手はもはや道具ではない。少し仲間に似た何かだ。

仲間との間には、制御ではなく信頼が必要だ。

私たちはまだ、この種の仲間との付き合い方が上手くない。

森哥 Ideas

Teaching Your AI Agent New Tricks: A Complete Guide to gh skill

The Same Prompt, Over and Over

What Are Agent Skills?

Hands-On: Installing a Skill

Search

Preview

Install

Installing for a Different Agent

Pinning Versions

Writing Your Own Skill

Create the Directory

Write SKILL.md

gh skill で AI エージェントにスキルを追加する：インストールから公開まで完全ガイド

毎回同じことを教えるのに疲れた

Agent Skills とは何か

実践：既存の skill をインストールする

検索

プレビュー

インストール

別のエージェントにインストール

バージョン固定

自分で skill を書く

ディレクトリ作成

SKILL.md を書く

用 gh skill 给 AI 装技能包：从安装到发布完整指南

一个反复出现的问题

Agent Skills：给 AI 的技能包

实战：装一个现成的 skill

搜索

预览

安装

给不同的 agent 装

版本锁定

自己写一个 skill

创建目录

编写 SKILL.md

The Future of AI Payments: From Programmable Accounts to Autonomous Agent Transactions

From “Humans Pay” to “Agents Pay”

Programmable Payments: Giving Money a Logic Layer

Three Parallel Technical Paths

Where Programmable Payments Actually Create Value

Agent Authorization Architecture: Designing “Pre-Consent”

A Three-Layer Authorization Model

The Hard Problem: Multi-Agent Nesting

The Identity Layer War: Next Decade’s Payment Entry Point

OpenAI Entering Payments: Not “If” but “How”

How Regulation Will Unfold

Closing

AI決済の未来：プログラマブル口座からエージェント自律決済へ

「人間が払う」から「エージェントが払う」へ

プログラマブル決済：お金に「ロジック」を持たせる

三つの並行する技術路線

プログラマブル決済が本当に価値を生む場所

エージェント認可アーキテクチャ：「事前同意」の仕組みを設計する

三層認可モデル

最も難しい問題：マルチエージェントのネスト

アイデンティティ層争奪：次の10年の決済入口

OpenAIの決済参入：「するかどうか」ではなく「どう入るか」

規制はどう動くか

おわりに

AI支付的未来：从可编程账户到Agent自主支付

从”人付款”到”Agent付款”

可编程支付：把钱变成”有逻辑的钱”

三条并行的技术路线

可编程支付真正的价值在哪里

Agent授权架构：设计一套”预先同意”的机制

三层授权模型

最难解决的问题：多Agent嵌套

身份层争夺：下一个十年的支付入口

OpenAI入场支付：不是”会不会”，是”怎么进”

监管会怎么走

结语

You Are Not a Point in the Universe — The Universe Is Inside You

The Universe Is Perceiving Itself Through You

Ancient Scriptures Already Said This

The Hard Problem of Consciousness

Finding the Thing That Never Left

Why Ancient People Could Think This Deeply

The More Advanced, The More Empty