简介

AgentPhone 是连接 AI 代理与物理世界的桥梁。它将标准安卓设备暴露为一组 MCP(Model Context Protocol)工具,让大语言模型能够像真人一样触摸屏幕、发送消息、管理文件。

什么是 AgentPhone?

与依赖脆弱无障碍 API 或 Root 技巧的纯软件自动化不同,AgentPhone 使用专用硬件层和定制安卓 ROM 来提供:

  • 稳定性: 代理作为系统服务运行,免受后台杀进程影响。
  • 身份标识: 真实 SIM 卡,支持正规短信和语音通话。
  • 合规性: 无需 Root;使用签名 OEM 系统权限。

架构概览

系统由三个主要组件构成:

1. AgentPhone 设备

运行自定义守护进程的物理安卓硬件。

2. AgentHub

管理设备连接和供电的 USB/网络桥接设备。

3. MCP 服务器

大语言模型的交互接口。将工具调用转换为设备命令。

Claude Desktop 集成

要在 Claude Desktop 中使用 AgentPhone,请将以下 StreamableHTTP 配置添加到你的 claude_desktop_config.json:

{
  "mcpServers": {
    "agentphone": {
      "url": "https://agenphone-api.yunke.ai/mcp/",
      "headers": {
        "Authorization": "Bearer YOUR_API_KEY"
      }
    }
  }
}

OpenClaw 集成

通过以下步骤,在 OpenClaw 中完成 AgentPhone 的安装与配置:

1安装 Skillhub CLI
根据 https://skillshub.yunke.ai/omni-cli/npm/skillshub.md 安装 Skillhub商店,但是只安装CLI。
2启用 agenphone 技能
启用 agenphone 技能
3登录
登录
4发送验证码
发送验证码
5查看 agenphone 能力
帮我列举 agenphone 能力
OpenClaw 使用教程示例

Manus 集成

Manus 原生支持自定义 MCP 服务器。按以下步骤连接 AgentPhone:

  1. 前往 SettingsIntegrationsCustom MCP Servers.
  2. 点击 Add Custom MCP Server.
  3. 服务器名称填写 AgentPhone,URL 填写 https://agenphone-api.yunke.ai/mcp/,认证 Header 填写 Authorization: Bearer YOUR_API_KEY.
  4. 点击 Test Connection 验证通过后保存,即可在 Manus 工作流中使用 AgentPhone 工具。

ZeroClaw & Perplexity

对于 Perplexity 或 ZeroClaw 等搜索型代理,AgentPhone 提供「MCP」接口。

在代理的工具定义设置中配置 MCP 端点。代理即可像搜索网页一样「搜索」你手机的内容(短信、照片、文件)。

移动应用安装

需要安装 AgentPhone 伴侣应用才能将物理设备连接到云端。

Android APK QR Code

或扫码下载

* 注意:需要 Android 11 或更高版本。建议使用专用设备以获得最佳性能。

MCP 工具参考

AgentPhone MCP 服务器暴露的全部工具完整参考。调用功能工具前,请先通过 create_task 获取 task_id。

任务管理

create_taskCreate an AI task and obtain a task_id. Must be called before invoking any functional tool. Params: task_type (e.g. "phone_call", "data_collection", "outbound_call"), description (optional). Returns JSON with task_id and status.
complete_taskMark a task as finished. Params: task_id, status ("completed" / "failed" / "cancelled"). Returns JSON with the updated task status.

App 管理

open_appOpen an application by package name or URL
list_packagesList all installed applications on the device
get_foreground_appGet the currently active foreground application
get_app_infoRetrieve metadata and info for a specific app
force_stop_appForce stop a running application
uninstall_appUninstall an application from the device
clear_app_dataClear all data and cache for an application

通话与短信

callInitiate a phone call to a number
hang_upEnd the current active call
answer_callAnswer an incoming call
list_contactsRetrieve all contacts from the device
list_call_logGet recent call history
search_contactsSearch contacts by name or phone number
send_smsSend an SMS message to a contact
read_smsRead SMS messages from the inbox

AI 外呼

ai_outbound_call_taskInitiate an AI outbound call (non-blocking). Sends the call command and waits for device acknowledgment (dialing state), then returns immediately. Params: phone (target number), prompt (AI conversation prompt), task_id, use_custom_prompt (optional, auto-set when prompt is provided). Returns JSON with request_id and task_id. Use query_request_events to poll for results.

屏幕与触控

tapSimulate a tap at specific screen coordinates
swipePerform a swipe gesture on the screen
dragDrag an element from one position to another
input_textType text into the focused input field
long_pressPerform a long press at specific coordinates
global_actionExecute a global Android action (home, back, recents)
screenshotCapture a screenshot of the current screen
phone_ai_observeCapture a screenshot and analyze it with an AI vision model. Returns next-step action suggestions with pixel coordinates. Params: instruction (task description), task_id, language (default "Chinese"). Returns JSON with thought, action_raw, and structured action (click / type / scroll / finished).
get_screen_hierarchyGet the full UI element hierarchy of the current screen
find_elementFind a UI element by attribute or selector
click_by_textClick an element matching specific text
click_by_view_idClick a UI element by its view resource ID
unlock_deviceUnlock the device screen
set_brightnessSet the screen brightness level

设备信息与操作

get_statusGet the current device status overview
get_device_infoRetrieve detailed device hardware and software info
get_batteryGet current battery level and charging state
get_volumeGet the current volume level
get_locationGet the device's current GPS location
get_lock_statusCheck if the device screen is locked
rebootReboot the device
shutdownShut down the device
set_volumeSet the device volume level
vibrateTrigger a device vibration pattern
phone_custom_commandSend a custom control command to the device for operations not covered by standard tools. Params: action (custom action name), task_id, params (JSON string, default "{}"), wait_response (default true). Returns command execution result.
phone_command_historyQuery device command execution history. Params: limit (default 10, max 50), status (optional filter: "pending" / "sent" / "success" / "failed" / "timeout"). Returns JSON array of command records.

剪贴板与音频

get_clipboardGet the current clipboard content
set_clipboardSet text content to the clipboard
play_audioPlay an audio file on the device
stop_audioStop currently playing audio

文件与媒体

list_filesList files in a specified directory
read_fileRead the content of a file
write_fileWrite or create a file with specified content
delete_fileDelete a file from the device
analyze_storageAnalyze device storage usage
find_large_filesFind large files consuming storage space
index_media_metadataIndex and retrieve metadata for media files
search_mediaSearch for media files by metadata or content

通知与悬浮窗

read_notificationsRead current notifications from the status bar
post_notificationPost a notification to the device
dismiss_notificationDismiss a specific notification
dismiss_all_notificationsDismiss all current notifications
show_toastShow a brief toast message on screen
show_overlayDisplay a floating overlay window
hide_overlayHide a specific overlay window
hide_all_overlaysHide all active overlay windows
list_overlaysList all active overlay windows

TTS 语音

speak_ttsConvert text to speech and play it on the device

任务查询

query_request_eventsQuery outbound call events by request_id. Core tool for polling AI call results. Params: request_id, event_type (optional: "status" / "dialog" / "task_summary"). Returns JSON with request status, is_completed flag, and events list. The call is considered ended when is_completed=true or a "task_summary" event appears.
query_task_statusQuery current AI task status, including execution progress, latest event, and statistics. Params: task_id. Returns JSON with task info, is_completed flag, call_state, progress (request/event counts), and the latest event and request.
query_task_historyQuery complete AI task history with all requests and events for tracing execution. Params: task_id, event_type (optional: "status" / "dialog" / "task_summary"), limit (default 50, max 200). Returns JSON with task info, all requests, and all events in reverse chronological order.

定价方案

选择适合你代理规模的方案。

月付

$140 / 月

  • 300 分钟语音通话
  • 社区支持

季付

$280 / 季

  • 1,800 分钟语音通话
  • 优先支持

年付

$830 / 年

  • 3,000 分钟语音通话
  • 专属客户经理
  • SLA 保障