1
0
Fork 0
claude-code-complete-agent/examples/11-computer-use/prompt.md
Kjell Tore Guttormsen 0d0b83f98c feat: make examples cumulative with carry-forward chain and capstone
Add three new sections to all 14 examples:
- "Carry Forward": what output feeds into later examples (01-10)
- "The Cumulative Path": alternative prompt building on previous output (02-10)
- "Now Try It Yourself": personalized template with transferable pattern (all)
- "Building On" callout connecting back to previous examples (02-10)

Add Example 14: Build Your Personal Agent - capstone that guides reader
through writing their own CLAUDE.md, creating a personal skill, connecting
a messaging channel, setting up automation, and testing end-to-end.

Update README with cumulative path diagram, two usage modes, and example 14.
Update GETTING-STARTED.md with cross-references to relevant examples.

17 files changed, 703+ lines added. The examples now form a coherent
learning path from "see what it can do" to "build your own agent."

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-26 21:14:35 +01:00

74 lines
2.5 KiB
Markdown

# Example 11: Computer Use
Demonstrate Claude Code's ability to control your desktop: open
apps, click buttons, fill forms, take screenshots. This is the
capability that made people say "Anthropic just killed OpenClaw."
**OpenClaw equivalent:** Browser automation + macOS/iOS/Android
companion apps with screen control.
**Requirements:**
- Claude Code Desktop app (not CLI-only)
- macOS with Accessibility and Screen Recording permissions granted
- Computer Use enabled in Settings > Desktop app > General
- Pro or Max plan
## The prompt
```
Open Safari, navigate to Hacker News, take a screenshot of the
front page, then open TextEdit and write a summary of the top
5 stories with their point counts. Save the file as
hn-summary-today.txt on the Desktop.
```
## What happens
1. Claude takes control of your screen (golden border appears)
2. Opens Safari, navigates to news.ycombinator.com
3. Takes a screenshot to read the page content
4. Opens TextEdit (or creates a new document)
5. Types the summary with story titles and point counts
6. Saves the file to your Desktop
## How this compares to OpenClaw
OpenClaw controls the browser via CDP/Playwright (programmatic).
Its macOS/iOS companion apps can interact with the desktop.
Claude Code Computer Use controls the screen like a human:
screenshots, mouse clicks, keyboard input. It is slower but
works with any application, not just browsers.
## Limitations (honest)
- Research preview. Expect rough edges.
- macOS only (for now)
- Slower than Playwright for browser-only tasks
- Cannot interact with apps that block screen recording
- Some actions cannot be undone (the macOS permission warning
is real)
For browser-only automation, Playwright MCP (example 04) is
faster and more reliable. Computer Use shines when you need to
interact with native desktop apps that have no API or CLI.
---
## Now Try It Yourself
Think about a desktop task you repeat that involves no API or CLI:
```
Open [application], navigate to [where], and [do what].
Take a screenshot before and after. Save the result as
[filename] on the Desktop.
```
**The pattern you just learned:** application + navigation + action + capture. Computer Use works with anything visible on screen. The golden rule: if you can do it by clicking, Claude can do it too.
Ideas worth trying:
- Open your email client, screenshot the inbox, and list unread subjects
- Fill out a form in a desktop app with data from a file
- Take a screenshot of a dashboard and summarize the metrics
- Open a PDF, extract key sections, and save them as markdown