smth is a local MCP server that lets your agent screenshot pages, inspect the DOM, check fonts and colors, and interact with forms — all in a persistent browser session.
Read any page without touching it. Screenshots at any viewport, visible text as Markdown, DOM through focused lenses, fonts, colors, and WCAG contrast ratios — all returned as structured data.
Drive the browser like a user. Click elements by id or CSS selector, type into form fields, hover to reveal tooltips and dropdowns. Every interaction waits for navigation to settle.
Understand what changed. Capture named DOM snapshots before and after any interaction, then diff them line by line. Or use fetch_dom_content to auto-strip shared chrome and return only unique page content.
smth runs as a persistent local service. One docker compose up, then add four lines of JSON to any project.
One-click install via Claude Desktop Extensions. Pending Anthropic review — star the repo to follow along.
★ Star on GitHub