alibaba/page-agent
原文摘要
JavaScript in-page GUI agent. Control web interfaces with natural language. Page Agent The GUI Agent Living in Your Webpage. Control web interfaces with natural language. 🌐 English | 中文 🚀 Demo | 📖 Docs | 📢 HN Discussion | 𝕏 Follow on X https://github.com/user-attachments/assets/a1f2eae2-13fb-4aae-98cf-a3fc1620a6c2 ✨ Features 🎯 Easy integration No need for browser extension / python / headless browser . Just in-page javascript. Everything happens in your web page. 📖 Text-based DOM manipulation No screenshots. No multi-modal LLMs or special permissions needed. 🧠 Bring your own LLMs 🐙 Optional chrome extension for multi-page tasks. And an MCP Server (Beta) to control it from outside 💡 Use Cases SaaS AI Copilot — Ship an AI copilot in your product in lines of code. No backend rewrite. Smart Form Filling — Turn 20-click workflows into one sentence. Perfect for ERP, CRM, and admin systems. Accessibility — Make any web app accessible through natural language. Voice commands, screen readers, zero barrier. Multi-page Agent — Extend your own web agent's reach across browser tabs chrome extension . MCP - Allow your agent clients to control your browser. 🚀 Quick Start One-line integ…
📋 本文为 GitHub Trending Daily RSS 的 RSS 摘要原文,未经 AI 整理。完整上下文请以 原文 为准。