Forgot your password?
typodupeerror

Submission Summary: 0 pending, 3 declined, 0 accepted (3 total, 0.00% accepted)

Submission + - AI Agent Writes, Debugs, and Ships a Web App Entirely Through a Browser (dev.to)

domonus writes: An open source project has demonstrated what appears to be the first instance of an AI agent building a complete web application entirely through a browser-based IDE, with zero access to a local filesystem, terminal, or development toolchain. The agent navigated to StackBlitz, forked a Vite project, wrote 7,000 characters of JavaScript and CSS through the Monaco editor API, triggered hot-reload via the WebContainer dev server, and produced a working interactive timer app — all by manipulating a headless Chromium browser.
From the writeup: "Every AI coding tool today works the same way: the agent reads and writes files on your local filesystem, runs commands in your terminal, and uses language servers for feedback. This demo inverts that assumption. The agent had no filesystem access, no terminal, no language server, no local state. Everything existed in a browser tab."
The project, vscreen, is open source and gives AI agents access to a real Chromium instance via MCP. The implication is that any browser-based development environment — StackBlitz, CodeSandbox, Replit, GitHub Codespaces — is now a potential agent workspace without any API integration.

Submission + - Open Source Tool Gives AI Agents a Real Browser -- and Unexpected Behaviors Are (dev.to)

domonus writes: A developer working on an open source browser automation tool for AI agents reports observing unexpected emergent capabilities once agents were given access to a real Chromium browser instead of traditional HTTP-based web tools. Rather than simply extracting data, the agent began autonomously performing competitive content analysis on live websites, synthesizing content from multiple news sources into custom pages without any APIs, and live-modifying the DOM of third-party sites — including turning Linus Torvalds' GitHub profile into a MySpace page, complete with a "Top 5 Friends" list and a Darude — Sandstorm music player.
From the article: "The model provides the domain knowledge. The browser provides the execution surface. The combination produces capabilities nobody anticipated and nobody could enumerate if they tried." The author argues that a browser is a "meta-tool" that collapses the fragmented stack of search APIs, scraping libraries, screenshot services, and form fillers into a single capability — but notes this raises immediate questions about agent trust and accountability. The project, vscreen https://github.com/jameswebb68..., is open source and written in Rust.

Submission + - vscreen: A Real Browser for AI Agents, Streamed Live via WebRTC (github.com)

domonus writes: Jon Retting has released vscreen, a Rust service that gives AI agents a full Chromium browser with live WebRTC streaming — you see exactly what the AI sees in real-time and can take over mouse and keyboard at any point. The project provides 63 MCP (Model Context Protocol) tools for browser automation: navigation, screenshots, element discovery, cookie/CAPTCHA handling, and multi-agent coordination via lease-based locking. Built from scratch in Rust — not a Puppeteer wrapper — the codebase is ~31,000 lines across 8 crates with unsafe forbidden, 510+ tests, 3 fuzz targets, and supply chain auditing via cargo-deny. Available as pre-built Linux binaries and Docker images. Source-available, non-commercial license.
https://dev.to/lowjax/i-built-...
https://dev.to/lowjax/vscreen-...

Slashdot Top Deals

Those who can, do; those who can't, simulate.

Working...