AI Browser Agent: Gemini 2.5
Execute autonomous web tasks using Google's Gemini 2.5 computer use model

TypeScript
Source codenpx create-browser-app --template gemini-cua
Python
Source codeuvx create-browser-app --template gemini-cua
Run autonomous browser tasks with Gemini 2.5
Execute web tasks autonomously using Google's Gemini 2.5 computer use agent. Give the agent a natural language instruction and it navigates pages, performs searches, clicks through flows, and returns structured results, all without manual scripting. Built on Browserbase for reliable cloud browser execution.
Steps
- Uses Stagehand Agent with the Gemini 2.5 Computer Use to control the browser
- Navigates the web and performs the given instruction autonomously
- Returns the final result after completing all steps