rtrvr.ai logo
rtrvr.ai
PricingBlogDashboard

Core Agent

Getting StartedWeb AgentSheets Workflows

Building Blocks

Recordings & GroundingTool CallingKnowledge Base (RAG)

Platform Access

API OverviewAgent APIScrape APIBrowser as API/MCP

Automation

ShortcutsTriggers & WebhooksSchedules

Account & Security

Cookie SyncPermissions & Privacy
DocsWeb Agent

Web Agent

The core DOM-native intelligence engine that powers every rtrvr.ai surface — extension, cloud, and API.

3 min read

rtrvr.ai is a DOM-only AI agent designed to understand, navigate, and extract data from the web with human-like precision. Unlike screenshot-based tools that rely on vision models, rtrvr.ai reads the actual page structure — making it faster, more accurate, and far harder for sites to detect.

Agentic Architecture

When you issue a command, the Planner agent orchestrates 20+ specialized sub-agents to complete the task:

  • Action Agent — Handles clicks, typing, and navigation
  • Extraction Agent — Identifies and structures data from the DOM
  • Crawl Agent — Manages pagination and multi-page discovery
  • PDF / File Agent — Reads and fills complex forms, uploads documents
For repeated tasks, skip the planning step entirely by using Shortcuts or Replay for deterministic, consistent results.

Example Prompts

text
"Go to Amazon and find the price of iPhone 15 Pro"

"Fill out this contact form with my information"

"Add everyone on this event page as a professional connection"

"Extract all email addresses from this company's team page"

"Go to ChatGPT, ask about top restaurants in SF, extract the response"
The agent handles complex multi-step workflows and will ask for clarification when needed. It operates identically whether running in the Chrome Extension, on Cloud headless browsers, or via the API.

Multi-Tab Orchestration

The agent can reference multiple open tabs simultaneously — comparing data across sites, moving information between systems, or processing dozens of URLs in parallel via Sheets Workflows.

Deterministic Replay

Once a multi-step workflow succeeds, you can replay it to bypass the planning phase entirely. Replay produces 100% consistent results and is the foundation for Shortcuts, Schedules, and Triggers.

Free with Gemini

Save credits by using your own Gemini API key. In the extension, type /add-gemini-key and the agent will walk you through getting a free key from Google AI Studio and configuring it automatically. The system falls back to platform credits if Gemini encounters issues.

Files & PDFs

Drag and drop files into the chat or attach them via the toolbar. The agent has specialized capabilities for PDFs: reading content, filling form fields, and generating new documents. It can also upload attached files to web pages when prompted (e.g., "fill this job application and upload my resume").

Context Tips

Everything in your chat — messages, files, images — becomes context for the AI. Keeping context clean leads to better results.

  • Start fresh chats for new tasks — stale context can confuse the planner
  • Only enable the MCP/custom tools you actually need for the current task
  • Use Personal Context in settings to store persistent info (resume, company details) the agent remembers across all chats
  • Click the ✨ Enhance button (Cmd/Ctrl+P) to have a specialized model rewrite your prompt for clarity before sending
Enhancing prompts lets you co-plan with the agent. It's especially powerful for complex multi-step tasks where precise instructions matter.

Cloud & Mobile

The Chrome Extension is where you build and test workflows. Export successful workflows to rtrvr.ai Cloud for 24/7 headless execution — no laptop required. Recordings, tools, and KB context travel with the workflow automatically.

  • Export any workflow from the extension to Cloud with one click
  • Scale to thousands of parallel URLs via the API
  • Connect your account to WhatsApp — send text or voice messages to trigger automations and receive results on your phone
  • Share your best automations via link — recipients import with one click, including recordings, tools, and all context

Platform Availability

CapabilityExtensionCloudAPI
Natural-language planning✅✅✅
Multi-tab orchestration✅✅✅
Authenticated site access✅ (your sessions)✅ (via Cookie Sync)✅ (via Cookie Sync)
Real-time feedback & steering✅——
Deterministic replay✅✅✅
Files & PDF handling✅✅✅
WhatsApp trigger & results—✅—
Share & import workflows✅✅✅
Previous
Getting Started
Next
Sheets Workflows

On this page

Agentic ArchitectureExample PromptsMulti-Tab OrchestrationDeterministic ReplayFree with GeminiFiles & PDFsContext TipsCloud & MobilePlatform Availability

Ready to automate?

Join teams using rtrvr.ai to build playful, powerful web automation workflows.