The recommended way to get started is the CLI. The
Dify plugin is an optional, advanced integration for teams building
workflows and agents inside Dify.
Why Use Actionbook in Dify
- Start from verified selectors and action manuals instead of guessing from raw HTML.
- Combine Actionbook search/get tools with live browser automation in one Dify workflow.
- Recover from selector drift by taking a fresh accessibility
snapshotand continuing the run.
Included Tools
| Tool | What it does | Best for |
|---|---|---|
search_actions | Search Actionbook for verified page areas and selectors | Finding the right element on a site |
get_action_by_area_id | Fetch the full action manual for a selected area ID | Getting precise CSS, XPath, and ARIA selectors |
browser_create_session | Start a managed cloud browser session | Opening a browser before multi-step automation |
browser_operator | Navigate, click, fill, snapshot, and inspect pages | Executing browser steps in a workflow |
browser_stop_session | Stop the cloud browser session and release resources | Cleanup at the end of a workflow |
Installation
Install the plugin directly from the official Dify Marketplace:- Open the Actionbook plugin page.
- Click
Install. - Enable the plugin in your Dify workspace.
- Configure the plugin credentials before running browser workflows.
Credentials
Actionbook API Key
- Optional for
search_actionsandget_action_by_area_id. - Leave it empty to use the free tier, or add a key for higher quotas.
- Get a key from actionbook.dev.
Hyperbrowser API Key
- Required for
browser_create_session,browser_operator, andbrowser_stop_session. - Get a key from Hyperbrowser.
Recommended Workflow
The most reliable Dify flow is:search_actionsto find a relevant page area.get_action_by_area_idto retrieve the full selector manual.browser_create_sessionto start a cloud browser session.browser_operatorfor navigation and page interaction.browser_stop_sessionto release the session when the workflow finishes.
browser_operatoraccepts eithersession_idorcdp_url; for multi-step workflows, pass both for the best recovery behavior.browser_create_sessionreturnsws_endpoint, which should be passed tobrowser_operatorascdp_url.snapshotis a fallback step when selectors fail or page state changes, not a mandatory success-path step.browser_stop_sessionshould be called after each browser workflow to release the remote session cleanly.
Browser Operator Tips
- Use
fillfor form inputs when possible. It clears and writes the value in a single step. - Use
snapshotafter navigation, after major page changes, or whenever a selector fails. - Snapshot responses include accessibility refs like
[ref=e3]; these are often the most reliable selectors for follow-upclickandfillcalls. - Always call
browser_stop_sessionat the end of the workflow to avoid unnecessary browser billing.
Example Use Cases
- Search or filter content on dynamic web apps.
- Combine Actionbook search/get tools with browser automation for agentic task execution.
- Recover from selector drift by taking a fresh
snapshotand continuing the workflow.