The web-agent-interface
library provides tools to LLM agents in browsers to interact with different websites.
Note: The project is still in development phase. It has limited coverage and its APIs may change.
- General DOM Operations: Get page content and user selection.
- Overleaf: Edit Overleaf documents.
- Google Calendar (GCal): Read and create events on Google Calendar.
To install and build the library, follow these steps:
-
Clone the repository:
git clone https://github.com/mlc-ai/web-agent-interface.git cd web-agent-interface
-
Install dependencies and build the project:
npm install npm run build
import { State } from '@mlc-ai/web-agent-interface';
const state = new State();
import { tool, retriever, action } from '@mlc-ai/web-agent-interface';
const system_prompt = `
You are a helpful AI agent.
You have the following tools to use:
${tools.map((t) => JSON.stringify(t.schema)).join(",\n")}
`
const { tool_name, parameters } = extractToolCall(llm_response);
const observation = tool[tool_name].implementation(state, parameters);
console.log("Got observation:", observation);