|
|
--- |
|
|
title: FARA - Browser Use Agent |
|
|
emoji: π€ |
|
|
colorFrom: blue |
|
|
colorTo: purple |
|
|
sdk: docker |
|
|
pinned: true |
|
|
license: mit |
|
|
short_description: Microsoft Fara-7B Browser Use Demo inspired by CUA2 |
|
|
app_port: 7860 |
|
|
tags: |
|
|
- computer-use |
|
|
- browser-automation |
|
|
- ai-agent |
|
|
- vision-language-model |
|
|
--- |
|
|
|
|
|
# π€ FARA - Computer Use Agent Demo |
|
|
|
|
|
FARA (Fara Agent for Real-world Automation) is an AI agent that can browse the web and complete tasks autonomously. |
|
|
|
|
|
## Features |
|
|
|
|
|
- π **Autonomous Web Navigation** - The agent can browse websites on its own |
|
|
- π **Web Search** - Search for information across the web |
|
|
- π **Form Filling** - Fill out forms automatically |
|
|
- π±οΈ **Point and Click** - Click buttons, links, and elements |
|
|
- β¨οΈ **Text Input** - Type text into fields |
|
|
- π **Page Scrolling** - Scroll through content |
|
|
|
|
|
## How to Use |
|
|
|
|
|
1. Enter a task in natural language (e.g., "Search for the latest news about AI") |
|
|
2. Click "Run Task" and watch the agent work! |
|
|
3. View the screenshots to see each step the agent takes |
|
|
|
|
|
## Powered By |
|
|
|
|
|
- **Microsoft Fara-7B** - Vision-Language Model for computer use |
|
|
- **Playwright** - Browser automation framework |
|
|
- **Modal** - Model hosting and inference |
|
|
|
|
|
## Links |
|
|
|
|
|
- [GitHub Repository](https://github.com/microsoft/fara) |
|
|
|
|
|
## License |
|
|
|
|
|
MIT License |