Self-Evolving
Crystallizes each task's execution path into a Skill. Capabilities grow with every use, forming your personal skill tree.
Through 9 atomic tools + a ~100-line Agent Loop, GenericAgent grants any LLM
system-level control of a local computer — browser, terminal, filesystem, keyboard/mouse,
screen vision, and mobile devices (ADB).
Don't preload skills — evolve them.
# One-line install (Linux / macOS)
$ GLOBAL=1 bash -c "$(curl -fsSL \
http://fudankw.cn:9000/files/ga_install.sh)"
# Or: developer mode
$ git clone https://github.com/lsdefine/GenericAgent.git
$ uv pip install -e ".[ui]"
$ python launch.pyw
✓ Agent ready · waiting for your first task▋
Sets up an isolated Python env, Git, and the desktop app — no system pollution.
GLOBAL=1 bash -c "$(curl -fsSL \
http://fudankw.cn:9000/files/ga_install.sh)"
powershell -ExecutionPolicy Bypass -c "$env:GLOBAL=1; `
irm http://fudankw.cn:9000/files/ga_install.ps1 | iex"
Clone the source, install core + UI deps, and add your LLM API key.
git clone https://github.com/lsdefine/GenericAgent.git
cd GenericAgent
uv venv
uv pip install -e ".[ui]"
cp mykey_template.py mykey.py # add your API key
python launch.pyw
⚙ Python 3.11 / 3.12 recommended (do not use 3.14).
A minimal seed, strong execution, and capabilities that grow as you use it — an agent that hands complexity to evolution.
Crystallizes each task's execution path into a Skill. Capabilities grow with every use, forming your personal skill tree.
~3K lines of core code; the Agent Loop is ~100 lines. No heavy dependencies, zero deployment overhead.
Injects into a real browser (keeps your login session). 9 atomic tools take direct control — browser, terminal, keyboard/mouse, vision, ADB.
Supports Claude / Gemini / Kimi / MiniMax and other major models. Cross-platform on Windows / macOS / Linux.
Under 30K context window — a fraction of other agents' 200K–1M. Less noise, fewer hallucinations, higher success rate.
Everything in this repo — from installing Git and git init to every commit — was done autonomously by GenericAgent. The author never opened a terminal.
Three pillars work together to complete complex tasks while continuously accumulating experience.
Perceive environment → reason → call tools → write experience to memory → loop.
code_run Run any code
file_read Read files
file_write Write files
file_patch Patch files
web_scan Perceive web
web_execute_js Control browser
ask_user Human-in-the-loop
update_working_checkpoint Working notepad
start_long_term_update Distill long-term memory
Via code_run, install packages, write scripts, and call external APIs — crystallizing temporary abilities into permanent tools.
This is what fundamentally sets GenericAgent apart from other agent frameworks.
After a few weeks, your agent will have a skill tree no one else in the world has — all grown from 3K lines of seed code.
From food delivery to stock screening — it really drives your apps and system.




| Feature | GenericAgent | OpenClaw | Claude Code |
|---|---|---|---|
| Codebase | ~3K lines | ~530,000 lines | Open-sourced (large) |
| Deployment | pip install + API Key | Multi-service orchestration | CLI + subscription |
| Browser Control | Real browser (session preserved) | Sandbox / headless | Via MCP plugin |
| OS Control | Mouse/kbd, vision, ADB | Multi-agent delegation | File + terminal |
| Self-Evolution | Autonomous skill & tool growth | Plugin ecosystem | Stateless between sessions |
| Out of the Box | Few core files + starter skills | Hundreds of modules | Rich CLI toolset |
Baselines include Claude Code, OpenAI CodeX, and OpenClaw — evaluated on Claude Sonnet 4.6 / Opus 4.6 / GPT-5.4 / MiniMax M2.7 backbones.
Can GA complete hard tasks more cheaply? · SOP-Bench, Lifelong AgentBench, RealFin
Can a minimal atomic toolset replace specialized ones? · Tool Efficiency Benchmark
Does condensed hierarchical memory beat redundant memory & embedding retrieval? · LoCoMo, 20-skill stress test
Can it distill reusable SOPs without intervention? · 9-round LangChain longitudinal study
Does density-driven design survive the open web? · WebCanvas, BrowseComp-ZH
If this project helped you, please leave a Star 🙏