Getting started¶
This page walks you through installing and running White Collar Agent locally (Conda) or via Docker.
Prerequisites¶
- Python 3.9+
git,conda, andpip- An API key for your chosen LLM provider (OpenAI or Gemini)
Run locally (Conda)¶
1) Clone the repository¶
git clone https://github.com/zfoong/WhiteCollarAgent.git
cd WhiteCollarAgent
````
### 2) Create the Conda environment
```bash
conda env create -f environment.yml
3) Activate the environment¶
If you’re not sure what the environment is called, list them and activate the one created from environment.yml:
conda env list
conda activate <ENV_NAME>
4) Set your API key¶
Pick one provider:
export OPENAI_API_KEY="<YOUR_KEY_HERE>"
or:
export GOOGLE_API_KEY="<YOUR_KEY_HERE>"
5) Start the agent (CLI)¶
python -m core.main
Once it launches, you can:
- chat with the agent,
- ask it to perform tasks,
- run
/helpinside the interface to see available commands.
Run with Docker¶
1) Build the image¶
From the repository root:
docker build -t white-collar-agent .
2) Run the container¶
Run interactively:
docker run --rm -it white-collar-agent
If you want to supply environment variables via a file (for example, based on .env.example):
cp .env.example .env
docker run --rm -it --env-file .env white-collar-agent
3) Enable GUI / screen automation (optional)¶
GUI actions (mouse/keyboard events, screenshots) require an X11 server. Choose one approach:
A) Use the host display (Linux + X11)
docker run --rm -it \
-e DISPLAY=$DISPLAY \
-v /tmp/.X11-unix:/tmp/.X11-unix \
-v $(pwd)/data:/app/core/data \
white-collar-agent
B) Run headlessly with a virtual display (Xvfb)
docker run --rm -it --env-file .env white-collar-agent \
bash -lc "Xvfb :99 -screen 0 1920x1080x24 & export DISPLAY=:99 && exec python -m core.main"
Notes¶
- GUI mode is experimental, so expect issues if/when the agent decides to switch to GUI mode.
-
If you run into setup problems, double-check:
-
your environment is activated,
- your API key is set,
- you’re launching with
python -m core.main.