Long

A single-process, multi-user LLM agent runtime on Elixir/OTP — for a household or a small team. Phoenix for the UI, Ash for the data layer, Oban for scheduled tasks, ReqLLM for provider abstraction.

Long started as a port of the Python GenericAgent to Elixir, borrowing its core shape — one session → ReAct loop → tools + memory + skills. The design has since diverged substantially: on the BEAM it gets real concurrency, fault tolerance, and long-lived push messaging natively (one supervised GenServer per session rather than a bolted-on Python process model), and the agent's capability layer has been rebuilt on mature, standard technology rather than a bespoke tool protocol — most notably GraphQL as the agent's primary skill (see below).

It's web-first: you don't run a CLI to talk to it. Open the browser, and chat, configuration, memory, channels, and scheduled tasks are all just pages.

Design philosophy

Long is built to install and run like a personal CLI tool, not like server infrastructure. Everything below follows from that one decision.

One self-contained binary. mix release bundles the Erlang VM (ERTS) and every BEAM dependency into a single tarball. The target machine needs neither Erlang nor Elixir installed — just untar and run. No Docker image to build, no base image to track.
Installs into one directory, owns nothing else. curl | bash drops everything under ~/.long/ — the binary, the VM, the SQLite database, the config, the agent workspace, the skills. Uninstall is rm -rf ~/.long. Upgrades wipe only bin/ lib/ releases/ erts-* and preserve your env, long.db, and agent/ data.
No external services to provision. Storage is SQLite (a file) plus the filesystem (skills, workspace). No Postgres, no Redis, no message broker. The whole runtime is one OS process — the BEAM — internally supervising sessions, bots, the scheduler, and even the headless-browser subprocess.
Userspace, no root. Install and autostart run entirely as your user. ~/.long/service install wires up launchd (macOS) or a systemd user unit (Linux) so the agent survives reboot — you never write a unit file or sudo anything.
No language runtime to provision. code_run executes TypeScript/JavaScript in a sandboxed Deno binary that the app downloads and manages itself on first use — no Python, Node, or uv to install (bash is available for shell commands). The optional headless browser, Obscura, is fetched the same way.
LAN-first, not internet-hardened. Binds 0.0.0.0, check_origin off, no forced SSL — so a freshly installed node is reachable from any device on your home network by IP. Internet exposure is opt-in (LONG_CHECK_ORIGIN, a reverse proxy, etc.), never the default that locks you out on first run. In the same spirit, code_run's bash mode runs with the server's full host access (only the Deno engine — the default — is sandboxed per-member); that's fine for a trusted household, not for untrusted members.
Open-source-friendly delivery. The installer pulls release tarballs straight from GitHub Releases over plain curl — no gh CLI, no GitHub account, no auth. Anyone can install with one line.

The result: getting Long onto a Mac mini or a Linux box in the corner is curl | bash + paste an API key, and it behaves like an appliance from there.

GraphQL as the agent's primary skill

Most agent frameworks invent a bespoke tool protocol — one narrow tool per capability (schedule_task, remember_fact, update_checkpoint, …), each with a hand-maintained schema the model has to be taught.

Long takes a different route: the agent's main capability is a single graphql tool over the entire Ash data layer — sessions, messages, both memory tiers, working checkpoints, scheduled tasks, secrets, LLM/search configs — for read and write through one uniform interface.

The model already speaks it. GraphQL is in every model's pretraining; there's no custom DSL to explain.
It's self-describing. The schema is introspectable ({ __schema { queryType { fields { name } } } }), so the agent discovers its own capabilities at runtime instead of us maintaining a wall of tool descriptions.
One tool replaces ~10. Adding a new Ash resource automatically grants the agent CRUD over it — no new tool to write, register, or document.

This is the core bet: lean on mature, introspectable, widely-understood technology (GraphQL) as the agent's capability surface, and let the model's existing fluency do the rest. File-based Skills (SKILL.md + scripts, below) remain for packaged, code-carrying capabilities; GraphQL is how the agent reads and writes its own world.

Web UI

Everything is a page — there's no separate CLI you have to learn to operate the agent day to day.

Page	What it is
`/chat`	the agent. Streaming replies, live tool-call display, a memory side-rail, AI-named sessions.
`/manage`	everything else. LLMs, memory, skills, groups & members, sessions, search providers, channels, scheduled tasks, phrases (i18n) — each a LiveView.

Features

GraphQL capability layer — one introspectable graphql tool gives the agent read/write over its whole data world (see above).
Groups & members (multi-tenant) — a deployment hosts one or more groups (a family, a small team). Each member links their own WeChat / Telegram by sending /bind <code>, gets an isolated per-member code workspace + personal skills, and can message other members in the group ("notify …"). One owner, many members.
Sandboxed code execution — code_run runs TypeScript/JavaScript on a managed Deno binary, sandboxed (read/write) to the caller's per-member workspace; bash is there for shell. No Python — Deno auto-downloads on first use.
Per-chat language (i18n) — every bot string resolves through a fallback chain (channel → member → group → platform-detected → system default) and is overridable at /manage/phrases; set a system-wide default language in one click.
Web-first LiveView UI — /chat (streaming output + tool-call display + live memory side-rail + AI-generated session titles) and /manage for everything else.
Four-tier memory:
- L1 WorkingCheckpoint (one key_info row per session)
- L2 GlobalMemory / SessionMemory (fact / preference / goal / decision, with importance + recency decay)
- L3 Anthropic-compatible Skills (SKILL.md + scripts/references/assets), scoped per-member or shared group-wide (promote a personal skill to global, or view a skill's full SKILL.md, at /manage/skills); the filesystem is the source of truth, a watcher drives an ETS index
- L4 SessionArchive (session archival + LLM summary)
Multi-provider LLM routing — ReqLLM speaks 20+ providers natively (openai / anthropic / google / groq / deepseek / openrouter / mistral / ollama / xai / bedrock / …); wire protocol is configurable, one alias is the default.
Unified admin at /manage — LLM configs, memory editing, skill browsing, groups & members, channels, phrases (i18n), session management, search providers, scheduled tasks, secrets — all LiveView, no ash_admin dependency.
Scheduled tasks — Oban-driven; the LLM schedules its own via GraphQL createScheduledTask, or you create them by hand at /manage/scheduled.
Multi-account channels — host several WeChat accounts and/or Telegram bots at once, each bound to a different member; inbound auto-tags the sender, outbound routes back through the exact account the chat arrived on. Onboarded at /manage/credentials.
Web search aggregation — Tavily / Brave API + SERP scrapers, RRF multi-source merge, providers configured at /manage/search.
Real headless browsing — the Obscura CLI (a Rust Chromium fork) powers the web_scan / web_execute_js tools.
Error observability — ErrorTracker dashboard, a :logger crash backstop, transparent exponential-backoff retry on LLM calls.
In-conversation control — /clear wipes a session, /status asks what the agent is doing, /btw <note> interjects mid-run, /bind <code> links this chat to a member.

Architecture

┌─────────────────────────────────────────────────────────────────┐
│  Phoenix LiveView ─ /chat ─ /manage ─ navigation hub            │
├─────────────────────────────────────────────────────────────────┤
│  Long.Agent.Server (GenServer per session)  ──►  ReAct loop     │
│   │                                │                            │
│   ├── persist messages → DB       ├── tools (file/web/memory/…) │
│   └── PubSub stream → LiveView    └── ReqLLM streaming          │
├─────────────────────────────────────────────────────────────────┤
│  Memory  L1 WorkingCheckpoint   L2 Global + Session             │
│          L3 Skill.Store (FS + watcher + ETS)                    │
│          L4 SessionArchive                                      │
├─────────────────────────────────────────────────────────────────┤
│  Long.Agent.Bots ─ WeChat │ Telegram                           │
│  Oban ─ ScheduledTask runner    ErrorTracker ─ /errors          │
├─────────────────────────────────────────────────────────────────┤
│  Storage:  SQLite (Ash) + filesystem (skills, workspace)        │
└─────────────────────────────────────────────────────────────────┘

Core stack:

Dependency	Role
Elixir 1.15+, Phoenix 1.8, Ash 3, AshSqlite	App / data layer
Oban + AshOban + Oban Web	Background job scheduling
ReqLLM	Unified multi-provider LLM access
Jido + Jido.AI	Tool system (Zoi schema + auto JSONSchema)
Mishka Chelekom	70+ Tailwind LiveView components
Obscura	Rust headless-browser CLI
ErrorTracker	In-app exception aggregation

Quick start

One-line install (macOS / Linux)

Prebuilt releases cover macos-arm64, linux-x64, and linux-arm64:

curl -fsSL https://raw.githubusercontent.com/mjason/long/main/install.sh | bash

The script:

pulls the latest tarball from GitHub Releases and extracts it to ~/.long/,
on first run generates ~/.long/env (with an auto-generated SECRET_KEY_BASE, DATABASE_PATH, …),
writes the ~/.long/run launcher and the ~/.long/service autostart controller.

$EDITOR ~/.long/env     # usually nothing to change
~/.long/run             # start; open http://localhost:4000

Make it start on boot (no root, no unit-file editing):

~/.long/service install     # enable autostart (launchd / systemd-user)
~/.long/service status      # is it registered + running?
~/.long/service logs        # tail run.log
~/.long/service uninstall   # disable autostart

Installer environment variables:

Variable	Default	Meaning
`LONG_INSTALL_DIR`	`~/.long`	install target
`LONG_VERSION`	latest	pin a version, e.g. `v0.2.9`

Docker

A self-contained image bakes in the mix release plus Deno and Obscura — nothing is downloaded on first run. The repo ships a ready docker-compose.yml:

services:
  long:
    image: ghcr.io/mjason/long:latest     # or build locally: build: .
    ports: ["4000:4000"]
    environment:
      SECRET_KEY_BASE: ${SECRET_KEY_BASE}  # generate once: openssl rand -base64 48
      PHX_HOST: ${PHX_HOST:-localhost}
      # LONG_CHECK_ORIGIN: "true"          # set when exposing to the internet
    volumes: ["long_data:/data"]          # SQLite DB + skills + workspace + memory
    restart: unless-stopped
volumes:
  long_data:

From a clone of the repo:

export SECRET_KEY_BASE=$(openssl rand -base64 48)
docker compose up -d          # build + run
# → http://localhost:4000

Data (SQLite DB, skills, agent workspace, memory) persists in the long_data volume across restarts and upgrades. Prefer a bare image? docker build -t long ..

Run from source

Requirements:

Elixir 1.15+ / Erlang 26+
SQLite 3
(Deno and the optional Obscura browser are auto-downloaded at runtime — nothing to pre-install.)

git clone https://github.com/mjason/long.git
cd long
mix setup            # deps.get + ecto.create + migrate + seeds + asset build
mix phx.server

Open http://localhost:4000/ and enter /chat or /manage/llms from the navigation hub.

Configure your first LLM

Open /manage/llms → New LLM:

Field	Example
Alias	`claude_main`
Provider	`anthropic`
Wire protocol	`anthropic_messages`
Model	`claude-sonnet-4`
API base	`https://api.anthropic.com` (or a proxy)
API key	`sk-ant-…`, or leave blank to use `api_key_env_var`
Set as default	✓

Save, go back to /chat, and new sessions bind to this alias automatically. The same flow works for OpenAI / Google / Groq / DeepSeek / any ReqLLM-supported provider.

Install your first Skill (optional)

mkdir -p priv/agent/skills/hello-world/scripts

cat > priv/agent/skills/hello-world/SKILL.md <<'MD'
---
name: hello-world
description: Demo skill — takes a `name` arg, returns a greeting string.
tags: [demo]
---

# hello-world

Run `scripts/hello.py "<name>"`.
MD

cat > priv/agent/skills/hello-world/scripts/hello.py <<'PY'
import json, sys
name = (json.loads(sys.argv[1]) if len(sys.argv) > 1 else {}).get("name", "world")
print(json.dumps({"greeting": f"hello, {name}"}, ensure_ascii=False))
PY

mix long.skill reindex   # or restart the server; the watcher also picks it up

Next conversation, the LLM sees hello-world under # Available skills and can skill_read then code_run it. The format is fully compatible with Anthropic Agent Skills, so you can git clone https://github.com/anthropics/skills priv/agent/skills/ to grab the official repo wholesale.

Channels & members

A deployment serves one or more groups; each group has members (people), and each member links their own chat accounts. The owner sets channels up once at /manage/credentials — no env vars, no restart:

WeChat — click 扫码登录 (scan to log in) for an inline QR and host an account via Tencent's iLink bot API (no desktop hook). Multiple accounts are supported — host one per member/role; the worker hot-reloads on connect.
Telegram — paste a @BotFather token; the bot starts long-polling immediately. Replies render as Telegram HTML, with typing indicators and inbound/outbound media. Multiple bots are supported, one per member.

Members then link themselves: each member has a /bind <code> (shown at /manage/groups); they send it to the shared bot and that chat account is tied to them. Once bound, members can address each other — "notify my spouse / 通知老张 …" — and the agent delivers to every channel that member has linked, in their language (see i18n above). Outbound always routes back through the exact account a chat arrived on.

CLI tools

Command	Purpose
`mix phx.server`	start the web server (default port 4000)
`mix long.skill list / reindex / remove NAME`	skill index management
`mix long.wechat.login`	WeChat QR login + buf persistence
`iex -S mix`	REPL: `Long.Agent.list_sessions()`, etc.
`mix test`	test suite
`mix precommit`	`compile --warnings-as-errors + format + test`

Configuration

Configure from the web, not from files. Almost everything — LLMs, search providers, channels, scheduled tasks, memory, secrets — lives in the DB and is edited at /manage. There's no config file to redeploy for day-to-day changes.

The only file-level config is a handful of filesystem roots, under :long, Long.Agent in config/config.exs (the installed release reads these from ~/.long/env instead):

config :long, Long.Agent,
  memory_root: "priv/agent/memory",      # legacy GenericAgent-compatible path
  skill_root:  "priv/agent/skills",      # L3 skill dir (SKILL.md lives here)
  workspace_root: "priv/agent/workspace" # root for code_run / file_* tools

Everything else is a page in /manage (or, if you prefer, an IEx call like Long.Agent.register_llm/1).

Development

mix test                                # unit + LiveView tests
mix test test/long/jido                 # run one group
mix format
mix precommit                           # compile --warnings-as-errors + format + test
mix usage_rules.docs Ash.Resource       # look up dependency docs

CLAUDE.md carries project-level AI-agent guidance (usage rules + skill entry points), auto-loaded if you use Claude Code / Cursor / similar IDE agents.

Status

Beta — small multi-user. Runs as a daily AI assistant for a household / small workgroup (a few members, one box on the LAN).

Multi-tenant by group + member — per-member code workspaces, personal/shared skills, per-channel binding — but no hard auth/RBAC yet: members are trusted, and bash runs unsandboxed (only the Deno engine is per-member sandboxed). Fine for a family/team; not for untrusted members or internet exposure.
The schema changes occasionally; no backward-compat promise.
Some paths (mixin LLM, full WeChat / Telegram chains) have light test coverage.
Deployment docs are single-node only.

Issues and PRs welcome, but there's no committed release cadence.

Roadmap

Near-term:

Multi-user via groups + members (per-member workspace, personal/shared skills, channel binding)
Group-level RBAC — member roles enforced at the data layer (for workgroups / semi-trusted members)
Consolidate the dual ReAct loop + tool families (retire the legacy Long.Agent.Loop / Long.Agent.Tools)
Pin + SHA-verify the Deno / Obscura binary downloads (supply chain)

Longer-term:

PubSub reconnect replay — buffer recent turn events so a reconnecting client doesn't miss mid-stream output
Timezone-aware scheduling (currently UTC-only)
Observability page — tool error rates, LLM retries, Deno / Obscura install status

Backlog (nice-to-have, not roadmap):

Framework-driven /manage forms (Mishka <.text_field> / AshPhoenix.Form) instead of hand-rolled <input>s

Acknowledgements

The Python GenericAgent was the original starting point.
Ash Framework lets data + domain logic be described together.
ReqLLM makes onboarding a new LLM provider a one-line alias.
Mishka Chelekom provides the full LiveView component library.
Anthropic Agent Skills defines the SKILL.md format.
Obscura forked Chromium into a clean CLI.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github/workflows		.github/workflows
assets		assets
config		config
lib		lib
priv		priv
rel/overlays/bin		rel/overlays/bin
test		test
.dockerignore		.dockerignore
.formatter.exs		.formatter.exs
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
docker-compose.yml		docker-compose.yml
install.sh		install.sh
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Long

Design philosophy

GraphQL as the agent's primary skill

Web UI

Features

Architecture

Quick start

One-line install (macOS / Linux)

Docker

Run from source

Configure your first LLM

Install your first Skill (optional)

Channels & members

CLI tools

Configuration

Development

Status

Roadmap

Acknowledgements

License

About

Uh oh!

Releases 29

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Long

Design philosophy

GraphQL as the agent's primary skill

Web UI

Features

Architecture

Quick start

One-line install (macOS / Linux)

Docker

Run from source

Configure your first LLM

Install your first Skill (optional)

Channels & members

CLI tools

Configuration

Development

Status

Roadmap

Acknowledgements

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 29

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages