AllArkive v0.1 — alpha

A self-hostable, offline knowledge ark — Wikipedia, repair guides, books, and a local AI that can search them, all running on your own hardware.

What this is

AllArkive bundles three things that already exist into something one person can install in an afternoon.

  1. An offline knowledge archive — Wikipedia, Stack Exchange, Project Gutenberg, iFixit, and more, packaged as Kiwix ZIM files.
  2. A local AI — an open-weight LLM via Ollama and Open WebUI, running entirely on your machine. No cloud calls.
  3. A retrieval pipeline — RAG that lets the AI answer questions using the local archive, with citations back to the source.

Run it on a laptop, a home server, or a Raspberry Pi. Use it as a private research assistant. Keep it as a fallback for when the open web gets worse.

Why

The infrastructure of shared knowledge is more centralised, more surveilled, and more hostile to users than it has ever been. The tools to run a useful piece of it on cheap hardware already exist — they just hadn't been packaged together. We packaged them.

This is not a survival kit, a prepper bunker, or a doomsday cache. It's a library and a search tool, built on open weights and open content, that runs on your own machine. The framing is censorship resistance, privacy, and educational access.

What's in the default bundle

The balanced bundle, recommended for most laptops — roughly 23 GB of ZIM data plus a 5 GB model.

Smaller (minimal) and larger (comprehensive) bundles are documented in the bundle docs.

Install

git clone https://github.com/clupai8o0/allarkive.git
cd allarkive
cp compose/.env.example compose/.env
./scripts/bootstrap.sh

After install, open http://localhost:8080 for the local landing page.

Heads-up: after the stack starts, the RAG indexer keeps running in the background, embedding every ZIM chunk through your local Ollama. On CPU expect several hours for the balanced bundle, less for minimal. Leave it running — it's resumable and idempotent. Kiwix browsing at http://localhost:8081 works immediately; RAG answers improve as coverage grows.

System requirements

Architecture

Three layers, each replaceable.

┌──────────────────────────────────────────┐
│  Glue: landing page, docker-compose,     │
│  bootstrap, RAG pipeline, docs           │
├──────────────────────────────────────────┤
│  Local AI: Ollama + Open WebUI + RAG     │
├──────────────────────────────────────────┤
│  Archive: Kiwix serving ZIM files        │
└──────────────────────────────────────────┘

Full breakdown in the architecture doc.

Documentation

Pick your starting point. Each doc has its own page with a sidebar and on-page table of contents.

Read first

Install

Deployment patterns

Contributing and governance

Security and honesty

Status

v0.1 is alpha. First public release lands alongside our BSides Melbourne talk in May 2026.

Co-built by Sam and Sham. Standing on the shoulders of Kiwix, Ollama, Open WebUI, and the people who maintain the open archives we bundle.