title: Runbook: ML Model Loading Failure description: Diagnose and resolve failures when a model fails to load in AI-Box. icon: material/cpu-64-bit

Model Loading Failure¶

Impact: Critical — service start failure

Failures in loading an optional model (e.g., S1 when S1_BACKEND=hf, or a reranker) can prevent startup or crash during first use.

Triage (≤5 minutes)¶

Inspect container logs
```
docker compose logs --tail=200 ai-box
```
Look for Python tracebacks mentioning model IDs/paths.
Identify the failing component
S1 (AIB-15): controlled by ENABLE_AIB_15, S1_BACKEND, S1_MODEL_ID
Reranker: controlled by ENABLE_RERANK, RERANK_MODEL_ID
Check env/config
Confirm paths/IDs and that heavy deps exist only if needed.
Default image may not include transformers/torch; using S1_BACKEND=hf without them will fail by design.

A) Model not found (FileNotFoundError / 404 hub)

If mounting local models, confirm volume:

services:
  ai-box:
    volumes:
      - ./models:/models

B) Corrupted cache

Remove the local HF cache inside container and restart:

docker compose exec ai-box bash -lc 'rm -rf ~/.cache/huggingface/*'
docker compose restart ai-box

C) Insufficient memory (OOM / exit 137)

D) Optional feature — disable to restore service