Skip to content

docs(environment-setup): local-disk requirement + datasets on a network mount#58

Open
saadqbal wants to merge 1 commit into
mainfrom
docs/743-nfs-storage
Open

docs(environment-setup): local-disk requirement + datasets on a network mount#58
saadqbal wants to merge 1 commit into
mainfrom
docs/743-nfs-storage

Conversation

@saadqbal

@saadqbal saadqbal commented Jun 16, 2026

Copy link
Copy Markdown
Contributor

What

Document the storage rules introduced for tracebloc/backend#743: the workspace database must be on a local disk, and large datasets can live on a network (NFS) mount.

Changes

  • setup-guide: a <Warning> that HOST_DATA_DIR must be a local disk (the installer preflight fails fast on NFS/CIFS/SMB); datasets can use a network mount via HOST_DATASET_DIR.
  • configuration: HOST_DATA_DIR row note + a new HOST_DATASET_DIR row; a Storage-section <Note> on the MySQL-local / datasets-on-NFS split and running ingestion as the mount owner uid (root_squash-safe).
  • troubleshooting: a "MySQL CrashLoopBackOff → data dir on NFS" entry.

No docs.json nav change (existing pages only).

Documents

The tracebloc/client (installer + chart) and tracebloc/client-runtime (ingestor uid) changes for backend#743.

🤖 Generated with Claude Code


Note

Low Risk
Documentation-only updates to environment-setup pages; no runtime or installer behavior changes in this repo.

Overview
Documents backend#743 storage rules: workspace MySQL must live on local disk, while large datasets can use a separate network mount.

setup-guide adds a <Warning> that HOST_DATA_DIR must be local (installer preflight rejects NFS/CIFS/SMB) and points readers to HOST_DATASET_DIR for network-mounted datasets.

configuration expands the installer table (HOST_DATA_DIR local-disk note, new HOST_DATASET_DIR row) and adds a Storage <Note> on keeping DB/logs local vs relocating only the dataset volume (hostPath.datasetPath / /tracebloc-data) and running ingestion as the mount owner uid for NFS root_squash.

troubleshooting adds a local k3d entry for MySQL CrashLoopBackOff when the data dir is on a network filesystem, with the same local vs HOST_DATASET_DIR split.

Reviewed by Cursor Bugbot for commit 9b6bb39. Bugbot is set up for automated code reviews on this repo. Configure here.

…rk mount

Documents the storage rules introduced for backend#743:

- setup-guide: Warning that HOST_DATA_DIR must be a local disk (the installer
  preflight fails fast on NFS/CIFS/SMB); large datasets can live on a network
  mount via HOST_DATASET_DIR.
- configuration: HOST_DATA_DIR row note + new HOST_DATASET_DIR row; a Storage
  section Note on the MySQL-local / datasets-on-NFS split and running ingestion
  as the mount owner uid (root_squash-safe).
- troubleshooting: MySQL CrashLoopBackOff -> data dir on NFS entry.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@LukasWodka

Copy link
Copy Markdown
Contributor

👋 Heads-up — Code review queue is at 18 / 8

Above the WIP limit. The team convention is to review existing PRs before opening new work.

Open PRs currently in Code review (oldest first):

  • .github#57 — fix(fr-gate): pass items at or beyond the required stage · author: @aptracebloc · no reviewer assigned
  • backend#812 — fix(datasets): subset label check for test-dataset compatibility (#811) · author: @saadqbal · no reviewer assigned
  • cli#78 — fix(dataset rm): delete staging files from a uid-65532 pod, not jobs-manager (#259) · author: @LukasWodka · no reviewer assigned
  • cli#79 — chore(schema): re-sync vendored ingest.v1.json from data-ingestors master · author: @LukasWodka · no reviewer assigned
  • client#261 — feat(installer): fail fast when HOST_DATA_DIR is on a network filesystem · author: @saadqbal · no reviewer assigned
  • client#262 — feat(installer,chart): place datasets on a network mount while MySQL stays local · author: @saadqbal · no reviewer assigned
  • client-runtime#108 — fix(authz): match ingest table prefixes at a segment boundary (close cross-tenant straddle) · author: @LukasWodka · no reviewer assigned
  • client-runtime#114 — fix(jobs): cap training Job backoffLimit to stop crashloops starving the cluster · author: @saadqbal · no reviewer assigned
  • client-runtime#115 — feat(ingestion): run the ingestion pod as the host uid for datasets on NFS · author: @saadqbal · no reviewer assigned
  • data-ingestors#270 — docs(releasing): correct ingestor rollout — floating tag + imagePullPolicy=Always, not INGESTOR_IMAGE_DIGEST rewrite · author: @saadqbal · no reviewer assigned

Pull from review before opening new work. (This is a nudge from the kanban WIP check, not a block.)

@mintlify

mintlify Bot commented Jun 16, 2026

Copy link
Copy Markdown
Contributor

Preview deployment for your docs. Learn more about Mintlify Previews.

Project Status Preview Updated (UTC)
tracebloc 🟢 Ready View Preview Jun 16, 2026, 2:49 PM

💡 Tip: Enable Workflows to automatically generate PRs for you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants