No description

Find a file

Jeremie Fraeys 260e18499e feat: Research features - narrative fields and outcome tracking Add comprehensive research context tracking to jobs: - Narrative fields: hypothesis, context, intent, expected_outcome - Experiment groups and tags for organization - Run comparison (compare command) for diff analysis - Run search (find command) with criteria filtering - Run export (export command) for data portability - Outcome setting (outcome command) for experiment validation Update queue and requeue commands to support narrative fields. Add narrative validation to manifest validator. Add WebSocket handlers for compare, find, export, and outcome operations. Includes E2E tests for phase 2 features.		2026-02-18 21:27:05 -05:00
.forgejo/workflows	chore: update configurations and deployment files	2026-02-16 20:38:19 -05:00
.gitea	ci: migrate from GitHub to Forgejo/Gitea	2026-02-12 12:05:00 -05:00
.windsurf	chore: add review workflow and test updates	2026-02-16 20:39:09 -05:00
build	chore: update Docker build configuration	2026-02-16 20:38:50 -05:00
cli	feat: Research features - narrative fields and outcome tracking	2026-02-18 21:27:05 -05:00
cmd	security: implement comprehensive secrets protection	2026-02-18 16:18:09 -05:00
configs	chore: update configurations and deployment files	2026-02-16 20:38:19 -05:00
db	feat: add GitHub workflows and development tooling	2025-12-04 16:56:25 -05:00
deployments	feat: add TUI SSH usability testing infrastructure	2026-02-18 17:48:02 -05:00
docs	refactor: complete maintainability phases 1-9 and fix all tests	2026-02-17 20:32:14 -05:00
examples	Slim and secure: move scripts, clean configs, remove secrets	2025-12-07 13:57:51 -05:00
internal	feat: Research features - narrative fields and outcome tracking	2026-02-18 21:27:05 -05:00
monitoring	chore(ops): reorganize deployments/monitoring and remove legacy scripts	2026-01-05 12:31:26 -05:00
native	feat: implement C++ native libraries for performance-critical operations	2026-02-16 20:38:04 -05:00
podman	refactor: reorganize podman directory structure	2026-02-18 16:40:46 -05:00
redis	Slim and secure: move scripts, clean configs, remove secrets	2025-12-07 13:57:51 -05:00
scripts	chore: move detect_native.go and setup_monitoring.py to dev/	2026-02-18 17:57:57 -05:00
tests	feat: Research features - narrative fields and outcome tracking	2026-02-18 21:27:05 -05:00
tools	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
.dockerignore	chore(repo): add dockerignore, changelog, and ignore local artifacts	2026-01-05 12:30:57 -05:00
.env.example	chore(repo): add dockerignore, changelog, and ignore local artifacts	2026-01-05 12:30:57 -05:00
.flake8	feat: initialize FetchML ML platform with core project structure	2025-12-04 16:52:09 -05:00
.gitignore	chore: gitignore generated SSH test keys	2026-02-18 17:48:48 -05:00
.golangci.yml	ci: align workflows, build scripts, and docs with current architecture	2026-01-05 12:34:23 -05:00
.golintrc	Fix multi-user authentication and clean up debug code	2025-12-06 12:35:32 -05:00
AGENTS.md	chore(cleanup): remove legacy artifacts and add tooling configs	2026-02-12 12:06:09 -05:00
CHANGELOG.md	refactor: replace panic with error returns and update maintenance	2026-02-18 14:44:21 -05:00
DEVELOPMENT.md	docs: comprehensive documentation updates	2026-02-12 12:05:27 -05:00
go.mod	refactor(dependency-hygiene): Fix Redis leak, simplify TUI wrapper, clean go.mod	2026-02-17 21:13:49 -05:00
go.sum	refactor(dependency-hygiene): Fix Redis leak, simplify TUI wrapper, clean go.mod	2026-02-17 21:13:49 -05:00
LICENSE	ci: align workflows, build scripts, and docs with current architecture	2026-01-05 12:34:23 -05:00
Makefile	refactor: reorganize scripts into categorized structure	2026-02-18 17:56:59 -05:00
pyproject.toml	feat: initialize FetchML ML platform with core project structure	2025-12-04 16:52:09 -05:00
README.md	docs: update README and CHANGELOG	2026-02-16 20:38:57 -05:00
SECURITY.md	refactor: extract domain types and consolidate error system (Phases 1-2)	2026-02-17 12:34:28 -05:00

README.md

FetchML

A lightweight ML experiment platform with a tiny Zig CLI and a Go backend. Designed for homelabs and small teams.

Installation (recommended)

FetchML publishes pre-built release artifacts (CLI + Go services) on GitHub Releases.

If you prefer a one-shot check (recommended for most users), you can use:

./scripts/verify_release.sh --dir . --repo <org>/<repo>

Download the right archive for your platform
Verify checksums.txt signature (recommended)

The release includes a signed checksums.txt plus:

checksums.txt.sig
checksums.txt.cert

Verify the signature (keyless Sigstore) using cosign:

cosign verify-blob \
  --certificate checksums.txt.cert \
  --signature checksums.txt.sig \
  --certificate-identity-regexp "^https://github.com/jfraeysd/fetch_ml/.forgejo/workflows/release-mirror.yml@refs/tags/v.*$" \
  --certificate-oidc-issuer https://token.actions.githubusercontent.com \
  checksums.txt

Verify the SHA256 checksum against checksums.txt
Extract and install

Example (CLI on Linux x86_64):

# Download
curl -fsSLO https://github.com/jfraeysd/fetch_ml/releases/download/<tag>/ml-linux-x86_64.tar.gz
curl -fsSLO https://github.com/jfraeysd/fetch_ml/releases/download/<tag>/checksums.txt
curl -fsSLO https://github.com/jfraeysd/fetch_ml/releases/download/<tag>/checksums.txt.sig
curl -fsSLO https://github.com/jfraeysd/fetch_ml/releases/download/<tag>/checksums.txt.cert

# Verify
cosign verify-blob \
  --certificate checksums.txt.cert \
  --signature checksums.txt.sig \
  --certificate-identity-regexp "^https://github.com/jfraeysd/fetch_ml/.forgejo/workflows/release-mirror.yml@refs/tags/v.*$" \
  --certificate-oidc-issuer https://token.actions.githubusercontent.com \
  checksums.txt
sha256sum -c --ignore-missing checksums.txt

# Install
tar -xzf ml-linux-x86_64.tar.gz
chmod +x ml-linux-x86_64
sudo mv ml-linux-x86_64 /usr/local/bin/ml

ml --help

Quick start

# Clone and run (dev)
git clone <your-repo>
cd fetch_ml
make dev-up

# Or build the CLI locally
cd cli && make all
./zig-out/bin/ml --help

What you get

Zig CLI (ml): Tiny, fast local client. Uses ~/.ml/config.toml and FETCH_ML_CLI_* env vars.
Go backends: API server, worker, and a TUI for richer remote features.
TUI over SSH: ml monitor launches the TUI on the server, keeping the local CLI minimal.
CI/CD: Cross‑platform builds with zig build-exe and Go releases.

CLI usage

# Configure
cat > ~/.ml/config.toml <<EOF
worker_host = "127.0.0.1"
worker_user = "dev_user"
worker_base = "/tmp/ml-experiments"
worker_port = 22
api_key = "your-api-key"
EOF

# Core commands
ml status
ml queue my-job
ml cancel my-job
ml dataset list
ml monitor  # SSH to run TUI remotely

Phase 1 (V1) notes

Task schema supports optional snapshot_id (opaque identifier) and dataset_specs (structured dataset inputs). If dataset_specs is present it takes precedence over legacy datasets / --datasets args.
Snapshot restore (S1) stages verified snapshot_id into each task workspace and exposes it via FETCH_ML_SNAPSHOT_DIR and FETCH_ML_SNAPSHOT_ID. If snapshot_store.enabled: true in the worker config, the worker will pull <prefix>/<snapshot_id>.tar.gz from an S3-compatible store (e.g. MinIO), verify snapshot_sha256, and cache it under data_dir/snapshots/sha256/<snapshot_sha256>.
Prewarm (best-effort) can fetch datasets for the next queued task while another task is running. Prewarm state is surfaced in ml status --json under the optional prewarm field.
Env prewarm (best-effort) can build a warmed Podman image keyed by deps_manifest_sha256 and reuse it for later tasks.

Changelog

See CHANGELOG.md.

Build

Native C++ Libraries (Optional)

FetchML includes optional C++ native libraries for performance. See docs/src/native-libraries.md for detailed build instructions.

Quick start:

make native-build        # Build native libs
make native-smoke        # Run smoke test
export FETCHML_NATIVE_LIBS=1  # Enable at runtime

Standard Build

# CLI (Zig)
cd cli && make all      # release-small
make tiny              # extra-small
make fast              # release-fast

# Go backends
make cross-platform    # builds for Linux/macOS/Windows

Deploy

Dev: docker-compose up -d
Prod: Use the provided systemd units or containers on Rocky Linux.

Docs

See docs/ for detailed guides:

docs/src/native-libraries.md – Native C++ libraries (build, test, deploy)
docs/src/zig-cli.md – CLI reference
docs/src/quick-start.md – Full setup guide
docs/src/deployment.md – Production deployment

Source code

The FetchML source code is intentionally not hosted on GitHub.

The canonical source repository is available at: <SOURCE_REPO_URL>.

License

FetchML is source-available for transparency and auditability. It is not open-source.

See LICENSE.

README.md Unescape Escape