History

Jeremie Fraeys 3187ff26ea refactor: complete maintainability phases 1-9 and fix all tests Test fixes (all 41 test packages now pass): - Fix ComputeTaskProvenance - add dataset_specs JSON output - Fix EnforceTaskProvenance - populate all metadata fields in best-effort mode - Fix PrewarmNextOnce - preserve prewarm state when queue empty - Fix RunManifest directory creation in SetupJobDirectories - Add ManifestWriter to test worker (simpleManifestWriter) - Fix worker ID mismatch (use cfg.WorkerID) - Fix WebSocket binary protocol responses - Implement all WebSocket handlers: QueueJob, QueueJobWithSnapshot, StatusRequest, CancelJob, Prune, ValidateRequest (with run manifest validation), LogMetric, GetExperiment, DatasetList/Register/Info/Search Maintainability phases completed: - Phases 1-6: Domain types, error system, config boundaries, worker/API/queue splits - Phase 7: TUI cleanup - reorganize model package (jobs.go, messages.go, styles.go, keys.go) - Phase 8: MLServer unification - consolidate worker + TUI into internal/network/mlserver.go - Phase 9: CI enforcement - add scripts/ci-checks.sh with 5 checks: * No internal/ -> cmd/ imports * domain/ has zero internal imports * File size limit (500 lines, rigid) * No circular imports * Package naming conventions Documentation: - Add docs/src/file-naming-conventions.md - Add make ci-checks target Lines changed: +756/-36 (WebSocket fixes), +518/-320 (TUI), +263/-20 (Phase 8-9)		2026-02-17 20:32:14 -05:00
..
benchmarks	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
lib	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
maintenance	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
testing	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
ci-checks.sh	refactor: complete maintainability phases 1-9 and fix all tests	2026-02-17 20:32:14 -05:00
ci-test.sh	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
manage-artifacts.sh	docs: update README and CHANGELOG	2026-02-16 20:38:57 -05:00
README.md	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
setup_monitoring.py	chore(ops): reorganize deployments/monitoring and remove legacy scripts	2026-01-05 12:31:26 -05:00
smoke-test-native.sh	docs: add native libraries documentation and smoke tests	2026-02-16 20:38:46 -05:00
smoke-test.sh	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
track_performance.sh	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00
verify_release.sh	chore(build): update build system, scripts, and additional tests	2026-02-12 12:05:55 -05:00

README.md

Scripts Directory

This directory contains setup and utility scripts for FetchML.

Production Scripts

`setup-prod.sh`

Purpose: Automated production setup for Rocky Linux bare metal deployment
Usage: sudo ./scripts/setup-prod.sh [base_path] [user] [group]
What it does:

Creates system user and groups
Sets up directory structure (/data/ml-experiments/*)
Installs dependencies (Go, Podman, Redis)
Configures GPU support for Podman
Creates systemd service files
Sets up log rotation

Example:

sudo ./scripts/setup-prod.sh /data/ml-experiments ml-user ml-group

Configuration validation

Validate configs using the built-in config lint targets:

make configlint
make worker-configlint

Cleanup Recommendation

These legacy scripts can be removed or archived. The current production setup only needs:

setup-prod.sh

Usage Workflow

First-Time Production Setup

# 1. Run production setup
sudo ./scripts/setup-prod.sh

# 2. Copy and configure
sudo cp configs/api/prod.yaml /etc/fetch_ml/config.yaml
sudo cp configs/workers/worker-prod.toml /etc/fetch_ml/worker.toml
sudo vim /etc/fetch_ml/config.yaml  # Update API keys, etc.

# 3. Build and install
make prod
sudo make install

# 4. Validate
./bin/configlint --schema configs/schema/api_server_config.yaml /etc/fetch_ml/config.yaml
./bin/configlint --schema configs/schema/worker_config_schema.yaml /etc/fetch_ml/worker.toml

# 5. Start services
sudo systemctl start fetchml-api fetchml-worker
sudo systemctl enable fetchml-api fetchml-worker

Development Setup (macOS)

# Use docker-compose for local development
docker-compose up -d

# Or run components directly
make dev
./bin/api-server -config configs/api/dev.yaml

Script Maintenance

When adding new scripts:

Add executable permission: chmod +x scripts/new-script.sh
Add header comment with purpose and usage
Update this README
Use consistent error handling and logging