fetch_ml/cmd
Jeremie Fraeys 0b5e99f720
refactor(scheduler,worker): improve service management and GPU detection
Scheduler enhancements:
- auth.go: Group membership validation in authentication
- hub.go: Task distribution with group affinity
- port_allocator.go: Dynamic port allocation with conflict resolution
- scheduler_conn.go: Connection pooling and retry logic
- service_manager.go: Lifecycle management for scheduler services
- service_templates.go: Template-based service configuration
- state.go: Persistent state management with recovery

Worker improvements:
- config.go: Extended configuration for task visibility rules
- execution/setup.go: Sandboxed execution environment setup
- executor/container.go: Container runtime integration
- executor/runner.go: Task runner with visibility enforcement
- gpu_detector.go: Robust GPU detection (NVIDIA, AMD, Apple Silicon, CPU fallback)
- integrity/validate.go: Data integrity validation
- lifecycle/runloop.go: Improved runloop with graceful shutdown
- lifecycle/service_manager.go: Service lifecycle coordination
- process/isolation.go + isolation_unix.go: Process isolation with namespaces/cgroups
- tenant/manager.go: Multi-tenant resource isolation
- tenant/middleware.go: Tenant context propagation
- worker.go: Core worker with group-scoped task execution
2026-03-08 13:03:15 -04:00
..
api-server refactor(go): Update Go commands and TUI controller 2026-02-23 14:13:14 -05:00
audit-verifier feat(audit): Tamper-evident audit chain verification system 2026-02-23 19:43:50 -05:00
configlint ci: align workflows, build scripts, and docs with current architecture 2026-01-05 12:34:23 -05:00
data_manager refactor(dependency-hygiene): Move path functions from config to storage 2026-02-17 21:15:23 -05:00
db-utils refactor(utilities): update supporting modules for scheduler integration 2026-02-26 12:07:15 -05:00
errors feat: implement research-grade maintainability phases 1,3,4,7 2026-02-18 15:27:50 -05:00
gen-keys refactor(go): Update Go commands and TUI controller 2026-02-23 14:13:14 -05:00
performance-regression-detector chore(build): update build system, scripts, and additional tests 2026-02-12 12:05:55 -05:00
profiler chore(build): update build system, scripts, and additional tests 2026-02-12 12:05:55 -05:00
scheduler refactor(scheduler,worker): improve service management and GPU detection 2026-03-08 13:03:15 -04:00
tui security: improve audit, crypto, and config handling 2026-03-04 13:23:42 -05:00
user_manager feat: implement Argon2id hashing and Ed25519 manifest signing 2026-02-19 15:34:20 -05:00
worker feat(worker): integrate scheduler endpoints and security hardening 2026-02-26 12:06:16 -05:00