fetch_ml/cmd
Jeremie Fraeys 43e6446587
feat(scheduler): implement multi-tenant job scheduler with gang scheduling
Add new scheduler component for distributed ML workload orchestration:
- Hub-based coordination for multi-worker clusters
- Pacing controller for rate limiting job submissions
- Priority queue with preemption support
- Port allocator for dynamic service discovery
- Protocol handlers for worker-scheduler communication
- Service manager with OS-specific implementations
- Connection management and state persistence
- Template system for service deployment

Includes comprehensive test suite:
- Unit tests for all core components
- Integration tests for distributed scenarios
- Benchmark tests for performance validation
- Mock fixtures for isolated testing

Refs: scheduler-architecture.md
2026-02-26 12:03:23 -05:00
..
api-server refactor(go): Update Go commands and TUI controller 2026-02-23 14:13:14 -05:00
audit-verifier feat(audit): Tamper-evident audit chain verification system 2026-02-23 19:43:50 -05:00
configlint ci: align workflows, build scripts, and docs with current architecture 2026-01-05 12:34:23 -05:00
data_manager refactor(dependency-hygiene): Move path functions from config to storage 2026-02-17 21:15:23 -05:00
db-utils ci: align workflows, build scripts, and docs with current architecture 2026-01-05 12:34:23 -05:00
errors feat: implement research-grade maintainability phases 1,3,4,7 2026-02-18 15:27:50 -05:00
gen-keys refactor(go): Update Go commands and TUI controller 2026-02-23 14:13:14 -05:00
performance-regression-detector chore(build): update build system, scripts, and additional tests 2026-02-12 12:05:55 -05:00
profiler chore(build): update build system, scripts, and additional tests 2026-02-12 12:05:55 -05:00
scheduler feat(scheduler): implement multi-tenant job scheduler with gang scheduling 2026-02-26 12:03:23 -05:00
tui refactor(go): Update Go commands and TUI controller 2026-02-23 14:13:14 -05:00
user_manager feat: implement Argon2id hashing and Ed25519 manifest signing 2026-02-19 15:34:20 -05:00
worker feat(worker): add integrity checks, snapshot staging, and prewarm support 2026-01-05 12:31:13 -05:00