fetch_ml/cmd
Jeremie Fraeys 6580917ba8
refactor: extract domain types and consolidate error system (Phases 1-2)
Phase 1: Extract Domain Types
=============================
- Create internal/domain/ package with canonical types:
  - domain/task.go: Task, Attempt structs
  - domain/tracking.go: TrackingConfig and MLflow/TensorBoard/Wandb configs
  - domain/dataset.go: DatasetSpec
  - domain/status.go: JobStatus constants
  - domain/errors.go: FailureClass system with classification functions
  - domain/doc.go: package documentation

- Update queue/task.go to re-export domain types (backward compatibility)
- Update TUI model/state.go to use domain types via type aliases
- Simplify TUI services: remove ~60 lines of conversion functions

Phase 2: Delete ErrorCategory System
====================================
- Remove deprecated ErrorCategory type and constants
- Remove TaskError struct and related functions
- Remove mapping functions: ClassifyError, IsRetryable, GetUserMessage, RetryDelay
- Update all queue implementations to use domain.FailureClass directly:
  - queue/metrics.go: RecordTaskFailure/Retry now take FailureClass
  - queue/queue.go: RetryTask uses domain.ClassifyFailure
  - queue/filesystem_queue.go: RetryTask and MoveToDeadLetterQueue updated
  - queue/sqlite_queue.go: RetryTask and MoveToDeadLetterQueue updated

Lines eliminated: ~190 lines of conversion and mapping code
Result: Single source of truth for domain types and error classification
2026-02-17 12:34:28 -05:00
..
api-server feat(api): refactor websocket handlers; add health and prometheus middleware 2026-01-05 12:31:07 -05:00
configlint ci: align workflows, build scripts, and docs with current architecture 2026-01-05 12:34:23 -05:00
data_manager ci: align workflows, build scripts, and docs with current architecture 2026-01-05 12:34:23 -05:00
db-utils ci: align workflows, build scripts, and docs with current architecture 2026-01-05 12:34:23 -05:00
performance-regression-detector chore(build): update build system, scripts, and additional tests 2026-02-12 12:05:55 -05:00
profiler chore(build): update build system, scripts, and additional tests 2026-02-12 12:05:55 -05:00
tui refactor: extract domain types and consolidate error system (Phases 1-2) 2026-02-17 12:34:28 -05:00
user_manager Fix multi-user authentication and clean up debug code 2025-12-06 12:35:32 -05:00
worker feat(worker): add integrity checks, snapshot staging, and prewarm support 2026-01-05 12:31:13 -05:00