Changelog

All notable changes to FlameIQ are documented here.

The format follows Keep a Changelog. FlameIQ adheres to Semantic Versioning.

1.0.0 — 2026-03-01

The first stable release of FlameIQ.

Core engine

Deterministic baseline vs. current snapshot comparison (compare_snapshots())
Configurable per-metric thresholds with direction-aware evaluation (latency.* → higher-is-worse; throughput → lower-is-worse; custom.* → absolute deviation)
Warning zone detection: metrics within 5 percentage points of their threshold emit a WARNING without failing the build
Typed exception hierarchy with 12 exception classes (flameiq.core.errors)
Machine-readable ComparisonResult.to_dict() for CI JSON output

Schema v1

Baseline management

Three strategies: last_successful, rolling_median, tagged
Local filesystem storage: JSON baseline + append-only JSONL history (BaselineStore)
Zero external services — fully offline

Statistical engine

Optional Mann-Whitney U test (non-parametric, distribution-free) (mann_whitney_compare())
Cohen’s d effect size with verbal labels (negligible / small / medium / large)
Noise-resistant median filter with warmup discard (noise_filter_median())
Configurable confidence level (default: 95%)

CLI

Providers

HTML report

Tooling

Documentation (https://docs.flameiq.dev)

Getting Started: installation, quick start, CI integration
User Guides: configuration, baseline strategies, custom providers
CLI Reference (all commands)
Architecture: overview, layers, schema design
Specifications: Schema v1, Statistical Methodology, Threshold Algorithm, Exit Codes
API Reference: all public modules
Contributing: development setup, RFC process, testing standards

Changes to be included in the next release.

Note

Add items here during development. They will be moved to a versioned section at release time.