Design and Visualization Notes
Why these graphs exist and why they were chosen. This page explains design intent, not interpretation rules.
Why KDE Instead of Histograms
Histogram bins impose hard boundaries that can distort GPX sampling structure. KDE gives a continuous density estimate where each observation contributes smoothly to nearby regions.
- continuous distribution shape (no bin-edge jumps)
- local influence from each point
- better visibility of mixed sampling regimes
Why Gaussian Kernels and Log-Scaled Axes
Time and distance deltas are positive and usually span multiple orders of magnitude. Linear spacing compresses dense near-origin structure and over-emphasizes long-tail outliers.
- KDE is computed in log space for better near-origin resolution
- tails remain visible without clipping
- values are still shown in linear units; only axis spacing is logarithmic
Why Bandwidth Is User-Controlled
There is no single correct smoothing level for raw GPX streams. Different analysis goals require different bandwidths.
- higher bandwidth for broad regime trends
- lower bandwidth for sharp fixed-interval peaks
- manual control keeps the smoothing tradeoff explicit
Why These Plot Types
Each view answers a distinct question.
Time/Distance KDE: dominant scales and multimodalityRug support: where observations exist along the axisTime-Distance scatter: joint behavior and isolated outliersSequence-aware scatter: local continuity by GPX index
Why Sequence-Aware Scatter Exists
Global scatter treats points as exchangeable and hides local temporal structure. Sequence-aware plotting preserves order and makes local behavior legible.
- distinguishes true pauses from sparse logging gaps
- shows stationary logging under fixed cadence
- exposes time-local transitions in recording behavior
Coverage caveat: Sequence-aware scatter is rendered from valid joint pairs only (where both time and distance deltas are usable). If the plot appears sparse or visually inconsistent, cross-check Track Details / Deep Inspector for anomalies and excluded points.
What These Plots Do Not Do
- do not prove GPX correctness
- do not score activity quality
- do not classify behavior categories
- do not infer causality
The visualization layer is observation-first: it surfaces structure and leaves interpretation to downstream analysis.