09-EFT.WP.Core.Density v1.0 | Appendix D — Formula Compendium and Sketch Derivations

Appendix D — Formula Compendium and Sketch Derivations

I. How to Use This Table and the Numbering Scheme

This appendix consolidates the book’s core formulas and gives the shortest derivation highlights. Numbering follows the unified scheme P91-*, S92-*, Mx-9*, I90-*. Any integral must name its measure and domain explicitly; any path-dependent quantity must state gamma(ell) and the path element d ell.
Variables, operators, and functions are marked inline with backticks, for example: rho(x,t), p(x), kde_h(x), S_xx(f), U_w, ENBW_Hz. Displayed formulas are presented as plain text.

II. Axioms and Measure Normalization (P91 series)

P91-1 (measure explicitness)
∫ p(x) dx = 1 ; ( ∫_V rho(x,t) dV ) defines a physical total. dx, dV are base measure elements.
P91-2 (unit/dim conservation)
dim( ∫ rho dV ) = dim(rho) * dim(V) ; never mix dimensionless p(x) with dimensional rho(x,t).
P91-3 (normalization discipline)
In discrete settings sum_i p_i = 1 ; histogram density p_hat = count / ( N * Delta ) (see S92-10).
Derivation highlights
- Leverages additivity and linearity of Lebesgue integrals.
- Uses dimensional homogeneity via check_dim(expr) to verify conservation.

III. Continuity and Conservation (S92-1, S92-2)

S92-1 (continuity with sources/sinks)
∂_t rho(x,t) + ∇•J(x,t) = s(x,t)
S92-2 (total-balance criterion)
d/dt ( ∫V rho dV ) = - ( ∮{∂V} J•n dS ) + ( ∫_V s dV )
Derivation highlights
- Integrate S92-1 over V and apply the divergence theorem ( ∫_V ∇•J dV ) = ( ∮_{∂V} J•n dS ).
- With s=0 and J•n|_{∂V}=0, the total M = ( ∫_V rho dV ) is constant in time.

IV. Probability Density, Likelihood, and Fisher Information (S92-3, S92-4)

S92-3 (likelihood for independent samples)
L(theta) = ∏_{i=1}^N p( x_i | theta )
S92-4 (Fisher information)
I_F(theta) = E[ ( ∂_theta log p(X|theta) )^T ( ∂_theta log p(X|theta) ) ]
Derivation highlights
- ∂_theta log L = ∑ ∂_theta log p(x_i|theta) ; the MLE satisfies ∂_theta log L = 0.
- For regular families, I_F = - E[ ∂^2_{theta} log p ]; variance lower bounds follow S92-17.

V. Kernel Density Estimation and Error (S92-5, S92-6)

S92-5 (1D KDE)
kde_h(x) = ( 1 / ( N h ) ) * ∑_{i=1}^N K( ( x - x_i ) / h )
S92-6 (mean integrated squared error, MISE)
MISE(h) = Var_term(h) + Bias_term(h)
AMISE(h) ≈ ( R(K) / ( N h ) ) + ( ( mu2(K)^2 / 4 ) * h^4 * R( f'' ) )
Derivation highlights
- E[ kde_h(x) ] = ( K_h * f )(x), so Bias ≈ ( mu2(K) * h^2 / 2 ) * f''(x).
- Var[ kde_h(x) ] ≈ ( f(x) * R(K) ) / ( N h ).
- Setting d(AMISE)/dh = 0 yields the optimal rate h* ∝ N^{-1/5} in 1D.

VI. Spatial / Spatio-temporal Intensity (S92-7)

S92-7 (intensity to counts)
Lambda(A) = ( ∫_A lambda(x) dV ), with N(A) ~ Poisson( Lambda(A) ) for Poisson processes.
Hawkes intensity (see Chapter 5)
lambda(t) = mu + ( ∑_k H(t - t_k) ), where the kernel H(•) satisfies stability ( ∫ H dt ) < 1.
Derivation highlights
- Campbell’s theorem: E[ N(A) ] = Lambda(A).
- Hawkes mean intensity: E[lambda] = mu / ( 1 - ∫ H ).

VII. Spectral Density and Window Normalization (S92-8, S92-9)

S92-8 (equivalent noise bandwidth)
ENBW_Hz = fs * ( ∑{n=0}^{N-1} w[n]^2 ) / ( ∑{n=0}^{N-1} w[n] )^2
S92-9 (window power coefficient)
U_w = ( 1 / N ) * ∑_{n=0}^{N-1} w[n]^2
Energy consistency (one-sided PSD, empirical)
∫_{0}^{fs/2} S_xx(f) df ≈ var(x) under proper window normalization and de-meaning.
Derivation highlights
- Welch’s method maps in-window variance to PSD via U_w and ENBW_Hz.
- For one-sided spectra, do not double the DC and Nyquist bins (volume-wide rule).

VIII. Discretization and Histogram Conservation (S92-10, S92-11)

S92-10 (histogram density)
p_hat[j] = count[j] / ( N * Delta )
S92-11 (voxelized total conservation)
mass_preserve = ( ∑_i rho_i * V_i ) (should equal the numerical approximation of the continuous integral).
Derivation highlights
- ∑_j p_hat[j] * Delta = 1.
- Grid refinement/coarsening must preserve ∑ rho_i V_i (see Mx-96).

IX. Regularization and Maximum Entropy (S92-12, S92-13)

S92-12 (variational sparse–smooth hybrid)
argmin_rho || A rho - y ||_2^2 + lambda * || ∇rho ||_1
S92-13 (MaxEnt solution with moment constraints)
p*(x) ∝ exp( ∑_{k=1}^m lambda_k * phi_k(x) )
Derivation highlights
- S92-12 uses first-order differences to approximate ∇rho; ADMM or PDHG are typical solvers.
- For S92-13, construct the Lagrangian L = -∫ p log p dx + ∑ lambda_k ( ∫ p phi_k dx - c_k ) + alpha ( ∫ p dx - 1 ) and take the variational derivative w.r.t. p to obtain the exponential family.

X. Normalization & Calibration (S92-14, S92-15) and the Two Arrival-Time Forms

S92-14 (standardization)
z = ( x - mu_x ) / sigma_x
S92-15 (change of variables with Jacobian)
p_Y(y) = p_X( x(y) ) * | det( ∂x / ∂y ) |
Two arrival-time forms (cross-volume anchor; see Core.Sea Chapter 8)
T_arr = ( 1 / c_ref ) * ( ∫{gamma(ell)} n_eff d ell )
T_arr = ( ∫{gamma(ell)} ( n_eff / c_ref ) d ell )
delta_form = | ( 1 / c_ref ) * ( ∫ n_eff d ell ) - ( ∫ ( n_eff / c_ref ) d ell ) |
Derivation highlights
- S92-15 follows from probability conservation P( x ∈ dx ) = P( y ∈ dy ).
- delta_form records implementation-disparity; it must be propagated through the dataflow (see Mx-98).

XI. Uncertainty and Information Bounds (S92-16, S92-17) and Propagation

S92-16 (Fisher information, matrix form)
I_F(theta) = E[ ( ∂_theta log p )^T ( ∂_theta log p ) ]
S92-17 (Cramer–Rao lower bound)
cov( theta_hat ) ≥ I_F(theta)^{-1}
Propagation and combination (metrological)
u_c^2 = ∑i ( c_i^2 * u(x_i)^2 ) + 2 * ∑{i<j} c_i c_j * cov(x_i,x_j)
U = k * u_c
Derivation highlights
- From zero-mean score and the information identity we obtain S92-17.
- The linearized Delta method for g(x) yields sensitivity coefficients c_i = ∂g/∂x_i.

XII. Quick Constants and Equivalences (Kernels / Windows)

Kernels (see Appendix C): mu2(K) and R(K) for Gaussian, Epanechnikov, Biweight, etc., used in S92-6 (AMISE).
Windows (see Appendix C): U_w, ENBW_Hz, coherent gain CG, used in S92-8/9 for energy consistency and amplitude correction.

XIII. Workflow Hooks (Mx-9 and I90-)**

Mx-93 (bandwidth selection): minimize a cross-validation score CV(h) as an AMISE proxy; output K(•), h, and error report.
Mx-95 (spectrum → energy checker): verify ∫ S_xx df ≈ var(x) given U_w and ENBW_Hz.
Mx-96 (grid refine/coarsen): preserve mass_preserve.
Mx-98 (multi-source alignment): record and propagate delta_form.
I90 key interfaces: kde_build/..., spectral_density/..., hist_density/..., change_of_variables/..., crlb/... must bind to their corresponding S92-* formulas and metadata.

XIV. Pre-Publication Consistency Checklist

Measure explicitness, unit homogeneity, and normalization closure; ∫ p dx = 1, ∑ p_i = 1, ∑ rho_i V_i conserved.
KDE reports K(•), h, CV(h) and the error regime (S92-5/6).
PSD reports window, U_w, ENBW_Hz, CG, and provides an energy reconciliation (S92-8/9).
Variable transforms include the Jacobian (S92-15); CRLB includes I_F and lower bounds (S92-16/17).
Any metric tied to arrival time includes both forms and delta_form, with an explicit gamma(ell) path reference.