Probability Model and Symbol Mapping
This page defines the SPIDER probabilistic model in symbols, then maps each symbol to configuration keys.
Equations intentionally use symbols only (no config key names inside math).
Quick equation summary
Use this block as a one-screen reference for the full model:
where \(\rho(\cdot)\) is selected from Gaussian, Laplace, Student-\(t\), or Huber.
Operational phase policy:
Phase 1 (MAP warmup) uses robust independent likelihoods to damp outlier influence and support outlier identification.
Phases 2-4 (sampling) use the correlated Gaussian likelihood with shared-event structure.
Quick-summary notation:
\(d_n\): observed differential time for datum \(n\)
\(\hat d_n\): model-predicted differential time for datum \(n\)
\(r_n\): residual, \(r_n=d_n-\hat d_n\)
\(u_n\): standardized residual, \(u_n=r_n/\sigma_n\)
\(\sigma_n\): phase-dependent scale for datum \(n\)
\(\rho(\cdot)\): per-datum robust penalty (Gaussian, Laplace, Student-\(t\), or Huber form)
\(B\): number of observations in the current likelihood batch
\(g\): station-phase group index in the correlated model
\(\mathbf{r}_g\): residual vector for group \(g\)
\(\boldsymbol{\Sigma}_g\): group covariance in the correlated model
\(N\): total number of observations in the full dataset
\(\Delta \mathbf{Z}\): stacked event perturbations across all events
Notation conventions
The symbols below are used throughout the page:
Symbol |
Definition |
|---|---|
\(n\) |
Observation index (differential-time row) |
\(i,j\) |
Event indices |
\(M\) |
Number of events |
\(B\) |
Number of observations in a batch used by the likelihood term |
\(N\) |
Total number of observations in the full dataset |
\(\mathbf{z}_i\) |
Event state for event \(i\) (space + origin-time component) |
\(\Delta \mathbf{z}_i\) |
Perturbation for event \(i\) |
\(\Delta \mathbf{Z}\) |
Collection of all event perturbations \(\{\Delta \mathbf{z}_i\}_{i=1}^M\) |
\(\mathbf{x}_i\) |
Spatial part of event state for event \(i\) |
\(t_i\) |
Origin-time part of event state for event \(i\) |
\(T(\mathbf{x}, s, \varphi)\) |
Travel-time surrogate evaluated at event location \(\mathbf{x}\), receiver \(s\), phase \(\varphi\) |
\(\sigma_P,\sigma_S\) |
Phase-specific residual scales |
\(\tau_P,\tau_S\) |
Phase-specific shared-event random-effect scales |
\(\mathbf{I}\) |
Identity matrix of appropriate dimension |
1) Forward model and residuals
For each differential-time datum \(n\), let:
\((i_n, j_n)\) be the event pair
\(s_n\) be the receiver
\(\varphi_n \in \{P,S\}\) be the phase
\(d_n\) be the observed differential time
Write each event state as \(\mathbf{z}_i=[\mathbf{x}_i^\top\; t_i]^\top\), where \(\mathbf{x}_i\) is spatial location and \(t_i\) is origin time.
Event state is represented as:
Predicted differential time:
Residual:
Phase-dependent scale:
2) Independent residual likelihood family (Phase 1 robust path)
Define standardized residual \(u_n = r_n / \sigma_n\).
The per-observation negative log-likelihood is:
Gaussian
Laplace
Student-\(t\) (fixed degrees of freedom \(\nu\))
where \(C(\nu)\) is the Student-\(t\) normalization constant (depends only on \(\nu\)).
Huber (threshold \(\delta_H\))
with
Batch-average independent likelihood term:
4) Priors
Event perturbation prior (default diagonal Gaussian)
Optional centroid prior
with
Optional hierarchical event precision prior
where \(k(i)\) maps event \(i\) to its cluster index, \(\mathbf{V}_0\) is constructed from scale hyperparameters, and \(\nu_0\) is the Wishart degrees of freedom.
5) Training objective (negative log posterior)
SPIDER uses a per-observation normalized objective:
where:
\(\mathcal{L}_{\text{data}} = \mathcal{L}_{\text{ind}}\) for independent-likelihood runs
\(\mathcal{L}_{\text{data}} = \mathcal{L}_{\text{corr}}\) when collapsed shared-event correlation is enabled
\(N\) is the total number of observations in the full dataset
6) Symbol-to-config mapping
Likelihood symbols
Symbol |
Meaning |
Config key(s) |
|---|---|---|
\(\sigma_P, \sigma_S\) |
Phase-dependent residual scales |
|
\(\nu\) |
Student-\(t\) degrees of freedom |
|
\(\delta_H\) |
Huber threshold |
|
Likelihood family selector |
Choice among Gaussian/Laplace/Student-\(t\)/Huber |
|
Correlated sampling likelihood selector |
Enables correlated Gaussian sampling path |
|
Prior symbols
Symbol |
Meaning |
Config key(s) |
|---|---|---|
\(s_x,s_y,s_z,s_t\) |
Event prior standard deviations |
|
\(c_x,c_y,c_z,c_t\) |
Centroid prior standard deviations |
|
\(\nu_0\) |
Wishart hyperprior degrees of freedom |
|
\(\mathbf{V}_0\) scale controls |
Wishart scale hyperparameters |
|
Hyperprior update cadence |
Epoch cadence for precision updates |
|
7) Which likelihood is active in each stage
Phase 1 (MAP warmup/outlier-screening stage) uses the
locate_maplikelihood block.In this stage, robust independent residual families (Laplace, Huber, or Student-\(t\)) are used to reduce sensitivity to outliers and help identify problematic residuals.
Phases 2-4 use the
samplelikelihood block and are intended to run with correlated Gaussian structure (correlated_gaussian) and shared-event whitening.If shared-event correlation is disabled in the sampling block, phases 2-4 fall back to the independent residual form.