Statistical Mechanics and Thermodynamics
The foundation of the GLAM framework is the adaptation of rigorous formalism from statistical physics and thermodynamics to quantify the biological organization of a tumor. By treating voxels of differing intensities as interacting particles, we can quantify the effective “interactions” shaping tumor architecture.
Radial Distribution Function (RDF)
The core descriptor is the pair radial distribution function, \(g_{\alpha\beta}(r)\), which measures the spatial correlation between voxels [40]. It quantifies the relative likelihood of finding a voxel with gray level \(\beta\) at a distance \(r\) from a reference voxel with gray level \(\alpha\) [41].
Positive Correlation (Attraction): \(g_{\alpha\beta}(r) > 1\) [42].
Negative Correlation (Repulsion): \(g_{\alpha\beta}(r) < 1\) [42].
Randomness: \(g_{\alpha\beta}(r) = 1\) [42].
Figure: RDF as a function of the distance \(r\) (from 0 to 100 mm). Each colored line represents a specific RDF curve, \(g_{\alpha\beta}(r)\), between two different gray levels, \(\alpha\) and \(\beta\).
Interpretation: This metric asks, “How much more or less likely am I to find tissue type B at a specific distance from tissue type A, compared to a completely random distribution?”
Advantage: It provides a fundamental, distance-dependent map of spatial relationships rather than just a single global average, revealing exact distances where tissues cluster or repel.
RDF Shape Statistics
To capture the extensive information embedded in the full RDF curves without incurring the “curse of dimensionality,” we transform key statistical properties of each \(g_{\alpha\beta}(r)\) curve into a set of primary GLAM matrices. For every gray-level pair, we compute the Peak Position, Peak Height, Median, Variance, Skewness, and Kurtosis.
Figure: RDF Median matrices derived from four co-registered MRI sequences: pre-contrast T1-weighted (T1), post-contrast T1-weighted (T1c), T2-weighted (T2), and Fluid-Attenuated Inversion Recovery (FLAIR).
Key Thermodynamic Metrics
Second Virial Coefficient (\(B_2\))
The second virial coefficient distills the complex, distance-dependent information of the entire RDF curve into a single value representing net affinity[cite: 66].
Negative value: Indicates net attraction or affinity between gray levels[cite: 66].
Positive value: Suggests net repulsion[cite: 66].
Figure: B2 matrices derived from four co-registered MRI sequences: pre-contrast T1-weighted (T1), post-contrast T1-weighted (T1c), T2-weighted (T2), and Fluid-Attenuated Inversion Recovery (FLAIR).
Interpretation: This metric asks, “Overall, do these two tissue types prefer to clump together (attraction) or push each other apart (repulsion) across the entire tumor?”
Advantage: It condenses a complex multi-distance curve into a single, highly interpretable summary statistic of spatial affinity.
Potential of Mean Force (PMF)
To evaluate the energetic stability of texture organization, GLAM computes the PMF based on the Boltzmann distribution[cite: 67, 68].
The total PMF energy is obtained by integrating this potential weighted by the RDF[cite: 72]:
To allow size-independent comparison across ROIs, the UPMF is normalized by volume, producing the Energy Density, which represents the average interaction energy per voxel.
Figure: Energy Density matrices derived from four co-registered MRI sequences: pre-contrast T1-weighted (T1), post-contrast T1-weighted (T1c), T2-weighted (T2), and Fluid-Attenuated Inversion Recovery (FLAIR).
Interpretation: This metric asks, “How much ‘energy’ would it take to maintain this specific spatial arrangement of tissues against natural thermodynamic mixing?”
Advantage: It translates geometric clustering into a physical energy landscape, allowing the identification of highly stable (deep energy wells) versus unstable, transient tissue architectures.
Isothermal Compressibility (\(\kappa_T\))
In statistical thermodynamics, the integral of the total correlation function is directly related to the system’s density fluctuations and its isothermal compressibility[cite: 628].
In the GLAM framework, this metric quantifies the “sponginess” or susceptibility of the texture to local density variations[cite: 630]. High compressibility implies large-scale density fluctuations and a tendency for the voxel population to cluster loosely, whereas low compressibility indicates a rigid, hyper-uniform distribution typical of highly packed cellular structures[cite: 631].
Interpretation: This metric asks, “How ‘spongy’ or susceptible to density fluctuations is the tissue architecture?”
Advantage: It captures large-scale structural heterogeneity. Low compressibility indicates rigid, tightly packed regions (like dense stroma), while high compressibility flags loose, highly variable clustering.
Pressure Virial
Derived from the virial equation of state, this quantifies the mean internal force between voxel populations, capturing the mechanical response within the texture[cite: 92, 93].
Figure: Pressure Virial matrices derived from four co-registered MRI sequences: pre-contrast T1-weighted (T1), post-contrast T1-weighted (T1c), T2-weighted (T2), and Fluid-Attenuated Inversion Recovery (FLAIR).
Interpretation: This metric asks, “What is the net mechanical ‘push or pull’ (internal stress) exerted between different voxel populations due to their spatial packing?”
Advantage: It links spatial statistics directly to mechanical properties, offering a non-invasive computational proxy for the internal mechanical stresses within the tumor microenvironment.
Configurational Disorder Index (\(CDI\))
To complement metrics of energetic stability, the Configurational Disorder Index (\(CDI\)) characterizes textural disorder. Whereas PMF energy assesses how stable the structure is, the \(CDI\) quantifies how dynamically disordered it is. The \(CDI\) is derived by comparing the measured RDF of the actual image, \(g_{structured}(r)\), to that of its randomized counterpart, \(g_{randomized}(r)\).
Drawing from the Boltzmann relation, which links the probability of a configuration to its energy and relative disorder, deviations from randomness reflect an underlying “ordering potential.” Because \(\ln(g(r))\) is proportional to the negative potential divided by this disorder parameter, the observed structure, \(\ln(g_{structured}(r))\), represents this potential modulated by the tissue’s inherent architectural chaos. The ratio between the structured and randomized forms yields a distance-dependent disorder profile:
Averaging \(CDI(r)\) within the first coordination shell yields a stable, physically interpretable estimate of the local structural disorder, \(CDI(\alpha,\beta)\). A high \(CDI(\alpha,\beta)\) indicates greater structural “noise” or heterogeneity, while a low \(CDI\) indicates rigid or highly structured ordering. Comparing diagonal versus off-diagonal \(CDI(\alpha,\beta)\) elements distinguishes self-organized tissue components from dynamically disordered interfaces.
Interpretation: This metric asks, “How much structural ‘noise’ or configurational disorder exists in this tissue compared to a perfectly ordered state?”
Advantage: It provides a clear indicator of architectural chaos; highly disordered tissues appear random and heterogeneous, while low-disorder tissues are rigidly structured. This thermodynamic-inspired approach provides a robust, physically interpretable measure of local architectural complexity in medical images.
1-Wasserstein Distance (Assembly Cost)
The 1-Wasserstein Distance, often called the Earth Mover’s Distance, measures the “Biological Work” or “Assembly Cost” of the tumor’s spatial architecture. It quantifies the total effort required to transform a completely randomized spatial distribution of voxels into the highly ordered, structured state actually observed in the tumor.
First, the Cumulative Coordination Number, \(N(R)\), is calculated. This represents the total number of neighbors accumulated up to radius \(R\):
The Wasserstein distance is then defined as the absolute area between the cumulative curves of the structured and randomized states:
High Value: Indicates a highly complex, non-random architecture with a high energetic “cost” of assembly, typical of highly organized distinct sub-regions or rigid boundaries.
Low Value: Indicates the tissue architecture is very close to a completely random distribution of cells or voxels.
Jensen-Shannon (JS) Divergence
While the standard RDF quantifies the distance-dependent interaction between two specific gray levels, comparing reciprocal RDF curves allows us to evaluate the directional symmetry—or structural anisotropy—of this relationship. In a perfectly isotropic (directionless) texture, the probability of finding gray level \(\beta\) at distance \(r\) from gray level \(\alpha\) should be identical to finding gray level \(\alpha\) at distance \(r\) from gray level \(\beta\). Therefore,
JS Divergence quantifies the disagreement between these two reciprocal RDF curves. Because standard Kullback-Leibler (KL) Divergence is asymmetric and struggles with zeros, JS Divergence creates a symmetric, bounded comparison by measuring how far both curves deviate from their shared average.
First, the raw RDF curves are L1-normalized into true probability distributions, \(P\) and \(Q\):
We define a midpoint distribution \(M\):
The JS Divergence is then calculated as the average KL Divergence of \(P\) and \(Q\) from \(M\):
Perfect Symmetry (\(JSD = 0\)): The relationship between tissue A and tissue B is structurally identical in reverse.
Structural Anisotropy (\(JSD > 0\)): Tissue B tends to cluster around tissue A differently than tissue A clusters around tissue B (e.g., A forms a core while B forms a shell).
Interpretation: This metric asks, “Is the local neighborhood of tissue A around tissue B an exact mirror image of tissue B around tissue A?”
Advantage: It offers a bounded (0 to 1) and perfectly symmetric measure of local directional bias, avoiding the mathematical instabilities of traditional Kullback-Leibler divergence when encountering empty tissue regions.
Cumulative JS Divergence
Raw RDF curves (\(g(r)\)) can be noisy, especially in small tumors or sparse gray levels, causing artificial spikes in standard JS Divergence.
Cumulative JS Divergence solves this by borrowing intuition from the Wasserstein metric (Earth Mover’s Distance). Instead of comparing local densities shell-by-shell, it compares the Cumulative Coordination Profiles—the total amount of tissue accumulated up to distance \(R\). This acts as a low-pass filter, ignoring high-frequency quantization noise and focusing on global spatial imbalances.
First, we multiply the RDF by the spherical volume element \(r^2\) and integrate to create a cumulative profile \(N(R)\), representing the total interaction mass up to radius \(R\):
These cumulative profiles are then L1-normalized to create monotonically increasing distributions \(P_{cumul}\) and \(Q_{cumul}\):
Finally, standard JS Divergence is applied to these smoothed, cumulative curves:
Interpretation: This metric asks, “If we grow a sphere outward, does the total accumulated mass of tissue B around tissue A grow at the same rate as tissue A around tissue B?”
Advantage: It is highly robust to minor spatial jitter, imaging noise, and voxelation artifacts, providing a much more stable measure of macroscopic architectural bias.