MTH-002 Methodology
Published
v2.0 January 18, 2026

Semantic Association Metrics

Measurement framework for semantic spread and fidelity

Abstract

A methodology for measuring semantic properties of word associations using embedding-based metrics. Implements the Divergent Association Task (DAT) framework for spread scoring and a novel fidelity metric for task validity. Provides the scoring foundation for the INS-001 Semantic Cartography instrument family, enabling measurement of how participants navigate conceptual space.

Studies in This Family
MTH-002.1
Spread and Fidelity Scoring
Core metrics for INS-001 semantic association assessment
MTH-002.2
Communicability Metrics
Measuring whether meaning survives transmission
MTH-002.3
Surprisal Scoring
Measuring unexpected semantic transitions
MTH-002.4
Population Normalization
Converting raw scores to population-relative measures

Core Constructs

This methodology operationalizes four constructs from the INS-001 Semantic Cartography instrument family:

ConstructDefinitionOperationalization
SpreadHow much conceptual territory responses coverMean pairwise semantic distance (DAT methodology)
FidelityHow well clues jointly identify the bridging taskCoverage × efficiency of foil elimination
CommunicabilityWhether meaning survives transmission to another agentReconstruction accuracy by partner or LLM
SurprisalHow unexpected a semantic transition isNegative log-probability against reference network

Theoretical Foundation

The scoring framework draws on two established research traditions:

Divergent Association Task (DAT) Olson et al. (2021) demonstrated that mean pairwise semantic distance between unrelated words correlates r ≈ 0.40 with composite creativity measures. This provides empirical validation for using semantic spread as a proxy for divergent thinking capacity.

Distributional Semantics Cosine similarity in embedding space serves as a standard measure of semantic relatedness. Hill, Reichart & Korhonen (2015) distinguish between similarity (categorical membership) and relatedness (associative connection)—embeddings capture the latter more reliably.

For caveats regarding embedding validity and cultural bias, see LIB-002: Digital Validity.

Metric Architecture

Both INS-001.1 (Signal) and INS-001.2 (Common Ground) use exactly two primary metrics:

  1. Spread — How much semantic territory do the clues cover?
  2. Fidelity — How well do clues jointly identify the bridging task?

These metrics are orthogonal (r = 0.055):

PatternSpreadFidelityInterpretation
Creative & preciseHighHighWide-ranging associations that triangulate the target
Divergent but off-taskHighLowScattered responses that don’t constrain the solution
Conventional but effectiveLowHighClustered but accurate identification
Constrained & off-taskLowLowLimited exploration, poor task engagement

Studies in This Family

StudyTitleFocusStatus
MTH-002.1Spread and Fidelity ScoringCore scoring algorithms and thresholdsPublished
MTH-002.2Communicability MetricsReconstruction accuracy measurementCalibrating
MTH-002.3Surprisal ScoringTransition probability against reference networksCalibrating
MTH-002.4Population NormalizationBootstrap null distributions and percentile rankingCalibrating

Implementation

The scoring algorithms are implemented in:

Limitations

LimitationImpactMitigation
Embedding model biasCultural and frequency effects in word representationsDocument reference model; note Western/English bias
Threshold calibrationInterpretation bands require empirical validationDerive from observed distributions; update with data
Single embedding modelResults depend on specific model choiceUse standardized model (text-embedding-3-small); note in methods

Changelog

VersionDateChanges
2.02026-01-18Updated terminology: “relevance” → “fidelity”, “divergence” → “spread”; updated metric architecture to reflect orthogonal metrics
1.02026-01-15Initial publication