| 1. Domain | 1.1 Scope of the Domain | Boundaries | The range of phenomena the science includes and excludes. | Examines the physical, cognitive, and linguistic systems governing the production, perception, and patterned organization of speech sounds. Includes articulatory/ acoustic phonetics, prosody, syllable structure, phonological features, tone, stress, phonotactics, assimilation, lenition/fortition, phonological alternations. Excludes morphological or syntactic structure except where phonological processes interface with them. |
| | Scale | The spatial, temporal, or organizational level at which the science operates (e.g., quantum, cellular, social, cosmic). | Operates at sub-lexical levels: articulatory gestures (milliseconds), acoustic waveforms, perceptual processing windows, segmental and suprasegmental structures, prosodic domains from syllables to intonational phrases. |
| 1.2 Ontological Commitments | Entities | The kinds of things assumed to exist within the domain (particles, organisms, agents, fields, etc.). | Speech sounds (segments), articulatory gestures, acoustic features (formants, pitch, duration), phonemes, allophones, features (+voice, +nasal, etc.), syllables, prosodic units, stress domains, tone units, phonological rules/constraints. |
| | Properties | The fundamental attributes these entities possess (mass, charge, genotype, preference, etc.). | Voicing, nasality, place/manner of articulation, duration, pitch (F0), intensity, phonotactic legality, feature composition, syllable weight, stress prominence, tone category, prosodic boundary strength. |
| | Categories | The basic ontological types used to classify domain elements (substances, processes, relations, structures). | Consonants, vowels, phonemes, features, syllables (light/heavy), phonological processes (assimilation, deletion, insertion), tone and stress systems, prosodic constituents, distinctive feature sets. |
| 1.3 State-Variables | Variables | The measurable or definable properties that describe system conditions. | Articulatory position, vocal-fold vibration state, airflow patterns, acoustic frequency values, duration metrics, amplitude, tonal target, stress level, phonotactic probability, feature activation states. |
| | Parameterization | How variables encode and represent the system’s state. | Encoded through articulatory coordinates, formant frequencies, pitch contours, waveform amplitude envelopes, phonological-feature matrices, constraint weightings (OT), rule parameters, syllable-structure representations. |
| 1.4 Admissible Idealizations | Simplifications | Conceptual reductions used to make the domain tractable (point masses, rational agents, perfect gases). | Treating speech segments as discrete; ignoring coarticulation; modeling phonological rules as categorical; assuming idealized speaker/listener; treating prosody as uniform; using simplified acoustic models; assuming stable phoneme inventories. |
| | Validity Conditions | The limits and contexts in which idealizations hold or break down. | Break downs in casual speech, high-coarticulation languages, tonal crowding, prosodic irregularity, pathological speech, multilingual phonological interaction, or when acoustic environments distort formant structure. |
| 1.5 Domain Assumptions | Structural Assumptions | Background ontological stances such as determinism, continuity, randomness, discreteness. | Assumes speech can be segmented into analyzable units; features combine systematically; phonological rules or constraints account for alternations; perceptual mapping corresponds to articulatory/acoustic structure; prosody organizes suprasegmental patterns. |
| | Implicit Commitments | Unstated but necessary assumptions that shape the field’s conceptual structure. | Assumes phonological categories are psychologically real; acoustic cues reliably map to phonological features; phonotactic structures reflect underlying mental representations; speakers aim for efficient, interpretable signaling. |
| 1.6 Internal Coherence Requirements | Consistency | The demand that domain concepts do not contradict one another. | Feature systems must map coherently onto articulatory/acoustic cues; phonological rules must produce consistent outputs; syllable structure must align with stress/tone patterns; prosodic domains must integrate into a unified hierarchical system. |
| | Compatibility | The requirement that entities, variables, and assumptions fit together into a unified descriptive framework. | Requires alignment among articulatory, acoustic, perceptual, and phonological representations; rule-based and constraint-based models must be interpretable within the same structural framework; suprasegmental and segmental systems must cohere. |
| 2. Evidence Layer | 2.1 Observable Phenomena | Observables | The aspects of the domain that can produce detectable signals accessible to measurement. | Articulatory movements, airflow patterns, vocal-fold vibration, formant frequencies, pitch contours, amplitude envelopes, spectral shape, duration contrasts, syllable boundaries, tone/intonation patterns, assimilation and coarticulation effects. |
| | Detection Limits | The boundaries of what can be resolved or sensed by current instruments or methods. | Rapid articulatory events below camera/sensor resolution; overlapping acoustic cues; coarticulation blurring segment boundaries; noise masking subtle contrasts; perceptual ambiguity; speaker variability; limitations in capturing prosodic nuance. |
| 2.2 Measurement Systems | Units | Standardized quantifications (meters, seconds, volts, decibels, dollars, etc.) necessary for consistent comparison. | Milliseconds (duration), Hz (pitch, formants), dB (intensity), airflow volume, articulatory displacement units, phonotactic probability scores, tonal step levels, stress prominence ratings. |
| | Instruments | Devices and tools (microscopes, spectrometers, sensors, surveys, detectors) used to produce measurements. | Spectrographs, wave-analyzers, PRAAT, ultrasound tongue imaging, electropalatography (EPG), electromagnetic articulography (EMA), airflow masks, EEG/MEG (auditory perception), acoustic microphones, perceptual rating protocols. |
| 2.3 Operational Definitions | Definitions | Terms defined by specific measurement procedures, ensuring empirical clarity. | Definitions of segment boundaries, phoneme vs allophone, voicing onset time (VOT), syllable nucleus, stress prominence, tonal target, feature specifications (+nasal, +voice), prosodic boundaries, phonotactic legality. |
| | Procedures | The explicit steps required to perform a measurement in a reproducible way. | Recording speech tokens; segmenting acoustic signals; measuring formants and VOT; coding articulatory gestures; eliciting minimal pairs; collecting perceptual judgments; computing phonotactic distributions; analyzing prosodic contours. |
| 2.4 Data Acquisition | Protocols | Formal processes for gathering data under controlled or standardized conditions. | Controlled elicitation tasks; reading passages; repetition tasks; minimal-pair production; spontaneous-speech sampling; cross-speaker and cross-dialect sampling; multi-condition acoustic collection (quiet/noisy environments). |
| | Sampling | Rules determining which subset of the domain is measured and how representative it is. | Sampling across speakers, dialects, age groups, contexts, phonotactic environments, syllable positions, prosodic domains, and speech rates; sampling tokens repeatedly for reliability. |
| 2.5 Data Character & Format | Data Types | The form raw evidence takes (time series, spectra, images, counts, qualitative records). | Spectrograms, waveform files, articulatory-trajectory files, formant tables, pitch-tracking sequences, duration tables, phonological-feature matrices, perceptual rating datasets, phonotactic frequency tables. |
| | Resolution | The granularity or precision with which data is captured. | Determined by sampling rate (kHz), temporal resolution of articulatory imaging, frequency resolution of spectral analysis, perceptual rating granularity, and accuracy of pitch/duration extraction algorithms. |
| 2.6 Reliability & Calibration | Calibration | Adjustment procedures ensuring instruments produce accurate results. | Calibrating microphones and articulatory sensors; validating formant-tracking algorithms; standardizing perceptual-rating procedures; verifying consistent VOT measurements; checking inter-rater agreement for segmentation. |
| | Error Characterization | Identification and quantification of noise, uncertainty, bias, and measurement error. | Acoustic noise; sensor drift; segmentation inconsistencies; mismeasured formants; speaker variability; perceptual bias; coarticulation complicating boundaries; algorithmic tracking errors; insufficient sampling. |
| 3. Structural Layer | 3.1 Patterns & Regularities | Laws / Relations | Stable, repeatable patterns governing how observables behave across conditions. | Coarticulation patterns; assimilation rules; vowel reduction; systematic alternations (lenition, fortition); syllable-weight effects; tone–stress interactions; phonotactic constraints; feature-combination laws. |
| | Invariants | Quantities or properties that remain constant under transformations (symmetries, conservation laws). | Stable phoneme inventories; universal feature distinctions (voice, place, manner); consistent syllable templates; recurrent prosodic hierarchies; cross-linguistic tendencies in stress, tone, and assimilation. |
| 3.2 Causal Architecture | Mechanisms | Underlying processes or structures that produce the observed regularities. | Articulatory mechanisms producing acoustic distinctions; perceptual categorization mechanisms; phonological-rule mechanisms; constraint-based optimization (OT); prosodic-organization mechanisms; gestural-coordination mechanisms. |
| | Pathways | Organized sequences of interactions forming a causal chain or network. | Articulatory gesture → acoustic signal → auditory perception → phonological categorization; underlying form → phonological rules/constraints → surface form; prosodic domain → stress/tone assignment → segmental alternation. |
| 3.3 Theoretical Vocabulary | Concepts | Core terms that encode the domain’s structure (force, gene, equilibrium, field). | Phoneme, allophone, distinctive feature, syllable, onset, nucleus, coda, mora, stress, tone, intonation, phonotactic constraint, coarticulation, assimilation, prosodic hierarchy, underlying vs surface representation. |
| | Classifications | Taxonomies, categories, or typologies that organize entities and relations. | Segment classes (vowels, consonants); feature categories (+voice, +nasal, etc.); syllable types (CV, CVC, etc.); tone inventories; stress systems; prosodic constituents (foot, phrase); rule types (assimilation, deletion, insertion). |
| 3.4 Formal Representations | Equations | Mathematical constructs expressing laws, relations, or mechanisms. | Feature-matrix representations; rule formalizations (A → B / X__Y); Optimality Theory constraint rankings; syllable-weight functions; tone-target interpolation formulas; gestural coordination timing equations. |
| | Models | Structured representations—mathematical, computational, or conceptual—used to predict and explain phenomena. | Rule-based phonology; feature-geometry models; metrical phonology; autosegmental phonology; OT models; gestural (articulatory) phonology; prosodic-hierarchy models; acoustic-targets models. |
| 3.5 Idealized Structures | Simplified Models | Purposeful abstractions that capture essential dynamics while omitting irrelevant detail. | Discrete, non-overlapping segments; categorical rules; fully stable phoneme inventories; simplified prosodic structure; uniform speakers; absence of coarticulation noise; perfectly aligned feature specifications. |
| | Limit Conditions | Regimes where specific models or approximations hold (classical vs. quantum, linear vs. nonlinear). | Casual speech undermines segment discreteness; heavy coarticulation breaks rule boundaries; tonal/intonational crowding; dialect mixing; speech disorders; extreme speaking rates; noisy acoustics reducing cue reliability. |
| 3.6 Integrative Frameworks | Unifying Theories | Higher-order structures that connect disparate laws or mechanisms under a coherent whole. | Hierarchical prosodic theory; autosegmental-metrical integration; feature-geometry unification; OT/phonetic grounding frameworks; perception–production loop theories; exemplar and usage-based phonology; cognitive–phonetic integration models. |
| | Interdisciplinary Links | Points where the theory connects to adjacent sciences or larger explanatory systems. | Links to cognitive psychology (categorization), neuroscience (speech perception circuits), computer science (speech recognition), acoustics (signal processing), anthropology (sound systems across cultures), and speech pathology (phonological disorders). |
| 4. Method Layer | 4.1 Inquiry Design | Experimental Design | Structured plans for manipulating variables to test causal claims. | Manipulating phonetic context, speaking rate, prosodic prominence, syllable position, or coarticulation environment; altering tone/stress cues; introducing noise; varying articulatory constraints to test causal effects on speech sound realization and perception. |
| | Observational Design | Systematic approaches for gathering non-manipulated data (surveys, field studies, natural experiments). | Observing natural speech in spontaneous or conversational contexts; tracking dialectal variation; documenting phonological alternations; recording prosodic patterns; collecting perception judgments without intervention. |
| 4.2 Testing & Validation | Hypothesis Testing | Procedures for evaluating whether evidence supports or contradicts specific claims. | Testing predictions from feature systems; validating phonological rules or OT constraints; measuring contrast perception; confirming coarticulation effects; evaluating tone/stress assignment models; testing phonotactic predictions. |
| | Replication | The requirement that results be independently reproducible under similar conditions. | Re-recording tokens across sessions; repeating perceptual experiments; replicating acoustic measurements with new speakers; verifying phonotactic results across corpora; validating articulatory trajectories with alternative instruments. |
| 4.3 Inference & Evaluation | Statistical Inference | Rules for drawing conclusions from noisy or incomplete data. | Analyzing formant distributions; testing duration/pitch contrasts; modeling assimilation/lenition environments; computing phonotactic probabilities; fitting prosodic-contour models; analyzing perception-accuracy curves. |
| | Model Comparison | Criteria (fit, simplicity, predictive accuracy, robustness) used to evaluate competing models. | Comparing rule-based vs OT analyses; testing feature-geometry models vs gestural models; comparing exemplar vs symbolic models; evaluating competing tone or stress models; contrasting acoustic-phonetic vs phonological representations. |
| 4.4 Error Management | Error Analysis | Identification and quantification of random and systematic errors. | Identifying segmentation errors; correcting formant-tracking failures; controlling microphone/sensor drift; detecting perceptual-judgment bias; eliminating noise-induced spectral distortion; identifying speaker variability as confound. |
| | Bias Control | Methods for minimizing subjective, instrumental, or procedural biases. | Counterbalancing stimuli; randomizing item order; using blinded perceptual tests; standardizing speaking conditions; matching participants by linguistic background; avoiding experimenter cueing; ensuring culturally neutral stimuli. |
| 4.5 Adjudication & Revision | Peer Scrutiny | Collective evaluation of claims through critique, review, and debate. | Independent re-analysis of acoustic data; replication by other laboratories; critique of phonological representations; evaluation of OT constraint rankings; reviewing perceptual-experiment coding; inspecting articulatory-data interpretations. |
| | Theory Revision | Procedures for modifying, replacing, or discarding models based on new evidence. | Updating feature systems; revising constraint rankings; adjusting phonological rules; reanalyzing prosodic hierarchies; integrating new acoustic/perceptual evidence; modifying gestural-coordination models. |
| 4.6 Integrity Conditions | Transparency | Requirements to disclose methods, data, assumptions, and limitations. | Full disclosure of recording parameters, segmentation criteria, formant-tracking settings, acoustic filters, perceptual-test instructions, model assumptions, and phonological rule/constraint formulations. |
| | Ethical Standards | Norms ensuring responsible conduct in experimentation, data handling, and publication. | Protecting participant privacy; securing speech recordings; avoiding coercive elicitation; respecting dialectal differences; reporting variability honestly; preventing biased interpretations of linguistic differences. |