| 1. Domain | 1.1 Scope of the Domain | Boundaries | The range of phenomena the science includes and excludes. | Protein Biology examines the structure, folding, dynamics, interactions, functions, and regulatory roles of proteins. It includes primary-to-quaternary structure, post-translational modifications, enzyme kinetics, and protein–protein/protein–ligand interactions. It excludes DNA/RNA-level information and cellular/organism-level processes except when directly mediated by proteins. |
| | Scale | The spatial, temporal, or organizational level at which the science operates (e.g., quantum, cellular, social, cosmic). | Operates at atomic, molecular, supramolecular, and cellular scales: amino acids, polypeptide chains, folded domains, complexes, molecular machines, membranes, and cellular protein networks across nanoseconds to functional timescales. |
| 1.2 Ontological Commitments | Entities | The kinds of things assumed to exist within the domain (particles, organisms, agents, fields, etc.). | Amino acids, polypeptide chains, protein domains, folded structures, protein complexes, enzymes, molecular motors, chaperones, post-translational modifications, cofactors, ligands, substrates, and interaction networks. |
| | Properties | The fundamental attributes these entities possess (mass, charge, genotype, preference, etc.). | Sequence identity, charge distribution, hydrophobicity, folding stability, catalytic activity, binding affinity, conformational flexibility, allosteric behavior, dynamic transitions, and PTM states. |
| | Categories | The basic ontological types used to classify domain elements (substances, processes, relations, structures). | Structural classes (α, β, mixed), functional classes (enzymes, receptors, transporters, scaffolds), folding states (native, misfolded, aggregated), interaction types (protein–protein, protein–ligand), and PTM categories (phosphorylation, acetylation, ubiquitination). |
| 1.3 State-Variables | Variables | The measurable or definable properties that describe system conditions. | Sequence composition, folding state, structural conformation, modification status, binding occupancy, catalytic rate, interaction strength, oligomeric state, and dynamic conformational transitions. |
| | Parameterization | How variables encode and represent the system’s state. | State represented through sequence data, structural coordinates, folding-energy landscapes, kinetic rate constants, binding-isotherm curves, PTM maps, and interaction-network measurements. |
| 1.4 Admissible Idealizations | Simplifications | Conceptual reductions used to make the domain tractable (point masses, rational agents, perfect gases). | Treating proteins as rigid bodies, assuming two-state folding, ignoring rare conformational states, modeling interactions with simplified potentials, coarse-graining amino acid behavior, or collapsing PTMs into binary categories. |
| | Validity Conditions | The limits and contexts in which idealizations hold or break down. | Idealizations fail when proteins exhibit complex folding pathways, multiple conformers, allosteric heterogeneity, crowding-dependent behavior, or when PTMs influence structure/function combinatorially. |
| 1.5 Domain Assumptions | Structural Assumptions | Background ontological stances such as determinism, continuity, randomness, discreteness. | Assumes that sequence determines structure, structure influences function, biochemical reactions obey consistent kinetic principles, and protein folding follows thermodynamic/kinetic rules. |
| | Implicit Commitments | Unstated but necessary assumptions that shape the field’s conceptual structure. | Assumes proteins have stable functional states, biochemical environments are reproducible, PTMs carry interpretable signals, and binding interactions follow predictable physical chemistry. |
| 1.6 Internal Coherence Requirements | Consistency | The demand that domain concepts do not contradict one another. | Sequence–structure–function relationships, folding thermodynamics, dynamic transitions, and interaction behaviors must align without contradicting biochemical or structural principles. |
| | Compatibility | The requirement that entities, variables, and assumptions fit together into a unified descriptive framework. | Entities (proteins, ligands, modifications), variables (structure, kinetics, stability), and assumptions (sequence–structure mapping, chemical consistency) must fit into a unified mechanistic framework. |
| 2. Evidence Layer | 2.1 Observable Phenomena | Observables | The aspects of the domain that can produce detectable signals accessible to measurement. | Detectable protein properties such as folding state, secondary/tertiary structure, enzymatic activity, ligand binding, PTM status, oligomerization, interaction networks, and conformational changes. |
| | Detection Limits | The boundaries of what can be resolved or sensed by current instruments or methods. | Lowest detectable protein concentrations, minimal resolvable structural features (Å resolution limits), smallest observable kinetic changes, fluorescence-intensity thresholds, and mass-spectrometry sensitivity limits for peptide identification or PTM detection. |
| 2.2 Measurement Systems | Units | Standardized quantifications (meters, seconds, volts, decibels, dollars, etc.) necessary for consistent comparison. | Mass (kDa), concentration (nM–µM), catalytic rates (kcat, Km), binding affinity (Kd), fluorescence intensity, circular-dichroism ellipticity, structural-resolution units (Å), and spectral absorbance. |
| | Instruments | Devices and tools (microscopes, spectrometers, sensors, surveys, detectors) used to produce measurements. | Mass spectrometers, HPLC systems, NMR spectrometers, X-ray crystallography, cryo-EM, fluorescence microscopes, spectrophotometers, plate readers, CD spectrometers, calorimeters (ITC/DSC), Western blot systems, and protein microarrays. |
| 2.3 Operational Definitions | Definitions | Terms defined by specific measurement procedures, ensuring empirical clarity. | Operational definitions of folded/unfolded states, catalytic activity, binding affinity, PTM presence, oligomeric state, and conformational transitions based on assay-specific thresholds and readouts. |
| | Procedures | The explicit steps required to perform a measurement in a reproducible way. | Standardized workflows such as protein purification, SDS-PAGE, Western blotting, enzyme-activity assays, co-immunoprecipitation, structural-determination pipelines, proteomics sample prep, and calorimetric binding measurements. |
| 2.4 Data Acquisition | Protocols | Formal processes for gathering data under controlled or standardized conditions. | Controlled protein-expression procedures, purification steps, kinetic-measurement cycles, crystallization trials, cryo-EM sample-prep protocols, mass-spec runs, and standardized replicates for quantitative assays. |
| | Sampling | Rules determining which subset of the domain is measured and how representative it is. | Rules for selecting protein variants, domains, conditions (temperature, pH), time points, interaction partners, or structural states to obtain representative biological protein behavior. |
| 2.5 Data Character & Format | Data Types | The form raw evidence takes (time series, spectra, images, counts, qualitative records). | Spectra (MS, NMR), diffraction patterns, EM density maps, kinetic time series, binding curves, electrophoretic band patterns, fluorescence traces, CD spectra, and peptide-sequence tables. |
| | Resolution | The granularity or precision with which data is captured. | Structural resolution (Å for EM/x-ray), time resolution for kinetic assays (ms–s), mass-spec resolution for peptide identification, and sensitivity limits for detecting rare conformational or PTM states. |
| 2.6 Reliability & Calibration | Calibration | Adjustment procedures ensuring instruments produce accurate results. | Calibration of spectrometers, plate readers, detectors, EM magnification, mass-spec mass accuracy, enzyme-activity standards, fluorescent probes, and baseline corrections in calorimetry and spectroscopy. |
| | Error Characterization | Identification and quantification of noise, uncertainty, bias, and measurement error. | Quantifying noise from detector drift, peptide-misidentification rates, sample degradation, purification contaminants, spectral overlap, electron-beam artifacts, kinetic measurement noise, and aggregation-induced variability. |
| 3. Structural Layer | 3.1 Patterns & Regularities | Laws / Relations | Stable, repeatable patterns governing how observables behave across conditions. | Sequence–structure–function relationships, folding–stability correlations, cooperative binding laws, catalytic rate rules, conserved structural motifs, and predictable thermodynamic behavior of protein folding. |
| | Invariants | Quantities or properties that remain constant under transformations (symmetries, conservation laws). | Conserved folds across species, stable domain architectures, catalytic triads, hydrophobic-core formation, invariant interaction motifs, and preservation of binding interfaces under evolutionary pressure. |
| 3.2 Causal Architecture | Mechanisms | Underlying processes or structures that produce the observed regularities. | Mechanisms include folding pathways, catalytic cycles, allosteric transitions, induced fit vs conformational selection, chaperone-assisted folding, PTM-driven activity shifts, and protein–protein interaction assembly. |
| | Pathways | Organized sequences of interactions forming a causal chain or network. | Ordered sequences such as synthesis → folding → modification → complex formation → functional action; or substrate binding → catalytic transition state → product release → enzyme reset; or misfold recognition → chaperone engagement → refolding or degradation. |
| 3.3 Theoretical Vocabulary | Concepts | Core terms that encode the domain’s structure (force, gene, equilibrium, field). | Core terms include primary/secondary/tertiary/quaternary structure, folding energy landscape, active site, allostery, catalytic efficiency, domains, motifs, PTMs, conformers, oligomerization, and binding affinity. |
| | Classifications | Taxonomies, categories, or typologies that organize entities and relations. | Structural classes (α-helix, β-sheet, mixed), functional protein families (enzymes, receptors, motors), fold families, domain architectures, PTM categories, complex types, and interaction-network classifications. |
| 3.4 Formal Representations | Equations | Mathematical constructs expressing laws, relations, or mechanisms. | Kinetic equations (Michaelis–Menten), thermodynamic folding equations (ΔG, ΔH, ΔS), binding-isotherm equations, rate laws for enzymatic cycles, statistical–mechanical models of conformational ensembles. |
| | Models | Structured representations—mathematical, computational, or conceptual—used to predict and explain phenomena. | Computational folding models, homology models, molecular dynamics simulations, coarse-grained interaction models, kinetic pathway models, docking models, and energetic landscape frameworks. |
| 3.5 Idealized Structures | Simplified Models | Purposeful abstractions that capture essential dynamics while omitting irrelevant detail. | Two-state folding models, rigid-body approximations, simplified interaction potentials, minimal catalytic mechanisms, coarse-grained residue models, and reduced PTM categories. |
| | Limit Conditions | Regimes where specific models or approximations hold (classical vs. quantum, linear vs. nonlinear). | Valid under moderate temperature, typical solvent conditions, and stable concentrations; break down under crowding, extreme pH, high temperature, denaturants, strong allosteric coupling, or multi-pathway folding behavior. |
| 3.6 Integrative Frameworks | Unifying Theories | Higher-order structures that connect disparate laws or mechanisms under a coherent whole. | Sequence→structure→function paradigm, thermodynamic folding framework, enzyme catalysis theory, allosteric regulation models, and protein-network integration across cellular pathways. |
| | Interdisciplinary Links | Points where the theory connects to adjacent sciences or larger explanatory systems. | Links to structural biology, biochemistry, molecular biology, biophysics, cell biology, pharmacology, and systems biology through shared principles of structure, kinetics, and interaction networks. |
| 4. Method Layer | 4.1 Inquiry Design | Experimental Design | Structured plans for manipulating variables to test causal claims. | Manipulating variables such as protein sequence, concentration, folding environment, ligand availability, PTM status, or binding partners through mutagenesis, controlled folding conditions, ligand titrations, or enzymatic modification. |
| | Observational Design | Systematic approaches for gathering non-manipulated data (surveys, field studies, natural experiments). | Non-manipulative approaches including measuring native protein structure, monitoring natural expression levels, mapping interaction networks, profiling PTMs, and observing spontaneous folding or aggregation behavior. |
| 4.2 Testing & Validation | Hypothesis Testing | Procedures for evaluating whether evidence supports or contradicts specific claims. | Testing claims about protein function, structural determinants, catalytic mechanisms, or interaction specificity through targeted assays, perturbation experiments, site-directed mutagenesis, or competitive binding studies. |
| | Replication | The requirement that results be independently reproducible under similar conditions. | Reproducing assays such as enzymatic kinetics, structural determinations (cryo-EM/x-ray/NMR), binding measurements, proteomics runs, and fluorescence-based conformational studies across independent replicates. |
| 4.3 Inference & Evaluation | Statistical Inference | Rules for drawing conclusions from noisy or incomplete data. | Inferring protein behavior from noisy spectral, structural, or kinetic data using statistical models, error propagation, confidence intervals, ensemble averaging, and Bayesian structural or interaction-inference methods. |
| | Model Comparison | Criteria (fit, simplicity, predictive accuracy, robustness) used to evaluate competing models. | Evaluating alternative folding models, kinetic models, interaction hypotheses, and structural predictions based on fit to empirical data, predictive success, stability under parameter variation, and cross-method consistency. |
| 4.4 Error Management | Error Analysis | Identification and quantification of random and systematic errors. | Quantifying noise from detector drift, sample degradation, mass-spec misidentification, spectral overlap, crystallographic noise, kinetic-measurement variability, and misfold-related artifacts. |
| | Bias Control | Methods for minimizing subjective, instrumental, or procedural biases. | Reducing bias through standardized protein-purification workflows, balanced sample preparation, validated antibodies and reagents, randomized measurement order, and calibrated structural/imaging conditions. |
| 4.5 Adjudication & Revision | Peer Scrutiny | Collective evaluation of claims through critique, review, and debate. | Independent evaluation of structural claims, functional interpretations, kinetic models, interaction networks, and proteomics analyses through peer review, cross-validation, and collaborative replication. |
| | Theory Revision | Procedures for modifying, replacing, or discarding models based on new evidence. | Updating models of protein folding, enzyme mechanisms, interactions, allostery, or PTM effects when new structural data, kinetic results, or functional observations contradict existing frameworks. |
| 4.6 Integrity Conditions | Transparency | Requirements to disclose methods, data, assumptions, and limitations. | Full reporting of purification methods, sample conditions, structural-determination parameters, kinetic protocols, calibration steps, modeling assumptions, and data-processing pipelines. |
| | Ethical Standards | Norms ensuring responsible conduct in experimentation, data handling, and publication. | Responsible conduct in protein experimentation, accurate reporting of results, avoidance of selective data presentation, proper handling of recombinant or engineered proteins, and adherence to biosafety and research-integrity standards. |