Independent AI-Safety Research Lab.

Precision systems for adaptive intelligence.

Is inference-time stability regulation sufficient to prevent collapse and unsafe behavior in sequence models under regime shift? TwoQuarks builds the instruments to find out.

ΔL₃ · inference-time stability signal live
monitoring stable regime … divergence detected · pre-critical

LLM Evaluation · Inference-Time Stability · RAG & Agentic Systems · MCP Tooling

Empirical validation

Validated across production model families.

Provider-agnostic methodology with statistical controls — validated against deployed Claude and GPT models through their APIs.

Cross-architecture

Claude · GPT

Validated across two production model families through their APIs.

Statistical controls

p = 0.0013 · L₃ = 0.000

5,000-permutation null and control-negative checks.

Provider-agnostic

Black-box methodology

Works against any production model through its API alone.

Operational tooling

PyPI · MCP · adapters

Reproducible pipelines you run in your own context.

Production fit

Evals · RAG · agentic

Azure AI stack and API instrumentation included.

Core thesis

Model failure is a trajectory before it is an outcome.

Most evaluation looks at the output. TwoQuarks looks at the path the model takes to get there.

click to reveal
Operational interpretation

Measure instability before it becomes visible.

Using multiple black-box realizations of a response, isomeric polarization estimates structural divergence (ΔL₃) and flags drift, refusal erosion, and rule-override pressure during inference, from API outputs alone.

Applied Research

Research depth. Production readiness.

TwoQuarks is a working portfolio for LLM safety evaluation, inference-time monitoring, and AI tooling that runs in your own context.

Evidence

Research

Empirical validation, preprints, cross-architecture findings, statistical controls.

Tooling

Instruments

Molecule, the twoquarks PyPI package, MCP server, API adapters.

Core layer

Framework

TwoQuarks, PfV, ΔL₃, the six quark flavors, inference-time control.

Demo

Playground

Interactive Molecule-style analysis — visible, runnable portfolio evidence.

Profile

About

Independent AI-safety research, engineering stack, resume, GitHub, contact.