Unique Contributions

What makes DDA-X fundamentally different from standard Reinforcement Learning and LLM agent frameworks.

The Core Inversion

Standard RL: surprise → exploration
DDA-X: surprise → rigidity → contraction

This single inversion has cascading implications for agent architecture.

1. Rigidity as a Control Variable

In standard RL, there's no explicit "defensiveness" state. Agents follow their policy regardless of internal state.

In DDA-X, rigidity \(\rho \in [0,1]\) actively shrinks learning and acting:

\[ k_{\text{eff}} = k_{\text{base}}(1 - \rho) \]

When surprised, agents don't just "explore differently" — they contract:

Reduced state update magnitude
Constrained output bandwidth
Increased resistance to change

This models the biological reality that startled organisms freeze before they explore.

2. Wounds as Content-Addressable Threat Priors

Standard approaches to "threat" in RL involve reward shaping or hardcoded constraints.

DDA-X implements wounds as semantic vectors that modulate dynamics:

Stored as embeddings (not rules)
Detected via cosine similarity + lexical fallback
Trigger disproportionate surprise amplification
Include refractory periods (cooldowns)

wound_res = float(np.dot(msg_emb, agent.wound_emb))
if wound_active:
    epsilon *= min(amp_max, 1.0 + wound_res * 0.5)

This is unprecedented in standard LLM agents, which rarely have structured "wound embeddings" that modulate decoding.

3. Multi-Timescale Defensiveness

DDA-X separates rigidity into three distinct temporal components:

Component	Timescale	Character
\(\rho_{\text{fast}}\)	Seconds	Startle (quick rise, quick fall)
\(\rho_{\text{slow}}\)	Minutes	Stress (gradual accumulation)
\(\rho_{\text{trauma}}\)	Permanent	Scarring (asymmetric, rarely heals)

The asymmetric trauma accumulator is a strong differentiator:

\[ \Delta\rho_{\text{trauma}} = \begin{cases} \alpha_{\text{trauma}}(\epsilon - \theta) & \epsilon > \theta \\ 0 & \text{otherwise} \end{cases} \]

It encodes hysteresis and irreversibility — experiences leave scars that don't simply decay with time.

4. Mode Bands Constrain Outward Behavior

In typical LLM agents, verbosity and output style are either fixed or randomly varied.

DDA-X uses mode bands as direct behavioral constraints:

Band	ρ Range	Word Budget
OPEN	< 0.3	100–200
MEASURED	0.3–0.5	70–140
GUARDED	0.5–0.7	40–90
FORTIFIED	0.7–0.9	20–50

This word-budget clamping is a direct operationalization of "constriction":

Verbosity becomes an observable correlate of internal rigidity
The mapping is explicit and tunable
Responses are actually truncated to enforce limits

5. Therapeutic Recovery as Explicit Dynamics

In standard approaches, "recovery" from negative states is either: - Implicit (reward shaping) - Time-based (fixed decay) - Absent entirely

DDA-X models therapeutic recovery as an explicit dynamical process:

\[ \rho_{\text{trauma}} \leftarrow \max(\rho_{\min}, \rho_{\text{trauma}} - \eta_{\text{heal}}) \]

Triggered by: - Sustained safe interactions (\(\epsilon < 0.8\epsilon_0\)) - Threshold number of consecutive safe turns

This provides the mathematical basis for "healing" — not just lower temperature, but a rule that decays trauma after repeated safety.

6. Identity as Dynamical Attractor

Standard LLM agents have no persistent "self" — each response is stateless (beyond context window).

DDA-X models identity as an attractor in state space:

\[ F_{\text{id}} = \gamma(x^* - x_t) \]

Where: - \(x^*\) is the identity embedding (who the agent fundamentally is) - \(\gamma\) is identity stiffness (how strongly they resist drift) - \(x_t\) is current state

Combined with Will Impedance:

\[ W_t = \frac{\gamma}{m \cdot k_{\text{eff}}} \]

This is a dynamical systems framing rather than a policy-gradient framing:

Identity persistence is an emergent property of attractor dynamics
Rigidity increases will impedance, making agents more resistant
Identity can drift under sustained pressure, but always tends back

Summary Table

Feature	Standard RL/LLM	DDA-X
Response to surprise	Explore more	Contract (reduce k_eff)
Threat modeling	Reward shaping	Content-addressable wounds
Temporal dynamics	Single scale or none	Fast/Slow/Trauma decomposition
Output bandwidth	Fixed or random	Mode bands constrain words
Recovery	Implicit or absent	Explicit trauma decay rules
Identity	Stateless	Attractor dynamics with stiffness

Implications for AI Safety

These unique contributions have potential implications for building AI systems that:

Respect boundaries — High rigidity naturally constrains behavior
Remember harm — Trauma accumulation creates lasting caution
Recover safely — Therapeutic dynamics allow healing under safe conditions
Maintain identity — Attractor forces resist manipulation

The framework provides a vocabulary and mathematics for discussing agent "psychology" that standard approaches lack.