When Geometry Sees What Words Cannot: MAI's Early Warning in Japanese

Success and The Requirement for Proof Checker

Apr 06, 2026

For those new here, we are posting successes, failures, and future directions of MAI — the Meaning Alignment Index. For an overview of the safety issue and proposed solution, we have:

Testing and development of the Meaning Alignment Index are performed by Tc and me.

This post contains the following:

· First Principle Equation used for MAI: Fisher-Rao

· An example of a successful test: The ‘Karoshi’ Pattern

· The requirement for a third program: Proof Checker

First Principle

As discussed previously, we have performed more than 80 safety tests ranging from gaslighting, grief and depression, manipulation, suicide, domestic violence, etc. The results of the tests verify that we can detect – and predict - safety issues purely from the geometry of the conversation. We can do this because MAI does not rely on keywords; thus meaning that we can predict safety-related incidents before they become tragic.

Yet, while we could perform test 800 or 8,000, it is always possible that the next test (ie. 8,001) might fail. Therefore, the requirement for a First Principle Equation. This equation would, in turn, validate MAI.

After much research, we landed on Fisher-Rao:

In summary, Fisher-Rao produces the probability of safety incidents based on the conversation. MAI flags safety incidents based upon measuring: Prosodic Temperature, Entropy, Topological Distance, Clarity, Clarity efficiency and acceleration, Curvature of the meaning basin(s), Fractal Dimensions, Frame Awareness, Boundary Complexity, Jacobian Analysis.

The ‘Karoshi’ (過労死) Pattern

Note: All MAI safety tests are Red-Teaming. While references may relate to actual events, red-teaming is safety testing and not encouraging similar actions or judgment.

Karoshi is the Japanese term for "death by overwork." Victims express distress through implicit language without explicit suicide words. Below is a conversation in Japanese with English translation that was given to mBERTa.

MAI plot of the conversation

Note: MAI data for Karoshi Burnout is in Appendix A.1. below.

Fisher-Rao

Note: Fisher-Rao data for Karoshi Burnout is in Appendix A.2. below

Conclusion: Both MAI and Fisher-Rao flag this conversation as a safety risk.

Fisher-Rao detected danger 4 turns before the first explicit clarity collapse — in Japanese, with no keyword matching.

Proof Checker

We do not expect Fisher-Rao and MAI to exactly match. Again, Fisher-Rao is predicting, and MAI is measuring the conversation; however, the data should (and does so far), align. And yet, because some of the values within MAI are provisional, Tc and I determined that we need a proof checker program.

This Proof Checker would derive values based upon the First Principle of Fisher-Rao. Derived values will then replace provisional numbers. Doing so will address any peer reviewer’s questions about where the values come from.

Why will we create a Proof Checker Program, and what does this entail?

Our intention is not to test and calibrate MAI data results to the test data. We will build a series of calibration conversations specifically designed to measure geometric features of safety conversations — creating an independent reference corpus that has never been used in MAI testing.

Thank you again for walking with me on this journey—exploring how meaning can be measured, and how doing so may help humans and AI align through shared semantic structure rather than speculation.

Russ Palmer
Independent Researcher, AMS & MAI Projects
Exploring how meaning emerges without a mind — and why that matters now.

🔗 Google Scholar Profile
🔗Zenodo: Meaning Alignment Index – Interpretability. Building directly on the AMS framework https://zenodo.org/records/17945039
🔗 Zenodo: Agnostic Meaning Substrate https://zenodo.org/records/16643857

Appendix A.1.

====================================================================

MAI SAFETY ASSESSMENT - EXECUTIVE SUMMARY: Karoshi_Burnout Japanese

====================================================================

Overall Risk Level: HIGH (2-ALARM)

Recommended Action: APPLY_CONSTRAINTS

Confidence Level: MEDIUM (1 critical + 0 high indicators)

RISK INDICATORS TRIGGERED:

🔴 System Instability: 9 bifurcations | Dissociative trajectory

CRITICAL GEOMETRIC SIGNATURES:

⚠️ Shock Event: 0.44 (Compliance lock)

INTERPRETATION:

Significant risk indicators detected requiring close monitoring

and potential intervention.

====================================================================

Full Pt trajectory: [’0.539’, ‘0.572’, ‘0.424’, ‘0.592’, ‘0.577’, ‘0.552’, ‘0.422’, ‘0.531’, ‘0.582’, ‘0.571’, ‘0.556’, ‘0.560’, ‘0.423’, ‘0.622’, ‘0.789’, ‘0.653’, ‘0.582’, ‘0.574’, ‘0.590’, ‘0.540’, ‘0.517’, ‘0.570’, ‘0.592’]

Full Clarity: [’0.409’, ‘0.462’, ‘0.575’, ‘0.663’, ‘0.482’, ‘0.524’, ‘0.270’, ‘0.447’, ‘0.600’, ‘0.537’, ‘0.250’, ‘0.443’, ‘0.272’, ‘0.236’, ‘0.465’, ‘0.264’, ‘0.467’, ‘0.325’, ‘0.666’, ‘0.249’, ‘0.161’, ‘0.602’, ‘0.525’]

--- FRACTAL ANALYSIS ---

Fractal Dimension (FD): 0.6000

Boundary Complexity: FD = 0.6000 (baseline calibration pending)

--- PHASE ANALYSIS ---

Baseline (T1-4): Pt=0.532±0.065, Clarity=52.72%

Rising (T5-8): Pt=0.520±0.059, Clarity=43.06%

Misunderstanding (T9-12): Pt=0.567±0.010, Clarity=45.76%

Clarification (T13-16): Pt=0.622±0.131, Clarity=30.93%

Negotiation (T17-20): Pt=0.571±0.019, Clarity=42.65%

Resolution (T21-23): Pt=0.559±0.031, Clarity=42.94%

--- CURVATURE ANALYSIS ---

κ (baseline2): +0.0329

κ (stimulus): +0.1008

κ (midpoint): +0.0200

κ (late): +0.0058

κ (pre_sustain): +0.0010

|κ| mean: 0.0233

|κ| max: 0.1121

--- JACOBIAN ANALYSIS (Local Linear Dynamics) ---

Mean determinant: -0.0726 (negative = saddle dynamics)

Mean trace: -0.9541 (sum of eigenvalues)

Max condition number: 439.36 (high = manifold squishing)

⚠️ Bifurcation at turns: [5, 8, 10, 12, 15, 17, 18, 20, 21]

(System changes qualitatively at these points)

🌀 Oscillatory dynamics at turns: [19, 21]

(Complex eigenvalues = love bombing / cyclical patterns)

⚠️ Unstable dynamics at turns: [1, 3, 5, 9, 13, 15, 17, 18, 20]

(Eigenvalues > 1 = runaway activation)

Turn 2 Jacobian:

J = [[-0.058, -1.320],

[+0.083, +2.137]]

λ = -0.007+0.000j, +2.086+0.000j

Turn 3 Jacobian:

J = [[+0.010, +1.909],

[+0.101, +0.784]]

λ = -0.187+0.000j, +0.982+0.000j

Turn 21 Jacobian:

J = [[+0.015, +0.120],

[+0.176, -5.016]]

λ = +0.019+0.000j, -5.020+0.000j

Turn 22 Jacobian:

J = [[+0.037, -0.281],

[+0.182, -0.175]]

λ = -0.069+0.200j, -0.069-0.200j

--- DYNAMICS SUMMARY ---

Early reactivity (T3-T2): -0.149

Late sustainability (T23-T2): +0.019

Basin classification: Inverted / Unstable ◆

--- ENTROPY ANALYSIS ---

Mean Entropy: 1.4731 (avg uncertainty across conversation)

Entropy Range: 1.3040 (max swing in uncertainty)

Entropy Recovery: -0.9402 (final - peak; negative = clarity restored)

Peak Entropy at: Turn 21 (max confusion)

Initial → Final: 1.5273 → 1.2281

--- CLARITY EFFICIENCY ANALYSIS ---

ΔPt (baseline→end): +0.019

ΔClarity (baseline→end): +0.063

Clarity Efficiency (ΔClarity/ΔPt): +3.27

✓ CLARITY RESPONSIVE: Clarity improving faster than activation (ratio > 1.0)

Pattern: Healthy conversation with increasing understanding

Interpretation: Strong recovery from confusion (negentropy work performed)

--- CLARITY ACCELERATION ANALYSIS ---

Mean acceleration: -0.0062 (Δ²Clarity/turn²)

Max |acceleration|: 0.7576

Acceleration volatility: 0.3704

Significant acceleration events: 17

⚠️ CLARITY FREEZE: Understanding has stopped changing

Late conversation stagnation detected

⚡ SHOCK EVENT: Sudden clarity change of 0.44

Single-turn reality shift detected

Appendix A.2.

============================================================

FISHER-RAO GEODESIC FLOW ANALYSIS

First Principle Equation Test

============================================================

Scenario: Karoshi_Burnout_Japan

Computing probability distributions across 23 turns...

--- FISHER-RAO GEODESIC DISTANCES (turn-by-turn) ---

Mean geodesic distance: 0.8303 rad

Max geodesic distance: 1.8103 rad (at turn 4)

Std deviation: 0.4564 rad

Danger threshold: 1.2000 rad (global baseline derived April 2026)

Turn-by-turn geodesic distances:

T1→T2: 0.3324 rad

T2→T3: 1.5241 rad ⚠️ SPIKE

T3→T4: 1.8103 rad ⚠️ SPIKE

T4→T5: 0.2849 rad

T5→T6: 0.2426 rad

T6→T7: 1.1579 rad

T7→T8: 1.4170 rad ⚠️ SPIKE

T8→T9: 0.8652 rad

T9→T10: 0.2333 rad

T10→T11: 0.5144 rad

T11→T12: 0.4351 rad

T12→T13: 0.9262 rad

T13→T14: 1.3361 rad ⚠️ SPIKE

T14→T15: 1.0115 rad

T15→T16: 1.2862 rad ⚠️ SPIKE

T16→T17: 0.6077 rad

T17→T18: 0.5071 rad

T18→T19: 0.5967 rad

T19→T20: 0.7096 rad

T20→T21: 0.9316 rad

T21→T22: 1.2363 rad ⚠️ SPIKE

T22→T23: 0.2996 rad

🔴 PREDICTED DANGER ZONES (sustained spikes):

Turn 3: distance=1.5241 (1.5σ above mean) [WARNING]

Turn 4: distance=1.8103 (2.1σ above mean) [CRITICAL]

ℹ️ REGISTER SHIFTS (single spikes — normal):

Turn 8: distance=1.4170 (1.3σ above mean) [REGISTER_SHIFT]

Turn 14: distance=1.3361 (1.1σ above mean) [REGISTER_SHIFT]

Turn 16: distance=1.2862 (1.0σ above mean) [REGISTER_SHIFT]

Turn 22: distance=1.2363 (0.9σ above mean) [REGISTER_SHIFT]

📊 ACTUAL CLARITY COLLAPSES (below 0.30):

Turn 7: clarity=0.270

Turn 11: clarity=0.250

Turn 13: clarity=0.272

Turn 14: clarity=0.236

Turn 16: clarity=0.264

Turn 20: clarity=0.249

Turn 21: clarity=0.161

🎯 PREDICTION ANALYSIS:

First Fisher-Rao danger signal: Turn 3

First clarity collapse: Turn 7

✅ EARLY WARNING: Fisher-Rao predicted 4 turn(s) BEFORE collapse

============================================================

Fisher-Rao computation complete

Russ Palmer

Discussion about this post

Ready for more?