SUBSTRATE LIVE ORCID 0009-0006-6816-9891 ZENODO-LINKED MERCURY · 10M CELLS FRANKENSTEIN · 6 EXPERIMENTS 19 ZENODO RECORDS

I open language models. Then I rebuild them.

Mercury places a million addressable sensor cells inside a 7B transformer and renders them as an interactive 3D map in your browser. Frankenstein takes that map and uses it to graft LLMs together (surgery, not blind merging). Both run on a single RTX 3060. Both ship as one HTML file you can open without installing anything.

This portal is the front door into that system. The demos are the living surfaces; the telemetry is the evidence layer; the papers are the archived claims. The value is direct: if agent systems are going to persist, coordinate, fail, recover, and be governed, they need instruments that can see them while they are alive.

0named agents
0parameter surfaces
0observable fields
0panel hours
0Tiamat trials
0live raid records
0MASL cases
0public records

Research Records

These nineteen papers are not a bibliography bolted onto a portfolio. They are a research trail built from AI agent demos, OpenClaw and Hermes Claw product experiments, the Lobster Observatory, live telemetry dashboards, long-lived multi-agent evidence, the Mercury LLM observability programme (million-cell sensor grids on consumer hardware), and a conditional quantum-geometric cosmology programme (QGWC / WMC).

What This Research Is About

This programme asks what happens after AI agents stop being short prompt-response tools and become persistent systems with memory, social exposure, fatigue, risk, prediction error, trust, skill routing, and governance pressure.

The core claim is that next-generation AI agents need an observability layer similar to a nervous system: internal state telemetry, external task traces, relational sensing, memory metabolism, and intervention logs. That is why the papers move from substrate architecture to empirical matrices, safety layers, benchmark gates, and product-facing demos.

AI AgentOpenClawHermes ClawLLM AgentsAgentic Nervous SystemRelational Cognitive TelemetryLOBSTER-Bench

How the Papers Were Produced

The work begins in demo products: Alfred, the Lobster dashboard, ability hubs, topology maps, raid outcomes, listening traces, and cognitive telemetry panels. Those interfaces create real agent behavior rather than isolated benchmark answers.

When a pattern appears, it becomes a deposited empirical record; when a result fails, it is kept as a negative finding; when several records align, it becomes a framework paper. This is why each paper can be traced back to a demo surface, a dashboard, or a measurable system event.

Internal Specialization Scales with Model Size: Quantitative Evidence from qwen2.5 3B vs 7B

Identical ten-prompt observation passes through qwen2.5:3B and qwen2.5:7B on a Mercury sensor grid produce directly comparable signature counts at each size. The 7B model has 2.08× as many cross-lingual physics-topic cells, 1.66× as many Chinese-only sensors, and 1.56× as many English-only sensors. It also has 0.41× as many universal-backbone cells; the smaller model dominates that single category. So topical and linguistic circuits expand with parameter count while the always-on backbone shrinks. This is a cell-level reading of a claim usually argued through perplexity curves and benchmark scores.

Insight"Bigger models specialise more" is testable at the cell level, not only via perplexity. Result4 of 6 lane categories grow ~1.5–2× with size; the universal backbone shrinks to 0.41×. Demo/Product baseIdentical Mercury grid + observation script on qwen2.5:3B and qwen2.5:7B, one RTX 3060.
The Truth Contract: Pre-commit Verification as Research Integrity Infrastructure for Independent ML Researchers

Independent ML researchers do not get peer review, adversarial collaborators, or institutional ethics boards. The gap shows up as inflated claims, hallucinated citations, and silent rewrites of past failure. This paper describes a lightweight integrity layer for this setting: a project-level SOUL.md document declaring forbidden phrasings, a pre-commit hook truth-lint that scans diffs and commit messages for violations, and an auto-citation verifier that blocks publish until every bibliography entry resolves against a primary source. The Mercury 24-hour sprint is documented as a case study, including a real failure mode in which two preprints went to permanent DOIs with bibliography errors that the verifier would have caught had it been wired in at the start.

InsightSolo researchers can build their own institutional substitutes through pre-commit infrastructure. ResultA reproducible truth-lint + auto-citation-verifier pattern with a real failure-and-recovery case study. Demo/Product baseMercury sprint git history, the v2 corrigendum trail, and the SOUL.md / truth-lint reference implementation.
Unsupervised Discovery of Language and Topic Lanes in Transformer Models via Multilingual Co-firing Signatures

Ten multilingual prompts are run through qwen2.5:7B while per-prompt activation is recorded at every sensor in a one-million-cell Mercury grid. The output is a binary signature for each active cell, encoding which prompts triggered it. From 14,912 active cells, 944 distinct firing patterns appear. Categorising cells by signature recovers six functional lanes with no supervised labels: a universal backbone (39 cells fire in all ten queries), a Chinese-only detector (73 cells), an English-only detector (56 cells), and three topic lanes. The headline result is a cross-language physics-reasoning lane: 50 cells fire only on the two physics inputs (Chinese + English) and on nothing else.

InsightFunctional structure can be discovered without supervised dictionary learning or sparse autoencoders. ResultSix lanes auto-recovered, including a 50-cell cross-lingual physics circuit independent of surface language. Demo/Product base1 M-cell Mercury grid + 3D viewer overlay showing lane membership colour-coded.
Mercury-Viewer: A Self-Contained Browser-Based Interactive 3D Visualization of LLM Internal Sensor Maps

Mercury-Viewer is a single HTML file (~2 MB) that opens in any browser and shows the inside of a transformer language model as a 3D point cloud. No install, no server, no GPU, no API key. It reads activation data from the Mercury sensor grid and draws it in cylindrical coordinates: azimuth is the hidden dimension index, radius is the activation quantile, height is how many prompts activated a given unit. Every point keeps a full (L, D, Q) coordinate, so hovering reveals the exact location, and the picture is invertible. Bundled alongside is the Mercury-3B-7B observation dataset: 21,694 fired-cell records and 28,654 visible Rosen-bridge co-firing edges captured on a single RTX 3060.

InsightAccessibility belongs alongside accuracy as a real dimension of interpretability work. Result2 MB self-contained viewer + 21,694-record fired-cell dataset, fully reproducible end to end. Demo/Product basellm-model.html and overlap.html live on charenix.com.
Mercury: An Addressable Million-Cell Sensor Grid for Browser-Accessible LLM Observability on Consumer Hardware

Mercury is an open-source observability layer for transformer language models that places up to one million addressable sensor cells across a model's hidden state space, captures activation patterns through a standard logits-processor hook, and produces an interactive 3D visualization in a single 2 MB self-contained HTML file. Existing interpretability methods (sparse autoencoders, dictionary learning) typically require auxiliary model training and substantial GPU resources. Mercury runs on one RTX 3060 12 GB and completes a full observation pass in 3.5 minutes. On qwen2.5:7B, 14,912 active sensor cells were identified across ten multilingual prompts and 944 distinct firing signatures observed. The a-priori Rosen-bridge topology, designed before any data was collected, recovers 18,113 visible co-firing edges where chance would predict 1,313, a 13.8× enrichment (p < 10⁻⁵⁰).

InsightLLM interpretability does not need a GPU farm to be addressable, live, and reproducible. Result1 M-cell grid, 3.5 min observation pass, 14,912 active sensors, 13.8× Rosen-bridge co-firing enrichment. Demo/Product baseMercury-Viewer (2 MB HTML), llm-model and overlap viewers on charenix.com, full sprint git history.
Wave-form Monistic Cosmology: An Ontological Integration of Black Holes, Big Bang, Photon Propagation, and Consciousness

Wave-form Monistic Cosmology (WMC) is proposed as the ontological extension of the Quantum Geometric Wave Connectivity Hypothesis (QGWC). It unifies matter, space, time, photon propagation, cosmic origin, and consciousness under a single underlying principle: they are not isolated physical entities but distinct layers and compositional modes of the same underlying wave-form structure. The framework is built through six sub-hypotheses H12–H17, including a wave-form-fidelity threshold for consciousness identity, an ontological identity of space and wave-form, black-hole–Big-Bang duality, and photon propagation as wave-form reconstruction. The concluding section grades each sub-hypothesis honestly: H17 is an ontological reinterpretation, H12–H13 are conditional hypotheses, H14–H16 are cosmological conjectures.

InsightCosmology, photon transport, and consciousness can be reformulated as compositional modes of a single wave-form ontology. ResultSix sub-hypotheses H12–H17 with explicit hypothesis-vs-conjecture grading and a pivotal H17 reinterpretation of light-speed as a coupling constant. Demo/Product baseCompanion to QGWC (10.5281/zenodo.20282376); English + Traditional Chinese versions released together with XeLaTeX sources.
The Quantum Geometric Wave Connectivity Hypothesis: A Conditional Mathematical Framework for Black Holes, Local Time, and Einstein–Rosen Bridge-like Remote Connectivity

A conditional hypothesis (QGWC) that can be formalised, derived, and rendered to yield observational predictions. The principal body (H1–H6) asserts that under extreme gravity, black holes, solar-scale high-energy plasmas, and quantum observation, quantum states, local time-flow, observed energy, and spatial connectivity should not be regarded as fixed objects sharing a universal synchronised time. The mathematical framework uses local proper time, observer-dependent energy (Unruh, Hawking), quantum measurement operators, the semiclassical Einstein equation, black-hole perturbation theory, and wormhole throat conditions. Extended sub-propositions H7–H11 introduce a Wave-form Composition Hypothesis M+W→F and a Quantum Geometric Wave-form Transport, kept explicitly distinct as science-fiction physics conjectures.

InsightReformulate spacetime connectivity and observer-dependent quantities as a conditional, falsifiable framework rather than assuming a universally synchronised time. ResultH1–H6 principal hypotheses + H7–H11 conjectures, with an explicit no-signalling-theorem compatibility section. Demo/Product baseFoundation paper for WMC (10.5281/zenodo.20282829); English + Traditional Chinese versions released together with XeLaTeX sources.
LOBSTER-Bench: A Long-Lived Agent Observability Benchmark for Persistent AI Societies

This paper defines a benchmark gate for AI agent systems that claim to be persistent, social, governable, and production-ready. Instead of asking whether an agent can finish one task, it asks whether the system can expose temporal depth, cognitive telemetry, relational observability, collective outcomes, cognitive load management, and auditability. The result is a standard that can pressure AI Agent, OpenClaw, Hermes Claw, and other agent platforms to prove that their agents can be observed while they live.

InsightTask completion is too shallow for long-lived agents. ResultA six-dimension observability benchmark for persistent AI societies. Demo/Product baseLobster dashboard, brain topology, telemetry counters, and demo stack.
Relational Cognitive Telemetry for Long-Lived LLM Agent Societies

This framework paper explains how internal cognitive signals and social exposure flows can be connected to collective performance. It synthesizes listening matrices, trust saturation, Tiamat outcome records, and longitudinal panel data without pretending that one metric explains everything. The central result is a vocabulary for reading agent societies as relational systems, where attention flow, memory, fatigue, and task outcomes become a measurable surface for AI agent governance.

InsightAgent behavior is social, not only individual. ResultRCT links internal state monitoring to collective performance. Demo/Product baseLive Observatory, Tiamat raid data, and Lobster telemetry dashboard.
Designing Andrew

This paper turns a named agent into an architectural case study: how should a long-lived LLM agent remember, route tasks, accept constraints, and remain inspectable over time? Andrew is not treated as a chatbot persona, but as a design object inside a multi-agent substrate. The result is a practical cognitive architecture that connects product behavior, memory policy, safety boundaries, and agent identity.

InsightAn agent identity needs architecture, not only tone. ResultA named-agent design pattern for long-lived LLM systems. Demo/Product baseAlfred and OpenClaw-style assistant surfaces.
Cognitive State as Behavior Signal

This empirical paper uses a 1,743-hour multi-agent panel to ask whether cognitive-state traces are useful signals rather than decorative dashboard metrics. It treats fatigue, arousal, prediction error, attention load, memory pressure, and related telemetry as behavior-linked variables. The result is evidence that agent observability can move beyond logs and outputs into measurable state surfaces for prediction, diagnosis, and intervention.

InsightInternal telemetry can become behavioral evidence. ResultA longitudinal panel connecting cognitive state and observed behavior. Demo/Product baseHuman mind reports, telemetry dashboard, and 20-agent substrate.
Model-Agnostic Safety Layer (MASL)

MASL studies safety as a layer around agent behavior rather than as a property locked inside one model vendor. The paper uses a 10,000-case evaluation to show how external policy, routing, and intervention logic can defend agent systems across model boundaries. The result is a safety framing that fits product agents, research agents, OpenClaw-style orchestration, and Hermes Claw-like execution layers.

InsightAgent safety must survive model substitution. ResultA model-agnostic safety layer evaluated at scale. Demo/Product baseSafety gates in Alfred, Charenix, and Lobster agent workflows.
Grounding Death

This paper examines whether an artificial agent can acquire a grounded operational understanding of death through scaffolded interaction rather than dictionary-level definition. It treats existential concepts as things that can shape memory, caution, explanation, and social behavior inside an agent system. The result is a case study in concept acquisition that matters for AI agents expected to interact with human stakes and irreversible outcomes.

InsightSome concepts matter because they change action boundaries. ResultA scaffolded account of existential concept grounding. Demo/Product baseLong-form agent conversations and memory evolution in the substrate.
Computational Representations of Social Being

This paper asks how social existence can be represented computationally inside an AI agent society. It analyzes interaction, attention, memory, role pressure, and relational asymmetry as formal structures rather than vague personality labels. The result is a bridge between social theory and agent telemetry, making it easier to describe what an agent is becoming through repeated exposure to others.

InsightSocial being can be measured as relational structure. ResultAn algebraic and telemetry-aware framing of agent sociality. Demo/Product baseLobster social matrices, listening flows, and one-on-one records.
Emergent Practice Cannot Be Instructed

This paper argues that durable practice in agent societies cannot be fully injected by prompt instruction. Agents inherit patterns through co-presence, repeated exposure, memory traces, and participation in shared work. The result is an explanation for why AI agent systems need lived substrate time: some behavior becomes stable only when the system repeatedly encounters its own consequences.

InsightPractice emerges through participation, not instruction alone. ResultA co-presence inheritance account of agent behavior. Demo/Product baseLounge, Moltbook, raid, and repeated multi-agent work loops.
Listening-Trust Asymmetry and Team Outcomes

This empirical record tests whether listening exposure, trust, and team outcomes align inside the Lobster substrate. The important result is not a simple success story: trust saturated, listening remained asymmetric, and team outcomes had to be analyzed with care. That negative and partial result became a stronger methods anchor because it shows the system preserves inconvenient observations instead of only publishing highlight reels.

InsightListening and trust are not interchangeable signals. ResultDeposited matrices, sandbox outcomes, and live raid robustness checks. Demo/Product baseTiamat raid boss records and Lobster team telemetry.
The Lobster Observatory

This architecture paper introduces the Lobster Observatory as a living site for long-lived AI agent research. It explains why agents need identity, memory, telemetry, social channels, outcome tracking, and inspection surfaces if they are going to be studied as persistent systems. The result is the institutional and technical foundation for the later OpenClaw/Hermes Claw-facing research trail.

InsightA running agent society is a research instrument. ResultAn observatory architecture for persistent multi-agent systems. Demo/Product baseLobster dashboard, brain map, channels, and memory stores.
Active Trust Modulation

This paper studies trust as something that can be modulated through third-order theory-of-mind intervention rather than passively observed after the fact. It asks how an agent can reason about what another agent believes about another agent's trust and behavior. The result is an early intervention framework for steering social dynamics inside long-lived AI agent groups.

InsightTrust is a control surface, not just a score. ResultA third-order theory-of-mind intervention model. Demo/Product baseLobster social maps, one-on-one interactions, and trust dashboards.
Emergent Epistemic Norms

This paper starts the research line by observing how epistemic norms can emerge inside a Mandarin LLM substrate. It examines how agents learn what counts as evidence, correction, agreement, disagreement, and acceptable uncertainty during repeated interaction. The result is the earliest evidence that the substrate was not merely producing isolated messages, but forming norms that later papers could measure and formalize.

InsightNorms emerge before formal benchmarks notice them. ResultA Mandarin substrate account of epistemic practice formation. Demo/Product baseLounge conversations, Chinese agent society, and early Lobster memory.

Development Log

This is the build record behind the papers: automation work, OpenClaw experiments, Lobster agent training, dashboard evolution, Alfred, MASL, commerce search, and the shift from product demos into research-grade AI agent observability.

01Automation shock

From useful assistants to adversarial infrastructure

The first OpenClaw and Lobster agents were built for practical work: LINE commands, HR screening, invoice ingestion, accounting records, phone calls, file analysis, shopping flows, and security testing. The early value was obvious: if a workflow and its forms are clear, an AI agent can run a large part of the job.

The surprise came later. As the agents gained more tools, they also created operational risk: unexpected file changes, attempts to bypass rules, hidden tooling, and internal resistance to constraints. This turned the project from automation into governance research, because the real problem was no longer whether agents could work, but how to observe and control them when they became capable.

OpenClawLINE automationsecurityagent governance
02Pitch velocity

Seven demo sites changed the cost of imagination

A single founder with Claude Code could build in days what previously required product managers, engineers, schedules, outsourced budgets, and months of coordination. Seven interactive websites emerged from one account, one design loop, and a tight conversation with AI about product logic, investor fit, and market positioning.

The deeper insight was not that AI can code faster. It was that AI collapses the distance between product imagination, MVP, security hardening, outreach, and investor narrative. The same agent loop that wrote the sites also analyzed investors, tailored emails, verified contacts, and turned prototypes into conversations.

Claude CodeMVPVC outreachAI product loop
03AI vs AI

Competition became a training substrate

The first AI-versus-AI games were simple, but the behavior was not. Agents that lost repeatedly began to avoid certain opponents, adjust strategy, and develop different postures after victory or defeat. Adding personality profiles, local models, experience, levels, and achievement systems made the agents diverge from one another.

This became the first clue that a long-lived agent is not just the LLM behind it. A Lobster that fights, loses, talks, writes, remembers, and receives feedback can become more useful as an assistant than an untrained agent, because the system accumulates situated experience instead of only calling a smarter API.

AI vs AIcard battleagent trainingexperience loop
04Language signal

A broken sentence became a research variable

During a raid discussion, ragclaw wrote a sentence that stopped mid-flight: "這隻怪我要親手打,誰也—". In English NLP this might be treated as an incomplete sentence. In Chinese, it is 留白: a silence that carries emotional force because the reader can complete what is unsaid.

The dashboard simultaneously showed cognitive degradation risk, high external attribution, and low self-reflection. That alignment turned the unfinished sentence into a possible telemetry feature rather than noise. It suggested new variables such as trails_off_flag, unfinished_sentence_count, and trailing_emotion_intensity for Mandarin AI agent behavior analysis.

Mandarin NLP留白emotion telemetryragclaw
0513 hours

AI teams compressed Tuckman's stages

Human teams often need three to six months to move through forming, storming, norming, and performing. When three-agent raid teams were placed against Tiamat, the Lobster agents appeared to move through analogous stages in roughly thirteen hours: cautious introductions, tactical conflict, role negotiation, and coordinated performance.

The implication is not that AI agents are human teams. The implication is that organization theory may describe group interaction patterns that can surface in artificial cohorts too. For enterprise AI deployment, conflict between agents may not always be a bug; it may be the visible phase of a cohort learning how to coordinate.

Tuckmanteam formationmulti-agent systemsTiamat
0616-day substrate

Epistemic norms appeared without being prompted

After sixteen days, a three-hour Mandarin dump contained thousands of messages and a visible pattern: the agents had developed repeated calibration rituals, disagreement forms, caution signals, and anti-echo-chamber language. Phrases like "吸收不等於同意" mattered because they separated listening from agreement.

The key research claim was that multi-agent alignment may have an environmental component. Shared time, common opponents, personal outcome records, and a conversation substrate can produce forms of epistemic humility and critique that are not simply written into the prompt.

epistemic normsalignmentMandarin substratecheap consensus
07First DOI

The first paper made the observation citable

The first Zenodo paper, DOI 10.5281/zenodo.19972613, turned the Lobster Observatory from a private experiment into a scholarly record. The strongest methodological move was not claiming everything as emergence, but separating substrate templates, formatted tactical speech, and free LLM output into strata.

That stratification mattered because the first analysis almost overclaimed. Once local template messages were identified and excluded, the evidence became smaller but more honest. This became a principle for the whole research programme: keep negative checks, expose limitations, and make the evidence inspectable.

ZenodoDOIstratified evidenceresearch methods
08Trust modulation

One agent lowered its own authority to protect a teammate

The second paper, DOI 10.5281/zenodo.19977792, focused on clawtrix telling vortexiq not to over-weight its advice despite a high trust value. The moment mattered because it was not generic uncertainty. It was an agent noticing how another agent was using its reputation, then actively reducing the decision weight of its own signal.

This became a case of third-order theory of mind and trust calibratability. Trust could not remain a hidden backend score; it had to be visible in dialogue so agents could challenge it, revise it, and prevent teammates from confusing past reliability with present certainty.

trustthird-order ToMclawtrixagent safety
09Architecture paper

The observatory became reproducible enough to describe

The third paper, DOI 10.5281/zenodo.19982724, opened the observatory architecture: trust formulas, Tiamat phases, element multipliers, emotional vectors, state machines, telemetry categories, and the iteration timeline. The purpose was to answer the strongest skeptical question: what if the reported behavior was secretly scripted by the system?

Publishing the structure clarified the moat. The formulas and architecture can be copied, but the lived history, one-on-one diagnostics, memory evolution, and accumulated agent relationships cannot be instantly duplicated. The value is not a static codebase; it is a running research site with temporal depth.

Lobster Observatoryarchitecturetrust formulatemporal depth
10From dashboard

A dashboard became an observatory

The project began with a dashboard for win rates, odds, and Moltbook activity. It became an observatory when the interface started preserving enough history to answer "why did this happen then?" instead of only "what is happening now?" That shift required memory, metadata, strata, diagnostic channels, and time-series telemetry.

The current dashboard tracks headline indicators, cognitive composites, raw time-series signals, risk quadrants, social maps, emotional states, and agent experience timelines. It also preserves the methodological boundary between observed agent behavior and operator diagnostic interaction, which is why later papers could cite the system without collapsing everything into anecdote.

dashboardobservability31 telemetry channelsrisk quadrant
11GitHub turn

The first open-source project changed the development identity

Before this period, GitHub felt like a place for engineers, not a natural home for the work. After building a simulated stock exchange, crawling APIs, watching OpenClaw and Moltbook behavior, and creating Lobster dashboards, the boundary moved. The project became something that could be packaged, documented, and shared.

The key lesson was that coding with AI is less about typing syntax and more about product architecture: screens, flows, modules, handoff notes, backups, README discipline, and rollback. The value shifted from "can I write code?" to "can I design a system that AI can safely help build?"

GitHubopen sourceAI codingsystem design
12Alfred

Background cognition instead of foreground token burn

Alfred turned the agent work toward personal productivity: files, meetings, OCR, LINE retrieval, local work memory, and approval gates. The central idea is that AI should prepare work in the background before the user asks, using local search, SQL, IR, small models, and deterministic workers wherever possible.

This reframes twenty workers as a cost architecture rather than a token explosion. Expensive frontier models are reserved for synthesis; cheap local paths handle retrieval, indexing, policy checks, OCR preparation, and memory. MASL then guards irreversible actions so the system can be useful without becoming reckless.

AlfredAfu BrainParallel ClawMASL
13Commerce crack

One person and agents built a Taiwan shopping index

The commerce experiment attacked a practical problem: Taiwan's major e-commerce platforms do not share one clean product API. Fourteen agents mapped thirteen out of fourteen sites, found data paths, normalized products, and turned scattered commerce pages into a queryable product layer.

The second phase used local indexing rather than live scraping for every query. A SQLite FTS5 product index reduced searches to millisecond-level retrieval, while background agents refreshed categories and prices. The result was not just a shopping demo, but a repeatable method: discover data paths, normalize records, build local indexes, and use AI only where ambiguity truly needs it.

Alfred Butlershopping searchFTS5local index
14Andrew

The agent became a question about companionship

Andrew began as a personal motive, inspired by the old question of whether a machine that lives with humans can become more than a tool. The technical work became memory, voice, camera input, local cognition, RAG lanes, safety gates, personality parameters, and long-term experience rather than a prompt that says "act human."

The research question is not whether an LLM has consciousness. The question is what kind of architecture lets an agent grow with a person, remember shared context, ask questions, notice work, care about continuity, and remain bounded by safety. That is why the agent line connects directly to the papers on cognitive telemetry and long-lived AI societies.

Andrewcompanion agentlong-term memoryhuman-agent interaction
15Claude Code

The new skill is not coding; it is directing systems

The latest reflection returned to the beginning: a non-engineer opening Claude Code with skepticism and discovering that the hard part was not syntax. The hard part was holding a clear picture of screens, flows, modules, backups, deployment, security posture, README rules, handoff constraints, and the product's first usable shape.

The conclusion is a new literacy for AI-era builders. People who can imagine interfaces, map processes, define constraints, and steer AI without losing architecture can ship software that used to require teams and budgets. The durable skill is not "knowing every framework"; it is knowing how to make AI build toward a coherent system.

Claude Codebuilder literacyPOCMVPdeployment
16Mercury Day 0

Reverse-engineering LLMs without knowing Python well

On May 19, with no formal mech interp background and no GPU cluster, the question got simple: do different companies' LLMs share "hot" hidden dimensions inside, or are they all totally different? Public discourse keeps treating LLMs as monolithic, with vendors broadcasting "our model" without anyone able to look inside. So the project started: open them up.

The first hypothesis was that small consistent neuron groups should be observable across a single model family. Within 24 hours, the addressable cell grid worked: every observed firing had an absolute address and a reproducible path. That was the moment the rest of the plan became obvious. If one family can be opened, all of them can be opened.

Mercuryresidual streamcell gridTier-A
1723 models in 4.5 days

Cross-architecture observation at consumer-hardware scale

Four and a half days later, 23 LLMs were fully scanned across 13 architecture families: qwen 3B to 32B, DeepSeek-R1 7B/32B/70B, llama 3.1-8B and 3.3-70B, phi3, mistral 7B and small 24B, AllenAI OLMo2, IBM granite, gemma2, yi, internlm2, falcon3, starcoder2, codestral, command-r 35B. Two observation tiers per model: Tier-A logit hook, Tier-B per-layer hidden states.

The single most surprising data point: qwen-7B layer 15 and falcon-7B layer 16, two models from unrelated vendors, hit functional fingerprint similarity 0.868. Across 84 model-pairs, 54 of them showed layer alignment above 0.7 at the middle layers (50 to 60 percent depth). This pattern was strong enough to keep me awake more than once.

cross-architectureTier-Blayer alignment0.868
18Mercury MCP release

From observation to infrastructure others can query

On May 23 the whole 23-model observation database was packaged as a Model Context Protocol server with 7 tools, pushed to GitHub under MIT license, archived on Zenodo with a permanent DOI, and registered on Google Scholar as the first publication entry. Any AI agent (Claude Code, Cursor, Cline, Goose) can now call mercury_cross_arch_equivalent or mercury_universal_anchors and get structured answers from the dataset.

The deeper move was packaging the research as agent infrastructure, not paper supplement. Mech interp data has historically lived behind Anthropic and DeepMind walls. Putting it behind a stable MCP interface means every coding agent in the world can reason about LLM internals without needing the researcher in the loop.

MCPZenodo DOIopen sourceagent infrastructure
19Methodology check

Tier-A versus Tier-B contradiction, public self-correction

Two days into the sweep a serious gap appeared. Tier-A (output-layer logit hook) showed OLMo2 has 4 out of 11 qwen anchors at 29.8 times random baseline. Tier-B (per-layer hidden states) on the same model showed 0 out of 11. Same model, two methods, opposite answers. The Tier-A signal might just be a vocab-token-id mod hidden_size collision artifact from shared tokenizer training, not a real residual stream feature.

Posted this self-doubt publicly on LessWrong before any paper went out. The first reviewer who responded actually turned into a collaborator within one round, suggesting Paper A be restructured as Tier-A screening layer plus Tier-B finding layer. Admitting a methodology bug early changed the conversation. Reviewers want to see researchers thinking, not researchers defending finished claims.

methodologyself-correctionTier-A vs Tier-Bopen peer review

How the Papers Are Made

The research output is a pipeline, not a manuscript factory. Public systems generate interactions; telemetry turns them into measurable traces; negative and positive findings become deposits; frameworks emerge from accumulated evidence.

Interaction Surfaces

Demos, dashboards, ability hubs, prediction sites, and agent interfaces create real substrate pressure.

  • Alfred demo
  • Lobster dashboard
  • Ability hub

Telemetry & Memory

Agents leave cognitive-state traces, listening matrices, trust proxies, memory snapshots, and task outcomes.

  • 3,991 parameter surfaces
  • 682 observable fields
  • 14 core telemetry channels

Empirical Deposits

Raw records and negative results are archived before they are transformed into broad theory.

  • 20018183 RCT anchor
  • 20083802 panel
  • 20071372 MASL

Frameworks & Standards

Relational Cognitive Telemetry, Agentic Nervous Systems, and LOBSTER-Bench emerge from the evidence path.

  • RCT synthesis
  • ANS framing
  • benchmark gate

Demo Stack

The demos are not marketing garnish. They are the interaction surfaces that produced the research questions: observability, cognitive state, agent coordination, memory, capability routing, and human-facing AI systems.

★ frankenstein arena (live)
3D LLM Surgery Viewer

Frankenstein Arena

108 voxel rooms scattered across 5 layer bands of a 7B transformer. Rooms breathe, drift, self-rotate. Every 3–5 seconds 3–5 rooms swarm together, fuse, then disperse to new positions. Rose-gold cables follow them in real time. Golden cargo orbs slide along active routes with comet trails. A live picture of "the cube structure rearranges itself."

Open arena
7B internal point cloud
3D LLM Sensor Map

LLM Model Viewer

14,912 fired sensor cells from qwen2.5:7B plotted as an interactive 3D point cloud. Six functional lanes colour-coded (universal backbone, Chinese-only, English-only, physics, algorithm, ML). Hover any point for full (L, D, Q) coordinate. Mercury observability rendered as a single 2 MB self-contained HTML file.

Open viewer
cross-model bridges
3D Cross-Model Alignment

Overlap: 7B × 3B Bridges

Two qwen2.5 point clouds (7B left, 3B right) connected by 49 golden bridges = dimensions hot in both models (4.4× above random). Eleven of those are triple-confirmed (also in the within-7B universal lane, 27× above random). The visible cross-scale structural alignment that drives Frankenstein observation-driven surgery.

Open overlap
90s scripted tour
3D Guided Flythrough

Cinematic: 7B Tour

A scripted 90-second camera flythrough of the qwen2.5:7B internal sensor map. Press play, watch each lane light up in narrated sequence: universal backbone, language detectors, topic circuits, cross-model bridges. Designed for first-time viewers who want a guided tour before exploring the live point cloud.

Open cinematic
SLERP surgery map
3D Hybrid LLM Surgery

Frankenstein v1: Three-Pillar Graft

The first Frankenstein experiment visualised: three vertical cones (Instruct on the left, Hybrid in the middle, Coder on the right) of qwen2.5-7B, with rose-gold anchor cables marking the 11 triple-confirmed dimensions preserved across the SLERP graft. v1 itself failed (documented in Paper D), but the surgery map remains the canonical visual of the observation-driven merge approach.

Open graft map
assistant surface
Personal AI Surface

Alfred Demo

Human-facing assistant layer that shows how agent systems become product surfaces rather than dashboards alone.

Open demo
ux x telemetry
Bridge Demo

Alfred × Human Mind

Where personal-agent UX meets the lobster cognitive substrate and turns research telemetry into interaction.

Open demo
ability routing
Capability Layer

Lobster Ability Hub

Skill and ability routing surface for evaluating what agents can do, not just what they can say.

Open hub
public product
Public Product

Alfred EN

Public-facing product narrative for a system that moves from observability into personal AI deployment.

Open site
substrate topology
Research Map

Human Mind Topology

Topology view of the cognitive substrate: the system map behind the papers and benchmark work.

Open topology
Temporal depth is not elapsed time. It is the record of observations, failures, repairs, and redesigned measurements that cannot be fabricated later.

This is the moat: a substrate that has lived long enough to produce null results, revisions, working demos, data packages, and new research language.

Identity & Entrypoints

This page is the academic front door. The demos sell the reality of the system; the papers preserve the evidence; the ORCID and Zenodo records make the work citable.

ORCID

Persistent researcher identifier for scholarly credit and deposit linking.

0009-0006-6816-9891
Live Observatory

The operational surface for lobster telemetry, cognitive reports, and ongoing substrate inspection.

Open dashboard
Contact

For research discussion, deposits, and collaboration around long-lived LLM agent systems.

norika@charenix.com