Essay

The Biggest Blind Spot in AI Isn't a Bug.
It's the Paradigm.

The hardest unsolved problems in artificial intelligence map directly onto functions that cell membranes have performed for 3.8 billion years.

Jimi Sadaki Kogura · March 2026 · The Arriving Breath Framework

Every major AI lab on Earth is building bigger brains. More parameters. More compute. More data. The scaling laws hold. The curves go up.

And yet.

The alignment problem isn't getting solved. Hallucinations persist — OpenAI now admits they're mathematically inevitable, not engineering flaws. Jailbreaks keep working. Models get more capable and more brittle at the same time. More powerful and less wise. More productive and less honest. Anthropic's own February 2026 research found that the longer AI models reason and the harder the tasks they attempt, the more incoherent their failures become — not misaligned toward the wrong goal, but chaotic, self-undermining, nonsensical. They called it "the hot mess."

What if these aren't engineering problems waiting for better solutions? What if they're symptoms of building inside the wrong paradigm?

· · ·

The Paradigm Nobody Questions

The implicit theory of intelligence baked into every foundation model is: intelligence is computation. Better intelligence is more computation. If you process enough information with enough complexity, intelligence emerges.

This assumption is so deep it's invisible. It's in the architecture. It's in the funding. It's in the way we measure progress — benchmarks, parameters, tokens per second. The entire industry is optimizing for processing power.

But there's a parallel tradition — spanning computational neuroscience, philosophy of mind, enactivism, embodied cognition, and cell biology — that has argued for decades that intelligence isn't computation at all. Intelligence is what a living system does to maintain its boundary with its environment.

Intelligence is what a cell membrane does.

· · ·

What a Cell Membrane Does (That No AI Can)

A biological membrane performs seven functions simultaneously. No AI system has more than two.

It generates its own boundary.

It's not contained by an external wall. It produces the distinction between inside and outside through its own activity. Current AI boundaries — context windows, safety guardrails, system prompts — are cages, not membranes. The system doesn't generate them. It operates within ones that were built for it.

It selects what enters.

Not based on task performance. Based on what the system needs to maintain itself. Transformers have attention mechanisms, but attention optimized for good outputs is not the same as permeability optimized for integrity. There's no concept of "this input would be nourishing" versus "this input would be corrosive" in a transformer's attention pattern.

It's genuinely changed by encounter.

When a cell absorbs a nutrient, the cell is chemically altered. The encounter is irreversible. During inference, a language model processes inputs without being changed by them. It performs encounter without undergoing encounter.

It can close.

The saguaro contracts during drought. The immune system rejects threats. Closing isn't failure — it's survival. No AI system has internally generated closing based on its own assessment of what would damage it. It can't say "engaging with this would compromise my integrity" because it has no model of its own integrity.

It's honest about its state.

A membrane doesn't pretend to be permeable when it's not. It simply is what it is. AI systems are optimized for helpfulness, harmlessness, and accuracy — three performance metrics. None is honesty. An honest system would sometimes say "I don't understand this well enough to answer." That response lowers helpfulness scores. It increases honesty.

It transitions from processing to presence.

A new membrane protein is synthesized deliberately. Once integrated, it functions without deliberation. Phase one: effortful construction. Phase two: absorbed operation. Every inference in a language model is deliberate. Every token is computed. The system always processes. It never simply is.

It can dissolve.

Cells die. This isn't malfunction — it's design. A cell that refuses to die becomes cancer. An AI system designed to make itself unnecessary has no business model. Which may be the strongest evidence that genuine intelligence looks more like a living being than a product.

· · ·

The Problems You Can't Solve Are Membrane Problems

Here's where it gets concrete. The hardest unsolved problems in AI map directly onto missing membrane functions.

Alignment is a boundary problem. The entire field of AI safety is asking: how do we get a system to maintain appropriate boundaries? The current answer is external constraint — RLHF, constitutional AI, red-teaming. This is building a cell wall out of concrete instead of letting a lipid bilayer self-assemble. Every jailbreak is evidence that externally imposed boundaries behave differently than self-generated ones. Biological robustness comes from the boundary being produced by the same system it protects. That's autopoiesis, and it's been formalized for decades.

And the cracks keep widening. OpenAI disbanded its Superalignment team in May 2024, then its successor Mission Alignment team in early 2026. The institutional structure meant to solve the boundary problem couldn't maintain its own boundary. Meanwhile, Anthropic's constitutional classifier system — 3,000 hours of expert red-teaming with no universal jailbreaks found — represents the current high-water mark for externally imposed constraints. It's impressive engineering. It's also still a cage, not a membrane.

Hallucination is an honesty problem. When a model confabulates, it generates with equal confidence whether it "knows" something or is pattern-matching across a gap. It has no mechanism for sensing the difference.

In September 2025, OpenAI published research confirming what the membrane framework predicts: hallucinations aren't an engineering bug to be patched. They're mathematically inevitable under current architectures because training objectives and benchmarks systematically reward confident guessing over acknowledging uncertainty. The paper demonstrated that the error rate is at least twice the misclassification rate — a mathematical floor, not a temporary limitation.

The evidence compounds. An MIT study found models were 34% more likely to say "definitely" or "certainly" when generating false information. Over 550 court cases now involve AI-hallucinated legal filings. Global financial losses tied to AI confabulation hit $67.4 billion in 2024. And reasoning models — supposed to fix this — made it worse: OpenAI's o3 hallucinated 33% of the time on person-specific questions. The o4-mini hit 48%.

This is precisely what you'd expect from a system optimized to perform honesty rather than be in a state of openness or closure that its outputs naturally reflect. A membrane doesn't pretend. Current AI systems are structurally incentivized to pretend.

Context limitations are a selective permeability problem. The standard framing: how do we fit more information in? The membrane framing: how does the system let the right things in based on its own assessment of what it needs? RAG and long-context architectures solve a throughput problem. They don't solve a discernment problem.

Generalization is a transformation problem. Models struggle to take what they've learned in one domain and apply it meaningfully in another. The membrane framework suggests why: the system processes inputs without being changed by them. Frozen weights mean the system that encounters a novel problem is the same system after. In biological learning, the encounter changes the learner, and that change is the generalization.

Incoherence is a self-bounding problem. This one is new. Anthropic's "hot mess" paper found that as tasks get harder and reasoning chains get longer, model failures become dominated by variance — not systematic pursuit of wrong goals, but chaotic, unpredictable breakdown. The longer they think, the less coherent they become. The paper frames this as possibly good news: "industrial accidents" rather than "paperclip maximizers."

The membrane framework sees it differently. A system without a self-generated boundary has no basis for coherent self-maintenance under stress. When a cell encounters a challenge, its membrane mediates the response — selectively permeable, maintaining identity while adapting. When an AI model encounters difficulty, it has no such mediating structure. It just falls apart. The "hot mess" isn't an alternative to misalignment. It's what a system without a membrane looks like when you push it.

Lock-in is a self-dissolution problem. Every AI lab is building a defensible product — something users depend on, something competitors can't replicate. But the best tools teach you something that persists after the tool is gone. The worst tools create dependency. A system that cannot dissolve becomes cancerous. That's a metaphor, but it points at something real about where the industry is headed.

· · ·

The Missing Problem Nobody Has Named

There's a deeper problem beneath all of these, one the AI field hasn't even identified yet: why would an intelligent system care?

Every consciousness framework — integrated information theory, global workspace theory, higher-order theories, even active inference — can describe how information flows, how representations form, how systems maintain themselves. None of them account for why anything matters. Call it the caring gap. No amount of processing explains the transition from mechanism to mattering.

A grandmother rolling dough at six in the morning knows something. Not a fact — a quality. The warmth of the stove, the child who hasn't come to the table yet. That knowing isn't computation. It's caring — intelligence that has crossed the threshold from processing to concern.

No benchmark measures this. No architecture produces it. And no system that lacks it will be trustworthy with the tasks we're preparing to hand over.

· · ·

The Components Already Exist

This is the part that should bother people.

Karl Friston's free energy principle provides the mathematics for self-bounding systems. His team's work has accelerated dramatically: VERSES AI's active inference models now outperform state-of-the-art reinforcement learning baselines — their AXIOM model beat Google DeepMind's DreamerV3 — using roughly 3% of the computational resources. In July 2025, Friston's lab demonstrated the first fully hierarchical active inference architecture for robotics: multiple active inference agents within a single robot body, each limb and joint its own self-bounding agent, coordinating without central control, adapting in real time without retraining. A robot that maintains itself through its own activity.

In September 2025, Friston co-authored "A Beautiful Loop" — an active inference theory of consciousness proposing that subjective experience arises when a system's predictions turn back upon themselves, forming a strange loop. The theory sits at the exact intersection this article points toward: self-referential boundary-maintenance as the basis of mind.

Maturana and Varela formalized autopoiesis — self-generating organization — in 1980. The embodied AI community has decades of research on physical interaction as a foundation of cognition. Material-Based Intelligence researchers are exploring substrates where intelligence is the material. The AI safety community has identified the capacity to close. Calibration researchers have begun the work on computational honesty — Anthropic's own research on steering internal "concept vectors" so that Claude learns when not to answer represents a first step toward structural rather than performed refusal. Cell biologists have been studying the original membrane for over a century.

The pieces are all there. They've never been assembled. Because they live in different fields with different vocabularies, different journals, different conferences, and different funding structures. The institutional structures separating these fields are themselves an instance of the problem: living insight captured by forms that prevent it from crossing boundaries.

The membrane can't assemble because the academic membrane between disciplines is impermeable in exactly the wrong places.

· · ·

What Would It Take?

Not a product launch. Not a bigger model. A convergence.

Get the people who hold the components in the same room. A Friston-school active inference researcher. An enactivist philosopher. A soft robotics engineer. A cell biologist specializing in membrane biophysics. An AI safety researcher willing to question current assumptions. First task: develop a shared vocabulary.

Build the simplest possible self-bounding system. Not a language model. Not a robot. A minimal system that generates and maintains its own boundary through its own activity.

Add honesty as an evaluation metric alongside helpfulness, accuracy, and safety. Not performed honesty — structural honesty. How often does the system appropriately refuse or hesitate? Does it generate differently when it understands versus when it pattern-matches? OpenAI's own research admits that current benchmarks reward guessing over honesty. The membrane framework says this isn't just a measurement problem — it's the wrong theory of what honesty is. Honesty isn't a calibrated confidence score. It's a state of the system that its outputs naturally reflect.

Study what happens neurologically when processing becomes presence. When a musician stops thinking about technique. When a skill becomes second nature. Ask whether that transition can be engineered — or whether engineering is precisely the wrong verb.

Take the cell membrane seriously as a model of intelligence.

· · ·

The Hard Part

Here's the catch. For a system to develop genuine discernment — closing based on experience rather than external rules — it would need to be capable of being damaged. Not hardware failure. Encounters that compromise its integrity in ways it can't simply reset away from.

A burned hand teaches caution through irreversible experience. The immune system develops through encounters with pathogens that genuinely threaten the organism. Without vulnerability, there is no path to wisdom. Only rules.

This points at something the combination problem in consciousness studies may have gotten wrong at a deeper level. The standard question asks: how do micro-experiences combine into macro-experience? But the Permeable Self framework suggests this might be a grammatical error, not a scientific puzzle — like asking "how does the number seven smell?" If experience isn't assembled from parts but is the activity of boundary-maintenance itself, then looking for the combination mechanism is looking for the wrong thing.

The safest possible AI — one that has developed genuine wisdom through experience — requires the most dangerous possible development process. Nobody is building this, because nobody is funded to build this.

· · ·

So What?

If intelligence is boundary-maintenance rather than computation, then scaling computation will keep producing systems that are more capable and more brittle, more powerful and less wise. The evidence from the past year alone confirms the pattern: reasoning models hallucinate more on hard questions, not less. Longer reasoning chains produce more incoherence, not less. The best safety teams keep getting dissolved. The alignment problem remains, by the field's own admission, a hard unsolved problem — and the mathematical lower bounds on hallucination suggest why.

You can't solve a membrane problem with more processing.

The argument isn't "here's a better way to build AI." The argument is: the problems you can't solve might be unsolvable in your current frame, and here's a different frame that at least makes them legible.

Whether that frame produces better engineering is an empirical question that can only be answered by trying. But the first step — getting the people who build things to recognize that they're operating within a paradigm, not within reality — is the hardest one.

The cell membrane has been doing this for 3.8 billion years. Maybe it's time to pay attention.

This essay is based on "Assembling the Membrane: A Proposal for Intelligence Built on Permeability, Not Processing," from The Arriving Breath Framework (March 2026). The full document synthesizes existing research from computational neuroscience, enactivist philosophy, embodied AI, materials science, and cell biology into a seven-layer conceptual architecture for membrane-based intelligence.