Skip to content
BeClaude
Research2026-06-30

From Detecting Agency to Doing Work: Self-Caused Credit Builds a Durable Behavioral Self in a Minimal Spiking Agent

Originally published byArxiv CS.AI

arXiv:2606.30191v1 Announce Type: new Abstract: How does an agent that can tell self from world come to be durably shaped by that distinction? Recent work shows that a predictive system can detect its own agency (Ye, 2026), but detecting agency does not explain durable, self-shaped behavior. We...

What Happened

This preprint from arXiv (2606.30191) tackles a fundamental question in artificial intelligence and cognitive science: how does a minimal neural agent transition from merely detecting its own agency to being durably shaped by that self-distinction? The authors build on prior work showing that predictive systems can recognize when they are the cause of sensory changes (agency detection), but they identify a critical gap—detection alone does not produce lasting behavioral change. Their proposed mechanism, "self-caused credit," allows a spiking neural network to assign learning signals specifically to actions the agent itself generated, creating a persistent self-reinforcing loop. Over time, this produces a "durable behavioral self"—a stable pattern of self-directed action that persists even when external cues change.

Why It Matters

This work addresses a deep puzzle in AI: how can an artificial system develop a stable sense of self without explicit programming or massive datasets? Current approaches to agency in AI either rely on hand-crafted reward functions or require extensive training on self-generated data. The self-caused credit mechanism offers a third path—an intrinsic, unsupervised way for agents to build self-referential behavioral patterns from minimal sensory-motor loops.

For the broader field, this touches on the "grounding problem" in AI: how symbols (like "self" and "world") get meaning from an agent's own experience. If a minimal spiking agent can develop durable self-shaped behavior, it suggests that selfhood might emerge naturally from any sufficiently predictive system interacting with its environment. This has implications for developmental robotics, where agents must learn to distinguish their own actions from external events, and for reinforcement learning, where credit assignment remains a central challenge.

Implications for AI Practitioners

For reinforcement learning researchers: The self-caused credit mechanism offers a potential alternative to traditional reward shaping. Instead of manually designing rewards that incentivize self-directed behavior, practitioners could build architectures that intrinsically assign credit to self-generated actions. This could reduce the need for extensive reward engineering in complex environments. For robotics and embodied AI: This work suggests that durable self-models might emerge from simple predictive architectures, not just from complex deep networks. Practitioners building low-power or neuromorphic systems could leverage spiking neural networks with self-caused credit to create agents that maintain coherent behavioral identities without constant supervision. For safety and alignment: A system with a durable behavioral self could be more predictable and interpretable than one that merely detects agency. If an agent consistently attributes outcomes to its own actions, its behavior becomes more stable and less susceptible to spurious correlations—a desirable property for deployed AI systems. Caveats: This is a minimal model with spiking neurons in a constrained environment. Scaling to complex tasks and real-world sensory streams remains unproven. Practitioners should view this as a promising proof-of-concept rather than a production-ready method.

Key Takeaways

  • Self-caused credit enables minimal spiking agents to develop durable behavioral selves, going beyond mere agency detection
  • This mechanism offers an intrinsic, unsupervised path to self-referential learning without hand-crafted rewards
  • Implications for reinforcement learning, robotics, and AI safety include reduced reward engineering and more predictable agent behavior
  • The approach is currently limited to minimal environments; scaling to complex tasks requires further validation
arxivpapersagents