Brain-Inspired Architecture for Deep Reasoning | 27M Parameters | Solves Complex Puzzles with 1000 Examples
Token Embedding + Puzzle Embedding + Position Encoding
Fast, Detailed Computations
Slow, Abstract Strategy
Halt/Continue Decision
All intermediate states are detached (no BPTT)
This enables O(1) memory complexity
L-module converges locally, H-module provides global context
Fast L-module (every step) vs Slow H-module (every L_cycles)
One-step gradient approximation, O(1) memory
Adaptive computation via Q-learning
Learning signal at every segment
ARC-AGI-1
(beats o3-mini)
Sudoku Extreme
accuracy
Maze 30x30
optimal paths
Training
examples only!
H-module PR=89.95 vs L-module PR=30.22
Ratio matches mouse cortical hierarchy
Inspired by theta-gamma coupling
Biologically plausible gradient
# Core HRM Forward Pass
for segment in range(max_segments):
# Reset carry for halted sequences
carry = reset_carry(halted, carry)
# Hierarchical computation
for h_step in range(H_cycles):
for l_step in range(L_cycles):
z_L = L_level(z_L, z_H + input)
z_H = H_level(z_H, z_L)
# ACT decision
q_halt, q_continue = q_head(z_H)
if q_halt > q_continue:
break