In this paper, we propose a new model by introducing a novel perspective into Krotov’s hierarchical associative memory, allowing the entire Transformer (MetaFormer) block, not just the token-mixing module, but also the channel-mixing module, layer normalization, and skip connection, to correspond exactly to a single Hopfield network.