NN Arch - a lzhbrian Collection

lzhbrian 's Collections

NN Arch

NN Arch Components

Loop

Linear Attention

TTT

NN Arch

updated 28 days ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published Dec 31, 2025 • 64
Recursive Language Models

Paper • 2512.24601 • Published Dec 31, 2025 • 81
Nested Learning: The Illusion of Deep Learning Architectures

Paper • 2512.24695 • Published Dec 31, 2025 • 42
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 255
Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 122
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 325
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 167