Researchers propose a hardware-aware algorithm for selective state space models.
Selective state space models (SSMs) aim to improve efficiency and performance by incorporating selective mechanisms.
The algorithm leverages parallel associative scan and utilizes kernel fusion and recomputation for fast and memory-efficient implementation on modern hardware.
The proposed hardware-aware algorithm for selective SSMs demonstrates significant speedup and memory optimization compared to standard implementations.