SLEEPYLAND is an open-source sleep staging evaluation framework designed to address challenges in model evaluation, generalization, bias, and human annotations.
It includes over 22,000 hours of in-domain (ID) sleep recordings and 84,000 hours of out-of-domain (OOD) sleep recordings.
SOMNUS, an ensemble combining models across architectures and channel setups, achieves robust performance across twenty-four different datasets, outperforming individual models in 94.9% of cases.
In evaluations on multi-annotated datasets, SOMNUS exceeds the best human scorer, better reproducing the scorer consensus than any individual expert.