LightOn and Answer.ai have released ModernBERT, an open family of encoder-only models that is a pareto improvement over BERT in terms of speed and accuracy.
ModernBERT extends the context length to 8,192 tokens, allowing it to handle long-context tasks effectively.
The model incorporates architectural enhancements such as Flash Attention 2 and rotary positional embeddings (RoPE) to improve computational efficiency and positional understanding.
ModernBERT outperforms models like RoBERTa and DeBERTa in various benchmarks and is more resource-efficient for inference on GPUs.