NVIDIA released NVILA, a family of open visual language models focusing on accuracy and efficiency.NVILA outperforms GPT-4o Mini in video benchmark and shows competitive performance against other open models.NVIDIA plans to release code and models for NVILA to enable reproducibility.NVILA uses the 'scale then compress' technique to balance accuracy and efficiency in visual language models.