NV-Embed is an embedding model open-sourced by NVIDIA, designed for retrieval tasks.It is finetuned on top of Mistral 7B with innovations in architecture, training, and data curation.The model introduces a latent attention layer for obtaining sequence-level embeddings.Strategies like multi-head attention and two-stage training contribute to better performance.