Bengaluru-based AI Lab LossFunk introduces IPO, a novel approach to aligning LLMs without external feedback.IPO performed comparably better to those utilizing SOTA reward models.The new technique offers a more efficient and scalable method for aligning LLMs with human preferences.LossFunk's mission is to build a state-of-the-art foundational reasoning model from India.