<ul data-eligibleForWebStory="true">Maximum Likelihood Estimation (MLE) estimates model parameters from observed data likelihood.Gradient Descent optimizes functions by adjusting parameters in the direction of gradients.Normal Distribution models symmetrically clustered data values around the mean in statistics.Sigmoid function maps inputs to values between 0 and 1 in neural networks.Cosine Similarity measures similarity between vectors in text analysis and recommendations.