<ul data-eligibleForWebStory="true"><li>A new study introduces a causal representation learning framework to evaluate language model capabilities effectively.</li><li>The framework models benchmark performance as a linear transformation of a few latent capability factors.</li><li>The latent factors are identified as causally interrelated after controlling for the base model as a common confounder.</li><li>The study analyzed over 1500 models across six benchmarks and identified a concise three-node linear causal structure explaining performance variations.</li><li>The causal structure revealed insights starting from general problem-solving capabilities, through instruction-following proficiency, to mathematical reasoning ability.</li><li>The results emphasize the importance of controlling base model variations during evaluation to accurately uncover causal relationships among latent model capabilities.</li></ul>

Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning

Discover more