Language models are being analyzed using Shapley Taylor interaction indices (STII) to understand how they represent internal structure.
Shapley interactions help measure how inputs in language and speech models work together to impact outputs beyond their independent influences.
The study looks into the relationship between models and underlying linguistic structures like syntactic structure, non-compositional semantics, and phonetic coarticulation.
Results indicate that autoregressive text models show interactions correlating with the syntactic proximity of inputs.
Both autoregressive and masked models encode nonlinear interactions in idiomatic phrases with non-compositional semantics.
In terms of speech results, models show the phonetic interaction necessary for extracting discrete phonemic representations.