This paper introduces an adaptive framework for edge inference using a transformer-powered deep joint source channel coding architecture.
The approach allows efficient transmission of essential features for object detection in resource-constrained edge devices under varying network conditions.
Input data is tokenized into high-level semantic representations, refined by a transformer, and transmitted over noisy wireless channels.
The system incorporates a semantic token selection mechanism and a resource allocation algorithm to improve robustness and performance in dynamic network scenarios.