A novel GEMM architecture called tuGEMM is proposed for low-precision edge AI.tuGEMM is based on temporal-coding and performs exact computation.Two variants of tuGEMM, serial and parallel, are introduced with distinct area/power-latency trade-offs.The designs show significant advantages in area-power efficiency compared to state-of-the-art stochastic unary systems, especially at low precisions.