Researchers analyzed the effectiveness of VQVAE in suppressing adversarial attacks on high-SNR radio-frequency data-points by targeting amplitude modulations from specific digitally modulated waveform classes.
Adversarial attacks were created to preserve the phase between the in-phase and quadrature components with adversarially changed values, and compared with attacks where the phase was not preserved.
The classification accuracy of adversarial examples was tested on a classifier trained to achieve 100% accuracy on the original data.
The study evaluated the ability of VQVAE to mitigate the strength of the attack by assessing the classifier accuracy on VQVAE reconstructions of the adversarial datapoints.
It was found that VQVAE significantly reduces the effectiveness of the attack.
Comparison was made among the I/Q plane diagram of attacked data, their reconstructions, and the original data.
Different methods and metrics were utilized to compare the probability distribution of the VQVAE latent space with and without attack.
By varying the attack strength, interesting properties of the discrete space were observed which could aid in detecting attacks.