The AlphaFold Protein Structure Database (AFDB) is known for its high structural coverage and near-experimental accuracy, making it valuable for protein design.
However, using AFDB directly in training deep models for tasks like inverse folding reveals a systematic geometric bias in the database's structural features.
To address this bias, a Debiasing Structure AutoEncoder (DeSAE) has been introduced to improve the reconstruction of native-like conformations from corrupted backbone geometries.
The application of DeSAE to AFDB structures has shown significant improvements in inverse folding performance, emphasizing the importance of debiasing in structure-based learning tasks.