Three factors - model architecture, dataset size, and class balance - influence test-time performance of machine-learning classifiers.
Evidence suggests dataset quality as an additional factor affecting classifier performance.
Study indicates quality is a dataset-intrinsic property, independent of model architecture, dataset size, and class balance.
Dataset quality is found to be an emergent property of the quality of datasets' constituent classes, providing a new target for optimization in machine-learning-based classification.