Researchers have introduced ESCHER, a visual concept library that aims to improve visual recognition.ESCHER utilizes a vision-language model as a critic to iteratively refine the concept library.The approach considers interactions between concepts and their impact on downstream classifiers.ESCHER does not require human annotations and demonstrates effectiveness in various visual classification tasks.