Data engineering pipelines are crucial components of predictive analytics frameworks and require significant time and expertise.
ADEPT is introduced as an automated data engineering pipeline using text embeddings, aiming to simplify the process.
ADEPT leverages text embeddings for representing diverse data sources and employs an information bottleneck criteria to handle entropy variance.
Experiments demonstrate that ADEPT surpasses existing benchmarks across various datasets, showing potential for efficient and scalable automation in data science.