Frenzy is a memory-aware serverless computing method for heterogeneous GPU clusters.It automates resource allocation and scheduling for efficient training of Large Language Models (LLMs).Key features include Memory-Aware Resources Predictor (MARP) and Heterogeneity-Aware Scheduling (HAS).Frenzy improves memory usage prediction, reduces scheduling overhead, and enhances resource allocation.