Leveraging collective compute power of edge devices for distributed learning is gaining interest in wireless networks.Critical challenges exist in optimizing learning at the network edge, including the trade-off between predictive accuracy and interpretability.Integrating inherently explainable models like decision trees in distributed learning is difficult due to their non-differentiable structure.Combining continual learning strategies with federated learning supports adaptive, lifelong learning in resource-limited environments.