Data-driven techniques have emerged as a promising alternative to traditional numerical methods for solving PDEs.In this work, the benefits of using memory for modeling time-dependent PDEs are investigated.The Memory Neural Operator (MemNO) architecture effectively models memory in PDEs.Empirical demonstrations show that MemNO outperforms baselines without memory, with up to 6x reduction in test error.