Pandera is a flexible, simple and extensible framework for data testing that allows you to validate not only the data, but also the functions which produce it. You can overcome the initial challenge of defining a data schema by inferring it from clean data and then fine-tuning it over time. Identify critical points in your pipeline and validate the data that enters and leaves them. Validate functions that generate your data by automatically creating test cases. You can choose from a wide range of pre-built tests or create your own rules to validate your data.