Union Pandera

The open-source framework for precision data testing for data scientists and ML engineers. Pandera provides a simple, flexible, and extensible data-testing framework for validating not only your data but also the functions that produce them. A simple, zero-configuration data testing framework for data scientists and ML engineers seeking correctness. Access a comprehensive suite of built-in tests, or easily create your own validation rules for your specific use cases. Validate the functions that produce your data by automatically generating test cases for them. Integrate seamlessly with the Python ecosystem. Overcome the initial hurdle of defining a schema by inferring one from clean data, then refine it over time. Identify the critical points in your data pipeline, and validate data going in and out of them. Build confidence in the quality of your data by defining schemas for complex data objects.

Features

Write complex schemas with speed and ease
Validate critical points of your pipeline
Quickly bootstrap schemas with trusted data
Easily create custom validation checks
Synthesize fake data to validate pipelines
Integrate seamlessly with the Python ecosystem

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Union Pandera

Union Pandera Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of Union Pandera!

Additional Project Details

Programming Language

Python

Related Categories

Python Libraries, Python Data Pipeline Tool

Registered

2023-05-25

Similar Business Software

StreamScape

Make use of Reactive Programming on the back-end without the need for specialized languages or cumbersome frameworks. Triggers, Actors and Event Collections make it easy to build data pipelines and work with data streams using simple SQL-like syntax, shielding users from the complexities of...

See Software
Ask On Data

Ask On Data is a chat based AI powered open source Data Engineering/ ETL tool. With agentic capabilities and pioneering next gen data stack, Ask On Data can help in creating data pipelines via a very simple chat interface. It can be used for tasks like Data Migration, Data Loading, Data...

See Software
Auth.js

Auth.js is an open-source authentication library designed to integrate seamlessly with modern JavaScript frameworks, providing a flexible and secure authentication experience. It supports various authentication methods, including OAuth (e.g., Google, GitHub), credentials, and WebAuthn, allowing...

See Software

Report inappropriate content