Add the Pointblank library to the Data Validation section #2712
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What is this Python project?
The aim of the Pointblank project is to validate DataFrames and database tables. You can add schema checks, many types of value checks, and checks for duplicates and incomplete records. It produces beautiful HTML reporting tables that can be sent to data stakeholders. This library is useful for validating data analysis code in notebooks and it also scales to data validation in pipeline processes.
What's the difference between this Python project and similar ones?
This library is focused on tables (and not other data structures). It produces reporting artifacts useful for communicating data quality issues. It is compatible with a huge number of table types since it uses Narwhals and Ibis as compatibility layers.
--
Anyone who agrees with this pull request could submit an Approve review to it.