
Sebastian Schelter
Articles
-
Aug 16, 2024 |
amazon.science | Sebastian Schelter |Stefan Grafberger |Philipp Schmidt |Tammo Rukat
Modern companies and institutions rely on data to guide every single decision. Missing or incorrect information seriously compromises any decision process. In previous work, we presented Deequ, a Spark-based library for automating the verification of data quality at scale. Deequ provides a declarative API, which combines common quality constraints with user-defined validation code, and thereby enables unit tests for data.
Try JournoFinder For Free
Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.
Start Your 7-Day Free Trial →