rigour

Data cleaning and validation functions for processing various types of text emanating and describing the business world. This applies to human and company names, language, territory and country codes, corporate and tax identifiers, etc.

The underlying idea is that handling these sorts of descriptors is easy on first glance, but reveals a dizzying set of complexity when carried into production. This is why rigour consolidates implementations that have already met some edge cases and are well-tested.

Installing `rigour`

You can grab the latest release from PyPI:

pip install -U rigour

Usage & documentation

See: https://opensanctions.github.io/rigour/

Acknowledgements

The address formatting database contained in rigour/data/addresses/formats.yml is derived from worldwide.yml in the OpenCageData address-formatting repository. It is used to format addresses according to customs in the country that is been encoded.

rigour consolidates and includes a set of older Python libraries into a single codebase: languagecodes, pantomime, fingerprints. The development of these libraries was funded by OCCRP as part of the Aleph software project.

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 419 Commits
.github		.github
contrib		contrib
docs		docs
genscripts		genscripts
resources		resources
rigour		rigour
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

rigour

Installing `rigour`

Usage & documentation

Acknowledgements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 8

Uh oh!

Languages

License

opensanctions/rigour

Folders and files

Latest commit

History

Repository files navigation

rigour

Installing rigour

Usage & documentation

Acknowledgements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 8

Uh oh!

Languages

Installing `rigour`

Packages