TaxonoPy (taxon-o-pie) is a command-line tool for harmonizing large biodiversity datasets into a consistent taxonomy ready for AI applications. Built on the Global Names Verifier (GNVerifier), it provides complete provenance tracking, flexible resolution strategies, and batch processing of 100M+ records to address challenges in reproducibility and scale in massive multi-source taxonomy alignment.
See https://imageomics.github.io/TaxonoPy for documentation on installation, usage, and more.
See the Wiki Development Page for development instructions.