PolyFuzz
Visit ToolPolyFuzz is a Python library for fuzzy string matching, grouping, and evaluation. It brings together various techniques like edit distance, TF-IDF, and transformer embeddings within a single framework.
At a glance
Trending
PolyFuzz is a Python library for fuzzy string matching, grouping, and evaluation. It brings together various techniques like edit distance, TF-IDF, and transformer embeddings within a single framework.
Trending
About
PolyFuzz is a Python library designed for fuzzy string matching, string grouping, and extensive evaluation functions. It consolidates diverse fuzzy string matching techniques into a unified framework, including various edit distance measures, character-based n-gram TF-IDF, and word embedding methods such as FastText, GloVe, and Hugging Face transformers embeddings. The library supports matching between two lists or within a single list, and offers functionalities to group similar matches. It also provides `fit`, `transform`, and `fit_transform` functions for production use cases, allowing models to be saved and loaded. Users can visualize model performance through precision-recall curves and even implement custom matching models by extending its `BaseMatcher` class.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending