a library for doing approximate and phonetic matching of strings.
Project description
Jellyfish is a python library for doing approximate and phonetic matching of strings.
jellyfish is a project of Sunlight Labs (c) 2014. All code is released under a BSD-style license, see LICENSE for details.
Written by James Turk <jturk@sunlightfoundation.com> and Michael Stephens.
See https://github.com/sunlightlabs/jellyfish/graphs/contributors for contributors.
Source is available at http://github.com/sunlightlabs/jellyfish.
Included Algorithms
String comparison:
Levenshtein Distance
Damerau-Levenshtein Distance
Jaro Distance
Jaro-Winkler Distance
Match Rating Approach Comparison
Hamming Distance
Phonetic encoding:
American Soundex
Metaphone
NYSIIS (New York State Identification and Intelligence System)
Match Rating Codex
Example Usage
>>> import jellyfish >>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish') 2 >>> jellyfish.jaro_distance('jellyfish', 'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs') 1
>>> jellyfish.metaphone('Jellyfish') 'JLFX' >>> jellyfish.soundex('Jellyfish') 'J412' >>> jellyfish.nysiis('Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex('Jellyfish') 'JLLFSH'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for jellyfish-0.5.0-cp34-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 994924a224aedff3386276641f31099d26deae97156d192962a3819e02b80ae9 |
|
MD5 | 302d34a19c89c0c237bf1f658cec7a04 |
|
BLAKE2b-256 | 832a4d32a323f4938dbdd037fef989dfe37de04713c153ee3b4f9a7fd163566c |
Hashes for jellyfish-0.5.0-cp34-none-win32.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5a954f74f15467a45699e17234e3232c2f940f812f4b24bb1e518f9a7da19e45 |
|
MD5 | 4c7c222ebe5c1700b6b5421fc2aa6b1a |
|
BLAKE2b-256 | 78af99941a609f77e8cbbcc90d4f0a320468c9eabb6d555fcdefd9aac9f6d3af |
Hashes for jellyfish-0.5.0-cp27-none-win_amd64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | bf09370eb564a00ffe0cbafdb826dc857a3042566dba9210fb19f8727f213c4b |
|
MD5 | 5214aaad9424444172165f41f95a5f25 |
|
BLAKE2b-256 | 283d652fe63709a0d181f9e911310e5ff7445e40bd0d8d205441d9fe9004a5be |
Hashes for jellyfish-0.5.0-cp27-none-win32.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e8d2c666d9c6e52746bd036a2c13154bfa64a35ce2d4dd94c54113b45aff2cd6 |
|
MD5 | 6565f575850b810bf1c9f1fdf3d5564d |
|
BLAKE2b-256 | 8903889c876cdd9b7e7730705e93168c6c912e80c640b9b9bb7af31cf69bde00 |