fuzzymatch_go

package module
v0.6.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 13, 2020 License: LGPL-3.0 Imports: 0 Imported by: 0

README

Fuzzy Match

pkg.go.dev reference

This repository contains a Go language implementation of approximate string matching algorithms.

Preset functions

Inside the package github.com/abhabongse/fuzzymatch-go/factory/preset, there are three examples of preset string similarity score functions with different customizations:

All of these functions have output values between 0 and 1 (inclusive), where 1 indicates that both input strings are identical under some criteria and 0 indicates that both strings are totally distinct. The signature of these functions is

func(string, string) float64
Customization

New string similarity score functions may be constructed via various higher-order functions provided in this module. Look at the construction of the preset functions above for some ideas of how to introduce addition functionalities to your string similarity score functions.

Notes

All source code for this project is released under the GNU Lesser General Public License v3.0.

Documentation

Overview

Please look at subpackages for more documentation.

Directories

Path Synopsis
Package candidate provides functions to generate a sequence of all possible variants of a given input string.
Package candidate provides functions to generate a sequence of all possible variants of a given input string.
diacritics
Package diacritics is the subpackage of package candidate which will attempt to remove diacritical marks from extended latin letters based on one of two different strategies.
Package diacritics is the subpackage of package candidate which will attempt to remove diacritical marks from extended latin letters based on one of two different strategies.
nametitle
Package nametitle is the subpackage of package candidate which generates names with titles stripped away based on pre-defined regular expression patterns.
Package nametitle is the subpackage of package candidate which generates names with titles stripped away based on pre-defined regular expression patterns.
Package editdist provides a set of string comparison functions to compute the distance between a pair of strings under various distance metrics in string space.
Package editdist provides a set of string comparison functions to compute the distance between a pair of strings under various distance metrics in string space.
thai
Package thai is a subpackage that provides additional editdist-related functions that are customized for strings containing Thai characters, etc.
Package thai is a subpackage that provides additional editdist-related functions that are customized for strings containing Thai characters, etc.
Package factory provides the basic building block structure which governs how different components of this fuzzymatch-go module can be combined together to construct a function determining the string similarity between any two given input strings.
Package factory provides the basic building block structure which governs how different components of this fuzzymatch-go module can be combined together to construct a function determining the string similarity between any two given input strings.
preset
Package preset provides a collection of pre-built string similarity scoring functions generated by the higher-ordered function in the factory parent
Package preset provides a collection of pre-built string similarity scoring functions generated by the higher-ordered function in the factory parent
Package runedata provides additional database regarding Unicode characters which are not part of the built-in Go golang.org/x/text package, especially those involving nuances in foreign scripts such as Thai.
Package runedata provides additional database regarding Unicode characters which are not part of the built-in Go golang.org/x/text package, especially those involving nuances in foreign scripts such as Thai.
thai
Package thai is the subpackage containing additional information regarding Thai Unicode character block.
Package thai is the subpackage containing additional information regarding Thai Unicode character block.
Package transform provides various string manipulation and sanitization functions.
Package transform provides various string manipulation and sanitization functions.
thai
Package thai is a subpackage that provides additional string sanitization functions that are customized for Thai characters and scripts.
Package thai is a subpackage that provides additional string sanitization functions that are customized for Thai characters and scripts.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL