preset

package
v0.6.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 13, 2020 License: LGPL-3.0 Imports: 6 Imported by: 0

Documentation

Overview

Package preset provides a collection of pre-built string similarity scoring functions generated by the higher-ordered function in the factory parent

Index

Constants

This section is empty.

Variables

PlainSimilarityScore computes the similarity score between two input strings but each input string will be sanitized before they are compared to each other.

SimpleSimilarityScore computes the similarity score between two input strings. Two input strings will be directly compared under optimal alignment distance metric without any pre-processing, and the resulting distance will be re-normalized to a similarity score between 0 and 1 (inclusive).

ThaiNameSimilarityScore computes the similarity score between two input strings with the following functionalities:

  1. Each input string will be sanitized via sanitaryThai.Sanitize function (e.g. removing diacritics from latin scripts, removing repeated Thai tonal marks, etc.)
  2. Each input string will be used to generate bare names (i.e. attempting to remove English and Thai titles such as Mrs. or dek-chai)
  3. For optimal alignment distance metric over string space, the specialized substitution/transposition penalty functions are used instead.

Functions

This section is empty.

Types

This section is empty.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL