tokenize

package
v0.0.0-...-aa82ca8 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Aug 17, 2021 License: MIT Imports: 10 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

This section is empty.

Types

type PipeTokenizeProcessor

type PipeTokenizeProcessor struct {
	// contains filtered or unexported fields
}

PipeTokenizeProcessor 文本分词器

func NewPipeTokenizeProcessor

func NewPipeTokenizeProcessor(storage storage.Persister, language common.LanguageType) *PipeTokenizeProcessor

NewPipeTokenizeProcessor 新建文本分词器.

func (*PipeTokenizeProcessor) InfoTokenize

func (p *PipeTokenizeProcessor) InfoTokenize(pGroup *sync.WaitGroup, input common.PacketChannel, output common.ConcordanceChannel)

InfoTokenize 对中/英文文本进行分词.

func (*PipeTokenizeProcessor) QueryTokenize

func (p *PipeTokenizeProcessor) QueryTokenize(query string, language common.LanguageType, concordance map[string]uint64)

QueryTokenize 对查询语句进行分词.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL