Documentation ¶
Overview ¶
Package chunk implements functions for finding useful chunks in text previously tagged from parts of speech.
Example ¶
txt := "Go is a open source programming language created at Google." words := tokenize.TextToWords(txt) tagger := tag.NewPerceptronTagger() fmt.Println(Chunk(tagger.Tag(words), TreebankNamedEntities))
Output: [Go Google]
Index ¶
Examples ¶
Constants ¶
This section is empty.
Variables ¶
View Source
var TreebankNamedEntities = regexp.MustCompile(
`((CD__)*(NNP.)+(CD__|NNP.)*)+` +
`((IN__)*(CD__)*(NNP.)+(CD__|NNP.)*)*`)
TreebankNamedEntities matches proper names, excluding prior adjectives, possibly including numbers and a linkage by preposition or subordinating conjunctions (for example "Bank of England").
Functions ¶
Types ¶
This section is empty.
Click to show internal directories.
Click to hide internal directories.