Documentation ¶
Index ¶
Constants ¶
View Source
const Newline rune = '\n'
Variables ¶
This section is empty.
Functions ¶
func UnicodeFor ¶
UnicodeFor Returns a list of unicodes for each input rune
Types ¶
type Info ¶
type Info struct { // A string representation of the input rune (for special newline and tab, the string representation is empty in this implementation) String string // The unicode number Unicode string // The character name CharName string // The codeblock CodeBlock string }
Info holds a set of unicode-related information for a rune
type Processor ¶
Processor
func (*Processor) UnicodeInfo ¶
Info Creates a list with unicode information for each input rune
type Tokenizer ¶
Tokenizer is a simple unicode tokenizer, that groups characters by code block A sequence of characters is treated as one token, as long as they belong to the same unicode code block Numerals, spacing and punctuation are treated as separates code blocks
Click to show internal directories.
Click to hide internal directories.