package
module
Version:
v0.0.0-...-77c60e4
Opens a new window with list of versions in this module.
Published: Nov 5, 2015
License: ISC
Opens a new window with license information.
Imports: 6
Opens a new window with list of imports.
Imported by: 0
Opens a new window with list of known importers.
README
¶
NihonGo
NihonGo is an utility of Japanese text for Go language.
go get github.com/dogenzaka/nihongo
Features
- Converting Katakana / Hiragana
- Unicode normalization
- Detecting Katakana / Hiragana strings in text
- Simple Japanese tokenizer ported TinySegmenter
Examples
import (
"fmt"
"github.com/dogenzaka/nihongo"
)
func TestNormalize() {
normalized := nihongo.Normalize("テストテスト+=")
fmt.Println(normalized) // テストテスト+=
}
func TestToHiragana() {
hira := nihongo.ToHiragana("テストてすと")
fmt.Println(hira) // てすとてすと
}
func TestToKatakana() {
kana := nihongo.ToKatakana("テストてすと")
fmt.Println(kana) // テストテスト
}
func TestTokenize() {
words := nihongo.Tokenize("私は人間です")
fmt.Println(words) // ["私" "は" "人間" "です"]
}
func TestContainsHiragana() {
nihongo.ContainsHiragana("ひらがな") // true
nihongo.ContiansHiragana("日本語") // false
}
func TestContainsKatakana() {
nihongo.ContainsKatakana("カタカナ") // true
nihongo.ContiansKatakana("日本語") // false
}
License
ISC
Documentation
¶
ContainsHiragana returns true when text contains hiragana
ContainsKatakana returns true when text contains katakana
Normalize japanese text which will convert with NFKC normalization.
Hankaku-Kana -> Zenkaku-Kana
Zenkaku special chars -> Hankaku special chars
ToHiragana converts all katakana text to hiragana.
You should normalize text before converting.
ToKatakana converts all hiragana text to katakana.
You should normalize text before converting.
Tokenize splits sentence to word array in Japanese
Source Files
¶
Click to show internal directories.
Click to hide internal directories.