nltk-go
Natural Language Toolkit (NLTK) for Golang inspired by nltk
This is a working in progress (the very beginning of this project to be more precise), so everything might change very fast and backwards compatibility is not guaranteed.
Our focus now is making "Floresta Sintática" available and easy to use using Golang, after that we will work on some other languages.
We started using a SQL dump of "Floresta Sintática" that we got working on a Docker postgres (see Bosque_CP_8.0_Postgres for more details), however there was some issues with the SQL insert statements, so we are working on parsing an XML (Tiger-XML) version and inserting on the database.
Next Steps
- Finish the database integration/interface
- Read the Tiger-XML
- Convert it to UTF-8
- Insert it in a Postgres Database
- Convert it to a Sqlite Database (it's easier to have the whole database in one single file)
- Support other languages
Contributing
- Fork it
- Write some good code
- Make a Pull-Request
- That's it :D
Licence
GPL V3