8000 GitHub - ChristopherLucas/txtorg: Software for preprocessing textual data in multiple languages for textual analysis.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

ChristopherLucas/txtorg

Repository files navigation

txtorg

txtorg is a Python-based utility that leverages Apache Lucene to facilitate text preprocessing and management. It outputs processed text in a variety of formats for use in a wide array of analytical software, including (but not limited to) the structural topic model. It scales to large corpora and has a graphical user interface that anyone can use. With Lucene, txtorg can support a wide range of languages.

For more information, including installation instructions, see http://txtorg.org/.

About

Software for preprocessing textual data in multiple languages for textual analysis.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0