Metadata-Version: 2.4 Name: corpustools Version: 2.2.0 Summary: Manage giellalt corpora License: GPLv3+ License-File: LICENSE.txt Author: Børre Gaup Author-email: albbas@gmail.com Requires-Python: >=3.10,<4.0 Classifier: License :: Other/Proprietary License Classifier: Programming Language :: Python :: 3 Classifier: Programming Language :: Python :: 3.10 Classifier: Programming Language :: Python :: 3.11 Classifier: Programming Language :: Python :: 3.12 Classifier: Programming Language :: Python :: 3.13 Classifier: Programming Language :: Python :: 3.14 Requires-Dist: GitPython (>=3.1.44,<4.0.0) Requires-Dist: epub (>=0.5.2,<0.6.0) Requires-Dist: feedparser (>=6.0.10,<7.0.0) Requires-Dist: gitdb (>=4.0.11,<5.0.0) Requires-Dist: html5lib (>=1.1,<2.0) Requires-Dist: lxml (>=5.4.0,<6.0.0) Requires-Dist: lxml-html-clean (>=0.4.2,<0.5.0) Requires-Dist: prompt-toolkit (>=3.0.51,<4.0.0) Requires-Dist: pytesseract (>=0.3.10,<0.4.0) Requires-Dist: python-dateutil (>=2.8.2,<3.0.0) Requires-Dist: python_tca2 @ git+https://github.com/divvun/python_tca2.git@main Requires-Dist: regex (>=2024.11.6,<2025.0.0) Requires-Dist: requests (>=2.31.0,<3.0.0) Requires-Dist: unidecode (>=1.4.0,<2.0.0) Project-URL: Homepage, https://github.com/giellalt/CorpusTools Description-Content-Type: text/markdown # Corpus Tools Tools to manipulate a giellalt corpus in different ways. ## Install and update from Apertium nightly These tools are a part of [Apertium nightly packages](https://wiki.apertium.org/wiki/Installation/Developers). For Mac users, running the `install-nightly.sh` suffices, Linux users will have to run ` install `, as well (search for corpustools in the package system). To update the tools on Mac, run `install-nightly.sh`. On Linux, update packages using the package manager. ## Install and update using pipx - [Install pipx](https://pypa.github.io/pipx/installation/) - Run `pipx install --force git+https://github.com/divvun/CorpusTools.git` [API and manual pages](https://divvun.github.io/CorpusTools/)