Plain text convertorΒΆ

This document explains how to convert plain text files to Tsakorpus JSON. See general information about source convertors and their configuration files here.

Convertor: /src_convertors/

This is the simplest possible convertor. It processes unannotated plain text from .txt files. Files should be encoded in UTF-8 without BOM. If you want to add morphological analysis at the time of conversion, you have to prepare a pre-analyzed word list.