CORPUS OF WRITTEN TATAR
Statistics

Corpus size ~356 mln words.
Sentences amount ~40 mln.
Word forms amount ~4,5 млн.

Words:

Corpus size ~116 mln words.
Sentences amount ~10 mln.
Word forms amount ~1,5 млн.

Lemmas: Words: Letters: Letters (at the beginnig of a word): Letters (at the end of a word): Phonemes (within a rhythmic group): Phonemes (within a word): Miscellaneous:

You can also get various statistical data on the Leipzig Corpora Collection website.