Skip to content

HughP/dnj-corpus

Repository files navigation

Eastern Dan [dnj] corpus and text input analysis

Sections

Corpora in Languages

  • A medium size corpus in Eastern Dan [dnj]
  • The Eastern Dan corpus consists of issues of a local newspaper (˗Pamɛbhamɛ), and medical counsels (chapters) from While waiting for a medical doctor translated into Eastern Dan. A detailed description of the corpus is available here.
  • A parallel corpus (the New Testament book of James) in following languages: Spanish [spa], French [fra], English [eng], Eastern Dan [dnj], Me'phaa [tcf], Mandarin [cmn], plus two Nigerian languages [bkv], and [eza].

Writing System Descriptions

/Writing-System-Descriptions The focus here is on the Eastern Dan writing system in general but some description also exists for the other languages in the parallel corpus. In the process of attempting to describe Eastern Dan's text input options, this corpus description also attempts a modest application of the principles set out by Martin Hosken for Writing System Descriptions. A specific analysis of the text input options for Eastern Dan also exists under the Text Input Systems and their Descriptions section.

Text Input Systems and their Descriptions

/Text-Input-Systems-and-Descriptions Information about several keyboard layouts exists as part of this corpus.

Tools

A variety of tools were used in this project. I try to keep a list and a copy of them in this project.

Tools for Text

/Tools/Tools-for-Text include:

  • Measuring diacritic density and tone melodies
  • Text character counting
  • Converting or Transforming Text

Tools for Keyboard Layout analysis

  • Typing
  • KLA

Tools for Descriptions

  • KLE

About

A small corpus of a local newspaper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published