Skip to content

Latest commit

 

History

History
24 lines (18 loc) · 666 Bytes

README-Natural-TODO.md

File metadata and controls

24 lines (18 loc) · 666 Bytes

Natural Language TODO List

This is a short list of projects that are "ready to go" but have not been started yet.

Morphology

Most European languages have conjugated verbs, meaning that there is a verb stem, and a varying suffix indicating tense and number. Effectively all syntactic structure is carried by the suffix, whereas fundamental semantic contents is in the stem.

To deal with morophology, words need to

Chinese

Chinese segmentation can be learned, in the sense of "set phrases".

Translation/parallel tests

This too should work.

Infrastructure dev is needed for parallel texts.