Welcome to LDC-IL - Data Distribution Portal
This is a data distribution portal of Linguistic
Data Consortium for Indian Languages (LDC-IL), a scheme of Department of Higher
Education, Ministry of Human Resource Development, Government of India
implemented by Central Institute of Indian Languages, Mysore.
Here you can find the linguistic resources
required for various types of language technology development works in Indian
languages. The resources include text and speech corpora with several types of
annotations such as parts of speech, chunking, sentence and word level
annotation, etc.
The goal is to make rich and varied types of
linguistic resources available to developers and research community so that
work in Indian languages gets promoted.