Welcome to LDC-IL - Data Distribution Portal
This is a data distribution portal of Linguistic Data Consortium for Indian Languages (LDC-IL), a scheme of Department of Higher Education, Ministry of Human Resource Development, Government of India implemented by Central Institute of Indian Languages, Mysore.
Here you can find the linguistic resources required for various types of language technology development works in Indian languages. The resources include text and speech corpora with several types of annotations such as parts of speech, chunking, sentence and word level annotation, etc.
The goal is to make rich and varied types of linguistic resources available to developers and research community so that work in Indian languages gets promoted.