Irula/Irular Mozhi Parallel Text Corpus: Linguistic Features and Structures
OverView
Total Words: 4,404,845 | Irula/Irular Mozhi Words: 22,909 | 5,332 sentences/phrases in each mother tonguesIndia has 270 mother tongues as per 2011 census. Fol...Your request cart is empty!
Dataset Description
Total Words: 4,404,845 | Irula/Irular Mozhi Words: 22,909 | 5,332 sentences/phrases in each mother tongues
India has 270 mother tongues as per 2011 census. Following the requirements of the NEP-2020, LDC-IL developed parallel corpus in Indian mother tongues. The Irula/Irular Mozhi parallel text corpus connected with English and 146 mother tongues of India. It contains 5,332 sentences/phrases systematically structured based on 159 grammatical categories. The Irula/Irular Mozhi section includes 22,909 words and 164,665 characters. Overall, the corpus comprises 4,404,845 words (over 4.4 million tokens) and 23,374,289 characters (approximately 23.3 million).
The price indicated corresponds to a single language component. The total payment will be determined based on the number of language components requested by the seeker.
For any research-based citations, please use the following citations:
1. Dr. Amudha R., Dr. Rejitha K. S., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2026. Irula/Irular Mozhi Parallel Text Corpus: Linguistic Features and Structures. Central Institute of Indian Languages, Mysore. 978-81-69099-11-0
2. Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.
Item specifics
- Authors Dr. Amudha R., Dr. Rejitha K. S., Dr. Narayan Choudhary, Prof. Shailendra Mohan
- Catalogue Number 1644
- ISBN 978-81-69099-11-0
- Data Source Descriptive Grammar
- Character Count 23374289
- Word Count 4404845
- Terms and Conditions General instructions for use of the resources provided by LDC-IL.
