Malayalam Parallel Text Corpus: Linguistic Features and Structures

Malayalam Parallel Text Corpus: Linguistic Features and Structures

0 reviews requests (0)
Catalogue Number: 1514
Stock In Stock

OverView

Malayalam Parallel Text Corpus: Linguistic Features and StructuresTotal Words: 4,404,845 | Malayalam Words: 20,955 | 5,332 sentences/phrases in each mother tonguesIndia has 270 mother tongues as per 2011 census. Following the requiremen...
Please Login to see the price

Dataset Description

Malayalam Parallel Text Corpus: Linguistic Features and Structures
Total Words: 4,404,845 | Malayalam Words: 20,955 | 5,332 sentences/phrases in each mother tongues

India has 270 mother tongues as per 2011 census. Following the requirements of the NEP-2020, LDC-IL developed parallel corpus in Indian mother tongues. The Malayalam parallel text corpus connected with English and 146 mother tongues of India. It contains 5,332 sentences/phrases  systematically structured based on 159 grammatical categories. The Malayalam section includes 20,955 words and 168,051 characters. Overall, the corpus comprises 4,404,845 words (over 4.4 million tokens) and 23,374,289 characters (approximately 23.3 million).

For any research-based citations, please use the following citations: 

  1. Dr. Rejitha K. S., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2026. Malayalam Parallel Text Corpus: Linguistic Features and Structures. Central Institute of Indian Languages, Mysore. XXX-XX-XXXXX-XX-X. 
  2. Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.

Item specifics

  • Authors Dr. Rejitha K. S., Dr. Narayan Choudhary, Prof. Shailendra Mohan
  • Corpus Type Parallel Text Corpus
  • ISBN 978-93-48633-45-3
  • Data Source On Field
  • Duration
  • # of Audio Segments
  • Release Date 06-03-2026
Commercial User
Non-Commercial User
LDC-IL Raw Text Corpora: An Overview
LDC-IL Raw Speech Corpora: An Overview

Write a review

Please login or register to review