Kashmiri Parts of Speech Annotated Corpus

Kashmiri Parts of Speech Annotated Corpus

0 reviews requests (0)
Catalogue Number: ‎1701‎
Stock In Stock

OverView

‎103488‎ Tags| ‎92746‎ Words | ‎5331‎ SentencesThe Linguistic Data Consortium for Indian Languages (LDC-IL) is developed Parts-of-Speech annotated corpus for ...
Please Login to see the price

Dataset Description

‎103488‎ Tags| ‎92746‎ Words | ‎5331‎ Sentences

The Linguistic Data Consortium for Indian Languages (LDC-IL) is developed Parts-of-Speech annotated corpus for Scheduled Indian languages. The corpus is annotated with Part-of-Speech (PoS) tags based on the Bureau of Indian Standards (BIS) PoS Tagset. This data is a significant resource for natural language processing and linguistic research. LDC-IL developed annotated text corpora for Kashmiri. The Kashmiri PoS annotated corpus is automatically tagged and then verified by linguistic experts to ensure accuracy and consistency.
Kashmiri PoS annotated Corpus contains ‎103488 Part-of-Speech tags.

For any research-based citations, please use the following citations:

1. Dr. Zargar Adil Ahmad, Dr. Narayan Choudhary, ‎Rajesha N., Manasa G. 2026. Kashmiri Parts of Speech Annotated Corpus. Central Institute of Indian Languages, Mysore. ‎978-81-69175-77-7. ‎

2. Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2026. LDC-IL Parts of Speech Annotated Corpus Based on BIS Framework. Central Institute of Indian Languages, Mysore. 978-81-69175-60-9.
‎103488‎ Tags| ‎92746‎ Words | ‎5331‎ Sentences


The Linguistic Data Consortium for Indian Languages (LDC-IL) is developed Parts-of-Speech annotated corpus for Scheduled Indian languages. The corpus is annotated with Part-of-Speech (PoS) tags based on the Bureau of Indian Standards (BIS) PoS Tagset. This data is a significant resource for natural language processing and linguistic research. LDC-IL developed annotated text corpora for Kashmiri. The Kashmiri PoS annotated corpus is automatically tagged and then verified by linguistic experts to ensure accuracy and consistency.
Kashmiri PoS annotated Corpus contains ‎103488 Part-of-Speech tags.

For any research-based citations, please use the following citations:

1. Dr. Zargar Adil Ahmad, Dr. Narayan Choudhary, ‎Rajesha N., Manasa G. 2026. Kashmiri Parts of Speech Annotated Corpus. Central Institute of Indian Languages, Mysore. ‎978-81-69175-77-7. ‎

2. Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2026. LDC-IL Parts of Speech Annotated Corpus Based on BIS Framework. Central Institute of Indian Languages, Mysore. 978-81-69175-60-9.

Item specifics

  • Authors Dr. Zargar Adil Ahmad, Dr. Narayan Choudhary, ‎Rajesha N., Manasa G.‎
  • Corpus Type Parts of Speech Annotated Text Corpus
  • Catalogue Number ‎1701‎
  • ISBN ‎978-81-69175-77-7‎
  • Data Source Annotated
  • Word Count 92746
  • Release Date 3/23/2026
  • Terms and Conditions General instructions for use of the resources provided by LDC-IL.
  • Tag Count 103488
Commercial User
Non-Commercial User
LDC-IL Raw Text Corpora: An Overview
LDC-IL Raw Speech Corpora: An Overview

Write a review

Please login or register to review