Dogri Sentence Aligned Speech Corpus
Catalogue Number: 1345
Stock In Stock
OverView
08:32:54 hours | 5.6 GB | 5,039 Audio Segments | 61 SpeakersThe LDC-IL Dogri Sentence Aligned Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer containing orthographically normalized...Please Login to see the price
Share This
Categories
Cart
Account
Search
Recent View
Go to Top
All Categories
×
Request Cart
×
Your request cart is empty!
Search
×
Recent View Datasets
×
Dataset Description
08:32:54 hours | 5.6 GB | 5,039 Audio Segments | 61 Speakers
The LDC-IL Dogri Sentence Aligned Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer containing orthographically normalized annotation in Devanagari script. This dataset spans a duration of 08:32:54 (hh:mm:ss) , consisting of read speech with continuous text, representative sentences, and date formats. The data is derived from 30 female and 31 male native Dogri speakers, encompassing diverse age groups and regions. A comprehensive explanation of dataset can be found in the Dogri Sentence Aligned Speech Documentation.
For any research-based citations, please use the following citations:
- Rajesha N., Manasa G., Dr. Rejitha K. S., Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Dogri Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-88-0
- Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.
Item specifics
- Authors Rajesha N., Manasa G., Dr. Rejitha K. S., Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan
- Corpus Type Sentence Aligned Speech Corpus
- Catalogue Number 1502
- ISBN 978-93-48633-88-0
- Data Source Data Source
- Duration 08:32:54
- # of Audio Segments 5039
- Release Date 20-03-2025
- Terms and Conditions General instructions for use of the resources provided by LDC-IL.
Commercial User
Non-Commercial User
LDC-IL Raw Text Corpora: An Overview
LDC-IL Raw Speech Corpora: An Overview