Punjabi Sentence Aligned Speech Corpus
OverView
52:24:51 hours | 34:8 GB | 31,338 Audio Segments | 449 SpeakersThe LDC-IL Punjabi Sentence Aligned Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer containing phonetically normalize...
Categories
Cart
Account
Search
Recent View
Go to Top
All Categories
×
Request Cart
×
Your request cart is empty!
Search
×
Recent View Datasets
×
Dataset Description
52:24:51 hours | 34:8 GB | 31,338 Audio Segments | 449 Speakers
The LDC-IL Punjabi Sentence Aligned Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer containing phonetically normalized and orthographically normalized annotations in Gurmukhi script. This dataset spans a duration of 52:24:51 (hh:mm:ss) , consisting of read speech with continuous text, representative sentences, and date formats. A comprehensive explanation of dataset can be found in the Punjabi Sentence Aligned Speech Documentation.
For any research-based citations, please use the following citations:
- Dr. Shalinder Singh, Rajesha N., Manasa G., Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Punjabi Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-69-9.
- Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.
Item specifics
- Authors Dr. Shalinder Singh, Rajesha N., Manasa G., Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan
- Corpus Type Sentence Aligned Speech Corpus
- Catalogue Number 1505
- ISBN 978-93-48633-33-0.
- Data Source On Field
- Duration 52:24:51 hours
- # of Audio Segments 31338
- Release Date 20-Mar-25
- Terms and Conditions General instructions for use of the resources provided by LDC-IL.
Commercial User
Non-Commercial User
LDC-IL Raw Text Corpora: An Overview
LDC-IL Raw Speech Corpora: An Overview