Home   >   CSC-OpenAccess Library   >    Manuscript Information
The Arabic Speech Database: PADAS
Mohamed Khalil Krichi, Adnan Cherif
Pages - 10 - 19     |    Revised - 10-05-2014     |    Published - 15-06-2014
Volume - 8   Issue - 2    |    Publication Date - June 2014  Table of Contents
HTK, MAUS, Phonetic Database, Automatic Segmentation.
This work describes a construction of PADAS “Phonetics Arabic Database Automatically segmented” based on a data-driven Markov process. The use of a segmentation database is necessary in speech synthesis and recognizing speech. Manual segmentation is precise but inconsistent, since it is often produced by more than one label and require time and money. The MAUS segmentation and labeling exist for German speech and other languages but not in Arabic. It is necessary to modify MAUS for establish a segmental database for Arab. The speech corpus contains a total of 600 sentences recorded by 3 (2 male and 1 female) Arabic native speakers from Tunisia, 200 sentences for each.
1 Google Scholar 
2 CiteSeerX 
3 refSeek 
4 Scribd 
5 SlideShare 
6 PdfSR 
A. Black and K. Tokuda, “The Blizzard Challenge Evaluating Corpus-Based Speech Synthesis on Common Datasets,” in Proceeding of Interspeech, Portugal, pp. 77-80, 2005.
A. Omar “Dirasat Al–Swat Al–Lugawi”.Cairo: Alam Al– Kutub 1985.
F. Schiel, and J. Harrington: “Phonemic Segmentation and Labelling using the MAUS Technique”. Workshop 'New Tools and Methods for Very-Large-Scale Phonetics Research',University of Pennsylvania, January 28-31, 2011.
F. Schiel, C. Heinrich, and S. Barf¨ußer “Alcohol Language Corpus”. Language Resources,2011.
F. Schiel, “MAUS Goes Iterative”. Proc. of the IV. International Conference on Language Resources and Evaluation, Lisbon, Portugal, pp. 1015-1018. 2004.
G.Droua-Hamdani “Algerian Arabic speech database (algasd)”. December 2010.
J. Garofolo, L. Lamel , W. Fisher, J. Fiscus, D. Pallett, N. Dahlgren, and V. Zue, “TIMIT Acoustic-Phonetic Continuous Speech Corpus,” Technical Document, Trustees of the University of Pennsylvania, Philadelphia, 1993.
J.L. Fleiss “Measuring nominal scale agreement among many raters”. Psychological Bulletin,Vol. 76, No. 5 pp. 378-382. 1971.
K. Tokuda, H. Zen, and A.W. Black, “An HMM-based speech synthesis system applied to English”, in IEEE Speech Synthesis Workshop, 2002.
L. Hadjileontiadis and S. Panas. “Separation of discontinuous adventitious sounds from vesicular sounds using a wavelet based filter”, IEEE Trans. Biomed. Eng., vol. 44, n° 7, pp.876-886, 1997.
L. Pineda, G.mez M., D. Vaufreydaz and J. Serignat “Experiments on the Construction of a Phonetically Balanced Corpus from the Web,” in Proceedings of 5th International Conference on Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, Korea, pp. 416-419, 2004.
M. Alghamdi, A. Alhamid, and M. Aldasuqi, “Database of Arabic Sounds: Sentences,”Technical Report, Saudi Arabia, 2003.
M.A. Mansour “Kacst arabic phonetics database”. Riyadh, Kingdom of Saudi Arabia. 2004.
R. Gordon, “Ethnologue: Languages of the World, Texas: Dallas”, SIL International, 2005.
S. Burger, K. Weilhammer, F. Schiel, H. G. Tillmann, “Verbmobil Data Collection and Annotation”. Foundations of Speech-to-Speech Translation (Ed.Wahlster W), Springer,Berlin, Heidelberg. 2000.
S. D’Arcy and M. Russell, “Experiments with the ABI (Accents of the British Isles) Speech Corpus,” in Proceedings of Interspeech 08, Australia, pp. 293-296, 2008.
S. Mallat. “A wavelet tour of signal processing”. Academic Press, 1999.
Mr. Mohamed Khalil Krichi
Faculty of Sciences of Tunis/ Laboratory of Signal Processing/ Physics Department University of Tunis-Manar TUNIS, 1060, - Tunisia
Professor Adnan Cherif
Faculty of Sciences of Tunis/ Laboratory of Signal Processing/ Physics Department University of Tunis-Manar TUNIS, 1060, - Tunisia