EXPLORE PUBLICATIONS BY COUNTRIES


	EUROPE

	MIDDLE EAST

	ASIA

	AFRICA
.............................

	United States of America

	United Kingdom

	Canada

	Australia

	Italy

	France

	Brazil

	Germany

	Malaysia

	Turkey

	China

	Taiwan

	Japan

	Saudi Arabia

	Jordan

	Egypt

	United Arab Emirates

	India

	Nigeria

Text to Speech Synthesis with Prosody Feature: Implementation of Emotion in Speech Output using Forward Parsing

MANOJ B. CHANDAK, R.V.Dharaskar, V.M.Thakre

Pages - 352 - 360 | Revised - 30-06-2010 | Published - 10-08-2010

Published in International Journal of Computer Science and Security (IJCSS)

Volume - 4 Issue - 3 | Publication Date - July 2010 Table of Contents

MORE INFORMATION

References | Cited By (4) | Abstracting & Indexing

KEYWORDS

Text to Speech Synthesis, Forward Parsing, Emotion Generator, Prosody Feature

ABSTRACT

One of the key components of Text to Speech Synthesizer is prosody generator. There are basically two types of Text to Speech Synthesizer, (i) single tone synthesizer and (ii) multi tone synthesizer. The basic difference between two approaches is the prosody feature. If the output of the synthesizer is required in normal form just like human conversation, then it should be added with prosody feature. The prosody feature allows the synthesizer to vary the pitch of the voice so as to generate the output in the same form as if it is actually spoken or generated by people in conversation. The paper describes various aspects of the design and implementation of speech synthesizer, which is capable of generating variable pitch output for the text. The concept of forward parsing is used to find out the emotion in the text and generate the output accordingly.

CITED BY (4)

1	Cunningham, T. (2012). Understanding Synthetic Speech and Language Processing of Students With and Without a Reading Disability (Doctoral dissertation).

2	Anil, M. C., & Shirbahadurkar, S. D. (2014, February). Speech modification for prosody conversion in expressive Marathi text-to-speech synthesis. In Signal Processing and Integrated Networks (SPIN), 2014 International Conference on (pp. 56-58). IEEE.

3	Anil, M. C., & Shirbahadurkar, S. D. Expressive Speech Synthesis using Prosodic Modification for Marathi Language.

4	Roy, A. J., & Student, F. Y. U. Emotional Text to Speech Synthesis in Indian Language.

ABSTRACTING & INDEXING

1	Google Scholar

2	Academic Journals Database

3	Academic Index

4	CiteSeerX

5	refSeek

6	iSEEK

7	Socol@r

8	ResearchGATE

9	Libsearch

10	Bielefeld Academic Search Engine (BASE)

11	Scribd

12	WorldCat

13	SlideShare

14	PDFCAST

15	PdfSR

REFERENCES

Allen, J., M.S. Hunnicutt, and D.H. Klatt, From Text to Speech: the MITalk System, 2007, Cambridge, UK, University Press.

Andrea Esuli and Fabrizio Sebastiani. 2007. PageRanking wordnet synsets: An application to opinion mining.In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 424–431, Prague, Czech Republic, June

B. Pang and L. Lee. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In (ACL-04), pages 271–278, Barcelona, ES. Association for Computational Linguistics

Bender, O., S. Hasan, D. Vilar, R. Zens, and H. Ney. 2005. Comparison of generation strategies for interactive machine translation. In Proceedings of the 10th Annual Conference of the European Association for Machine Translation (EAMT05), pages 33–40, Budapest

Casacuberta, F. and E. Vidal. 2007. Learning finite-state models for machine translation. Machine Learning, 66(1):69–91.

D. Jurafsky and J. H. Martin. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice Hall PTR, Upper Saddle River, NJ, USA, 2000

F. Casacuberta et al. Some approaches to statistical and finite-state speech-to-speech translation. Computer Speech and Language,18:25–47, 2004.

Fangzhong Su and Katja Markert. 2008. From word to sense: a case study of subjectivity recognition. In Proceedings of the 22nd International Conference on Computational Linguistics, Manchester

Hong Yu and Vasileios Hatzivassiloglou. 2003. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Conference on Empirical Methods in Natural Language Processing , pages 129–136, Sapporo,Japan.

I. Titov and R. McDonald. 2008. A Joint Model of Text and Aspect Ratings for Sentiment Summarization. ACL-2008

J. Wiebe, and T. Wilson. 2002. Learning to Disambiguate Potentially Subjective Expressions. CoNLL-2002.

J. Yuan, J. Brenier, and D. Jurafsky, “Pitch accent prediction: Effects of genre and speaker,” in Proc. Interspeech 2005, Lisbon, Portugal, 2005

Laxmi-India, Gr.Noiida, March 2010. Development of Expert Search Engine for Web Environment. In International Journal for Computer Science and Security, pages 130-135, Vol 4. Issue 1, CSC Journals, Malaysia.

Tom´as, J. and F. Casacuberta. 2006. Statistical phrase-based models for interactive computer-assisted translation. In Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics and 21th International Conference on Computational Linguistics (COLING/ACL 06), pages 835–841, Sydney.

V. Strom, R. Clark, and S. King, “Expressive prosody for unit-selection speech synthesis,” in Proc. Interspeech, Pittsburgh, 2006.

MANUSCRIPT AUTHORS

Associate Professor MANOJ B. CHANDAK

S.R.K.N.E.C, NAGPUR - India

chandakmb@gmail.com

Dr. R.V.Dharaskar

- India

Dr. V.M.Thakre

- India

CREATE AUTHOR ACCOUNT

LAUNCH YOUR SPECIAL ISSUE

View all special issues >>

PUBLICATION VIDEOS