Design and implementation of a Luganda speech database for unit selection speech synthesis using phonetic transcription
Abstract
Speech synthesis systems have greatly simplified natural language interaction between humans and computers. A number of such systems exist with several languages for example Indian, English and Arabic, incorporated in them. However, there has been limited implementation in African languages like Luganda. The few that exist produce unnatural and unintelligent speech since they do not take into account the “co-articulation” effects of producing phonemes, phoneme duration or desired pitch variation across the utterance in continuous speech. There is therefore a need to develop a Luganda speech database that uses phonetic transcriptions as a method for unit selection synthesis that can be applied in Text to Speech systems, phonetic analysis, speech analysis and Digital Signal Processing. It reflects on a single speaker speech database with carefully recorded audios under studio conditions consisting of 1578 phonetically balanced Luganda utterances. We predict a high usability for semi-literate and visually impaired Luganda speakers when applied to speech synthesis systems but also for further learning. More improvements can be through focusing on the speaker traits such as gender, age and origin of the voice talent person. The same approach can also be extended to other indigenous African languages other than Luganda to build systems that can help the semi-literate and visually impaired people.
Collections
Related items
Showing items related by title, author, creator and subject.
-
Striking a balance: the free speech versus hate speech debate a comprehensive and comparative analysis of international and domestic law
Mugabi, Samuel (Makerere University, 2022-08)Within contemporary public debate (internationally and in Uganda), there is a difference of opinion between the understanding of the fundamentality, constitutionality and the scope of the right to freedom of speech and ... -
Speech recognition and voice command control system for Luganda language
Muwebwa, Solomon Eltonjim; Olwe, Samuel (2019-06-14)A Speech Recognition and Command Control Interface is software that allows the user control over specified computer functions by voice. Our implementation of this system is unique in that it responds to commands made in ... -
Design and Implementation of a Luganda Text Normalization Module for a Speech Software Program
Kagumire, Sulaiman (2019-06-14)This report investigates the problem of text normalization; specifically, the normalization of non-standard words (NSWs) in Luganda. Non-standard words can be defined as those word tokens which do not have a dictionary ...