Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence

dc.contributor.authorLau, Chee Yong
dc.date.accessioned2023-08-21T06:33:59Z
dc.date.available2023-08-21T06:33:59Z
dc.date.issued2015
dc.descriptionThesis (PhD. (Biomedical Engineering))
dc.description.abstractSpeech synthesis is important nowadays and could be a great aid in various applications. So it is important to build a simple, reliable, light-weight, ease of use speech synthesizer. However, conventional speech synthesizers require tedious human efforts to prepare high quality recorded database, and the intelligibility of synthetic speech may decrease due to the appearance of polyphone (character with more than 1 pronunciation) because the speech synthesizer may not contain the definition of the polyphones. Moreover, the ready speech synthesizers in market are mostly built in Unit Selection method, which is large in database size and relying on Malay linguist knowledge. In this study, statistical parametric speech synthesis method has been adopted using lab speech and free speech data harvested online. The intelligibility improvement has been achieved using Active Learning and Feedforward Neural Network with Back-Propagation. The amount of training data used remained the same throughout this study. The result was evaluated using perception test. The listening test showed that the intelligibility of synthetic speech has been improved about 20%- 30% using the artificial intelligence technique. Volunteers were invited to take part in Active Learning experiment. The result showed no controversy between the result done by volunteers and the correct answer. In conclusion, a light-weight Malay speech synthesizer has been created without relying on Malay linguist knowledge. Using free source as training data can ease the human effort in preparing training database and using artificial intelligence technique can improve the intelligibility of synthetic speech under the same amount of training data used
dc.description.sponsorshipFaculty of Biosciences and Medical Engineering
dc.identifier.citationNA
dc.identifier.issnNA
dc.identifier.urihttp://openscience.utm.my/handle/123456789/629
dc.language.isoen
dc.publisherUniversiti Teknologi Malaysia
dc.relation.ispartofseriesNA; NA
dc.subjectSpeech synthesis
dc.subjectSpeech processing systems
dc.titleMalay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence
dc.typeThesis
dc.typeDataset
Files
Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
LauCheeYongPFBME2015_A.pdf
Size:
58.09 KB
Format:
Adobe Portable Document Format
Description:
FEEDFORWARD NEURAL NETWORK WITH BACK-PROPAGATION MODULE
Loading...
Thumbnail Image
Name:
LauCheeYongPFBME2015_B.pdf
Size:
88.07 KB
Format:
Adobe Portable Document Format
Description:
ACTIVE LEARNING MODULE
Loading...
Thumbnail Image
Name:
LauCheeYongPFBME2015_C.pdf
Size:
43.85 KB
Format:
Adobe Portable Document Format
Description:
DECISION TREE QUESTIONS
Loading...
Thumbnail Image
Name:
LauCheeYongPFBME2015_D.pdf
Size:
68.52 KB
Format:
Adobe Portable Document Format
Description:
EXAMPLE OF CONTEXT DEPENDENT LABEL
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: