Development of language identification system using MFCC and vector quantization
This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and...
| Main Authors: | , , |
|---|---|
| Format: | Proceeding Paper |
| Language: | English |
| Published: |
2017
|
| Subjects: | |
| Online Access: | http://irep.iium.edu.my/60070/ http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf |
| _version_ | 1848785427500105728 |
|---|---|
| author | Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira |
| author_facet | Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira |
| author_sort | Gunawan, Teddy Surya |
| building | IIUM Repository |
| collection | Online Access |
| description | This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and four females were selected as subjects for this research and each of them spoke different languages, including Arabic, Chinese, English, Korean and Malay. The MFCC will be extracted to derive the related feature vector. Vector Quantization (VQ) algorithm is then used as classifier. The recognition rate is then calculated for each language. Several experiments were conducted to find the optimum parameters, in which we found that sampling frequency of 16000 Hz and codebook size of 75 provided good results. On average, the recognition rate for all five languages evaluated was 78%. The experimental results show that our proposed system provides a good recognition rate. |
| first_indexed | 2025-11-14T16:52:58Z |
| format | Proceeding Paper |
| id | iium-60070 |
| institution | International Islamic University Malaysia |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-14T16:52:58Z |
| publishDate | 2017 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | iium-600702017-12-14T06:43:49Z http://irep.iium.edu.my/60070/ Development of language identification system using MFCC and vector quantization Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira TK Electrical engineering. Electronics Nuclear engineering This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and four females were selected as subjects for this research and each of them spoke different languages, including Arabic, Chinese, English, Korean and Malay. The MFCC will be extracted to derive the related feature vector. Vector Quantization (VQ) algorithm is then used as classifier. The recognition rate is then calculated for each language. Several experiments were conducted to find the optimum parameters, in which we found that sampling frequency of 16000 Hz and codebook size of 75 provided good results. On average, the recognition rate for all five languages evaluated was 78%. The experimental results show that our proposed system provides a good recognition rate. 2017 Proceeding Paper NonPeerReviewed application/pdf en http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf Gunawan, Teddy Surya and Husain, Rashida and Kartiwi, Mira (2017) Development of language identification system using MFCC and vector quantization. In: 4th IEEE International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA) 2017, 28th-30th November 2017, Putrajaya. (Unpublished) http://icsima.ieeemy-ims.org/17/ |
| spellingShingle | TK Electrical engineering. Electronics Nuclear engineering Gunawan, Teddy Surya Husain, Rashida Kartiwi, Mira Development of language identification system using MFCC and vector quantization |
| title | Development of language identification system using MFCC and vector quantization |
| title_full | Development of language identification system using MFCC and vector quantization |
| title_fullStr | Development of language identification system using MFCC and vector quantization |
| title_full_unstemmed | Development of language identification system using MFCC and vector quantization |
| title_short | Development of language identification system using MFCC and vector quantization |
| title_sort | development of language identification system using mfcc and vector quantization |
| topic | TK Electrical engineering. Electronics Nuclear engineering |
| url | http://irep.iium.edu.my/60070/ http://irep.iium.edu.my/60070/ http://irep.iium.edu.my/60070/13/60070-Development%20of%20Language%20Identification.pdf |