Development of a Text-Independent Speaker Identification System

Access control to confidential information and facilities is conventionally through the use of passwords, smart cards or keys that can be stolen or forgotten. In this work, a software-based access control system that identifies users by the unique features in their voices, which cannot be easily bre...

Descrición completa

Gardado en:
Detalles Bibliográficos
Autor Principal: Fisusi, Abimbola Adeola
Formato: Thesis
Idioma:inglés
Publicado: 2015
Subjects:
Acceso en liña:http://localhost:8080/xmlui/handle/123456789/2154
Tags: Engadir etiqueta
Sen Etiquetas, Sexa o primeiro en etiquetar este rexistro!
_version_ 1810764577125695488
author Fisusi, Abimbola Adeola
author_facet Fisusi, Abimbola Adeola
author_sort Fisusi, Abimbola Adeola
collection DSpace
description Access control to confidential information and facilities is conventionally through the use of passwords, smart cards or keys that can be stolen or forgotten. In this work, a software-based access control system that identifies users by the unique features in their voices, which cannot be easily breached, was developed with a view to overcome the limitations of conventional access control methods. The implementation of the system involved two phases - the training and testing phases. During the training phase, speech samples were collected from seven male and five female speakers by recording their voices with a microphone connected to a computer system. Speaker's unique features were extracted from the speech samples in form of Mel-Frequency Cepstrum Coefficients (MFCCs) which estimated the unique shape of the spectral envelope of each user. The extracted MFCCs features were used to build speaker models in form of codebooks for the speakers using the Vector Quantization (VQ) approach. The speaker models created were stored in a database. During the testing phase, another set of speech samples were collected from the same set of speakers of the training phase. MFCC features were extracted from each of the testing speech samples and compared with the codebooks created during the training phase. For each testing speech sample, the speaker whose codebook gave the lowest average distortion was identified as the true speaker. Codebooks of different sizes ranging from 16 to 256 were used to perform the identification task. In the same vein, the performance of the system as a text-dependent system was also evaluated using the same words for both training and testing phases. The performance of the text-independent speaker identification system was evaluated by comparing the speakers' testing phase MFCCs with their training phase codebooks. The results showed that the identification rate of the text-independent speaker identification system increased with increase in codebook size. "The identification rate of the system was 57.14% when 16-vector codebooks were used as speaker models. It was 71.43% and 85.71% for 32-vector and 64-vector codebooks respectively. The speaker identification system was able to achieve 100% identification rate at codebook sizes of 128 and 256. The average distortions of speakers from testing speech samples were found to decrease with increase in codebook size. The 128-vector codebooks are preferred over 256-vector codebooks because the time required to perform the identification task is shorter for 128-vector codebooks although both give 100% identification rate. 1n conclusion, the developed text-independent speaker identification system could distinguish between speakers correctly and provide security for confidential information and facilities better than conventional methods if the codebook size used for the identification task is large enough.
format Thesis
id oai:ir.oauife.edu.ng:123456789-2154
institution My University
language English
publishDate 2015
record_format dspace
spelling oai:ir.oauife.edu.ng:123456789-21542023-05-13T11:12:52Z Development of a Text-Independent Speaker Identification System Fisusi, Abimbola Adeola Access control system Access control methods Text-independent speaker identification system Speaker identification system Access control to confidential information and facilities is conventionally through the use of passwords, smart cards or keys that can be stolen or forgotten. In this work, a software-based access control system that identifies users by the unique features in their voices, which cannot be easily breached, was developed with a view to overcome the limitations of conventional access control methods. The implementation of the system involved two phases - the training and testing phases. During the training phase, speech samples were collected from seven male and five female speakers by recording their voices with a microphone connected to a computer system. Speaker's unique features were extracted from the speech samples in form of Mel-Frequency Cepstrum Coefficients (MFCCs) which estimated the unique shape of the spectral envelope of each user. The extracted MFCCs features were used to build speaker models in form of codebooks for the speakers using the Vector Quantization (VQ) approach. The speaker models created were stored in a database. During the testing phase, another set of speech samples were collected from the same set of speakers of the training phase. MFCC features were extracted from each of the testing speech samples and compared with the codebooks created during the training phase. For each testing speech sample, the speaker whose codebook gave the lowest average distortion was identified as the true speaker. Codebooks of different sizes ranging from 16 to 256 were used to perform the identification task. In the same vein, the performance of the system as a text-dependent system was also evaluated using the same words for both training and testing phases. The performance of the text-independent speaker identification system was evaluated by comparing the speakers' testing phase MFCCs with their training phase codebooks. The results showed that the identification rate of the text-independent speaker identification system increased with increase in codebook size. "The identification rate of the system was 57.14% when 16-vector codebooks were used as speaker models. It was 71.43% and 85.71% for 32-vector and 64-vector codebooks respectively. The speaker identification system was able to achieve 100% identification rate at codebook sizes of 128 and 256. The average distortions of speakers from testing speech samples were found to decrease with increase in codebook size. The 128-vector codebooks are preferred over 256-vector codebooks because the time required to perform the identification task is shorter for 128-vector codebooks although both give 100% identification rate. 1n conclusion, the developed text-independent speaker identification system could distinguish between speakers correctly and provide security for confidential information and facilities better than conventional methods if the codebook size used for the identification task is large enough. 2015-04-22T12:12:21Z 2018-10-27T12:26:01Z 2015-04-22T12:12:21Z 2018-10-27T12:26:01Z 2015-04-22 Thesis http://localhost:8080/xmlui/handle/123456789/2154 en PDF application/pdf Nigeria
spellingShingle Access control system
Access control methods
Text-independent speaker identification system
Speaker identification system
Fisusi, Abimbola Adeola
Development of a Text-Independent Speaker Identification System
title Development of a Text-Independent Speaker Identification System
title_full Development of a Text-Independent Speaker Identification System
title_fullStr Development of a Text-Independent Speaker Identification System
title_full_unstemmed Development of a Text-Independent Speaker Identification System
title_short Development of a Text-Independent Speaker Identification System
title_sort development of a text independent speaker identification system
topic Access control system
Access control methods
Text-independent speaker identification system
Speaker identification system
url http://localhost:8080/xmlui/handle/123456789/2154
work_keys_str_mv AT fisusiabimbolaadeola developmentofatextindependentspeakeridentificationsystem