Development of a Text-Independent Speaker Identification System

Access control to confidential information and facilities is conventionally through the use of passwords, smart cards or keys that can be stolen or forgotten. In this work, a software-based access control system that identifies users by the unique features in their voices, which cannot be easily bre...

Descrición completa

Gardado en:

Detalles Bibliográficos
Autor Principal:	Fisusi, Abimbola Adeola
Formato:	Thesis
Idioma:	inglés
Publicado:	2015
Subjects:	Access control system Access control methods Text-independent speaker identification system Speaker identification system
Acceso en liña:	http://localhost:8080/xmlui/handle/123456789/2154
Tags:	Engadir etiqueta Sen Etiquetas, Sexa o primeiro en etiquetar este rexistro!

_version_	1810764577125695488
author	Fisusi, Abimbola Adeola
author_facet	Fisusi, Abimbola Adeola
author_sort	Fisusi, Abimbola Adeola
collection	DSpace
description	Access control to confidential information and facilities is conventionally through the use of passwords, smart cards or keys that can be stolen or forgotten. In this work, a software-based access control system that identifies users by the unique features in their voices, which cannot be easily breached, was developed with a view to overcome the limitations of conventional access control methods. The implementation of the system involved two phases - the training and testing phases. During the training phase, speech samples were collected from seven male and five female speakers by recording their voices with a microphone connected to a computer system. Speaker's unique features were extracted from the speech samples in form of Mel-Frequency Cepstrum Coefficients (MFCCs) which estimated the unique shape of the spectral envelope of each user. The extracted MFCCs features were used to build speaker models in form of codebooks for the speakers using the Vector Quantization (VQ) approach. The speaker models created were stored in a database. During the testing phase, another set of speech samples were collected from the same set of speakers of the training phase. MFCC features were extracted from each of the testing speech samples and compared with the codebooks created during the training phase. For each testing speech sample, the speaker whose codebook gave the lowest average distortion was identified as the true speaker. Codebooks of different sizes ranging from 16 to 256 were used to perform the identification task. In the same vein, the performance of the system as a text-dependent system was also evaluated using the same words for both training and testing phases. The performance of the text-independent speaker identification system was evaluated by comparing the speakers' testing phase MFCCs with their training phase codebooks. The results showed that the identification rate of the text-independent speaker identification system increased with increase in codebook size. "The identification rate of the system was 57.14% when 16-vector codebooks were used as speaker models. It was 71.43% and 85.71% for 32-vector and 64-vector codebooks respectively. The speaker identification system was able to achieve 100% identification rate at codebook sizes of 128 and 256. The average distortions of speakers from testing speech samples were found to decrease with increase in codebook size. The 128-vector codebooks are preferred over 256-vector codebooks because the time required to perform the identification task is shorter for 128-vector codebooks although both give 100% identification rate. 1n conclusion, the developed text-independent speaker identification system could distinguish between speakers correctly and provide security for confidential information and facilities better than conventional methods if the codebook size used for the identification task is large enough.
format	Thesis
id	oai:ir.oauife.edu.ng:123456789-2154
institution	My University
language	English
publishDate	2015
record_format	dspace
spelling	oai:ir.oauife.edu.ng:123456789-21542023-05-13T11:12:52Z Development of a Text-Independent Speaker Identification System Fisusi, Abimbola Adeola Access control system Access control methods Text-independent speaker identification system Speaker identification system Access control to confidential information and facilities is conventionally through the use of passwords, smart cards or keys that can be stolen or forgotten. In this work, a software-based access control system that identifies users by the unique features in their voices, which cannot be easily breached, was developed with a view to overcome the limitations of conventional access control methods. The implementation of the system involved two phases - the training and testing phases. During the training phase, speech samples were collected from seven male and five female speakers by recording their voices with a microphone connected to a computer system. Speaker's unique features were extracted from the speech samples in form of Mel-Frequency Cepstrum Coefficients (MFCCs) which estimated the unique shape of the spectral envelope of each user. The extracted MFCCs features were used to build speaker models in form of codebooks for the speakers using the Vector Quantization (VQ) approach. The speaker models created were stored in a database. During the testing phase, another set of speech samples were collected from the same set of speakers of the training phase. MFCC features were extracted from each of the testing speech samples and compared with the codebooks created during the training phase. For each testing speech sample, the speaker whose codebook gave the lowest average distortion was identified as the true speaker. Codebooks of different sizes ranging from 16 to 256 were used to perform the identification task. In the same vein, the performance of the system as a text-dependent system was also evaluated using the same words for both training and testing phases. The performance of the text-independent speaker identification system was evaluated by comparing the speakers' testing phase MFCCs with their training phase codebooks. The results showed that the identification rate of the text-independent speaker identification system increased with increase in codebook size. "The identification rate of the system was 57.14% when 16-vector codebooks were used as speaker models. It was 71.43% and 85.71% for 32-vector and 64-vector codebooks respectively. The speaker identification system was able to achieve 100% identification rate at codebook sizes of 128 and 256. The average distortions of speakers from testing speech samples were found to decrease with increase in codebook size. The 128-vector codebooks are preferred over 256-vector codebooks because the time required to perform the identification task is shorter for 128-vector codebooks although both give 100% identification rate. 1n conclusion, the developed text-independent speaker identification system could distinguish between speakers correctly and provide security for confidential information and facilities better than conventional methods if the codebook size used for the identification task is large enough. 2015-04-22T12:12:21Z 2018-10-27T12:26:01Z 2015-04-22T12:12:21Z 2018-10-27T12:26:01Z 2015-04-22 Thesis http://localhost:8080/xmlui/handle/123456789/2154 en PDF application/pdf Nigeria
spellingShingle	Access control system Access control methods Text-independent speaker identification system Speaker identification system Fisusi, Abimbola Adeola Development of a Text-Independent Speaker Identification System
title	Development of a Text-Independent Speaker Identification System
title_full	Development of a Text-Independent Speaker Identification System
title_fullStr	Development of a Text-Independent Speaker Identification System
title_full_unstemmed	Development of a Text-Independent Speaker Identification System
title_short	Development of a Text-Independent Speaker Identification System
title_sort	development of a text independent speaker identification system
topic	Access control system Access control methods Text-independent speaker identification system Speaker identification system
url	http://localhost:8080/xmlui/handle/123456789/2154
work_keys_str_mv	AT fisusiabimbolaadeola developmentofatextindependentspeakeridentificationsystem

Development of a Text-Independent Speaker Identification System

Títulos similares