The Seybold Report

Original Research

DIALECT RECOGNITION SYSTEM FOR BAGRI RAJASTHANI LANGUAGE USING OPTIMIZED FEATURED SWARM CONVOLUTIONAL NEURAL NETWORK (OFSCNN) MODEL

POONAM KUKANA 1, POOJA SHARMA 2, PUNEET SAPRA 3, and NEERU BHARDWAJ 4.

Vol 18, No 10 ( 2023 ) | DOI: 10.5281/zenodo.10005831 | Author Affiliation: Department of Computer Science and Engineering, University School of Engineering & Technology, Rayat-Bahra University, Mohali, Punjab, India 1,2,3; Department of Computer Science and Engineering, Chandigarh University, Mohali, Punjab, India 4. | Licensing: CC 4.0 | Pg no: 299-318 | Published on: 04-10-2023

Abstract

The dialects of a language hold a significant place in speech processing (SP) applications. The objective of dialect identification is to categorize speech sample data into a specific dialect of a speaker's spoken language. A dialect recognition system must effectively distinguish between different dialects of a standard language, as they tend to possess many similarities. The dialect of a language is not a distinct characteristic, as it can be impacted by the utterer's birthplace. Researchers in the domain of automatic speech recognition (ASR) face difficulties in identifying the speech patterns unique to each dialect or language. The proposed work recognizes the dialects of the Bagri Rajasthani language from undefined expressions of speech. Rajasthani Language is one of the eldest and most famous languages in the Indo-Aryan languages. It comprises the different dialects and for recognizing the dialects, it used dissimilar phases of acoustic and spectral characteristics of the speech signal (SS). The spectral and acoustic features of SSs are measured to design the system. As there is no specific speech dataset for Bagri dialects, the database is built, to verify the Bagri dialects of the Rajasthani language. To improve the accuracy rate, and error rate in recognizing the Bagri dialects, the acoustic and spectral characteristics of speech expressions are joined. To verify several Bagri dialects of the Rajasthani language, different simulations for classification and investigation are carried out i.e., OFSCNN model, GA-NN, etc. The outcomes are important and the accuracy of 96.95 % for the OFSCNN model, 80.63 % for GA-NN, and 93.45% for the Multiclass SVM method is an achievement.

Keywords: Speech Dialect Recognition, Dialects, OFSCNN (Optimized Featured Swarm Convolutional Neural Network), MFCC (Mel Frequency Co-efficient Cepstral), PSO (Particle Swarm Optimization), CNN (Convolutional Neural Network.

Keywords

Speech Dialect Recognition, Dialects, OFSCNN (Optimized Featured Swarm Convolutional Neural Network), MFCC (Mel Frequency Co-efficient Cepstral), PSO (Particle Swarm Optimization), CNN (Convolutional Neural Network.

Download PDF →

| Home

Overview

Original Research

POONAM KUKANA 1, POOJA SHARMA 2, PUNEET SAPRA 3, and NEERU BHARDWAJ 4.

Abstract

Keywords