| Home

Overview


Original Research

ENHANCED BINARY BAT ALGORITHM BASED FEATURE SELECTION MODEL FOR BIG DATA INFORMATION EXTRACTION

MANJUNATHA SWAMY C 1, Dr. S. MEENAKSHI SUNDARAM 2, and Dr. LOKESH M R 3.

Vol 18, No 02 ( 2023 )   |  DOI: 10.17605/OSF.IO/Z3R9N   |   Author Affiliation: Research Scholar, Department of CSE, GSSS Institute of Engineering & Technology for Women. Mysuru, Karnataka, India 1; Professor & Head, Department of CSE, GSSS Institute of Engineering & Technology for Women Mysuru, Karnataka, India 2; Professor Department of CSE, Vivekananda College of Engineering & Technology, Puttur, Karnataka, India 3.   |   Licensing: CC 4.0   |   Pg no: 1512-1524   |   Published on: 27-02-2023

Abstract

Purpose: Information extraction from big data is improved by either reducing the number of features in a data set or selecting features using intelligent data analysis. Generally, big data sets are complex to process using traditional approaches. Feature selection is highly essential in big data information extraction because it chooses the subset of features that influence the final classification. Reducing the number of selected features in the data leads to enhanced accuracy and efficiency of data extraction with other attributes used in the mathematical model. This work aims to improve the performance of the classifier using an enhanced binary bat algorithm-based effective feature selection model. Design: The purpose of this paper is to provide, an effective feature selection model for big data information extraction. An enhanced binary bat algorithm has been proposed to improve attribute selection using local optimization and global optimization methods. Findings: All the experiments were carried out with different datasets on the number of iterations and fitness of the attributes to validate the effective performance of the proposed algorithm. Experimental results and graphs show that the proposed methodology improves the overall performance of optimization. Originality: A feature selection model based on the binary bat algorithm has been the focus of this paper. Feature subset selection and feature ranking are the two important methods used in this approach. Experiments were conducted on datasets to analyze the patterns in the number of iterations and fitness of the attributes over selection. The improvement in feature selection leads to better classification accuracy of the proposed model compared to other nature inspired techniques.


Keywords

Extraction of information, Selection of features, EBBA Algorithm, Bio-inspired concept, Classification, Response time, Precision, and Recall