A Comparative Study on Data Collection Methods: Investigating Optimal Datasets for Data Mining Analysis

Jatnika, Hendra and Waluyo, Ari and Aziz, Abdul (2024) A Comparative Study on Data Collection Methods: Investigating Optimal Datasets for Data Mining Analysis. Journal of Applied Data Sciences, 5 (1). pp. 16-23. ISSN 27236471

Full text not available from this repository. (Request a copy)

Abstract

This study is dedicated to evaluating the efficiency of diverse data collection methods in obtaining optimal data for computational data mining. The investigation meticulously compares the questionnaire and web mining methodologies within the framework of SVM and NBC algorithms to discern the flexibility inherent in each data type. The outcomes of this comprehensive analysis demonstrate that questionnaires showcase remarkable flexibility, exhibiting accuracy rates surpassing 80% in both algorithms, along with AUC values exceeding 0.9 when contrasted with data acquired through web mining techniques. These results underscore the paramount importance of the dataset collection method in the realm of computational data mining. The study contributes compelling evidence that advocates for the superiority of the questionnaire data collection method over web mining in the specific context of computational data mining. The questionnaire method not only outperforms in terms of flexibility but also achieves high accuracy, making it a more reliable choice for acquiring data in this domain. Beyond its practical implications, the research highlights a critical aspect of methodology in data collection by emphasizing the necessity of exploring and assessing methods that may have been overlooked in previous research endeavors. This underscores the continuous evolution of research methodologies and the need for ongoing exploration to enhance the robustness and effectiveness of data collection in computational data mining studies.

Item Type: Article
Additional Information: This research assesses the efficacy of two distinct data collection approaches, specifically employing questionnaires and web mining in the domain of data mining computing. Preceding the model evaluation, the data utilized in this investigation undergoes a preprocessing phase. Subsequent to subjecting the model to NBC and SVM algorithms, the assessment and validation outcomes demonstrate that the questionnaire-based data collection method exhibits notable advantages, showcasing high flexibility and superior accuracy in comparison to the web mining data collection technique. Moreover, the AUC value derived from the questionnaire method is notably high, registering at 0.9, signifying its considerable flexibility. These findings substantiate the superiority of the questionnaire data collection method, particularly within the realm of data mining computing.
Uncontrolled Keywords: Comparative Modeling, Support Vector Machine, Naive Bayes Classifier
Subjects: Bidang Keilmuan > Algoritma
Bidang Keilmuan > Data Mining
Bidang Keilmuan > Data Science
Bidang Keilmuan > Deep learning
Jurnal
Bidang Keilmuan > Komputer dan Telekomunikasi
Bidang Keilmuan > Teknik Informatika
Depositing User: Yudha Formanto
Date Deposited: 17 Nov 2025 06:56
Last Modified: 17 Nov 2025 06:56
URI: https://repository.itpln.ac.id/id/eprint/3916

Actions (login required)

View Item
View Item