We found a match
Your institution may have rights to this item. Sign in to continue.
- Title
A TEDE Algorithm Studies the Effect of Dataset Grouping on Supervised Learning Accuracy.
- Authors
Wang, Xufei; Wang, Penghui; Song, Jeongyoung; Hao, Taotao; Duan, Xinlu
- Abstract
Datasets are the basis for research on deep learning methods in computer vision. The impact of the percentage of training sets in a dataset on the performance of neural network models needs to be further explored. In this paper, a twice equal difference enumeration (TEDE) algorithm is proposed to investigate the effect of different training set percentages in the dataset on the performance of the network model, and the optimal training set percentage is determined. By selecting the Pascal VOC dataset and dividing it into six different datasets from largest to smallest, and then dividing each dataset into the datasets to be analyzed according to five different training set percentages, the YOLOv5 convolutional neural network is used to train and test the 30 datasets to determine the optimal neural network model corresponding to the training set percentages. Finally, tests were conducted using the Udacity Self-Driving dataset with a self-made Tire Tread Defects (TTD) dataset. The results show that the network model performance is superior when the training set accounts for between 85% and 90% of the overall dataset. The results of dataset partitioning obtained by the TEDE algorithm can provide a reference for deep learning research.
- Subjects
CONVOLUTIONAL neural networks; ARTIFICIAL neural networks; COMPUTER vision; ALGORITHMS; SUPERVISED learning; DEEP learning
- Publication
Electronics (2079-9292), 2023, Vol 12, Issue 11, p2546
- ISSN
2079-9292
- Publication type
Article
- DOI
10.3390/electronics12112546