Data Sets and Software Libraries Used for Deep Learning
Özkan İnik1*, Erkan Ülker2
1Gaziosmanpaşa University, Tokat, Turkey
2Selçuk University, Konya, Turkey
Presented at the International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT2017), Tokat, Turkey, Dec 02, 2017
SETSCI Conference Proceedings, 2017, 1, Page (s): 72-77 ,
Published Date: 08 December 2017 | 1284 11
The purpose of this study is to examine the software libraries and data sets used for Deep Learning architectures. Deep Learning brings a different perspective to the field of artificial intelligence. It has begun to be used an incredibly wide field in recent years. Deep Learning models process high resolution photos in computer vision. Unlike traditional machine learning method, there is no pre-processing phase, such as cropping or extracting features for identifying objects on a photo. Similarly, while old networks can only define two types of objects (or, in some cases, the absence and presence of a single object), these modern networks can describe many different categories of objects. There are two main reasons why Deep Learning has emerged, especially in recent years. The first of these is the training data as much as today. Secondly there is hardware to process this data. In this context according to its purpose many software libraries have been developed and data sets have been created. A total of 10 different data sets and 6 different software libraries were examined in this study. The data sets are the MNIST data Set, CIFAR10 data set, CIFAR 100 data set, STL-10 data set, Street View House Numbers (SVHN) data set, Large Scale Visual Recognition Challenge (LSVRC) data set, Caltech 101 data set, Caltech 256, Labeled Faces in the Wild data set and Pascal VOC data set respectively. The number of images in each data set, the number of classes, etc. is explained in detail. The software libraries are Theano, Caffe, Torch, TensorFlow, Keras and MatConvNet respectively. The advantages and disadvantages of these software libraries are explained. In addition, platform and performance values are given in detail. Necessary information (Deep Learning Libraries and data sets) are presented especially for researchers who want to work in the field of Deep Learning.
Keywords - Caffe, Classification, CIFAR10, CNN, Deep Learning, Keras, MNIST, Pascal VOC, TensorFlow, Theano, Torch
