Research Area:  Machine Learning
In recent research, deep learning-based methods have achieved state-of-the-art performance in various computer vision tasks. However, these methods are commonly supervised, and require huge amounts of annotated data to train. Acquisition of data demands an additional costly effort, particularly for the tasks where it becomes challenging to obtain large amounts of data considering the time constraints and the requirement of professional human diligence. In this paper, we present a data level synthetic sampling solution to learn from small and imbalanced data sets using Generative Adversarial Networks (GANs). The reason for using GANs are the challenges posed in various fields to manage with the small datasets and fluctuating amounts of samples per class. As a result, we present an approach that can improve learning with respect to data distributions, reducing the partiality introduced by class imbalance and hence shifting the classification decision boundary towards more accurate results. Our novel method is demonstrated on a small dataset of 2789 tomato plant disease images, highly corrupted with class imbalance in 9 disease categories. Moreover, we evaluate our results in terms of different metrics and compare the quality of these results for distinct classes.
Keywords:  
Image-To-Image Translation
Generative Adversarial Networks
Data Augmentation
Plant Disease Datasets
Machine Learning
Deep Learning
Author(s) Name:  Nazki, Haseeb, Yoon, Sook, Park, Dong Sun
Journal name:  Smart Media Journal
Conferrence name:  
Publisher name:  Korea Institute of Science and Technology Information
DOI:  10.30693/SMJ.2019.8.2.46
Volume Information:  Volume 8,Issue 2, Pages.46-57
Paper Link:   https://www.koreascience.or.kr/article/JAKO201918961949570.page