Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Big data time series forecasting based on nearest neighbours distributed computing with Spark - 2018

Big data time series forecasting based on nearest neighbours distributed computing with Spark

Research Area:  Big Data

Abstract:

A new approach for big data forecasting based on the k-weighted nearest neighbours algorithm is introduced in this work. Such an algorithm has been developed for distributed computing under the Apache Spark framework. Every phase of the algorithm is explained in this work, along with how the optimal values of the input parameters required for the algorithm are obtained. In order to test the developed algorithm, a Spanish energy consumption big data time series has been used. The accuracy of the prediction has been assessed showing remarkable results. Additionally, the optimal configuration of a Spark cluster has been discussed. Finally, a scalability analysis of the algorithm has been conducted leading to the conclusion that the proposed algorithm is highly suitable for big data environments.

Keywords:  

Author(s) Name:  R. Talavera-Llames,R. Pérez-Chacón,A. Troncoso and F. Martínez-Álvarez

Journal name:  Knowledge-Based Systems

Conferrence name:  

Publisher name:  ELSEVIER

DOI:  10.1016/j.knosys.2018.07.026

Volume Information:  Volume 161, 1 December 2018, Pages 12-25