Published by the Foundation for Open Access Statistics Editors-in-chief: Bettina Grün, Torsten Hothorn, Rebecca Killick, Edzer Pebesma, Achim Zeileis    ISSN 1548-7660; CODEN JSSOBK
Authors: Simone Villa, Marco Rossetti
Title: Learning Continuous Time Bayesian Network Classifiers Using MapReduce
Abstract: Parameter and structural learning on continuous time Bayesian network classifiers are challenging tasks when you are dealing with big data. This paper describes an efficient scalable parallel algorithm for parameter and structural learning in the case of complete data using the MapReduce framework. Two popular instances of classifiers are analyzed, namely the continuous time naive Bayes and the continuous time tree augmented naive Bayes. Details of the proposed algorithm are presented using Hadoop, an open-source implementation of a distributed file system and the MapReduce framework for distributed data processing. Performance evaluation of the designed algorithm shows a robust parallel scaling.

Page views:: 2932. Submitted: 2013-03-01. Published: 2014-12-25.
Paper: Learning Continuous Time Bayesian Network Classifiers Using MapReduce     Download PDF (Downloads: 3253)
Supplements: Java source code, binary, and replication materials Download (Downloads: 280; 24MB)

DOI: 10.18637/jss.v062.i03

This work is licensed under the licenses
Paper: Creative Commons Attribution 3.0 Unported License
Code: GNU General Public License (at least one of version 2 or version 3) or a GPL-compatible license.