Published by the Foundation for Open Access Statistics Editors-in-chief: Bettina Grün, Torsten Hothorn, Rebecca Killick, Edzer Pebesma, Achim Zeileis    ISSN 1548-7660; CODEN JSSOBK
Authors: Dirk Eddelbuettel, Murray Stokely, Jeroen Ooms
Title: RProtoBuf: Efficient Cross-Language Data Serialization in R
Abstract: Modern data collection and analysis pipelines often involve a sophisticated mix of applications written in general purpose and specialized programming languages. Many formats commonly used to import and export data between different programs or systems, such as CSV or JSON, are verbose, inefficient, not type-safe, or tied to a specific programming language. Protocol Buffers are a popular method of serializing structured data between applications - while remaining independent of programming languages or operating systems. They offer a unique combination of features, performance, and maturity that seems particularly well suited for data-driven applications and numerical computing. The RProtoBuf package provides a complete interface to Protocol Buffers from the R environment for statistical computing. This paper outlines the general class of data serialization requirements for statistical computing, describes the implementation of the RProtoBuf package, and illustrates its use with example applications in large-scale data collection pipelines and web services.

Page views:: 5126. Submitted: 2014-02-05. Published: 2016-07-11.
Paper: RProtoBuf: Efficient Cross-Language Data Serialization in R     Download PDF (Downloads: 2069)
RProtoBuf_0.4.4.tar.gz: R source package Download (Downloads: 142; 1MB) Replication materials Download (Downloads: 116; 4KB)

DOI: 10.18637/jss.v071.i02

This work is licensed under the licenses
Paper: Creative Commons Attribution 3.0 Unported License
Code: GNU General Public License (at least one of version 2 or version 3) or a GPL-compatible license.