Published by the Foundation for Open Access Statistics Editors-in-chief: Bettina Grün, Torsten Hothorn, Rebecca Killick, Edzer Pebesma, Achim Zeileis    ISSN 1548-7660; CODEN JSSOBK
Authors: Stephan J Ritter, Nicholas P Jewell, Alan E Hubbard
Title: R Package multiPIM: A Causal Inference Approach to Variable Importance Analysis
Abstract: We describe the R package multiPIM, including statistical background, functionality and user options. The package is for variable importance analysis, and is meant primarily for analyzing data from exploratory epidemiological studies, though it could certainly be applied in other areas as well. The approach taken to variable importance comes from the causal inference field, and is different from approaches taken in other R packages. By default, multiPIM uses a double robust targeted maximum likelihood estimator (TMLE) of a parameter akin to the attributable risk. Several regression methods/machine learning algorithms are available for estimating the nuisance parameters of the models, including super learner, a meta-learner which combines several different algorithms into one. We describe a simulation in which the double robust TMLE is compared to the graphical computation estimator. We also provide example analyses using two data sets which are included with the package.

Page views:: 5058. Submitted: 2012-01-30. Published: 2014-04-22.
Paper: R Package multiPIM: A Causal Inference Approach to Variable Importance Analysis     Download PDF (Downloads: 6129)
multiPIM_1.4-1.tar.gz: R source package Download (Downloads: 420; 81KB)
v57i08.R: R example code from the paper Download (Downloads: 488; 11KB) R data files with results from analyses Download (Downloads: 1469; 8MB)

DOI: 10.18637/jss.v057.i08

This work is licensed under the licenses
Paper: Creative Commons Attribution 3.0 Unported License
Code: GNU General Public License (at least one of version 2 or version 3) or a GPL-compatible license.