Presenting Author:

Luke Rasmussen

Principal Investigator:

Leah Welty, Ph.D.

Department:

Preventive Medicine

Keywords:

reproducible research, statistics

Location:

Ryan Family Atrium, Robert H. Lurie Medical Research Center

C117 - Clinical

StatTag: A Tool for Connecting Statistical Output to Microsoft Word

Introduction: Within the research community, there is a lack of broadly accessible tools to integrate document preparation in Microsoft Word with statistical code, results, and data in a way that easily facilitates reproducible research. Manual approaches pose the risk of error when transcribing results into the manuscript. Tools such as kintR use LaTeX (a technical typesetting program), and plain text editors for document preparation. Despite the merits of these programs, Word remains the mainstay, and sometimes singular option, for manuscript preparation in many fields, especially academic medicine. Although statistical packages may generate output compatible with Microsoft Word (e.g., R Markdown, SAS ODS), they are one-directional in editing capabilities. This is not compatible with the practice of reproducible research in the collaborative medicine environment. We present StatTag – a plug-in for Microsoft Word, which addresses these gaps. Methods: StatTag was developed as a plug-in for Microsoft Word 2010 or higher using the C# programming language, and is currently compatible with Windows 7 or higher, and Stata 14, SAS 9.4 and R 3.3. Future versions of StatTag will allow use on computers operating Mac OS 10.11 and above. It is integrated into the Word toolbar, allowing users to associate ("link") code files with a document, annotate ("tag") the portion of the code file that contains relevant output, and insert that output into a Word document. Within the document, a user may double-click on the output, allowing them to see the exact code used to generate that value. It communicates with R, SAS and Stata via local application programming interfaces (APIs). Results: StatTag is a free plug-in (available from stattag.org) for conducting reproducible research and creating dynamic documents using Microsoft Word, and supports the R, SAS and Stata statistical packages. StatTag allows users to embed statistical output (estimates, tables, and figures) within Word and provides an interface to edit statistical code directly from Word. This output can then be individually or collectively updated in one-click with a behind-the-scenes call to R, SAS and/or Stata. As of March 1, 2017, over 290 individuals have registered to download the software, and it has been presented within the Northwestern community as well as externally through technical and broad statistical conferences Discussion: With StatTag, modification of a dataset or analysis no longer entails transcribing or re-copying results in to a manuscript or table. In addition, it allows collaborators to modify manuscript text using Word track changes, and that marked-up document to be used without manual transcription of edits. This software has the potential to change the way statisticians and analysts collaborate on a daily basis.