Practical Data Science for Stats: A PeerJ Collection
Nick Horton, Amherst College
Jenny Bryan of the University of British Columbia and RStudio and Hadley Wickham of RStudio co-edited the recently published collection of papers, Practical Data Science for Stats, which are available from PeerJ.
These preprints focus on the practical side of data science workflows and statistical analysis, particularly the many aspects of day-to-day analytical work that are almost absent from the conventional statistics literature and curriculum. And yet these activities account for a considerable share of the time and effort of data analysts and applied statisticians.
The goal of the collection is to increase the visibility and adoption of modern data analytical workflows and facilitate the transfer of tools and frameworks between industry and academia, between software engineering and statistics/computer science, and across different domains. While these preprints have not been reviewed by PeerJ, they have been reviewed for content by the editors listed above and peers. Versions of these articles are also under review for a special issue of The American Statistician.
A sampling of the papers in the collection include the following:
- “Data Organization in Spreadsheets” by Karl W. Broman and Kara H. Woo
- “Forecasting at Scale” by Sean J. Taylor and Benjamin Letham
- “The Democratization of Data Science Education” by Sean Kross, Roger D. Peng, Brian S. Caffo, Ira Gooding, and Jeffrey T. Leek
The full list of papers is given below:
September 27, 2017 preprint
Forecasting at scale
884 downloads 3,046 views
Sean J Taylor, Benjamin Letham
https://doi.org/10.7287/peerj.preprints.3190v2
September 1, 2017 preprint
How to share data for collaboration
612 downloads 3,364 views
Shannon E Ellis, Jeffrey T Leek
https://doi.org/10.7287/peerj.preprints.3139v5
August 31, 2017 preprint
Opinionated analysis development
769 downloads 4,168 views
Hilary Parker
https://doi.org/10.7287/peerj.preprints.3210v1
August 30, 2017 preprint
Wrangling categorical data in R
428 downloads 1,784 views
Amelia McNamara, Nicholas J Horton
https://doi.org/10.7287/peerj.preprints.3163v2
August 30, 2017 preprint
Lessons from between the white lines for isolated data scientists
280 downloads 1,286 views
Benjamin S Baumer
https://doi.org/10.7287/peerj.preprints.3160v2
August 29, 2017 preprint
Teaching stats for data science
443 downloads 1,923 views
Daniel T Kaplan
https://doi.org/10.7287/peerj.preprints.3205v1
August 29, 2017 preprint
Documenting and evaluating Data Science contributions in academic promotion in Departments of Statistics and Biostatistics
118 downloads 742 views
Lance A Waller
https://doi.org/10.7287/peerj.preprints.3204v1
August 28, 2017 preprint
Modeling offensive player movement in professional basketball
538 downloads 717 views
Steven Wu, Luke Bornn
https://doi.org/10.7287/peerj.preprints.3201v1
August 28, 2017 preprint
Excuse me, do you have a moment to talk about version control?
921 downloads 4,280 views
Jennifer Bryan
https://doi.org/10.7287/peerj.preprints.3159v2
August 27, 2017 preprint
The democratization of data science education
880 downloads 2,591 views
Sean Kross, Roger D Peng, Brian S Caffo, Ira Gooding, Jeffrey T Leek
https://doi.org/10.7287/peerj.preprints.3195v1
August 26, 2017 preprint
Packaging data analytical work reproducibly using R (and friends)
390 downloads 1,832 views
Ben Marwick, Carl Boettiger, Lincoln Mullen
https://doi.org/10.7287/peerj.preprints.3192v1
August 25, 2017 preprint
Extending R with C++: A Brief Introduction to Rcpp
325 downloads 1,332 views
Dirk Eddelbuettel, James Joseph Balamuta
https://doi.org/10.7287/peerj.preprints.3188v1
August 24, 2017 preprint
How R helps Airbnb make the most of its data
253 downloads 1,034 views
Ricardo Bion, Robert Chang, Jason Goodman
https://doi.org/10.7287/peerj.preprints.3182v1
August 24, 2017 preprint
Data organization in spreadsheets
707 downloads 3,437 views
Karl W Broman, Kara H. Woo
https://doi.org/10.7287/peerj.preprints.3183v1
August 24, 2017 preprint
Infrastructure and tools for teaching computing throughout the statistical curriculum
152 downloads 630 views
Mine Cetinkaya-Rundel, Colin W Rundel
https://doi.org/10.7287/peerj.preprints.3181v1
August 23, 2017 preprint
Declutter your R workflow with tidy tools
789 downloads 3,027 views
Zev Ross, Hadley Wickham, David Robinson
https://doi.org/10.7287/peerj.preprints.3180v1
[…] Practical Data Science – Collection of Papers […]
Welcome!
Amstat News is the monthly membership magazine of the American Statistical Association, bringing you news and notices of the ASA, its chapters, its sections, and its members. Other departments in the magazine include announcements and news of upcoming meetings, continuing education courses, and statistics awards.
ASA HOME
Departments
Archives
ADVERTISERS
PROFESSIONAL OPPORTUNITIES
FDA
US Census Bureau
Software
STATA
QUOTABLE
“ My ASA friendships and partnerships are some of my most treasured, especially because the ASA has enabled me to work across many institutional boundaries and
with colleagues from many types of organizations.”
— Mark Daniel Ward
Editorial Staff
Managing Editor
Megan Murphy
Graphic Designers / Production Coordinators
Olivia Brown
Meg Ruyle
Communications Strategist
Val Nirala
Advertising Manager
Christina Bonner
Contributing Staff Members
Kim Gilliam