Last Updated on November 23, 2023
21. An Introduction To R: Notes on R: A Programming Environment for Data Analysis and Graphics by William N Venables, David M Smith, and the R Core Team
This tutorial manual provides a comprehensive introduction to R, a software package for statistical computing and graphics.
This 105 introductory text offers a well-rounded primer to the world of R.
R supports a wide range of statistical techniques and is easily extensible via user-defined functions. One of R’s strengths is the ease with which publication-quality plots can be produced in a wide variety of formats.
22. Exploratory Data Analysis with R by Roger D. Peng
Exploratory Data Analysis with R teaches you to use R to effectively visualize and explore complex datasets. Exploratory data analysis is a key part of the data science process because it allows you to sharpen your question and refine your modeling strategies.
The author is a professor of Biostatistics at Johns Hopkins Bloomberg School of Public Health in Baltimore, Maryland.
The book has a suggested price of $15 but can be downloaded for free.
23. Introduction to Empirical Bayes: Examples from Baseball Statistics by David Robinson
Introduction to Empirical Bayes teaches you how to use empirical Bayesian methods for estimating binomial proportions, through a series of intuitive examples drawn from baseball statistics.
You’ll learn both the theory and the practice behind empirical Bayes, including computing credible intervals, performing Bayesian A/B testing, and fitting mixture models. Each example is accompanied with visualizations to demonstrate the mathematical concepts, as well as R code that can be adapted to analyze your own data.
This book is adapted from a series of ten posts on the author’s blog with additional material added.
Like some of the other books in this roundup, the book is available for a “pay-what-you-want”. This includes free.
24. Mastering Software Development in R by Roger D Peng, Sean Kross & Brooke Anderson
Mastering Software Development in R covers R software development for building data science tools.
This book offers a rigorous introduction to R, before moving on to advanced R programming instruction. It then proceeds to teach you how to build R packages that will satisfy the criteria for submission to CRAN. The book finishes with instruction on how to build data visualization tools such as creating data graphics using ggplot2, building simple and dynamic maps, as well as creating gpplot2 themes.
25. The caret Package by Max Kuhn
The caret (Classification And Regression Training) R package contains various functions for training and plotting classification and regression model.
The book looks at visualizations, pre-processing, data splitting, model training and tuning.
The book then moves on to available models, a basic list of model types or relevant characteristics. There’s also chapters on parallel processing, random hyperparameter search, subsampling for class imbalances, using recipes with train, and using your own model in train. The book concludes with chapters on adaptive resampling, variable importance, measuring functions, and more.
Next page: Page 6 – Modern Statistics for Modern Biology and more books
Pages in this article:
Page 1 – R for Data Science and more books
Page 2 – R Graphics Cookbook and more books
Page 3 – Fundamentals of Data Visualization and more books
Page 4 – Data Analysis for the Life Sciences and more books
Page 5 – An Introduction To R and more books
Page 6 – Modern Statistics for Modern Biology and more books
Page 7 – A Little Book of R for Biomedical Statistics and more books
All books in this series:
Free Programming Books | |
---|---|
Ada | ALGOL-like programming language, extended from Pascal and other languages |
Agda | Dependently typed functional language based on intuitionistic Type Theory |
Arduino | Inexpensive, flexible, open source microcontroller platform |
Assembly | As close to writing machine code without writing in pure hexadecimal |
Awk | Versatile language designed for pattern scanning and processing language |
Bash | Shell and command language; popular both as a shell and a scripting language |
BASIC | Beginner’s All-purpose Symbolic Instruction Code |
C | General-purpose, procedural, portable, high-level language |
C++ | General-purpose, portable, free-form, multi-paradigm language |
C# | Combines the power and flexibility of C++ with the simplicity of Visual Basic |
Clojure | Dialect of the Lisp programming language |
ClojureScript | Compiler for Clojure that targets JavaScript |
COBOL | Common Business-Oriented Language |
CoffeeScript | Transcompiles into JavaScript inspired by Ruby, Python and Haskell |
Coq | Dependently typed language similar to Agda, Idris, F* and others |
Crystal | General-purpose, concurrent, multi-paradigm, object-oriented language |
CSS | CSS (Cascading Style Sheets) specifies a web page’s appearance |
D | General-purpose systems programming language with a C-like syntax |
Dart | Client-optimized language for fast apps on multiple platforms |
Dylan | Multi-paradigm language supporting functional and object-oriented coding |
ECMAScript | Best known as the language embedded in web browsers |
Eiffel | Object-oriented language designed by Bertrand Meyer |
Elixir | Relatively new functional language running on the Erlang virtual machine |
Erlang | General-purpose, concurrent, declarative, functional language |
F# | Uses functional, imperative, and object-oriented programming methods |
Factor | Dynamic stack-based programming language |
Forth | Imperative stack-based programming language |
Fortran | The first high-level language, using the first compiler |
Go | Compiled, statically typed programming language |
Groovy | Powerful, optionally typed and dynamic language |
Haskell | Standardized, general-purpose, polymorphically, statically typed language |
HTML | HyperText Markup Language |
Icon | Wide variety of features for processing and presenting symbolic data |
J | Array programming language based primarily on APL |
Java | General-purpose, concurrent, class-based, object-oriented, high-level language |
JavaScript | Interpreted, prototype-based, scripting language |
Julia | High-level, high-performance language for technical computing |
Kotlin | More modern version of Java |
LabVIEW | Designed to enable domain experts to build power systems quickly |
LaTeX | Professional document preparation system and document markup language |
Lisp | Unique features - excellent to study programming constructs |
Logo | Dialect of Lisp that features interactivity, modularity, extensibility |
Lua | Designed as an embeddable scripting language |
Markdown | Plain text formatting syntax designed to be easy-to-read and easy-to-write |
Objective-C | Object-oriented language that adds Smalltalk-style messaging to C |
OCaml | The main implementation of the Caml language |
Pascal | Imperative and procedural language designed in the late 1960s |
Perl | High-level, general-purpose, interpreted, scripting, dynamic language |
PHP | PHP has been at the helm of the web for many years |
PostScript | Interpreted, stack-based and Turing complete language |
Prolog | A general purpose, declarative, logic programming language |
PureScript | Small strongly, statically typed language compiling to JavaScript |
Python | General-purpose, structured, powerful language |
QML | Hierarchical declarative language for user interface layout - JSON-like syntax |
R | De facto standard among statisticians and data analysts |
Racket | General-purpose, object-oriented, multi-paradigm, functional language |
Raku | Member of the Perl family of programming languages |
Ruby | General purpose, scripting, structured, flexible, fully object-oriented language |
Rust | Ideal for systems, embedded, and other performance critical code |
Scala | Modern, object-functional, multi-paradigm, Java-based language |
Scheme | A general-purpose, functional language descended from Lisp and Algol |
Scratch | Visual programming language designed for 8-16 year-old children |
SQL | Access and manipulate data held in a relational database management system |
Standard ML | General-purpose functional language characterized as "Lisp with types" |
Swift | Powerful and intuitive general-purpose programming language |
Tcl | Dynamic language based on concepts of Lisp, C, and Unix shells |
TeX | Markup and programming language - create professional quality typeset text |
TypeScript | Strict syntactical superset of JavaScript adding optional static typing |
Vala | Object-oriented language, syntactically similar to C# |
VHDL | Hardware description language used in electronic design automation |
VimL | Powerful scripting language of the Vim editor |
XML | Rules for defining semantic tags describing structure ad meaning |