Tools
Software Tools
This course will use R statistical analysis software. We will use several specialized Bayesian analysis packages written for R
Menu
Installing Tools
Installing
- Download R version 4.2.1 from https://cran.rstudio.com/
- Windows:
- Download and install the “base distribution” of R 4.0.3 .
- MacOS:
- Download and install R version 4.0.3
- Linux:
- You should be able to install R from your Linux distribution’s package manager:
sudo apt-get install r-base r-base-dev
for Debian or Ubuntusudo yum install R
orsudo dnf install git
for Fedora, Red Hat, and related distributions.- If you are using Ubuntu, there are advanced instructions for configuring the package management system to use the very latest versions of R and its associated packages at https://cran.rstudio.com/bin/linux/ubuntu/fullREADME.html . This is optional and the default version of R should be fine for this class, as long as it is version 4.2 or higher.
- You should be able to install R from your Linux distribution’s package manager:
- Windows:
- If you have a version of R older than 4.2, you should update it to 4.2 or higher.
Installing
RStudio is a free editing and software development environment that makes it much easier to work with R. I strongly recommend that you install RStudio for this course.
-
Go to the download page for the free desktop edition of RStudio at https://www.rstudio.com/products/rstudio/download/#download and download the installer for your operating system. Windows, MacOS, and the Debian, Ubuntu, Fedora, RedHat, and openSUSE editions of Linux are all supported.
There are other versions of RStudio (an expensive professional edition and a server edition). You want the free desktop edition. As of August 2022, the latest version is RStudio Desktop version 2022.07.1+554.
-
Run the installer.
-
After the installer finishes running, run RStudio .
-
When RStudio starts up, the lower left part of the screen should have a window that displays the R version, saying something like this:
R version 4.2.1 (2022-06-23) -- "Funny-Looking Kid" Copyright (C) 2022 The R Foundation for Statistical Computing Platform: x86_64-w64-mingw32/x64 (64-bit) R is free software and comes with ABSOLUTELY NO WARRANTY. You are welcome to redistribute it under certain conditions. Type 'license()' or 'licence()' for distribution details. R is a collaborative project with many contributors. Type 'contributors()' for more information and 'citation()' on how to cite R or R packages in publications. Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R.
The details will be different depending on your operating system, but if you see something like this, RStudio correctly found R on your computer.
-
Configuring RStudio for additional optional software. There are a few other pieces of software that you may want to use this semester, but these are not required.
If have installed the Git revision control system or the LaTeX typesetting software, you should configure RStudio to use them:
-
If will use Git for software revision control with R, you should open the “Tools” menu, and click on the “Global Options” choice.
- Go to the “Git/SVN” tab and click “enable version control interface for RStudio projects”. If RStudio can find the git program on your computer, it will appear in the “git executable” field. If RStudio can’t find it, you can help it by browsing to the git program.
-
If you have installed LaTeX on your computer, click on the SWeave tab, and select “knitr” for weaving
.Rnw
files, and choosepdfLaTeX
for typesetting LaTeX files into PDF.
-
R Packages:
After you have R and RStudio installed, you will need to install a number of software packges that extend R for Bayesian data analysis.
-
RStan
This is the trickiest package to install. I recommend that you follow the detailed installation instructions on the RStan web site. There are instructions for Mac, Windows, and Linux.
Before you can install RStan, you will need to install a C++ compiler and related tools before you can use RStan. (Don’t worry, you won’t need to do any programming in C++. Stan writes the C++ code for you, based on your description of the statistical problem.) Follow the instructions at the rstan web page for Mac or Windows or Linux .
If you’re having trouble installing the C++ toolchain on MacOS, try the instructions on installing the R Compiler Tools for RCpp from the coatless professor
After you’ve done this, you can go to the main RStan installation instructions and continue from there.
If you run into difficulty, let me know and I will help you.
-
cmdstanr
This is another package you will need to install to run therethinking
package.The first step is to start a fresh R session or in RStudio open the “Session” menu and choose “Restart R”. Then type this in the console window:
install.packages("cmdstanr", repos = c("https://mc-stan.org/r-packages/", getOption("repos")))
Next, type this:
library(cmdstanr) check_cmdstan_toolchain(fix = TRUE, quiet = TRUE)
If this works without error, you should finally type this:
install_cmdstan(cores = 2)
If you are having difficulty installing
cmdstanr
, check the documentation at the <code>cmdstanr</code> web site . -
Other R packages
This will be a lot easier. Open RStudio, go to the “Console” window in RStudio, and type
install.packages(c("devtools", "pacman")) library(pacman) for (p in c("tidyverse", "ape", "bayesplot", "brms", "broom", "coda", "dagitty", "flextable", "ggdag", "ggformula", "ggmcmc", "ggrepel", "ggthemes", "knitr", "loo", "mvtnorm", "patchwork", "posterior", "psych", "rcartocolor", "Rcpp", "rmarkdown", "rprojroot", "sf", "shiny", "shinystan", "statebins", "tidybayes", "viridis", "viridisLite", "wesanderson")) { p_install(p, character.only = TRUE, force = FALSE) } remotes::install_github("rmcelreath/rethinking") remotes::install_github("mjskay/tidybayes.rethinking")
Now you should be good to go.
Update: installing cmdstanr
and additional tools
If you have already installed R and RStudio and rstan
, following the
instructions in the previous section at the beginning of the semester,
you may still need to install some additional tools. I’ve added this section
to guide you through the additional tools.
Just go to the previous section where it says “cmdstanr” and follow the instructions there and at “Other R Packages”.
If you haven’t installed
Optional Tools:
Installing the tinytex package
It is optional to install the tinytex
package.
You will be able to do all the work for the labs without it, but if you do
install it, it will give you the option to produce nicely formatted PDF
output from your RMarkdown files (for lab reports, presentations, etc.).
The R tinytex
package installs a sophisticated typesetting system
called LaTeX
on your computer. RMarkdown uses this system to
generate PDF output.
The tinytex
package needs to download a lot of files from the internet,
and it can take 10 minutes or more to do so, even on a fast connection.
So it’s a good idea to wait until you can let your computer run for a while,
and until you’re connected to a good fast internet connection, preferably
one that doesn’t charge you for data.
To install tinytex
, go to the RStudio
console, and
you type the following:
install.packages('tinytex')
tinytex::install_tinytex()
If you want to uninstall tinytex
later, you can just type this
command at the RStudio
console:
tinytex::uninstall_tinytex()`
Installing
Git is software for managing revisions as you develop software. You will not need it for this course, but it may be useful if you plan to develop complicated scripts for data analysis in the future.
If you have a Mac or Linux you may already have git installed. Test it by
opening up a terminal window and typing which git
. If you get a response
like /usr/bin/git
then it’s installed. If there is no response, then you
need to install git.
- Windows:
- Download and install git from https://git-scm.com
- Choose the default options for the installer.
- Optionally, you might want to also install Tortoise Git, which integrates git into the Windows explorer, so you can execute git commands from the context menu when you right-click on files or directories in the explorer. You can download Tortoise Git from https://tortoisegit.org/
- Download and install git from https://git-scm.com
- MacOS:
- If git is not already installed on your computer, you can download and install it from https://git-scm.com
- Linux:
- If git is not already installed, you can install it from your
distribution’s package manager:
sudo apt-get install git
for Debian or Ubuntusudo yum install git
orsudo dnf install git
for Fedora, Red Hat, and related distributions.
- If git is not already installed, you can install it from your
distribution’s package manager:
Introducing Youself to
Whatever operating system you’re using, after you install git you will need to introduce yourself to git (you only need to do this once). It is important for git to knows your name and email address so it can keep track of who is editing files when you are working collaboratively and so it gives you credit for the files you have authored and edited.
-
Open a terminal prompt:
- On Windows, open a “git bash” window (git will give you the option to do this when it finishes installing) or you can do so from the Windows Start menu, under “Git”.
- On MacOS or Linux, open a regular terminal window (on MacOS, you can use Finder to find the terminal application in the “applications” folder).
-
Type the following at the terminal prompt:
git config --global user.name "Your Name" git config --global user.email your.name@vanderbilt.edu
using your real name and email.
You only need to introduce yourself to git one time after you install it. Then it will remember who you are every time you use it.
Getting an account on GitHUB
If you will be using Git, you may want to create a free account on the GitHub website. This will allow you to use GitHub to store your work in the cloud, so you can get access it from other computers and have a backup in case your computer dies.
- Go to https://github.com and register for a free account
- After you have set up your account, go to GitHub Education and register your account for the free extras you can get as a student.
Resources for Learning More
and Resources
- Our principal resource will be the book, R for Data Science . You can buy a printed copy or use the free online edition , which is identical to the latest printed edition.
- RStudio
also has very useful “Cheat Sheets” that you can access from the
help menu. These are two-page PDF files that explain the basics of things
you may want to do with R:
- Manipulating tibbles and data frames with <code>dplyr</code>
- Visualizing data (making graphs and charts) with <code>ggplot2</code>
- Reading data from files and organizing it into tidy tibbles using <code>readr</code> and <code>tidyr</code>
- Manipulating lists and vectors with <code>purrr</code>
- Using RMarkdown
- There is also a cheatsheet for the RStudio IDE (Integrated Development Environment), which explains how to do things with RStudio , with a list of keyboard shortcuts for many common tasks.
- There are several additional cheatsheets that aren’t listed on the Help menu, but you can see them if you click on “Browse Cheatsheets… ” at the bottom of the Cheatsheet menu or visit https://www.rstudio.com/resources/cheatsheets/
- The team also has a large selection of free video tutorials and webinars about using R and RStudio . These range from basics of R and RStudio for beginners and simple introductions to the basics of data science with R to very advanced topics about specialized topics.
and GitHUB Resources
-
There is a lot of free documentation about git at the git-scm website, including a full Git reference manual and a free online book, Pro Git
-
Professor Jenny Bryan at the University of British Columbia, has written a lot of helpful tutorial material specifically about using git and GitHub with R and RStudio at Happy Git and GitHub for the useR .
Professor Bryan has also posted a detailed video tutorial at the RStudio Webinars and Videos page . This tutorial walks you through all the steps of setting up git with RStudio
and how to use it to keep track of your edits and revisions, and synchronize your work with GitHub (this serves three functions: backing up your data to the cloud, sharing your data with other people, and collaborating on writing code or documents with other people).