RStudio¶
RStudio is available for use graphically in your browser via OOD. As with other standalone programs, you'll need to select the resources required using the job creation form. You'll also need to select both the version of RStudio you wish to use, and the version of R you wish to use. To adjust the environment, please use the Environment Setup field to load modules besides R and RStudio as seen below. All other modules and paths should be loaded here as it is difficult to load and consistently use modules once RStudio starts.
Important
Unless an older version of R is absolutely necessary, it is highly suggested to always use the newest version of R and RStudio for both updated functionality within those software as well as updated compilers for package installation. Using the newest version of R solves most known package installation errors.
RStudio and Python¶
If you have a workflow that uses both R and Python, it is strongly recommended to use the reticulate package along with Anaconda environments. Reticulate allows researchers to load Python packages into a native R session as objects. For instance, if someone prefer some functionality of the pandas
package but has other code already written in R, they can import pandas
to R and use both simultaneously.
This also allows researchers to download precompiled command line binaries into an Anaconda environment and easliy use them in their R scripts.
For setup, use the following steps:
-
In a terminal on a compute node, either in an HPC Desktop job or by clicking the blue Host button on any job card:
- Load the
Anaconda3
module - Create an Anaconda environment. More information about how to create Anaconda environments can be found in our documentation.
- Activate your environment and install your requuired python packages using either
pip install
orconda install
depending on the package source.
Note
The preceding steps should only need to be run once. If other Python packages need to be installed in the same environment, repeat steps 1 and 3. You will not need to recreate your environment.
- Load the
-
In RStudio:
- Add the command
module load Anaconda3
to the Environment Setup window when requesting the RStudio job. - If not already installed, install the
reticulate
package using eitherinstall.packages
or the renv package. - Use
reticulate::use_condaenv('env_name')
to load your conda environment. - From here, you will be able to interact with all of the python packages and non-python precompiled binaries in your Anaconda environment using R and RStudio. Please read more about how to do that in reticulate's documentation.
- Add the command
For cases where your R code only needs access to precompiled binaries or libraries and does not need to import any Python libraries, you can instead create your Anaconda environment and add the following lines into the Environment Setup window:
This will add those binaries and libraries to your environment $PATH
which RStudio will inherit.
Important
If you're wanting to directly use any Python package in R, DO NOT include the conda activate
command in the Environment Setup. Use reticulate
instead as described above.
RStudio Projects and renv¶
The most recent versions of RStudio installed on Cheaha support R Projects as well as package management through the renv
package. Please read more about improving analysis reproducibility using both of these tools in our workflow solutions
Using Pandoc and knitr
within RStudio¶
If you want to use RMarkdown to create reports in RStudio, R modules using version 4.2.0 and later include knitr
compatibility. Please use the latest versions of both R and Rstudio for fully integrated knitr
functionality.
Starting With a Clean Session to Avoid Errors¶
By default, RStudio loads the most recently opened project at startup and restores the .RData
file into the workspace. If you only work on a single project, this may be helpful. If you frequently change projects then these default settings can create difficult-to-diagnose errors, or you may inadvertently alter a project by adding incorrect packages, for example.
To reduce the risk of these kinds of errors, uncheck the highlighted fields below in the RStudio Options menu under the "General" selection.
- Restore most recently opened project at startup
- Restore .RData into workspace at startup