Skip to main content

What I did to use git with Rstudio on Ubuntu 16.04 and Elementary OS Loki.


This article was updated on July 7 2018.

Before installing anything I signed up on https://github.com/ and created my R-ANALYTICS repository:


Now the installation and setup.

1. I installed git on Ubuntu, from a terminal window:

$ sudo apt-get update
$ sudo apt-get install git

2.- I configured git:

I did this through the git config command.
I provided my name and email address because git embeds this information into each commit.
Open a terminal and run:

$ git config --global user.name "Your Name" --> This is part of your link: https://github.com/LaranIkal, you see, my name is LaranIkal
$ git config --global user.email "youremail@domain.com"

To see all of the configuration items just type:

$ git config --list

As a note, it is saved to a file called ~/.gitconfig in your home folder.

Steps In Rstudio:


1.- Checking/Setting Rstudio right configuration to use Git/SVN: Menu→Tools→Global Options
Select GIT/SVN tab at the end of the below screen shot is the sample.

Now check the values:

My git executable is in:
laranikal@Analytics:/usr/bin$ ls -ltr git
-rwxr-xr-x 1 root root 1866896 oct 4 13:22 git

svn is not there but, Rstudio ise setup like that and since I am using git, I do not care about svn.
laranikal@Analytics:/usr/bin$ ls -ltr svn
ls: cannot access 'svn': No such file or directory



On Elementary OS Loki, I got the window below, so everything was done without any additional configuration, the thing is that rstudio detected git automatically:





3.- Now I created my project:

Note. To import a created project is the same process, it is just that the project files are already created and they are imported into your project folder.


On Rstudio go to File→ New Project

The next screen is showed, select Version Control:


Select git on the next screen:


Switch to the browser to get the https address of my project:


Clicking on the small icon
to the right will copy the address to the clipboard.


Now I pasted it to my new project repository URL:
The project directory name was setup automatically after pasting the URL.
The last step was to browse to the folder I wanted to clone my git repository into and clicked Create Project button.

Now I created new files:


Then I committed them: Menu → Tools → Version Control → Commit 

Select the document to be committed, enter a commit message and click the Commit button to the right:
This will commit to the local git repository on the hard drive.




Click close button on the right up side. And close the commit window.





Now it is needed to push the changes to git site.
Open a terminal window in the project local folder, type git push command and press enter:

After pressing enter git asks for user and password and, commit changes:


After this I can see my changes in my git repository on the web browser:



As additional information, I just found this site that may be helpful for the R users:

http://happygitwithr.com



Enjoy it!!!.

Carlos Kassab




Popular posts from this blog

UPDATED: Using R and H2O to identify product anomalies during the manufacturing process.

Note.  This is an update to article:  http://laranikalranalytics.blogspot.com/2019/03/using-r-and-h2o-to-identify-product.html - It has some updates but also code optimization from  Yana Kane-Esrig(  https://www.linkedin.com/in/ykaneesrig/ ) , as she mentioned in a message: The code you posted has two nested for() {} loops. It took a very long time to run. I used just one for() loop. It was much faster   Here her original code: num_rows=nrow(allData) for(i in 1:ncol(allData)) {   temp = allData [,i]   cat( "Processing column:", i, ", number missing:", sum( is.na(temp)), "\n" )    temp_mising =is.na( allData[, i])    temp_values = allData[,i][! temp_mising]    temp_random = sample(temp_values, size = num_rows, replace = TRUE)      temp_imputed = temp   temp_imputed[temp_mising]= temp_random [temp_mising]   # describe(temp_imputed)   allData [,i] = temp_imputed      cat( "Processed column:", i, ", number missing:", sum( is.na(allData [,i

Installing our R development environment on Ubuntu 20.04

  Step 1: Install R,  Here the link with instructions:  How to instal R on Ubuntu 20.04 Adding the steps I followed because sometimes the links become unavailable: Add GPG key: $ sudo apt-key adv --keyserver keyserver.ubuntu.com --recv-keys E298A3A825C0D65DFD57CBB651716619E084DAB9 Output: Executing: /tmp/apt-key-gpghome.NtZgt0Un4R/gpg.1.sh --keyserver keyserver.ubuntu.com --recv-keys E298A3A825C0D65DFD57CBB651716619E084DAB9 gpg: key 51716619E084DAB9: public key "Michael Rutter " imported gpg: Total number processed: 1 gpg: imported: 1 Add repository: $ sudo add-apt-repository 'deb https://cloud.r-project.org/bin/linux/ubuntu focal-cran40/' Output: Hit:1 https://deb.opera.com/opera-stable stable InRelease Hit:2 http://us.archive.ubuntu.com/ubuntu focal InRelease Hit:3 http://archive.canonical.com/ubuntu focal InRelease

Using R and H2O Isolation Forest anomaly detection for data quality, further analysis.

Introduction: This is the second article on data quality, for the first part, please go to:  http://laranikalranalytics.blogspot.com/2019/11/using-r-and-h2o-isolation-forest-for.html Since Isolation Forest is building an ensemble of isolation trees, and these trees are created randomly, there is a lot of randomness in the isolation forest training, so, to have a more robust result, 3 isolation forest models will be trained for a better anomaly detection. I will also use Apache Spark for data handling. For a full example, testing data will be used after training the 3 IF(Isolation Forest) models. This way of using Isolation Forest is kind of a general usage also for maintenance prediction. I am working with data from file: https://www.kaggle.com/bradklassen/pga-tour-20102018-data NOTE: There was a problem with the data from the link above, so I created some synthetic data that can be downloaded from this link:  Golf Tour Synthetic Data # Set Java parameters, enough memo