Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
notes:r_cheat_sheet [2014/10/05 03:24]
smthng [Vectors]
notes:r_cheat_sheet [2015/01/10 15:25] (current)
smthng [File Handling]
Line 6: Line 6:
  
   * ''R'' is the command-line interpreter   * ''R'' is the command-line interpreter
 +  * ''install.pa­ckages("gg­plot2") '' to install additional packages
 +
   * Expressions are evaluated and displayed e.g. 1, 1+1, "Hello World"   * Expressions are evaluated and displayed e.g. 1, 1+1, "Hello World"
   * Booleans are e.g. ''1=1'' , ''3>4'' , ''TRUE'', T, ''FALSE'', F   * Booleans are e.g. ''1=1'' , ''3>4'' , ''TRUE'', T, ''FALSE'', F
   * For variable assignment ''x=1'' or ''x<-1''   * For variable assignment ''x=1'' or ''x<-1''
-  * For help on a function use ''help(sum)'' or ''example(sqrt)''+  * For help on a function use ''help(sum)'' ,  ''help(package='ggplot2')''  or ''example(sqrt)''
   * Operations are ''+ - * / = <-''   * Operations are ''+ - * / = <-''
   * ''NA'' is used to express a missing or unknown data  value. Expressions on NA return NA.   * ''NA'' is used to express a missing or unknown data  value. Expressions on NA return NA.
- 
 ===== Vectors ===== ===== Vectors =====
   * To create a vector, use the combine command ''c(4,7,9)''   * To create a vector, use the combine command ''c(4,7,9)''
Line 25: Line 26:
   * ''myseq + 1'' adds one to all elements of the myseq vector.   * ''myseq + 1'' adds one to all elements of the myseq vector.
   * Scalar operations or functions on vectors typically produce other vectors e.g. + - == sin(myseq)   * Scalar operations or functions on vectors typically produce other vectors e.g. + - == sin(myseq)
 +  * ''head(myvec)'' , ''tail(myvec)'' to show start or end of vector
  
-===== Data Sets ===== 
-  * ''factor'' is a collection type for categorized values - ''myfac=factor(myvec)'' 
-  * ''factor''s group unique string values as ''level''s e.g. levels(myfac) shows unique levels.  
-  * ''as.integer(myfac)'' shows levels as integers, can be used to set plot type 
-  * ''legend("to­pright", level­s(types), pch=1­:length(le­vels(types­)))'' 
-  * A data frame collects sets of related values (i.e. sets of columns with values in the same order) e.g. ''mydf=data.frame(weights,prices,types)'' 
-  * To extract a column, use double-square brackets with the column index or name e.g. ''mydf%%[['weights']]%%'' or just a dollar sign e.g. ''treasure$prices'' 
 ===== Plotting ===== ===== Plotting =====
  
Line 43: Line 38:
   * ''persp(mymat)'' plots a contour map in perspective.   * ''persp(mymat)'' plots a contour map in perspective.
   * ''image(volcano)'' generates a heat map of the matrix.   * ''image(volcano)'' generates a heat map of the matrix.
 +  * ''qplot(weights, prices, color=types)'' - more attractive plotting using ggplot2 package.
 ===== Matrices ===== ===== Matrices =====
  
Line 51: Line 46:
   * ''mymatrix[3,4]'' returns an element of the matrix (row,column).   * ''mymatrix[3,4]'' returns an element of the matrix (row,column).
   * ''mymatrix[,2]'' returns entire second column.   * ''mymatrix[,2]'' returns entire second column.
 +
 +===== Data Sets =====
 +  * ''factor'' is a collection type for categorized values - ''myfac=factor(myvec)''
 +  * ''factor''s group unique string values as ''level''s e.g. levels(myfac) shows unique levels. 
 +  * ''as.integer(myfac)'' shows levels as integers, can be used to set plot type
 +  * ''legend("to­pright", level­s(types), pch=1­:length(le­vels(types­)))''
 +  * A data frame collects sets of related values (i.e. sets of columns with values in the same order) e.g. ''mydf=data.frame(weights,prices,types)''
 +  * To extract a column, use double-square brackets with the column index or name e.g. ''mydf%%[['weights']]%%'' or just a dollar sign e.g. ''treasure$prices''
 +  * ''merge'' merges data sets by joining on shared column names
  
 ===== Statistics ===== ===== Statistics =====
  
   * ''mean(myvec)  median(myvec)   sd(myvec)''   * ''mean(myvec)  median(myvec)   sd(myvec)''
 +  * ''cor.test'' tests for correlation (Pearson's product-moment)
 +  * ''line = lm(cola ~ colb)'' calculates a linear model between cola and colb that can be plotted with ''abline(line)''
 +  * 
 ===== File Handling ===== ===== File Handling =====
  
   * ''list.files()'' to list files in furrent directory   * ''list.files()'' to list files in furrent directory
   * ''source("file.R")'' to load file of code   * ''source("file.R")'' to load file of code
 +  * ''read.csv('mydat.csv')'' to load a csv file
 +  * ''read.table'' to read text data with other separators
 +  * ''con<-url("http://google.com","r")'' to read a webpage
 +  * ''x<-readLines(con)'' to convert to a vector of lines
Recent changes RSS feed Creative Commons License Donate Minima Template by Wikidesign Driven by DokuWiki