Professional Documents
Culture Documents
R Ecosystem
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Density
disp
***
0.66
drat
10 15 20 25 30
150
0.71
am
wt
250
vs
drat
mpg
mpg
drat
vs
am
gear
qsec
gear
***
50
qsec
50
carb
250
**
0.45
hp
150
disp
wt
cyl
hp
*** 0.89
***
0.71
Density
100
***
0.79
Density
300
Density
Density
*** 0.87
***
0.68
carb
***
0.78
hp
***
0.85
cyl
wt
300
10 15 20 25 30
100
mpg
disp
K-centroids clustering
https://www.rmetrics.org/node/34
http://rgm2.lab.nig.ac.jp/RGM2/func.php?rd_id=flexclust:kcca
$
%
$
!"
$
%
$
#
LHS
77 (butter +24)
size: support
color: lift
{hamburger meat}
{salty snack}
{sugar}
{cream cheese }
{white bread}
{beef}
{curd}
{butter}
{bottled beer}
{domestic eggs}
{fruit/vegetable juice}
{pip fruit}
{whipped/sour cream}
{citrus fruit}
{sausage}
{pastry}
{shopping bags}
{tropical fruit}
{root vegetables}
{bottled water}
{yogurt}
{other vegetables}
{soda}
{rolls/buns}
{whole milk}
RHS
Aliation network
school activity clubs
Choir, concert
Wrestling, 8th
Teachers of Tomorrow
Orchestra, Full Concert
Football, 8th
Track, girls V
Softball, V
Softball, JV (10th)
Tennis girls V
Basketball,
girls 9th9th
Volleyball,
Cheerleaders, 9th
http://sna.stanford.edu/sna_R_labs/output/lab_5/5.7_magact_stdnt_actvts_1996_clubs.pdf
http://blog.revolutionanalytics.com/2012/07/making-beautiful-maps-in-r-with-ggmap.html
https://www.rmetrics.org/blog/RiskSurfaces
http://www.quantmod.com/examples/intro/
Pr(>F)
0.00822 **
0.00247 **
0.02114 *
0.14959
S&P versus jobless claims
6e+05
5e+05
Rsquared = 0.88
800
Mean Sq
68.7
189.3
95.2
33.1
14.4
4e+05
Df Sum Sq
5
343.3
1
189.3
1
95.2
1
33.1
15
215.5
3e+05
block
N
K
N:K
Residuals
1000
1200
S&P 500
1400
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
library(ggplot2)
Yihui Xie
April 6, 2012
50000
40000
30000
20000
10000
0
http://bit.ly/knitr-graphics-src
(Rnw source)
2
http://bit.ly/knitr-main-pdf
Fair
Good
Very Good
Premium
Ideal
Graphical Devices
The knitr package comes with more than 20 built-in graphical devices,
and you can specify them through the dev option. This document uses
the global option dev=tikz, i.e., the plots are recorded by the tikz
device by default, but we can change the device locally. Since tikz will
be used extensively throughout this manual and you will see plenty of
tikz graphics later, now we first show a few other devices.
cut
80
20
60
Volume
40
cut
count
Before reading this specific manual1 , you must have finished the main
manual2 .
60
65
70
75
80
85
90
Height
IX
III
XII
Plot Recording
As mentioned in the main manual, knitr uses the evaluate package to
record plots. There are two sources of plots: first, whenever plot.new()
or grid.newpage() is called, evaluate will try to save a snapshot of the
VI
We can also set the alignment of plots easily with the fig.align option; this document uses fig.align=center as a global option, and
we can also set plots to be left/right-aligned. Figure 8 is an example
http://yihui.name/knitr/demo/graphics/
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Relational databases:
I
NoSQL Databases:
I
Web:
I
I
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Data Analysts
Captivated by Rs Power
http://r4stats.com/articles/popularity/
http://www.revolutionanalytics.com
R Usage
http://blog.kaggle.com
http://www.kdnuggets.com
http://www.rexeranalytics.com
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Software # Blogs
R
SAS
Stata
Others
365
40
8
0-3
Aggregated at http://r-bloggers.com/
For Q & A:
I
http://stackoverflow.com/questions/tagged/r
https://stat.ethz.ch/mailman/listinfo/r-help
http://r4stats.com/articles/popularity/
Journal
RSS Feed
ISSN: 2073-4859
Table of Contents
Editorial
11
20
30
41
45
56
60
R/Finance 2012:
Applied Finance with R
64
70
Changes on CRAN
80
R Foundation News
96
R-Project: http://www.r-project.org/
local copy
http://cran.r-project.org/mirrors.html
local copy
Quick-R: http://www.statmethods.net/
local copy
local copy
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
R Books
Many, many R textbooks are available. Choice depends on:
I statistics emphasis
I data mining emphasis
I software emphasis
I intended application
Some good free R books to get started:
http://cran.r-project.org/doc/manuals/R-intro.pdf
http://cran.r-project.org/doc/contrib/Paradis-rdebuts_en.pdf
http://cran.r-project.org/doc/contrib/usingR.pdf
http://cran.r-project.org/doc/contrib/Verzani-SimpleR.pdf
http://cran.r-project.org/doc/contrib/Owen-TheRGuide.pdf
http://cran.r-project.org/doc/contrib/Lam-IntroductionToR_LHL.pdf
http://ipsur.org/index.html
http://cran.r-project.org/other-docs.html
http://www.oup.com/uk/orc/bin/9780199299881/01student/companions/R_
Companion.pdf
R package
http://cran.stat.ucla.edu/web/packages/e1071/vignettes/svmdoc.pdf
local copy
local copy
http://rgm2.lab.nig.ac.jp/RGM2/
local copy
http://gallery.r-enthusiasts.com/thumbs.php
Outline
R Is a Powerful System for Data Analysis
R Does Data Mining, Visualization, and Traditional Statistics
R Generates Reports and Web Content
R Interoperates Widely
R is Growing Rapidly
R has a strong support community
You can learn from the R community
You can learn R on your own
Next steps:
Download R
Try a data mining interface like Rattle
Try a statistics interface like Deducer or R Commander
Try a programmer interface like RStudio