分析時有時會用到樞紐分析,
R也有類似的功能,
就是plyr。
如下面例子,
產生一個data frame,
其中包含組別、性別與年齡三個col,
我們可以使用ddply,
依照組別與年齡作為分組標準,
計算各組的平均數與標準差。
另外也常遇到計算各組樣本出現次數,
那就可以用nrow來計算。
如果分組標準只有一個,
則可以使用最簡單的table。
下方有小範例。
In Excel, there is something called Pivot Table.
In R, you can use ddply to summarize data.
All you have to do is to give it a data frame,
set the columns which you want to operate,
and set what you want to do (such as sum, mean, counting frequency...),
and it will give you the result.
Apart from this,
if you want to count the frequency of the data,
you can just use table.
The relating example is down below.
Popular Posts
Blog Archive
Categories
R
(28)
data.table
(4)
Python
(3)
Rstudio
(3)
dplyr
(3)
rvest
(3)
網路爬蟲
(3)
Error
(2)
Web Crawler
(2)
grepl
(2)
jupyter
(2)
plyr
(2)
ubuntu
(2)
教學
(2)
.Last.value
(1)
Big Data
(1)
Console
(1)
IEEE程式語言排行
(1)
PuTTY
(1)
Rprofile.site
(1)
Rselenium
(1)
XLConnect
(1)
assign
(1)
bar chart
(1)
cat
(1)
conflict
(1)
coord_flip
(1)
data.frame
(1)
dcast
(1)
download.file
(1)
evalWithTimeout
(1)
excel_sheets
(1)
factor
(1)
file.rename
(1)
fread
(1)
ggplot2
(1)
global variable
(1)
group_by
(1)
gsub
(1)
invalid multibyte character
(1)
jiebaR
(1)
join
(1)
jupyter_contrib_nbextensions
(1)
jupyterthemes
(1)
loading
(1)
melt
(1)
merge
(1)
mutate
(1)
numeric
(1)
print
(1)
rbind
(1)
read.csv
(1)
read_csv
(1)
read_excel
(1)
readr
(1)
readxl
(1)
scientific notation
(1)
scipen
(1)
separate_rows
(1)
setDF
(1)
setDT
(1)
sqldf
(1)
static IP address
(1)
str_count
(1)
stringr
(1)
table
(1)
tidyr
(1)
timeout
(1)
trim
(1)
txtProgressBar
(1)
unique
(1)
zip
(1)
人力銀行
(1)
參考資源
(1)
技能
(1)
文字探勘
(1)
橫條圖
(1)
玩玩小數據
(1)
結巴分詞
(1)
能力
(1)
資料分析
(1)
資料分析師
(1)
長條圖
(1)
沒有留言:
張貼留言