代码之家 › 专栏 › 技术社区 › Max

在R中按列取消数据帧的堆叠

Max · 技术社区 · 5 年前

我想将R中的数据帧按两列展开,即从

id   segment  count  freq
1    a        x1a    f1a
1    b        x1b    f1b
1    c        x1c    f1c
2    a        x2a    f2a
2    b        x2b    f2b
2    c        x2c    f2c

我想得到:

id   count_a  count_b count_c freq_a freq_b freq_c
1    x1a      x1b     x1c     f1a    f1b    f1c
2    x2a      x2b     x2c     f2a    f2b    f2c

基本上,这相当于通过前两列id和segment来取消数据帧的堆栈。但是,我不知道如何使用R中的unstack()函数来实现这一点。我可以使用一种非常简单的方法(嵌套for循环、连接列名等,然后绑定)来实现这一点,但是必须有一种更直接、更有效的方法。

1 回复 | 直到 5 年前

akrun 5 年前

pivot_wider

library(dplyr)
library(tidyr)
df1 %>%       
   pivot_wider(names_from = c(segment), values_from = c(count, freq))
# A tibble: 2 x 7
#     id count_a count_b count_c freq_a freq_b freq_c
#  <int> <chr>   <chr>   <chr>   <chr>  <chr>  <chr> 
#1     1 x1a     x1b     x1c     f1a    f1b    f1c   
#2     2 x2a     x2b     x2c     f2a    f2b    f2c

或与 dcast

library(data.table)
dcast(setDT(df1), id ~ segment, value.var = c('count', 'freq'))
#   id count_a count_b count_c freq_a freq_b freq_c
#1:  1     x1a     x1b     x1c    f1a    f1b    f1c
#2:  2     x2a     x2b     x2c    f2a    f2b    f2c

更新

如果存在重复项,则创建序列列

df1 %>%
   mutate(rn = rowid(segment)) %>%
    pivot_wider(names_from = c(segment), values_from = c(count, freq)) %>%
   select(-rn)

data.table

dcast(setDT(df1), id + rowid(segment) ~ segment, 
       alue.var = c('count', 'freq'))[, segment := NULL][]

数据

df1 <- structure(list(id = c(1L, 1L, 1L, 2L, 2L, 2L), segment = c("a", 
"b", "c", "a", "b", "c"), count = c("x1a", "x1b", "x1c", "x2a", 
"x2b", "x2c"), freq = c("f1a", "f1b", "f1c", "f2a", "f2b", "f2c"
)), class = "data.frame", row.names = c(NA, -6L))

推荐文章

Amp · 使用R ggplot2删除geom_radial中axis.line和panel.border之间的空格

4 月前

Hard_Course · 用另一列中的值替换行的最后一个非NA条目

4 月前

Mark R · 使用geom_sf()删除地球仪上不需要的网格线

4 月前

Joe · 根据对工作日和本周早些时候的日期的了解,找到一个日期

4 月前

Ben · 统计向量中的单词在字符串中出现的频率

4 月前

TheCodeNovice · R中符号格式的尾随零和其他问题[重复]

4 月前

katefull06 · 在R中使用terra修改范围时,会为单独的SpatRaster重写范围

4 月前

dez93_2000 · 在R管道子功能中引用管道对象的当前状态

4 月前

accibio · 在ggplot2中为同一变量创建两个连续的颜色渐变比例

4 月前

Mankka · 如何在Ggplot2中绘制均匀的径向图

4 月前