如何继续使用缺少一个级别(已被删除)的因子?

问题描述 投票:0回答:2

我正在使用我已下载的一组Fitbit数据,它有一个工作日列表,我正在尝试正确订购。现在,当前的数据集没有“星期五”,但我希望这些因素无论如何都要包含它。

即使数据集中只有6个工作日,我如何继续将工作日视为1-7?

file<-choose.files()
   slp<-data.frame(read.csv(file))

   wkdaylevels<-c("Sunday","Monday","Tuesday","Wednesday","Thursday","Friday","Saturday")

    slp$FellAsleepAt<-strptime(slp$FellAsleepAt, format="%B %e, %Y at %I:%M%p")
    slp$AwokeAt<-strptime(slp$AwokeAt,format="%B %e, %Y at %I:%M%p")
    slp$TotalTimeSlept<-gsub("h ",":",slp$TotalTimeSlept)
    slp$TotalTimeSlept<-gsub("m","",slp$TotalTimeSlept)
    slp$TimeAsleep<-as.numeric(difftime(slp$AwokeAt,slp$FellAsleepAt))
    slp$Date<-as.Date(slp$FellAsleepAt, format="%M/%D/%Y")
    slp$DayofWeek<-as.factor(weekdays(slp$Date),levels=wkdaylevels)



    ggplot(slp,aes(x=DayofWeek,y=TimeAsleep))+
        geom_point()

数据来源:https://docs.google.com/spreadsheets/d/1Vdgmtwx0vNKDKEZFMEGAWQ58H66ia-xjI0evR7idfkc/edit?usp=sharing

r r-factor
2个回答
4
投票

答:使用factor而不是as.factor

函数as将对象强制转换为类。在你的情况下,as.<type>胁迫一种类型(因素为你)。函数factor用于将对象编码为因子。关键的区别是as.factor不允许水平参数,而factor则允许。

如果检查每个函数的源代码,您将看到as.factor通过使用对象的唯一级别作为其级别来执行强制。如果未指定factor参数,levels=会执行此操作,但允许输入级别。

例如:

x <- 1:6
x2 <- factor(x, levels= 1:7)
levels(x2)
[1] "1" "2" "3" "4" "5" "6" "7"
x2 <- as.factor(x, levels= 1:7) # in this case, levels won't be evaluated due to lazy evaluation
Error in as.factor(x, levels = 1:7) : unused argument (levels = 1:7)

TBH,我不知道为什么你的R会话没有给你这个错误。你在使用R 3.2.3吗?


2
投票

使用levels<-?levels寻求帮助):

wdays <- as.factor(c("Sunday", "Monday"))
wkdaylevels<-c("Sunday","Monday","Tuesday","Wednesday","Thursday","Friday","Saturday")
levels(wdays) <- wkdaylevels
wdays
# [1] Monday Sunday
# Levels: Sunday Monday Tuesday Wednesday Thursday Friday Saturday

如果你想删除未使用的级别,你可以使用

droplevels(wdays)
# [1] Monday Sunday
# Levels: Sunday Monday

要么

factor(wdays)
# [1] Monday Sunday
# Levels: Sunday Monday
© www.soinside.com 2019 - 2024. All rights reserved.