我需要先说我是使用 R/Posit 产品的初学者,所以如果我看起来很笨,我很抱歉。
我正在进行文本分析,我正在清理包含特殊字符 [有时与单词相关] 的数据,例如:pdf\ \、__________、_city 等。
我想删除那些特殊字符并尝试了一些 sub 和 gsub 语法,但我在网上找到的每条建议都出错了。我真的很沮丧,因为我似乎找不到任何进展。我真的需要很多帮助,如果我能得到解释,我将不胜感激。我想了解这个程序,只需要帮助开始。谢谢!
ISISPosts$paragComb<-sub("pdf\ "," ", ISISPosts$paragComb) Warning messages: 1: In stringi::stri_info() : Your current locale is not in the list of available locales. Some functions may not work properly. Refer to stri_locale_list() for more details on known locale specifiers. 2: In stringi::stri_info() : Your current locale is not in the list of available locales. Some functions may not work properly. Refer to stri_locale_list() for more details on known locale specifiers.
ISISPosts$paragComb<-sub(r"(pdf\ )"," ", ISISPosts$paragComb) Error in sub("pdf\ \", " ", ISISPosts$paragComb) : invalid regular expression 'pdf\ ', reason 'Trailing backslash' In addition: Warning message: In sub("pdf\ \", " ", ISISPosts$paragComb) : TRE pattern compilation error 'Trailing backslash'
ISISPosts$paragComb<-sub(" - ")," ", ISISPosts$paragComb) Error: unexpected ',' in "ISISPosts$paragComb<-sub(" - "),"
ISISPosts$paragComb<-sub("...")," ", ISISPosts$paragComb) Error: unexpected ',' in "ISISPosts$paragComb<-sub("..."),"
ISISPosts$paragComb<-gsub(""," ",ISISPosts$paragComb)
ISISPosts$paragComb<-gsub("\"," ",ISISPosts$paragComb) Error in gsub("\", " ", ISISPosts$paragComb) : invalid regular expression '', reason 'Trailing backslash' In addition: Warning message: In gsub("\", " ", ISISPosts$paragComb) : TRE pattern compilation error 'Trailing backslash'
ISISPosts$paragComb<-gsub("\\"," ","pdf\ ")
s=“下
my_str<-'pdf\ '