初学者如何消除特殊字符

问题描述 投票:0回答:0

我需要先说我是使用 R/Posit 产品的初学者,所以如果我看起来很笨,我很抱歉。

我正在进行文本分析,我正在清理包含特殊字符 [有时与单词相关] 的数据,例如:pdf\ \、__________、_city 等。

我想删除那些特殊字符并尝试了一些 sub 和 gsub 语法,但我在网上找到的每条建议都出错了。我真的很沮丧,因为我似乎找不到任何进展。我真的需要很多帮助,如果我能得到解释,我将不胜感激。我想了解这个程序,只需要帮助开始。谢谢!

ISISPosts$paragComb<-sub("pdf\ "," ", ISISPosts$paragComb) Warning messages: 1: In stringi::stri_info() : Your current locale is not in the list of available locales. Some functions may not work properly. Refer to stri_locale_list() for more details on known locale specifiers. 2: In stringi::stri_info() : Your current locale is not in the list of available locales. Some functions may not work properly. Refer to stri_locale_list() for more details on known locale specifiers.

ISISPosts$paragComb<-sub(r"(pdf\ )"," ", ISISPosts$paragComb) Error in sub("pdf\ \", " ", ISISPosts$paragComb) : invalid regular expression 'pdf\ ', reason 'Trailing backslash' In addition: Warning message: In sub("pdf\ \", " ", ISISPosts$paragComb) : TRE pattern compilation error 'Trailing backslash'

ISISPosts$paragComb<-sub(" - ")," ", ISISPosts$paragComb) Error: unexpected ',' in "ISISPosts$paragComb<-sub(" - "),"

ISISPosts$paragComb<-sub("...")," ", ISISPosts$paragComb) Error: unexpected ',' in "ISISPosts$paragComb<-sub("..."),"

ISISPosts$paragComb<-gsub(""," ",ISISPosts$paragComb)

  • [1] “” 错误:意外的字符串常量在: “ISISPosts$paragComb<-gsub(""," ",ISISPosts$paragComb) [1] ""

ISISPosts$paragComb<-gsub("\"," ",ISISPosts$paragComb) Error in gsub("\", " ", ISISPosts$paragComb) : invalid regular expression '', reason 'Trailing backslash' In addition: Warning message: In gsub("\", " ", ISISPosts$paragComb) : TRE pattern compilation error 'Trailing backslash'

ISISPosts$paragComb<-gsub("\\"," ","pdf\ ")

  • “pdf\” 错误:意外的符号在: “ISISPosts$paragComb<-gsub("\\"," ","pdf\ ") "pdf"

s=“下

  • ISISPosts$paragComb<-gsub(""", " ", s) Error: unexpected '\' in: "s= " under ISISPosts$paragComb<-gsub(""

my_str<-'pdf\ '

  • my_str<-gsub(pattern = ('\\'), replacement = '', x=my_str) Error: unexpected '\' in: "my_str<-'pdf\ ' my_str<-gsub(pattern = ('"
error-handling rstudio special-characters text-mining
© www.soinside.com 2019 - 2024. All rights reserved.