删除android中的html标签

问题描述 投票:0回答:9

我有以下 XML 提要:

<Description>
  <p>Touch, tap, flip, slide! You don&#39;t just read Books, you experience it.</p>
</Description>

这里我必须显示像

这样的描述

触摸、点击、翻转、滑动! 39.只要阅读书籍,您就会体验到它。

这里我对解析器的处理如下:

   public static String removeHTML(String htmlString)
  {
  // Remove HTML tag from java String    
String noHTMLString = htmlString.replaceAll("\\<.*?\\>", "");

// Remove Carriage return from java String
noHTMLString = noHTMLString.replaceAll("\r", "<br/>");
noHTMLString = noHTMLString.replaceAll("<([bip])>.*?</\1>", "");
// Remove New line from java string and replace html break
noHTMLString = noHTMLString.replaceAll("\n", " ");
noHTMLString = noHTMLString.replaceAll("\"", "&quot;");
noHTMLString = noHTMLString.replaceAll("<(.*?)\\>"," ");//Removes all items in brackets
noHTMLString = noHTMLString.replaceAll("<(.*?)\\\n"," ");//Must be undeneath
noHTMLString = noHTMLString.replaceFirst("(.*?)\\>", " ");
noHTMLString = noHTMLString.replaceAll("&nbsp;"," ");
noHTMLString = noHTMLString.replaceAll("&amp;"," ");
return noHTMLString;

    }

在结束元素中:

   public void endElement(String uri, String localName, String qName)throws SAXException {
  currentElement = false;
   if (localName.equalsIgnoreCase("Description")){
   sitesList.setDescription(currentValue);
   String Sub_arry=n+currentValue;
   Appscontent.Sub_arraylistdes.add(Sub_arry);
   String stringWithoutHTML=removeHTML(currentValue);
   System.out.println("description value----->"+n+att_ID+"------>>"+stringWithoutHTML);}

现在我必须运行该应用程序意味着 html 标签与我的描述一起显示...这里如何删除 HTML 标签?请为我提供解决方案???

我希望显示没有 Html 标签的描述...请提供这些解决方案。

编辑:

    if (localName.equalsIgnoreCase("Description")){
    sitesList.setDescription(currentValue);
    String Sub_arry=n+currentValue;
    StringBuffer sb = new StringBuffer();
    sb.append(Sub_arry);
     String newString = sb.toString();
      Appscontent.Sub_arraylistdes.add(newString);
       System.out.println("description value----->"+n+att_ID+"------>>"+newString);}

编辑:

  public static String html2text(String html) {
  return Jsoup.parse(html).text();
    }

结束元素:

    if (localName.equalsIgnoreCase("Description")){
    sitesList.setDescription(currentValue);
    String Sub_arry=n+currentValue;
    Appscontent.Sub_arraylistdes.add(Sub_arry);
      String stringWithoutHTML=html2text(currentValue);
       System.out.println("description value----->"+n+att_ID+"------>>"+stringWithoutHTML);}

但是我没有得到o/p..请为我提供解决方案???我怎样才能删除这些描述中的html标签...

java android html xml-parsing stringbuffer
9个回答
17
投票

您可以使用 Android 中内置的 HTML 类轻松删除 Android 中的任何 HTML 标签。导入

android.text.Html;
。现在,考虑到“data”是带有 HTML 标签的字符串变量,您可以使用
Html.fromHtml(data).toString()
来获取不带任何 HTML 标签的字符串。


8
投票

删除html的简单方法。这将返回非 html 格式的文本

 private String removeHtml(String html) {
    html = html.replaceAll("<(.*?)\\>"," ");
    html = html.replaceAll("<(.*?)\\\n"," ");
    html = html.replaceFirst("(.*?)\\>", " ");
    html = html.replaceAll("&nbsp;"," ");
    html = html.replaceAll("&amp;"," ");
    return html;
}

根据html标签格式化并删除标签。

Html.fromHtml(data).toString();

3
投票

一种选择是添加 JSoup 库,导入并使用它,如下所示:

public static String html2text(String html) {
return Jsoup.parse(html).text();
}

3
投票
String plain = Html.fromHtml("your_html_string").toString();

1
投票
    private int iMobileVersion = Build.VERSION.SDK_INT;
    String strResonseTemplate = data.getStringExtra("template"); //getting HTML data in string

    if (iMobileVersion >= 24) {
                                mEtReply.setText(Html.fromHtml(strResonseTemplate, Html.FROM_HTML_MODE_COMPACT));// this code only works on and above API 24, and removes all HTML tag, but gives same view as HTML in Edittext.
                            } else {
                                mEtReply.setText(Html.fromHtml(strResonseTemplate).toString()); // bellow API level 24 we are removing only HTML tags, it will show as normal text.

                            }

希望这会有所帮助:)


0
投票

据我所知,您可以通过跨接口获取数据。

尝试使用这个:

跨度可跨度 = Html.fromHtml(arrayList.get(position).getBusinessDescription());
System.out.println("描述 值----->"+n+att_ID+"------>>"+spannable);

查看以下链接了解更多详情:

http://developer.android.com/reference/android/text/Spanned.html http://developer.android.com/reference/android/text/Spannable.html


0
投票

只需添加这几行代码即可完成。

String html=(jsonObject1.getString("originaltext"));
            html = html.replaceAll("<(.*?)\\>"," ");
            html = html.replaceAll("<(.*?)\\\n"," ");
            html = html.replaceFirst("(.*?)\\>", " ");
            html = html.replaceAll("&nbsp;"," ");
            html = html.replaceAll("&amp;"," ");
            newsModel.setNews(html);
            Log.d("originaltext: ",html);

0
投票
//Patter to detect elements contained into "<>"
private static final Pattern REMOVE_TAGS = Pattern.compile("<.+?>");

//Method to remove the html tags contained in a String variable
public static String removeTags(String string) 
{
  //validate that at least one value contains the string
  if (string == null || string.length() == 0) 
  {
     return string;
  }
  //Function to find the matches within the chain and the pattern       
  Matcher m = REMOVE_TAGS.matcher(string);
  //replace <> element with ""
  return m.replaceAll("");
}

//Implementation of the method to eliminate html tags and place inside a Text control
this.itemView.setText(
 Html.fromHtml(
  new String(removeTags("<h1>My Title here</h1>").getBytes("ISO-8859-1"),"UTF-8")
 )
);

0
投票

在 Kotlin 中,你可以简单地使用这个扩展函数来做到这一点 -

fun String.removeHtml() = replace("\\<.*?\\>".toRegex(), "")

你可以将此函数称为

val cleanString = yourHtmlString.removeHtml()
© www.soinside.com 2019 - 2024. All rights reserved.