[javascript学习指南]java用正则表达式删除html标签几个例子

更新时间：2019-08-21 来源：正则表达式手机版 字体：大中小

【www.bbyears.com--正则表达式】

例子1

新闻内容或者博客文章，如果显示摘要，需要去除内容的html格式标签，找到一个正则表达式，实现了：

代码如下

/**
     * 删除input字符串中的html格式
     *
     * @param input
     * @param length
     * @return
     */
    public static String splitAndFilterString(String input) {
        if (input == null || input.trim().equals("")) {
            return "";
        }
        // 去掉所有html元素,
        String str = input.replaceAll("＼＼&[a-zA-Z]{1,10};", "").replaceAll(
                "<[^>]*>", "").replaceAll("[(/>)<]", "");
        return str;
    }

过滤掉所有script脚本的正则：
content.replaceAll("<script[^>]*?>[＼＼s＼＼S]*?<＼＼/script>", "")
过滤掉所有style的正则：
content.replaceAll("<[＼＼s]*?style[^>]*?>[＼＼s＼＼S]*?<[＼＼s]*?＼＼/[＼＼s]*?style[＼＼s]*?>", "");
过滤掉所有html标签，保留p和br标签。
content.replaceAll("]*>", "");
过滤掉所有html标签，保留p标签。
content.replaceAll("]*>", "");

例子2

代码如下

import java.util.regex.Matcher;
import java.util.regex.Pattern;

public class HtmlUtil {
    private static final String regEx_script = "<script[^>]*?>[＼＼s＼＼S]*?<＼＼/script>"; // 定义script的正则表达式
    private static final String regEx_style = "]*?>[＼＼s＼＼S]*?<＼＼/style>"; // 定义style的正则表达式
    private static final String regEx_html = "<[^>]+>"; // 定义HTML标签的正则表达式
    private static final String regEx_space = "＼＼s*|＼t|＼r|＼n";//定义空格回车换行符

    /**
     * @param htmlStr
     * @return
     * 删除Html标签
     */
    public static String delHTMLTag(String htmlStr) {
        Pattern p_script = Pattern.compile(regEx_script, Pattern.CASE_INSENSITIVE);
        Matcher m_script = p_script.matcher(htmlStr);
        htmlStr = m_script.replaceAll(""); // 过滤script标签

        Pattern p_style = Pattern.compile(regEx_style, Pattern.CASE_INSENSITIVE);
        Matcher m_style = p_style.matcher(htmlStr);
        htmlStr = m_style.replaceAll(""); // 过滤style标签

        Pattern p_html = Pattern.compile(regEx_html, Pattern.CASE_INSENSITIVE);
        Matcher m_html = p_html.matcher(htmlStr);
        htmlStr = m_html.replaceAll(""); // 过滤html标签

        Pattern p_space = Pattern.compile(regEx_space, Pattern.CASE_INSENSITIVE);
        Matcher m_space = p_space.matcher(htmlStr);
        htmlStr = m_space.replaceAll(""); // 过滤空格回车标签
        return htmlStr.trim(); // 返回文本字符串
    }

    public static String getTextFromHtml(String htmlStr){
        htmlStr = delHTMLTag(htmlStr);
        htmlStr = htmlStr.replaceAll(" ", "");
        htmlStr = htmlStr.substring(0, htmlStr.indexOf("。")+1);
        return htmlStr;
    }

    public static void main(String[] args) {
        String str = " 整治“四风”   清弊除垢
公司召开党的群众路线教育实践活动动员大会
";
        System.out.println(getTextFromHtml(str));
    }
}

例子3

代码如下

* 删除Html标签

* @param inputString

* @return

public static String htmlRemoveTag（String inputString） {

if （inputString == null）

return null;

String htmlStr = inputString; // 含html标签的字符串

String textStr = "";

java.util.regex.Pattern p_script;

java.util.regex.Matcher m_script;

java.util.regex.Pattern p_style;

java.util.regex.Matcher m_style;

java.util.regex.Pattern p_html;

java.util.regex.Matcher m_html;

try {

//定义script的正则表达式{或<script[^>]*?>[＼s＼S]*?<＼/script>

String regEx_script = "<[＼s]*?script[^>]*?>[＼s＼S]*?<[＼s]*?＼/[＼s]*?script[＼s]*?>";

//定义style的正则表达式{或]*?>[＼s＼S]*?<＼/style>

String regEx_style = "<[＼s]*?style[^>]*?>[＼s＼S]*?<[＼s]*?＼/[＼s]*?style[＼s]*?>";

String regEx_html = "<[^>]+>"; // 定义HTML标签的正则表达式

p_script = Pattern.compile（regEx_script, Pattern.CASE_INSENSITIVE）；

m_script = p_script.matcher（htmlStr）；

htmlStr = m_script.replaceAll（""）； // 过滤script标签

p_style = Pattern.compile（regEx_style, Pattern.CASE_INSENSITIVE）；

m_style = p_style.matcher（htmlStr）；

htmlStr = m_style.replaceAll（""）； // 过滤style标签

p_html = Pattern.compile（regEx_html, Pattern.CASE_INSENSITIVE）；

m_html = p_html.matcher（htmlStr）；

htmlStr = m_html.replaceAll（""）； // 过滤html标签

textStr = htmlStr;

} catch （Exception e） {

e.printStackTrace（）；

}

return textStr;// 返回文本字符串

}

本文来源：http://www.bbyears.com/aspjiaocheng/63518.html

链接：http://www.bbyears.com/aspjiaocheng/63518.html
[javascript学习指南]java用正则表达式删除html标签几个例子(转载时请注明本文出处及链接)

猜你感兴趣

【全民飞机大战最新战机】全民飞机大战新战机阿波罗属于及升级费用攻略 2019-08-21
【龙腾世纪3】龙腾世纪：审判directx error问题解决的方法 2019-08-21
[说说心情短语人生感悟]心情短语人生感悟 2019-08-21
[好朋友留言短句子暖]给好朋友的留言句子 2019-08-21
通过wordpress数据库文件|通过WordPress数据库操作类wpdb访问数据库 2019-08-21
[天天炫斗烈焰战马]天天炫斗烈风武器锻造和炙炎武器锻造礼包内容及获得方法 2019-08-21
【穿越火线锋角色属性】穿越火线命运三角色属性以及价格的介绍 2019-08-21
生活感悟经典句子微信_给好朋友的感悟生活的经典句子 2019-08-21
【jquery中自定义插件开发教程视频】jquery中自定义插件开发教程 2019-08-21
[zblog php]ZBLOG调用随机文章、热门文章、热评文章的php代码 2019-08-21

本类排行

本类最新

更多>>

[javascript学习指南]java用正则表达式删除html标签几个例子

猜你感兴趣

热门标签

本类排行

本类最新