java正則過濾html_求一個去除html源代碼中的無效代碼( 如注釋

A. 【java作業向】正則表達式過濾HTML標簽

過濾HTML標簽的Java正則表達式 (?s)<.*?/?.*?>

按照你的要求編寫的用正則表達式過濾HTML標簽的Java程序如下

public class AA {

public String tagFilter(String s){

String regex = "(?s)<.*?/?.*?>";

String ss=s.replaceAll(regex,"");

return ss;

}

public static void main(String[] args) {

String s="<div class="guid time online">測試 abc</div><span data-url="games/details/" class="guid done">你好13548</span><a href="games/details/" class="guid">15個字母Abc</a><i class="icon-guid"/>";

String result=new AA().tagFilter(s);

System.out.println(result);

}

B. java正則表達式過濾html p標簽

用JavaScript方法如下，JAVA語言類似：
'你的HTML文本'.replace(/.+>(.+)<.+/,'$1')

C. 如何寫一個java正則表達式，用來判斷給定字元串是否匹配到<html標簽

如果只是匹配<html的話，直接s.contains("<html");就可以。

D. 如何使用java的正則表達式提取html標簽

你的意思是不是用Java訪問一個鏈接，在返回的數據中提取出放在標簽中的數據，例如取出<img src=""/>這些標簽中的數據

E. java中怎麼用正則截取html中的全部<td ... >....</td>

<td( .*?)?>.*?</td>
//看看這個吧，可能沒那麼完善，但應該足以應付大多數情況。

import java.util.ArrayList;
import java.util.List;
import java.util.regex.Matcher;
import java.util.regex.Pattern;

public class TdMatcher {
public static void main(String[] args) {
String html = "";
html += "<table>";
html += "<tr>";
html += "<td>message</td>";
html += "<td colspan='2'>message2</td>";
html += "<td >message3</td>";
html += "</tr>";
html += "</table>";
String[] matcher = matcher(html);
for (int i = 0; i < matcher.length; i++) {
System.out.println(matcher[i]);
}
}

private static String[] matcher(String html) {
Pattern pattern = Pattern.compile("<td( .*?)?>.*?</td>");
Matcher matcher = pattern.matcher(html);
List<String> list = new ArrayList<String>();
while (matcher.find()) {
list.add(matcher.group());
}
return list.toArray(new String[0]);
}
}

F. JAVA正則表達式解析HTML字元串

public class TestString4 {
    public static void main(String[] args) {
        String s = "<R_Data> 0005,實驗室0,0,0|0101,實驗室A-測試點1,200,200|0102,實驗室C-測試點2,80,400|0109,實驗室C-測試點1,80,300|1020,實驗室C-測試點3,80,500|1141,實驗室A-測試點2,400,400|1146,實驗室A-測試點3,300,300|1239,實驗室B-測試點1,50,150|1240,實驗室B-測試點2,80,200|1264,實驗室B-測試點3,220,110| </R_Data>";
        s = s.replace("<R_Data>", "").replace("</R_Data>", "").trim();
        String ss[] = s.split("\|");
        String[][] sss = new String[ss.length][];
        for(int i=0;i<ss.length;i++){
            sss[i] = ss[i].split(",");
        }
    }
}

sss中存放的就是你需要的數據

G. java如何去掉字元串中的 html標簽

1.去除單個HTML標記
String s="asdfasd<script>asdfsfd</script>1234";
System.out.println(s.replaceAll("<script.*?(?<=/script>)",""));
2.去除所有HTML標記
import java.util.regex.Matcher;
import java.util.regex.Pattern;

public class HTMLSpirit{ ITjob 遠標教育
public static String delHTMLTag(String htmlStr){
String regEx_script="<script[^>]*?>[\\s\\S]*?<\\/script>"; //定義script的正則表達式
String regEx_style="<style[^>]*?>[\\s\\S]*?<\\/style>"; //定義style的正則表達式
String regEx_html="<[^>]+>"; //定義HTML標簽的正則表達式

Pattern p_script=Pattern.compile(regEx_script,Pattern.CASE_INSENSITIVE);
Matcher m_script=p_script.matcher(htmlStr);
htmlStr=m_script.replaceAll(""); //過濾script標簽

Pattern p_style=Pattern.compile(regEx_style,Pattern.CASE_INSENSITIVE);
Matcher m_style=p_style.matcher(htmlStr);
htmlStr=m_style.replaceAll(""); //過濾style標簽

Pattern p_html=Pattern.compile(regEx_html,Pattern.CASE_INSENSITIVE);
Matcher m_html=p_html.matcher(htmlStr);
htmlStr=m_html.replaceAll(""); //過濾html標簽

return htmlStr.trim(); //返迴文本字元串
}
}

H. 求一個去除html源代碼中的無效代碼( 如注釋,空白字元,空白行等)的 java正則表達式~謝謝

注釋的正則：
頁面樣式的正則：<style[^>]*>[^<]*?</style>
HTML標簽的正則：<[^>]*?>

/// <summary>
/// 正則替換
/// </summary>
/// <param name="sOld">原內容</param>
/// <param name="sRegexString">正則表達式</param>
/// <param name="sReplaceString">新字元串</param>
/// <returns></returns>
public static string ReplaceRegxString(string sOld, string sRegexString, string sReplaceString)
{
Regex reg = new Regex(@sRegexString, RegexOptions.Singleline | RegexOptions.IgnoreCase);
return reg.Replace(sOld, sReplaceString);
}

熱點內容

網路設備能用到什麼發布：2025-07-13 14:37:26 瀏覽：64

暴風轉碼如何添加文件夾發布：2025-07-13 14:23:56 瀏覽：515

延安整合網路營銷有哪些發布：2025-07-13 14:18:44 瀏覽：74

查找word打開過的文件在哪裡發布：2025-07-13 14:14:23 瀏覽：137

b樹java代碼發布：2025-07-13 14:07:46 瀏覽：683

電腦文件存儲發布：2025-07-13 14:05:23 瀏覽：657

蘭州中考徵集志願在哪個網站發布：2025-07-13 14:04:37 瀏覽：215

cs文件上傳下載發布：2025-07-13 13:53:11 瀏覽：244

拷貝文件到根目錄下重命名linux 發布：2025-07-13 13:48:20 瀏覽：603

api函數的頭文件發布：2025-07-13 13:47:45 瀏覽：249

華為怎麼綁定迷你編程發布：2025-07-13 13:14:41 瀏覽：215

機構怎麼申請少兒編程考級發布：2025-07-13 13:03:33 瀏覽：495

崑山數控編程哪裡好學發布：2025-07-13 12:54:33 瀏覽：459

jspcfor跳出發布：2025-07-13 12:52:33 瀏覽：65

word怎麼插入羅馬數字i 發布：2025-07-13 12:13:37 瀏覽：315

哪個網站可以找到法人代表發布：2025-07-13 12:08:38 瀏覽：106

蘋果5s日版a1453支持什麼網路發布：2025-07-13 12:08:28 瀏覽：297

微信打開文件如何設置發布：2025-07-13 12:07:05 瀏覽：218

漫畫書app中非可視組件是什麼發布：2025-07-13 11:50:58 瀏覽：3

d盤文件隱藏怎麼恢復發布：2025-07-13 11:50:23 瀏覽：287

導航:首頁 > 編程語言 > java正則過濾html

java正則過濾html

與java正則過濾html相關的資料

友情鏈接