3 回答

TA貢獻2051條經(jīng)驗 獲得超10個贊
Document document = Jsoup.parse("<div id=\"AE9D7F630640426F8457A661607D2B8E-5-2\" style=\"display: none;\" class=\"kbcontent\"> 19040172b-1 <br>SQL Server開發(fā) <br> <font title=\"老師\">鄭尚</font> <br> <font title=\"周次(節(jié)次)\">3-5,7-14(周)</font> <br> <font title=\"教室\">東區(qū)綜合樓D-101</font> <br> </div>");
System.out.println(document.text());
Output:19040172b-1 SQL Server開發(fā) 鄭尚 3-5,7-14(周) 東區(qū)綜合樓D-101
不知道是否滿足樓主的需求?
Document document = Jsoup.parse("<div id=\"AE9D7F630640426F8457A661607D2B8E-5-2\" style=\"display: none;\" class=\"kbcontent\"> 19040172b-1 <br>SQL Server開發(fā) <br> <font title=\"老師\">鄭尚</font> <br> <font title=\"周次(節(jié)次)\">3-5,7-14(周)</font> <br> <font title=\"教室\">東區(qū)綜合樓D-101</font> <br> </div>");
Element div = document.getElementById("AE9D7F630640426F8457A661607D2B8E-5-2");
TextNode n1 = (TextNode) div.childNode(0);
System.out.println(n1.text()); // 19040172b-1
TextNode n2 = (TextNode) div.childNode(2);
System.out.println(n2.text()); // SQL Server開發(fā)
// ...
如果樓主的格式是固定的直接像上面這樣解析HTML會比較好一些,不需要REGEX。

TA貢獻1815條經(jīng)驗 獲得超10個贊
String html = "<div id=\"AE9D7F630640426F8457A661607D2B8E-5-2\" style=\"display: none;\" class=\"kbcontent\"> 19040172b-1 <br>SQL Server Develop <br> <font title=\"teacher\">zheng</font> <br> <font title=\"week\">3-5,7-14</font> <br> <font title=\"classroom\">D-101</font> <br> </div> ";
html = html.replaceAll("<br>", "#~#");
Document doc = Jsoup.parse(html.toString());
String newHtml = doc.text();
String[] ary = newHtml.split("#~#");
for (int i = 0;i < ary.length;i++){
System.out.println(ary[i]);
}
添加回答
舉報