下面將輸出的程序
<DIV>平均響應時間服務器是關鍵的,因爲它的值282 & GT; 0毫秒。 < br/> [閾值詳細信息:如果值爲& gt; 0,警告如果值= 0,則清除值& lt; 0] < /格>
package test;
import java.io.StringReader;
import javax.xml.transform.Transformer;
import javax.xml.transform.TransformerFactory;
import javax.xml.transform.dom.DOMSource;
import javax.xml.transform.stream.StreamResult;
import org.apache.xerces.dom.DocumentImpl;
import org.cyberneko.html.parsers.DOMFragmentParser;
import org.w3c.dom.Document;
import org.w3c.dom.DocumentFragment;
import org.xml.sax.InputSource;
public class TestHTMLDOMFragment {
private static final String PARSE_TEXT = "<div>Average Response Time server is critical because its value 282 > 0 ms. <br>[Threshold Details : Critical if value > 0, Warning if value = 0, Clear if value < 0]</div>";
public static void main(String[] argv) throws Exception {
DOMFragmentParser parser = new DOMFragmentParser();
// output the elements in lowercase, nekohtml doesn't do this by default
parser.setProperty("http://cyberneko.org/html/properties/names/elems","lower");
// if this is set to true (the default, you dont need to specifiy this)
// then neko html wont and an html,head and body tags to the response.
parser.setFeature("http://cyberneko.org/html/features/document-fragment",true);
Document document = new DocumentImpl();
DocumentFragment fragment = document.createDocumentFragment();
// parse the document into a fragment
parser.parse(new InputSource(new StringReader(PARSE_TEXT)), fragment);
TransformerFactory transformerFactory = TransformerFactory.newInstance();
Transformer transformer = transformerFactory.newTransformer();
// don't display the namespace declaration
transformer.setOutputProperty("omit-xml-declaration", "yes");
DOMSource source = new DOMSource(fragment);
StreamResult result = new StreamResult(System.out);
transformer.transform(source, result);
}
}
在代碼中的註釋顯示以上我使用解析器設置。
我也用了org.cyberneko.html.parsers.DOMFragmentParser你可以也可以解析文本,這只是一個HTML片段
我使用nekohtml 14年9月1日
如果你使用maven,這裏是pom.xml依賴關係部分...
<dependencies>
<dependency>
<groupId>net.sourceforge.nekohtml</groupId>
<artifactId>nekohtml</artifactId>
<version>1.9.14</version>
<type>jar</type>
</dependency>
</dependencies>