2013-03-11 92 views
0

我想解析一個HTML對象使用簡單的HTML DOM解析器使用PHP。我想提取的特定部分沒有正確包裝在任何標籤中。簡單的HTML DOM解析器得到文本沒有標籤

<li class="tags"> 
    Required text: <span itemprop="testCat"><a href="/topics/new-topic/index.html" title="New Topic" onclick="s_objectID=&quot;http://www.example.com/topics/new-topic/index.html_1&quot;;return this.s_oc?this.s_oc(e):true">New Topic</a></span>, <span itemprop="testCat"><a href="/topics/new-topic-2/index.html" title="New Topic" onclick="s_objectID=&quot;http://www.example.com/topics/new-topic-2/index.html_1&quot;;return this.s_oc?this.s_oc(e):true">New Topic</a></span>, <span itemprop="testCat"><a href="/topics/new-topic-3/index.html" title="New Topic 3" onclick="s_objectID=&quot;http://www.example.com/topics/new-topic-3/index.html_1&quot;;return this.s_oc?this.s_oc(e):true">New Topic 3</a></span>, 
    <div class="more"> 
     <a href="javascript: void(0);" class="more-trigger" onclick="s_objectID=&quot;javascript: void(0);_1&quot;;return this.s_oc?this.s_oc(e):true">more</a> 
     <div class="more-tags" style="top: 15px; left: 0px; display: none;"> 
     <div class="hd"></div> 
     <div class="bd"> 
      <ul id="topic-filedin"> 
       <li>Another Required Text : 
        <a href="/topics/new-topic-4/index.html" onclick="s_objectID=&quot;http://www.example.com/topics/new-topic-4/index.html_1&quot;;return this.s_oc?this.s_oc(e):true">New Topic 4</a> 
       </li> 
       <li>Topic Intended For : 
        <a href="/topics/for-kids/index.html" onclick="s_objectID=&quot;http://www.example.com/topics/for-kids/index.html_1&quot;;return this.s_oc?this.s_oc(e):true">For Kids</a> 
       </li> 
      </ul> 
     </div> 
     <div class="ft"></div> 
     </div> 
     <script type="text/javascript"> 
     SNI.Node.ArticleInfo.moreTags(); 
     </script> 
    </div> 
</li> 

我能夠得到使用

$categories = $single_content->find('li[class=tags] span'); 
foreach ($categories as $key) { 
    echo $key->plaintext . '<br>'; 
} 

包裹標籤中的文本,我無法找出Required textAnother Required TextTopic Intended For

回答

0

要獲得 '另一個所需的文本' 和 '主題旨在' 您可以使用此 -

$文字= $ single_content->找到( 「UI [CLASS =話題filedin] /李」);

$ textArray = array();

的foreach($文本$ TA)

{

$textArray[] = $ta->plaintext; 

}

,你會得到一個數組所需要的文字。

相關問題