0
我試圖從抓取的內容中獲取特定信息。隨着nutch將整個網站的文本全部放在一起,我很難獲得特定的內容。我想爲我在彈性搜索中編入索引的抓取文本內容添加分隔符。在nutch爬行內容中添加分隔符
例如,而從http://example.com/抓取數據的獲取elasticsearch索引的數據
Example Domain Example Domain This domain is established to be used for illustrative examples in documents. You may use this domain in examples without prior coordination or asking for permission. More information...
我希望它是格式
Example Domain | Example Domain | This domain is established to be used for illustrative examples in documents. You may use this domain in examples without prior coordination or asking for permission. | More information...
我們能否在Nutch的配置某處定義這個分隔符?
謝謝。很有幫助 – vibhash
很高興幫助! –