2017-10-21 52 views
0
import net.ruippeixotog.scalascraper.browser.JsoupBrowser 
import net.ruippeixotog.scalascraper.dsl.DSL._ 
import net.ruippeixotog.scalascraper.dsl.DSL.Extract._ 
import net.ruippeixotog.scalascraper.dsl.DSL.Parse._ 

object Scraper { 
    val browser = JsoupBrowser() 

    val doc = browser.get("http://camhr.com") 

    def main(args: Array[String]): Unit = { 
    // Extract the <span> elements inside #menu 
    val items = doc >?> element("#footer") 
    print(items) 

    } 

} 

我在網站上看到的是英文,但是當我運行此代碼時,我在控制檯(IntelliJ IDE)中獲得中文。錯誤的輸出編碼渲染與scalascraper

如何獲得正確的英文文本渲染?

回答

1

嘗試用這樣的事情:

val url = "http://camhr.com" 

val doc = Jsoup.connect(url) 
    .header("Accept-Language", "en") 
    .get