0
我試圖使用dryscrape和python爲學習目的而刮掉http://quotes.toscrape.com/。我能夠通過class =「quote」獲得所有div。想用class =「quote」循環div的列表,並使用xpath從這個父元素獲取多個數據。Dryscrape:使用xpath從父節點列表中刮取子節點數據
import dryscrape
from bs4 import BeautifulSoup
session = dryscrape.Session()
url = 'http://quotes.toscrape.com/'
print 'Visiting the URL...'
session.visit(url)
print 'Status: ', session.status_code()
for div in session.xpath("//div[@class='quote']"):
# please help me to scrape author and quote for each div elements