2017-08-01 115 views
2

很基本的問題:如何在美麗的湯中返回標籤的元素?

如何would.I返回001,002 ... 101來自:

<us-applicant sequence="001" app-type="applicant" designation="us-only"> 
... 
<us-applicant sequence="101" app-type="applicant" designation="us-only"> 

用美麗的湯嗎?我知道返回兩個標記之間的東西基本外觀,但我不知道這是什麼元素實際上是

回答

1

你可以做這樣的事情,使用列表理解,你使用['sequence']屬性:

from bs4 import BeautifulSoup 

data = ''' 
<us-applicant sequence="001" app-type="applicant" designation="us-only"> 
<us-applicant sequence="100" app-type="applicant" designation="us-only"> 
<us-applicant sequence="101" app-type="applicant" designation="us-only"> 
''' 

soup = BeautifulSoup(data, 'html.parser') 
>>> [tag['sequence'] for tag in soup.findAll('us-applicant')] 
['001', '100', '101']