我想从HTML标签中使用漂亮汤提取属性。怎么做?
例如:
<div class="search-pagination-top clearfix mtop ">
<div class="row"><div class="col-l-4 mtop pagination-number" tabindex="0"
aria-label="Page 1 of 15 "><div>Page <b>1</b> of <b>15</b> </div></div>
如何从“咏叹号”属性中获取文本?
我试着使用select(),但是没有用。
发布于 2019-01-25 07:00:06
您可以像字典一样提取属性值。使用密钥aria-label
Ex:
from bs4 import BeautifulSoup
html = """<div class="search-pagination-top clearfix mtop ">
<div class="row"><div class="col-l-4 mtop pagination-number" tabindex="0"
aria-label="Page 1 of 15 "><div>Page <b>1</b> of <b>15</b> </div></div>
"""
soup = BeautifulSoup(html, "html.parser")
print( soup.find("div", class_="col-l-4 mtop pagination-number")["aria-label"] )
输出:
Page 1 of 15
https://stackoverflow.com/questions/54360308
复制相似问题