BeautifulSoup .get未返回“href”

文章/答案/技术大牛

发布

1回答

、、

prevLink = soup2.select('.previous_post')[Previous Post: <a href然后我尝试使用.get('href')拉出链接，但它返回'none‘。>>>print(prevLink[0].get('href'))

浏览 14提问于2018-12-19得票数 1

回答已采纳

2回答

BeautifulSoup .link.get("href")仅返回None

、

在我的网络爬行器上使用BeautifulSoup，出于某种原因，我的links变量返回了我指定的代码块，但当我试图获取"href“时，它只返回"None”。from bs4 import BeautifulSoup r = requests.get("https://www.kickstarter.com/discoverpageGrab.find_all("div", {&

浏览 1提问于2017-01-29得票数 3

2回答

使用BeautifulSoup只获取URL列表的第一个链接

、、、

我解析了整个HTML文件，使用Python中的Beautifulsoup模块提取了一些URL，并提供了这样的代码： for linein link :我在shell中得到了一系列的链接，这些链接可以观察编辑：网页是：，脚本必须返回HTML页面中的第一个短U

浏览 3提问于2012-10-14得票数 3

回答已采纳

3回答

BeautifulSoup如何从<a>标签中获取文本

、、

我不知道如何从这门课中提取课文我想要7,457，但我不明白.我试过了，但它只给我显示了链接response = requests.get(ur

浏览 1提问于2020-10-20得票数 0

回答已采纳

3回答

美丽的汤，看起来只有一种图案

、、

代码如下： import codecsfd = codecs.open('input.html', 'r') soup = BeautifulSoup(html, "lxml") link.extract()text = link.get('href<

浏览 41提问于2021-11-01得票数 0

1回答

漂亮的汤处理错误

、、

我想知道如何处理href在Text:之后不存在的情况是否有更好的方法搜索Contact:之后存在的内容？

浏览 2提问于2011-09-14得票数 0

回答已采纳

1回答

BeautifulSoup文档中给出的不工作的示例

、

我正在尝试BeautifulSoup文档中给出的示例，其中一个例子是没有给出预期的结果。p> <a href="http://example.com/elsie" class="sister" id="link1">Elsie<&

浏览 3提问于2015-01-19得票数 1

回答已采纳

2回答

循环之后，哪种方法是返回到列表中的所有迭代元素的正确方式？

、、、

我有以下函数，它接受一个.html文档并提取一些内容： tree = etree.fromstringfor e in tree.xpath('//b'): # Here, instead of the above line, I would like to getin a single string all the printed ele

浏览 4提问于2017-04-10得票数 2

回答已采纳

1回答

提取存储在dataframe - Python列表中的URL的一部分

、、、、

在下面的示例25709中，我只尝试提取数字部分，并将其添加到一个变量，让我们称其为athleteID，稍后我可以将其添加到动态URL中，以迭代并发送搜索请求： i = id.split('\=') print(id_list) '<a href

浏览 4提问于2021-11-27得票数 1

2回答

使用Beautifulsoup和Selenium从包含特定单词的网页中获取链接

、、、

我写了这段代码来登录我的FB帐户，并使用Selenuim和BeautifulSoup获取页面上的所有群组链接，但BeautifulSoup使用不能正常工作。我想知道如何在同一代码中使用Selenuim和BeautifulSoup。 pwd = raw_input(

浏览 0提问于2015-03-19得票数 0

3回答

AttributeError：'NoneType‘对象没有属性'attrs’

AttributeError: 'NoneType' object has no attribute 'attrs'import urllib2Request(url, headers={'User-Agent' : "Magic Browser"}) soup = BeautifulSoup(page,'html.parser&#x

浏览 0提问于2018-03-29得票数 0

回答已采纳

1回答

Python: spider递归循环

、

我有一个简单的BeautifulSoup爬虫，它返回深度2或更深的服务器链接，具体取决于添加的功能数量：from bs4 import BeautifulSoup soup = BeautifulSoup(pageText, "html.parser") for link in soup.findAll("a"):href = link.get("<

浏览 2提问于2016-05-17得票数 1

回答已采纳

3回答

从bs4.element.Tag获取项目

、

我有类型为bs4.element.Tag的元素 <a class="nav-link match-link-stats" href="/football/matches/match867851_Kalteng_Putra-Arema-online

浏览 1提问于2019-08-07得票数 12

回答已采纳

5回答

TypeError:必须是str，而不是NoneType

这是我的密码from bs4 import BeautifulSoup page = 1 plain_text = source_code.text href = &q

浏览 12提问于2017-04-23得票数 1

回答已采纳

1回答

BeautifulSoup4 .get('href')不仅返回href，还返回一些垃圾文件

、、

我正在写一个程序，在谷歌搜索"jopa olega“，并打印第一个结果的网址import requests, webbrowser, bs4 res.raise_for_status() links = soup.select('div#main > div &

浏览 0提问于2019-10-18得票数 0

1回答

是否可以使用Python 3访问包含特定文本的网站中的链接？

、、

dls=https://www.sanantonio.gov/DevServ/CrystalReports/BldgActHDMonticelloPrk.xls' resp = requests.get

浏览 0提问于2019-04-12得票数 0

1回答

用python网络爬虫模拟cookie

、、

我正在尝试使用‘请求’库和BeautifulSoup4库来制作一个web爬虫，但是为了成功地实现这个目的，我必须访问一个链接来激活特定的cookie，这样我就可以搜索该查询的内容。import requests page = 1 source_code = requests.

浏览 2提问于2014-09-16得票数 0

回答已采纳

2回答

没有类名的Python抓取特定标签

、、

页面中有趣的数据包含在下面的结构中： <tr style=""> <a hrefclass="hidden-xs"></td></tbody>我试过这样做： page =

浏览 2提问于2017-05-15得票数 0

2回答

如何在python中使用漂亮汤查找字符串的第二次出现

、、、

<td><a href="javascript:__doPostBack('ctl00$cph1$grdRfqSearch','Page$21')">...

浏览 1提问于2019-08-10得票数 1

回答已采纳

2回答

在python中搜索多个子字符串的列表？

、

15个链接的列表，我想搜索包含'sen_floor‘或'asm_floor’的链接这是我到目前为止的代码(ca_data是原始链接)： import requestssoup = BeautifulSoup(ca.content, 'html.parser') forlink in soup.findAll('a', at

浏览 5提问于2020-01-03得票数 2

回答已采纳

点击加载更多