python 尝试使用BeautifulSoup获取链接，但它不工作？

r3i60tvu 于 2023-01-16 发布在 Python

关注(0)|答案(1)|浏览(130)

我必须解析这个HTML：

import requests
from bs4 import BeautifulSoup
import re

url = 'https://link.com/'

html = requests.get(url)

soup = BeautifulSoup(html.content, 'html.parser')

link = soup.findAll('a', href=re.compile('https://specificlink/'))

输出：

[<a href="https://specificlink" style="display:inline-block;width:192px;"</a>]

我正在寻找专门的链接。在上面的代码后，我得到了正确的链接和链接后也style="*************"。我怎么才能使它，所以我只得到一个纯链接没有任何进一步的HTML下面？

python

来源：https://stackoverflow.com/questions/75123403/trying-to-get-link-using-beautifulsoup-but-its-not-working

1条答案

按热度按时间

0yg35tkg1#

您可以使用Beautiful Soup文档的“提取所有URL部分”中所示的方法get()
因此，您可以将代码的findAll部分修改为如下所示：

...
links = soup.findAll('a', href=re.compile('https://specificlink/'))

for link in links:
    #you can do something else instead of printing here
    print(link.get('href'))

赞(0）回复(0）举报 2023-01-16

我来回答

python 尝试使用BeautifulSoup获取链接，但它不工作？

1条答案

相关问题

热门标签

最新问答