这篇文章已经发布超过一年了，内容可能已经过时，请谨慎参考。

BS4

2024/10/6

BeautifulSoup 基础

bs4 用于解析 HTML，常见用法如下：

常用方法

find()：返回第一个匹配的标签
find_all()：返回所有匹配标签
attrs：获取标签属性字典

示例

from bs4 import BeautifulSoup

html = "<a class='a1' href='https://example.com'>link</a>"
soup = BeautifulSoup(html, "html.parser")

soup.find("a")
soup.find_all("a")
soup.find("a", class_="a1")
soup.find_all("li", limit=2)
soup.a.attrs
soup.a.get("href")

文章目录

BeautifulSoup 基础
- 常用方法
- 示例