您的位置：首页 > 编程语言 > Python开发

Python爬虫（2）--BeautifulSoup的使用

2017-07-28 14:50 316 查看

# -*- coding: utf-8 -*-
import urllib
from bs4 import BeautifulSoup

url = "http://www.baidu.com"
page = urllib.urlopen(url)
soup = BeautifulSoup(page，"html.parser")
print soup

# -*- coding: utf-8 -*-from bs4 import BeautifulSouphelloworld = '<p>Hello World</p>'soup_string = BeautifulSoup(helloworld, "html.parser")print soup_string

先安装：http://www.crummy.com/software/BeautifulSoup/bs4/download/4.3/beautifulsoup4-4.3.2.tar.gz

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

python3实现网络爬虫（6）--正则表达式和BeautifulSoup配合使用
使用python语言结合beautifulsoup编写简单的网络爬虫
Python3.7 爬虫（三）使用 Urllib2 与 BeautifulSoup4 爬取网易云音乐歌单
python爬虫——beautifulsoup4使用学习
Python使用BeautifulSoup进行爬虫
Python爬虫包 BeautifulSoup 学习（十）各种html解析器的比较及使用
Python爬虫之使用BeautifulSoup解析HTML文本
python爬虫由浅入深3--BeautifulSoup的使用的基本方法
Python 爬虫---（6） beautifulSoup 库的使用
python爬虫主要就是五个模块：爬虫启动入口模块，URL管理器存放已经爬虫的URL和待爬虫URL列表，html下载器，html解析器，html输出器同时可以掌握到urllib2的使用、bs4（BeautifulSoup）页面解析器、re正则表达式、urlparse、python基础知识回顾（set集合操作）等相关内容。
Python爬虫学习---------使用beautifulSoup4爬取名言网
简单爬虫python实现02——BeautifulSoup的使用
python爬虫之BeautifulSoup 使用select方法详解
【Python3.6爬虫学习记录】（二）使用BeautifulSoup爬取简单静态网页文章
Python使用BeautifulSoup爬虫，和pyspider框架的使用
Python 爬虫实战（一）：使用 requests 和 BeautifulSoup
python3实现网络爬虫（3）--BeautifulSoup使用（2）
python爬虫：BeautifulSoup 使用select方法的使用
python3实现网络爬虫（4）--BeautifulSoup使用（3）
Python3.7 爬虫（二）使用 Urllib2 与 BeautifulSoup4 抓取解析网页

新的分享

#新闻拍一拍# 微软推出 Pylance，改善 VS Code 中的 Python 体验
跟我学Python图像处理丨5种图像阈值化处理及算法对比
基于Python设计一个具有基本功能的通讯录
liunx上升级python2至python3
es的查询、排序查询、分页查询、布尔查询、查询结果过滤、高亮查询、聚合函数、python操作es
python常用标准库（时间模块time和datetime）
python之logging日志
python之configparser类的使用
Python常用标准库（pickle序列化和JSON序列化）
MySQL（12） - Python+MySQL读取写入图片
MySQL（11） - Python+MySQL开发新闻管理系统
Python 什么是flask框架？快速入门(flask安装，登录，新手三件套，登录认证装饰器，配置文件，路由系统，CBV)

章节导航