python爬虫(Xpath)
2018-01-10 17:24
246 查看
import requests
from lxml import etree
url = 'http://tieba.baidu.com/p/2166231880'
header = {'User-Agent':'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87 Safari/537.36'}
r = requests.get(url,headers=header).content
s = etree.HTML(r)
a=s.xpath('//div/img/@src')
b=0
for i in a:
try:
with open('C:\\Users\Administrator\\Desktop\\Python\\实写爬虫\\图片\\'+i[-9:-4]+'.jpg','wb') as f:
print(i)
text=(requests.get(i,headers=header).content)
f.write(text)
b=b+1
except:
print('完毕')
break
from lxml import etree
url = 'http://tieba.baidu.com/p/2166231880'
header = {'User-Agent':'Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87 Safari/537.36'}
r = requests.get(url,headers=header).content
s = etree.HTML(r)
a=s.xpath('//div/img/@src')
b=0
for i in a:
try:
with open('C:\\Users\Administrator\\Desktop\\Python\\实写爬虫\\图片\\'+i[-9:-4]+'.jpg','wb') as f:
print(i)
text=(requests.get(i,headers=header).content)
f.write(text)
b=b+1
except:
print('完毕')
break
相关文章推荐
- Python爬虫学习笔记(3)-XPath与多线程爬虫
- Python爬虫:Xpath语法笔记
- Python爬虫利器三之Xpath语法与lxml库的用法
- 【python爬虫】scrapy框架笔记(一):创建工程,使用scrapy shell,xpath
- Python爬虫利器三之Xpath语法与lxml库的用法
- 关于Python爬虫学习进步(xpath处理的小插曲--xpath如同“失灵”)
- Python爬虫抓取马蜂窝游记的照片 基于xpath
- python爬虫提取信息:正则表达式和xpath
- python爬虫之xpath
- python3爬虫必学Xpath,快速使用lxml.etree
- python3 [入门基础实战] 爬虫入门之xpath的学习
- [Python实战项目] - xpath 爬虫实战,获取纵横小说网连载小说最新章节(一)
- python爬虫入门(三)XPATH和BeautifulSoup4
- Python--通过XPath实现网络爬虫
- python.scrapy爬虫-xpath查询语法
- Python爬虫知识(3)—— xpath 选择器
- python中的爬虫神器 XPath 介绍
- Python爬虫(入门+进阶)学习笔记 1-4 使用Xpath解析豆瓣短评
- python爬虫之xpath
- 芝麻HTTP:Python爬虫利器之Xpath语法与lxml库的用法