您的位置:首页 > 编程语言 > Python开发

Python 爬虫:2.几个爬虫实例

2018-02-27 11:39 337 查看
#百度搜索全代码
import requests
try:
kv={'wd':'Python'}
r=requests.get('http://www.baidu.com/s',params=kv)
print(r.request.url)
r.raise_for_status()
print(len(r.text))
except:
print('ERROR')
#爬取搜狗搜索全代码
import requests
try:
kv={'query':'Python'}
r=requests.get('http://www.sogou.com/web',params=kv)
print(r.request.url)
r.raise_for_status()
print(len(r.text))
except:
print('ERROR')
#爬取网页上的图片
import requests
path='D:/abc.jpg'
url='https://p1.ssl.qhmsg.com/t01b91ce68adb052a9e.jpg'
r=requests.get(url)
print(r.status_code)
with open(path,'wb') as f:
f.write(r.content)
f.close()
import requests
path='D:/a.png'
url='http://p0.ifengimg.com/pmop/2018/0227/13CB6A9806BFBFA31E2D3EE039923442A9669FD0_size25_w318_h300.png'
r=requests.get(url)
with open(path,'wb') as f:
f.write(r.content)
f.close()
#爬取网页上的图片,并用原本的名称命名
import requests
import  os
url='https://p1.ssl.qhmsg.com/t01b91ce68adb052a9e.jpg'
root='D://pics//'
path=root+url.split('/')[-1]
try:
if not os.path.exists(root):
os.mkdir(root)
if not os.path.exists(path):
r=requests.get(url)
with open(path,'wb') as f:
f.write(r.content)
f.close()
print('Success!')
else:
print('Already exits')
except:
print('ERROR')
结果:
http://www.baidu.com/s?wd=Python
334987 http://www.sogou.com/web?query=Python 313179
200
Success!
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: