python 学习笔记
2016-07-14 18:16
405 查看
新手刚接触python 做了下面的例子,这个例子的功能下载指定网页上的图片:
import urllib.request import re import threading from time import sleep,ctime from html import parser #connect to a URL website = urllib.request.urlopen("http://sc.chinaz.com/tupian/taikongtupian_3.html") #read html code html = website.read().decode() #use re.findall to get all the links links = re.findall(r'(http:[^\s]*?(jpg|png|gif|JPG|PNG|GIF))', html) print(links) print ("网页抓取成功,开始下载图片:") def downjpg( filepath,FileName ="default.jpg" ): try: print(str(filepath)) web = urllib.request.urlopen(filepath) print("访问网络文件"+"\n") jpg = web.read() DstDir="G:\\image\\" print("保存文件"+DstDir+FileName+"\n") try: File = open( DstDir+FileName,"wb" ) File.write( jpg) File.close() return except IOError: print("error\n") return except Exception: print("error\n") return def upload(url,name): path = r"G:\\image\\"+name data = urllib.request.urlopen(url).read() print(url) f = open(path,"wb") print(path) f.write(data) print(data) f.close() i=150 it = iter(links) # 创建迭代器对象 for x in it: i+=1 name=str(i)+".jpg" upload(x[0],name) #downjpg(x[0],name) #print(re.findall('"http:.*"', x)[0]+name)
相关文章推荐
- python基础语法
- Python最好用的模板引擎Jinja
- Python机器登陆新浪微博代码示例
- Python yield
- python传说中的验证码识别
- python 模块学习_os
- Learning Python 002 print() 和 input()
- Learning Python 002 print() 和 input()
- Python基础语法
- 用python写一个简单的推荐系统
- 完全理解Python迭代对象、迭代器、生成器
- Python解析excel文件并存入sqlite数据库
- python笔记 - day4-之装饰器
- python简要
- Python 库大全
- 详解Python中re.sub
- 习题5 更多的变量和打印
- scrapy抓取csdn中标题带有“语义”关键字的文章的标题和链接
- Python.Lists
- python制作二维码