您的位置:首页 > 编程语言 > Go语言

关键字爬google的pdf

2015-11-05 14:21 399 查看
import google

import requests

def download_file(url,index):

local_filename=index+"-"+url.split("/")[-1]

r=requests.get(url,stream=True)

with open(local_filename,"wb") as f:

for chunk in r.iter_content(chunk_size=1024):

if chunk:

f.write(chunk)

f.flush

return local_filename

g=google.search('site:*.gov.ph filetype:pdf',tld='com.hk')

index=1

for url in g:

if url.endswith(".pdf"):

file_path=download_file(url,str(index))

print "downloading:"+url+"->"+file_path

index+=1

print "all download finished"
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: