[Python爬虫]1.豆瓣电影Top250
2017-06-24 21:32
871 查看
# 豆瓣电影Top250 import requests from bs4 import BeautifulSoup for page in range(10): page = page*25 url = "https://movie.douban.com/top250?start={}".format(page) response = requests.get(url).text bsObj = BeautifulSoup(response, 'html.parser') div_tags = bsObj.find_all('div', {'class': 'info'}) try: for div_tag in div_tags: movie_name = div_tag.find('a').get_text().strip('\n').replace('\n','') actors = div_tag.find('p').get_text().strip('\n').replace(' ','').replace('\n',' ') rating_num = div_tag.find('div',{'class':'star'}).find_all('span')[1].get_text() rating_people = div_tag.find('div',{'class':'star'}).find_all('span')[3].get_text() jianjie = div_tag.find('span',{'class':'inq'}).get_text() #print(movie_name + '\n' +actors + '\n' +rating_num + ' '+rating_people + '\n' + "简介:"+jianjie + '\n') with open ('E:/douban250.txt','a+',encoding='utf-8') as f: content = movie_name + '\n' +actors + '\n' +rating_num + ' '+rating_people + '\n' + "简介:"+jianjie+'\n'+'\n' f.write(content) except: continue
相关文章推荐
- 运维学python之爬虫高级篇(五)scrapy爬取豆瓣电影TOP250
- 1.【python爬虫学习笔记】爬取豆瓣电影top250
- Python爬虫豆瓣电影top250
- [Python/爬虫]利用xpath爬取豆瓣电影top250
- Python爬虫----抓取豆瓣电影Top250
- Python 采用Scrapy爬虫框架爬取豆瓣电影top250
- Python爬虫实战——豆瓣电影Top250
- python爬虫 Scrapy2-- 爬取豆瓣电影TOP250
- 萌新的Python学习日记 - 爬虫无影 - 爬取豆瓣电影top250并入库:豆瓣电影top250
- Python爬虫实战——豆瓣电影top250
- Python爬虫初学(2)豆瓣电影top250评论数
- [python爬虫] BeautifulSoup和Selenium对比爬取豆瓣Top250电影信息
- 实践Python的爬虫框架Scrapy来抓取豆瓣电影TOP250
- Python爬虫获取豆瓣电影TOP250
- [python爬虫入门]爬取豆瓣电影排行榜top250
- python3[爬虫基础入门实战] 爬取豆瓣电影排行top250
- (7)Python爬虫——爬取豆瓣电影Top250
- 实践Python的爬虫框架Scrapy来抓取豆瓣电影TOP250
- Python爬虫小案例:豆瓣电影TOP250
- python 爬虫 保存豆瓣TOP250电影海报及修改名称