Python爬虫解析多级嵌套json
2020-04-01 19:04
896 查看
个人博客地址https://nfreak-man.cn
最近刚接触Python爬虫,正好最近肺炎在全国蔓延,所以准备从网站爬取肺炎实时数据,并解析自己想要的数据。获取json数据网址为 https://m.look.360.cn/events/feiyan
爬取网址:
def main(): url='https://m.look.360.cn/events/feiyan' headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36 QIHU 360EE'} parse_page(url)
解析并保存json格式文件:
state = json.loads(response.content) with open("feiyan.json", "w",encoding='utf-8') as f: f.write(json.dumps(state,indent=2,ensure_ascii=False)) print("保存成功")
获取json文件后发现为嵌套格式,具体格式大家可以打开上方网址查看:
{ "data":[ { "data":{省份} "citys":[ {城市}{}..... ] } { "data":{省份} "citys":[ {城市}{}..... ] } "country":[ {其他国家} ] ] }
解析数据
因为我只想获取各个城市的名称、感染人数、治愈人数和死亡人数,所以需要嵌套循环来获取并逐个输出。代码如下:
省级数据获取:
provincename = state['data'][i]['data']['provinceName'] confirmedCount = state['data'][i]['data']['confirmedCount'] curedCount = state['data'][i]['data']['curedCount'] deadCount = state['data'][i]['data']['deadCount']
城市数据获取:
cityName = state['data'][i]['cities'][j]['cityName'] diagnosed = state['data'][i]['cities'][j]['diagnosed'] cured = state['data'][i]['cities'][j]['cured'] died = state['data'][i]['cities'][j]['died']
国外城市数据获取:
countryname = state['country'][c]['provinceName'] diagnosed = state['country'][c]['diagnosed'] cured = state['country'][c]['cured'] died = state['country'][c]['died']
以上就是嵌套json解析格式。
输出效果:
源码地址:github
- 点赞
- 收藏
- 分享
- 文章举报
相关文章推荐
- Python 3爬虫网易云(七)—— JSON简介与解析
- python爬虫--python解析json
- python爬虫解析json
- 神箭手云爬虫-爬取携程【国际】航班/机票信息-利用python解析返回的json文件将信息存储进Mysql数据库
- python爬虫解析json数据报错(unicode error) 'utf-8' codec can't decode byte 0xbb in position 0
- python爬虫解析库(Xpath、beautiful soup、Jsonpath)
- python爬虫 如何解析json文件 json文件的解析提取和jsonpath的应用
- [转]解析多级json数据为list中嵌套一级字典的形式
- python爬虫4——json数据处理
- python解析json
- python解析json并存入到DataFrame中
- python-Json模块【解析Json】
- python爬虫教程(4)-正则表达式解析网页
- python3 协程爬虫,爬取豆瓣排行榜json数据
- 利用urllib读取JSON,然后将JSON解析为Python对象 —— python学习笔记
- Python爬虫解析网页的4种方式 值得收藏
- python中jsonpath解析使用
- python 读写 json 解析
- [Python]网络爬虫(八):糗事百科的网络爬虫(v0.3)源码及解析(简化更新)
- urllib通过Post请求爬去数据并解析JSON(Python)