UnicodeDecodeError: 'gb2312' codec can't decode byte 0x88 in position 164111: illegal multibyte sequ
2017-10-10 14:35
489 查看
使用python遇到UnicodeDecodeError: 'gb2312' codec can't decode byte 0x88 in position 164111: illegal multibyte sequence
# 基金抓取 from urllib import request import chardet page1_url = "http://fund.eastmoney.com/fund.html" def getHtml(pageUrl): response = request.urlopen(pageUrl) raw_html = response.read() getEncoding = chardet.detect(raw_html)['encoding'] src = raw_html.decode(getEncoding) print(src) getHtml(page1_url)
这么办?大概意思是 网页有 非法字符你需要加上ignore
# 基金抓取 from urllib import request import chardet page1_url = "http://fund.eastmoney.com/fund.html" def getHtml(pageUrl): response = request.urlopen(pageUrl) raw_html = response.read() getEncoding = chardet.detect(raw_html)['encoding'] src = raw_html.decode(getEncoding, 'ignore') print(src) getHtml(page1_url)
相关文章推荐
- 【Python】Python读取文件报错:UnicodeDecodeError: 'gbk' codec can't decode byte 0x99 in position 20: illegal multibyte sequence
- Python读取CSV文件:UnicodeDecodeError: 'gbk' codec can't decode byte 0xba ....illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xae in position 30738: illegal multibyte sequence
- python | 读文件编码问题 | UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 34: illegal mu
- 【python问题解决】UnicodeDecodeError :'gb2312' codec can't decode bytes:illegal multibyte sequence
- 报错信息:UnicodeDecodeError: 'utf-8' codec can't decode byte 0x88 in position 6382: invalid start byte
- Python2.7:UnicodeDecodeError :'gb2312' codec can't decode bytes:illegal multibyte sequence
- 手把手教你:python读取.htm文件报错:UnicodeDecodeError: 'utf8' codec can't decode byte 0xb3 in position 0的解决方法
- 解决Python字符串处理出现错误UnicodeDecodeError: ‘ascii’ codec can’t decode byte 0xe4 in position 108: ordinal not in range(128)
- 解决unicodedecodeerror ascii codec can’t decode byte 0xd7 in position 9 ordinal not in range(128)
- python: UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 20: ordinal not in rang
- 解决unicodedecodeerror ascii codec can’t decode byte 0xd7 in position 9 ordinal not in range(128)
- python用pip装第三方库numpy时报错:UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 7: ordi
- xlutils写入中文报错:UnicodeDecodeError: 'ascii' codec can't decode byte 0xb8 in position 0: ordinal not in
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xa4 in position 18: illegal multibyte sequence
- About UnicodeDecodeError: 'utf8' codec can't decode byte 0xa1 in position 157: invalid start byte
- 手把手教你:python读取.htm文件报错:UnicodeDecodeError: 'utf8' codec can't decode byte 0xb3 in position 0的解决方法
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xb7 in position 2069解决方法
- UnicodeDecodeError: 'utf-8' codec can't decode byte 0x92 in position 884: invalid start byte
- 提示"UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 205: illegal multibyte sequenc