解决Python中pandas读取*.csv文件出现编码问题
2019-07-12 09:56
1206 查看
1、问题
在使用Python中pandas读取csv文件时,由于文件编码格式出现以下问题:
Traceback (most recent call last): File "pandas\_libs\parsers.pyx", line 1134, in pandas._libs.parsers.TextReader._convert_tokens File "pandas\_libs\parsers.pyx", line 1240, in pandas._libs.parsers.TextReader._convert_with_dtype File "pandas\_libs\parsers.pyx", line 1256, in pandas._libs.parsers.TextReader._string_convert File "pandas\_libs\parsers.pyx", line 1494, in pandas._libs.parsers._string_box_utf8 UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 19: invalid start byte During handling of the above exception, another exception occurred: Traceback (most recent call last): File "E:\PyCharm 2017.3.4\helpers\pydev\pydevd.py", line 1668, in <module> main() File "E:\PyCharm 2017.3.4\helpers\pydev\pydevd.py", line 1662, in main globals = debugger.run(setup['file'], None, None, is_module) File "E:\PyCharm 2017.3.4\helpers\pydev\pydevd.py", line 1072, in run pydev_imports.execfile(file, globals, locals) # execute the script File "E:\PyCharm 2017.3.4\helpers\pydev\_pydev_imps\_pydev_execfile.py", line 18, in execfile exec(compile(contents+"\n", file, 'exec'), glob, loc) File "F:/OneDrive - emails.bjut.edu.cn/Program/Python/DCAE/test.py", line 18, in <module> load_phenotypes_ABIDE2_RfMRIMaps() File "F:/OneDrive - emails.bjut.edu.cn/Program/Python/DCAE\Data\load_data.py", line 109, in load_phenotypes_ABIDE2_RfMRIMaps pheno = pd.read_csv(pheno_path) File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 678, in parser_f return _read(filepath_or_buffer, kwds) File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 446, in _read data = parser.read(nrows) File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 1036, in read ret = self._engine.read(nrows) File "E:\Python\Python35\lib\site-packages\pandas\io\parsers.py", line 1848, in read data = self._reader.read(nrows) File "pandas\_libs\parsers.pyx", line 876, in pandas._libs.parsers.TextReader.read File "pandas\_libs\parsers.pyx", line 891, in pandas._libs.parsers.TextReader._read_low_memory File "pandas\_libs\parsers.pyx", line 968, in pandas._libs.parsers.TextReader._read_rows File "pandas\_libs\parsers.pyx", line 1094, in pandas._libs.parsers.TextReader._convert_column_data File "pandas\_libs\parsers.pyx", line 1141, in pandas._libs.parsers.TextReader._convert_tokens File "pandas\_libs\parsers.pyx", line 1240, in pandas._libs.parsers.TextReader._convert_with_dtype File "pandas\_libs\parsers.pyx", line 1256, in pandas._libs.parsers.TextReader._string_convert File "pandas\_libs\parsers.pyx", line 1494, in pandas._libs.parsers._string_box_utf8 UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 19: invalid start byte
我认为该问题是由于文件编码格式不是'utf-8'所导致的,因此,尝试将文件格式进行转换,转换方式如下:
首先使用txt文本打开文件,然后另存为,在右下角将编码改为‘UTF-8',点击保存即可
总结
以上所述是小编给大家介绍的解决Python中pandas读取*.csv文件出现编码问题 ,希望对大家有所帮助,如果大家有任何疑问请给我留言,小编会及时回复大家的。在此也非常感谢大家对脚本之家网站的支持!
如果你觉得本文对你有帮助,欢迎转载,烦请注明出处,谢谢!
您可能感兴趣的文章:
相关文章推荐
- Python里解决写入csv文件时出现多余空行的问题及看文件或数据编码方式
- 解决问题:pandas读取或者写入csv文件会多出现一列----Unnamed:0
- python中写入csv,excel显示、pandas读取csv文件的编码问题
- php读取csv文件后,uft8 bom导致在页面上显示出现问题的解决方法
- php读取csv文件后,uft8 bom导致在页面上显示出现问题的解决方法
- 解决pandas使用read_csv()读取文件遇到的问题
- IO 流读取文件时候出现乱码 文件编码格式问题 怎么转换解决方法
- 我用python将结果写入txt文件出现的编码问题及其解决方法
- 用Python3读取CSV类型文件时出现无效字节延续的问题
- rood-Python 3读取.CSV文件遇到的编码问题
- python读取文件中的第一行出现编码问题
- Python文件读取编码错误问题解决之(PyCharm开发工具默认设置的坑。。。)
- IO 流读取文件时候出现乱码 文件编码格式问题 怎么转换解决方法
- spark不同版本读取csv文件出现的编码问题
- Learning Python 015 Python3解决问题:读取文件时,出现乱码或者“UnicodeDecodeError 'gbk' codec can't decode” 错误
- Python将字典写入csv文件时出现每隔一行会空一行问题的解决办法
- 解决python3.6下scrapy中xpath.extract()匹配出来的内容转成json与.csv文件没有编码(unicode)的问题
- Python处理unicode编码的txt文件(Python中文处理)——解决to_excel()和to_csv()导出文件内容为空的问题
- 解决pandas中读取中文名称的csv文件报错的问题
- fgetcsv 读取csv文件出现问题解决办法