python读取文件时遇到非法字符的处理 UnicodeDecodeError: 'gbk' codec can't decode bytes in position

报错UnicodeDecodeError: 'gbk' codec can't decode bytes in position

ipath = 'D:/学习/语料库/SogouC.mini/Sample/C000013/18.txt'
uipath = unicode(ipath , "utf8")
# uipath = ipath.decode('utf8')
f = codecs.open(uipath,'r','gbk','ignore')
text = f.read()
f.close()

参数ignore会忽略非法字符

posted on 2015-03-19 22:18  长456风  阅读(795)  评论(0编辑  收藏  举报

导航