代碼如下:# -*- coding: utf-8 -*-#http://www.qiushibaike.com/8hr/page/1?s=4603425import urllib2from bs4 import BeautifulSouppage=1xiubai=open(r'xiubai.txt','w+')for page in range(1,11): url="http://www.qiushibaike.com/8hr/page/"+str(page)+"?s=4603425" soup = BeautifulSoup(urllib2.urlopen(url).read()) for result in soup.findAll("div", "content", title=True): xiubai.write(result.text)報錯:Traceback (most recent call last):
File "D:\python\python2.7.5\pythontest\爬蟲\kanxiushibaike.py", line 13, in <module>
xiubai.write(result.text)
UnicodeEncodeError: 'ascii' codec can't encode characters in position 2-10: ordinal not in range(128)不用result.text 直接result雖然包含了html的部分代碼但是又可以轉為str。之后就可以隨便用了。但是直接str(result.text)就會報同樣錯誤
python爬蟲保存為txt的編碼問題求解決?
慕尼黑8549860
2018-07-19 04:05:25