已解決430363個問題，去搜搜看，總會有你想問的

字節對象以“repr 格式”存儲為 b'foo' 而不是 encode()-ing

首頁猿問字節對象以“repr...

字節對象以“repr 格式”存儲為 b'foo' 而不是 encode()-ing

Python

開心每一天1111 2021-09-14 16:23:01

一些倒霉的同事將一些數據保存到這樣的文件中：s = b'The em dash: \xe2\x80\x94'with open('foo.txt', 'w') as f: f.write(str(s))當他們應該使用s = b'The em dash: \xe2\x80\x94'with open('foo.txt', 'w') as f: f.write(s.decode())現在foo.txt看起來像b'The em-dash: \xe2\x80\x94'代替The em dash: —我已經將此文件作為字符串讀取：with open('foo.txt') as f: bad_foo = f.read()現在如何將bad_foo錯誤保存的格式轉換為正確保存的字符串？

查看完整描述

3 回答

忽然笑

TA貢獻1806條經驗獲得超5個贊

您可以嘗試文字 eval

from ast import literal_eval

test = r"b'The em-dash: \xe2\x80\x94'"

print(test)

res = literal_eval(test)

print(res.decode())

反對回復 2021-09-14

BIG陽

TA貢獻1859條經驗獲得超6個贊

如果您相信輸入不是惡意的，則可以ast.literal_eval在損壞的字符串上使用。

import ast

# Create a sad broken string

s = "b'The em-dash: \xe2\x80\x94'"

# Parse and evaluate the string as raw Python source, creating a `bytes` object

s_bytes = ast.literal_eval(s)

# Now decode the `bytes` as normal

s_fixed = s_bytes.decode()

否則，您將不得不手動解析并刪除或替換有問題的重復轉義。

反對回復 2021-09-14

3 回答
0 關注
255 瀏覽

關注

添加回答

舉報

0/150

提交

取消

亚洲在线久爱草,狠狠天天香蕉网,天天搞日日干久草,伊人亚洲日本欧美

熱搜

最近搜索清空

字節對象以“repr 格式”存儲為 b'foo' 而不是 encode()-ing

字節對象以“repr 格式”存儲為 b'foo' 而不是 encode()-ing

3 回答

添加回答