1 回答

TA貢獻1775條經驗 獲得超11個贊
beautifulsoup不解析JavaScript,所以你需要使用其他工具。例如re提取信息:
import re
from ast import literal_eval
txt = '''<script type="text/javascript">
//COMMENT// Some data already here
$(document).ready(function() {
name.init("<website Link>")
lang.init("en", "GB")
data.init("hello", "", "AT3K21SDV", "YIERE34ITEW832WCNG3VMASJKHO345JKRELRK5", "")
});
</script>'''
data = re.search(r'data\.init(\(.*?\))', txt).group(1)
data = literal_eval(data)
print(data[2], data[3])
印刷:
AT3K21SDV YIERE34ITEW832WCNG3VMASJKHO345JKRELRK5
編輯:如果里面data.init(...)是換行符,你必須flags=re.DOTALL設置re.search():
import re
from ast import literal_eval
txt = '''<script type="text/javascript">
//COMMENT// Some data already here
$(document).ready(function() {
ab.info.init("sv", "pp", "f", "NONE",
"rw", "3r7u6565667",
"3435345")
});
</script>'''
data = re.search(r'info\.init(\(.*?\))', txt, flags=re.DOTALL).group(1)
data = literal_eval(data)
print(data)
印刷:
('sv', 'pp', 'f', 'NONE', 'rw', '3r7u6565667', '3435345')
添加回答
舉報