已解決430363個問題，去搜搜看，總會有你想問的

requests 和 bs4 無法讀取整個 html

首頁猿問 requests 和 bs4...

requests 和 bs4 無法讀取整個 html

Html5

呼啦一陣風 2023-09-04 16:09:58

我試圖獲取此網站列表中的所有 href：https://nihongonosei.net/?page_id =10246 。該網站非常簡單干凈。查看源代碼后，我發現沒有任何動態。但是，如果我這樣做import requestsurl = 'https://nihongonosensei.net/?page_id=10246'r = requests.get(url)r.textr.text僅包含約20000個字符信息。超過一半的 html 丟失了。我嘗試從“查看頁面源代碼”復制整個 HTML 并直接加載到 Beautifulsoup：from bs4 import BeautifulSouphtml = '' # too long to copy. Here is the link: view-source:https://nihongonosensei.net/?page_id=10246 soup = BeautifulSoup(html, 'html.parser')仍然只保留了大約 20000 個字符，并且 html 的上半部分丟失了。這是我的問題：requests和是否有字符限制BeautifulSoup？如果是這樣，我該如何取消限制？如果沒有，為什么我無法獲取完整的 html？多謝！

查看完整描述

目前暫無任何回答

0 回答
0 關注
188 瀏覽

關注

添加回答

舉報

0/150

提交

取消

亚洲在线久爱草,狠狠天天香蕉网,天天搞日日干久草,伊人亚洲日本欧美

熱搜

最近搜索清空

requests 和 bs4 無法讀取整個 html

requests 和 bs4 無法讀取整個 html

添加回答