我想使用 Python 中的 MechanicalSoup 包在此ONS 網(wǎng)頁上下載 Excel 文件。我已閱讀 MechanicalSoup文檔。我在 StackOverflow 和其他地方廣泛搜索了一個例子,但沒有運氣。我的嘗試是:# Install dependencies# pip install requests# pip install BeautifulSoup4# pip install MechanicalSoup# Import librariesimport mechanicalsoupimport urllib.requestimport requestsfrom bs4 import BeautifulSoup# Create a browser object that can collect cookiesbrowser = mechanicalsoup.StatefulBrowser()browser.open("https://www.ons.gov.uk/economy/grossdomesticproductgdp/timeseries/l2kq/qna")browser.download_link("https://www.ons.gov.uk/generator?format=xls&uri=/economy/grossdomesticproductgdp/timeseries/l2kq/qna")在最后一行中,我也嘗試過:browser.download_link(link="https://www.ons.gov.uk/generator?format=xls&uri=/economy/grossdomesticproductgdp/timeseries/l2kq/qna",file="c:/test/filename.xls")2019 年 1 月 25 日更新:感謝 AKX 在下面的評論,我已經(jīng)嘗試過browser.download_link(re.escape("https://www.ons.gov.uk/generator?format=xls&uri=/economy/grossdomesticproductgdp/timeseries/l2kq/qna"))在每種情況下,我都會收到錯誤消息:mechanicalsoup.utils.LinkNotFoundError然而鏈接確實存在。嘗試將其粘貼到您的地址欄中以確認:https://www.ons.gov.uk/generator?format=xls&uri=/economy/grossdomesticproductgdp/timeseries/l2kq/qna我究竟做錯了什么?2019 年 1 月 25 日更新 2:感謝下面 AKX 的回答,這是回答我的問題的完整 MWE(為以后遇到相同困難的任何人發(fā)帖):# Install dependencies# pip install requests# pip install BeautifulSoup4# pip install MechanicalSoup# Import librariesimport mechanicalsoupimport urllib.requestimport requestsfrom bs4 import BeautifulSoupimport re# Create a browser object that can collect cookiesbrowser = mechanicalsoup.StatefulBrowser()browser.open("https://www.ons.gov.uk/economy/grossdomesticproductgdp/timeseries/l2kq/qna")browser.download_link(link_text=".xls",file="c:/py/ONS_Data.xls" )
添加回答
舉報
0/150
提交
取消