首頁(yè) 猿問(wèn) 在Python上使用seleniu...

在Python上使用selenium或beautifulsoup從帶有鏈接的頁(yè)面中抓取數(shù)據(jù)

Python

慕田峪7331174 2023-08-08 10:00:19

我想知道如何抓取這個(gè)網(wǎng)站：https://1997-2001.state.gov/briefings/statements/2000/2000_index.html它只包含'a'和'href'，沒(méi)有類或ID，結(jié)構(gòu)非常簡(jiǎn)單。我想運(yùn)行一個(gè)字符串來(lái)抓取頁(yè)面上所有鏈接的內(nèi)容。我已經(jīng)使用 chromedriver 嘗試過(guò)這段代碼，但它只打印了鏈接列表（我在網(wǎng)絡(luò)抓取方面非常業(yè)余）。任何幫助都會(huì)很棒。 >>> elems = driver.find_elements_by_xpath("//a[@href]") >>> for elem in elems: ... print(elem.get_attribute("href"))

查看完整描述

目前暫無(wú)任何回答