from bs4 import BeautifulSoupurl = 'C:\\Users\\Zandrio\\Documents\\Python-Selexion\\HTML-localhost\\Selexion.html'page = open(url)soup = BeautifulSoup(page.read(), features="lxml")prettify = soup.prettifyModel = "".join([div.text for div in soup.find_all('div', {'class' : 'title-options'})])print(Model)輸出:PS C:\Users\Zandrio> & C:/Users/Zandrio/AppData/Local/Programs/Python/Python38/python.exe c:/Users/Zandrio/Documents/Requests/selexion.py SQQE55Q90R Merk: Samsung Afdrukken網(wǎng)頁(yè):<div class="title-options"><span> SQQE55Q90R</span><span> Merk: Samsung </span><span class="print"> <a href="/nl/pr/samsung-qled-qe55q90r/product-sheet" class="product-file-pdf" target="_blank" title="Afdrukken">Afdrukken</a></span></div>在這種情況下,我只想要型號(hào),即這里的SQQE55Q90R。請(qǐng)建議任何解決方案。
1 回答

嗶嗶one
TA貢獻(xiàn)1854條經(jīng)驗(yàn) 獲得超8個(gè)贊
from bs4 import BeautifulSoup
url = 'C:\\Users\\Zandrio\\Documents\\Python-Selexion\\HTML-localhost\\Selexion.html'
page = open(url)
soup = BeautifulSoup(page.read(), features="lxml")
div = soup.body.find('div', attrs={'class': 'title-options'})
model_number = div.span.text.strip() # text of first span
print(model_number)
添加回答
舉報(bào)
0/150
提交
取消