我正在嘗試從SEC存檔網(wǎng)站中提取以下信息。1)大型加速文件管理器 2)加速文件管理器 3)非加速文件管理器 4)小型報(bào)告公司 5)新興成長(zhǎng)型公司這是它在網(wǎng)站上的顯示方式:Large accelerated filer ? Accelerated filer ?Non-accelerated filer ? (Do not check if a smaller reporting company) Smaller reporting company ?Emerging growth company ?在某些情況下,一個(gè)或多個(gè)項(xiàng)目可能不存在。我想編寫(xiě)一個(gè)通用代碼,可以為許多公司提取這些值?,F(xiàn)在我面臨的問(wèn)題是HTML的結(jié)構(gòu)正在從一個(gè)公司到另一個(gè)公司發(fā)生變化。到目前為止,我已經(jīng)遇到了3種不同的結(jié)構(gòu)(請(qǐng)參閱下面的HTML結(jié)構(gòu))。如何編寫(xiě)代碼以跨不同結(jié)構(gòu)進(jìn)行泛化?<td valign="bottom">Large accelerated filer</td><td valign="bottom"> </td><td valign="bottom">?</td><td valign="bottom"> </td><td valign="bottom">Accelerated filer</td><td valign="bottom"> </td><td valign="bottom">?</td></tr><tr style="page-break-inside:avoid ; font-family:Times New Roman; font-size:10pt"><td valign="bottom"><font style="white-space:nowrap">Non-accelerated filer</font></td><td valign="bottom"> </td><td valign="bottom">? (Do not check if a smaller reporting company)</td><td valign="bottom"> </td><td valign="bottom">Smaller reporting company</td><td valign="bottom"> </td><td valign="bottom">?</td></tr><tr style="page-break-inside:avoid ; font-family:Times New Roman; font-size:10pt"><td valign="bottom">Emerging growth company</td><td valign="bottom"> </td><td valign="bottom">?</td><td valign="bottom"> </td><td valign="bottom"></td><td valign="bottom"> </td><td valign="bottom"></td></tr>另一種結(jié)構(gòu):filer <font style="FONT-FAMILY:WINGDINGS">x</font> Accelerated filer <font style="FONT-FAMILY:WINGDINGS">¨</font> Non-accelerated filer <font style="FONT-FAMILY:WINGDINGS">¨</font> Smaller reporting company <font style="FONT-FAMILY:WINGDINGS">¨</font> </font>
Python + Selenium Web 抓取動(dòng)態(tài)元素
30秒到達(dá)戰(zhàn)場(chǎng)
2022-08-25 15:10:12