首頁猿問 Python中通過絕對值獲取XML...

Python中通過絕對值獲取XML文件中的值

Python

慕桂英3389331 2023-10-25 10:49:16

我有一個我想要檢索的 XML 文件值的絕對路徑。絕對路徑的格式為“A/B/C”。我怎樣才能在Python中做到這一點(diǎn)？

查看完整描述

3 回答

GCT1015

TA貢獻(xiàn)1827條經(jīng)驗獲得超4個贊

使用ElementTree庫（請注意，我的答案使用核心 python 庫，而其他答案使用外部庫。）

要抓取前三個句子，只需將這些行添加到您的代碼中：

section = soup.find('section',class_ = "article_text post") #Finds the section tag with class "article_text post"

txt = section.p.text #Gets the text within the first p tag within the variable section (the section tag)

print(txt)

輸出：

Many people will land on this page after learning that their email address has appeared in a data breach I've called "Collection #1". Most of them won't have a tech background or be familiar with the concept of credential stuffing so I'm going to write this post for the masses and link out to more detailed material for those who want to go deeper.

希望這有幫助！

反對回復(fù) 2023-10-25

繁星coding

TA貢獻(xiàn)1797條經(jīng)驗獲得超4個贊

另一種方法。

from simplified_scrapy import SimplifiedDoc, utils, req

# Basic

xml = '''<ROOT><A><C>The Value</C></A></ROOT>'''

doc = SimplifiedDoc(xml)

print (doc.select('A>B>C'))

# Multiple

xml = '''<ROOT><A><C>The Value 1</C></A><A><C>The Value 2</C></A></ROOT>'''

doc = SimplifiedDoc(xml)

# print (doc.selects('A').select('B').select('C'))

print (doc.selects('A').select('B>C'))

# Mixed structure

xml = '''<ROOT><A><other>no B</other></A><A><other></other>no C</A><A><C>The Value</C></A></ROOT>'''

doc = SimplifiedDoc(xml)

nodes = doc.selects('A').selects('B').select('C')

for node in nodes:

for c in node:

if c:

print (c)

結(jié)果：

{'tag': 'C', 'html': 'The Value'}

[{'tag': 'C', 'html': 'The Value 1'}, {'tag': 'C', 'html': 'The Value 2'}]

{'tag': 'C', 'html': 'The Value'}

反對回復(fù) 2023-10-25

慕慕森

TA貢獻(xiàn)1856條經(jīng)驗獲得超17個贊

您可以使用lxml，您可以通過安裝pip install lxml。

from simplified_scrapy import SimplifiedDoc, utils, req

# Basic

xml = '''<ROOT><A><C>The Value</C></A></ROOT>'''

doc = SimplifiedDoc(xml)

print (doc.select('A>B>C'))

# Multiple

xml = '''<ROOT><A><C>The Value 1</C></A><A><C>The Value 2</C></A></ROOT>'''

doc = SimplifiedDoc(xml)

# print (doc.selects('A').select('B').select('C'))

print (doc.selects('A').select('B>C'))

# Mixed structure

xml = '''<ROOT><A><other>no B</other></A><A><other></other>no C</A><A><C>The Value</C></A></ROOT>'''

doc = SimplifiedDoc(xml)

nodes = doc.selects('A').selects('B').select('C')

for node in nodes:

? for c in node:

? ? if c:

? ? ? print (c)

結(jié)果：

{'tag': 'C', 'html': 'The Value'}

[{'tag': 'C', 'html': 'The Value 1'}, {'tag': 'C', 'html': 'The Value 2'}]

{'tag': 'C', 'html': 'The Value'}

反對回復(fù) 2023-10-25

3 回答
0 關(guān)注
201 瀏覽

關(guān)注

添加回答

舉報

0/150

提交

取消

使用 Ctrl+D 可將網(wǎng)站添加到書簽

微信客服

購課補(bǔ)貼
聯(lián)系客服咨詢優(yōu)惠詳情

幫助反饋 APP下載

慕課網(wǎng)APP
您的移動學(xué)習(xí)伙伴

公眾號

掃描二維碼
關(guān)注慕課網(wǎng)微信公眾號

第七色在线视频,2021少妇久久久久久久久久,亚洲欧洲精品成人久久av18,亚洲国产精品特色大片观看完整版,孙宇晨将参加特朗普的晚宴

熱搜

最近搜索清空

Python中通過絕對值獲取XML文件中的值

Python中通過絕對值獲取XML文件中的值

3 回答

添加回答