第七色在线视频,2021少妇久久久久久久久久,亚洲欧洲精品成人久久av18,亚洲国产精品特色大片观看完整版,孙宇晨将参加特朗普的晚宴

為了賬號(hào)安全,請(qǐng)及時(shí)綁定郵箱和手機(jī)立即綁定
已解決430363個(gè)問(wèn)題,去搜搜看,總會(huì)有你想問(wèn)的

從我的 XML 文件中提取信息并為其分配一個(gè)向量

從我的 XML 文件中提取信息并為其分配一個(gè)向量

蠱毒傳說(shuō) 2023-12-05 15:01:47
我想用 python 解析我的計(jì)算機(jī)上的一些 XML 文件并從每個(gè)文件中提取一些信息這是我的其中之一的 xml 文件:(如果您想要文本在這里: https://github.com/peldszus/arg-microtexts/blob/master/corpus/en/micro_b002.xml)作為第一級(jí),我已經(jīng)完成了第一級(jí):myList = []                #read the whole text from for root, dirs, files in os.walk(path):    for file in files:        if file.endswith('.xml'):            with open(os.path.join(root, file), encoding="UTF-8") as content:                tree = ET.parse(content)                myList.append(tree)在 myList 中,我有一些 XMl 文件 <xml.etree.ElementTree.ElementTree at 0x1f0fb1f8430>現(xiàn)在對(duì)于根“邊緣”,它們沒(méi)有 type="seg" <edge id="c1" src="a1" trg="a3" type="sup"/>  <edge id="c2" src="a2" trg="a3" type="sup"/>  <edge id="c4" src="a4" trg="a3" type="reb"/>  <edge id="c5" src="a5" trg="c4" type="und"/>我想提取標(biāo)簽“src”,我想提取標(biāo)簽=Src,  src="a1"    src="a2"    src="a4"   src="a5" 然后我想分配的數(shù)字不在src中,因?yàn)檫@句話稱為前提,例如這里...我想說(shuō)“a3”是所謂的“前提”(因?yàn)樗皇菢?biāo)簽src)例如這里(0,0,1,0,0) 應(yīng)該是我的過(guò)程的結(jié)果,因?yàn)?a3 沒(méi)有被應(yīng)用,我將第三個(gè)數(shù)組設(shè)置為 1,其余的設(shè)置為零一般來(lái)說(shuō),我想提取信息以注釋我的文本,該文本已使用 xml 進(jìn)行了一些注釋
查看完整描述

3 回答

?
德瑪西亞99

TA貢獻(xiàn)1770條經(jīng)驗(yàn) 獲得超3個(gè)贊

您的問(wèn)題中并非所有內(nèi)容都清楚...

以下是數(shù)據(jù)提取部分


import xml.etree.ElementTree as ET


xml = '''<?xml version='1.0' encoding='UTF-8'?>

<arggraph id="micro_b002" topic_id="higher_dog_poo_fines" stance="pro">

  <edu id="e1"><![CDATA[One can hardly move in Friedrichshain or Neuk?lln these days without permanently scanning the ground for dog dirt.]]></edu>

  <edu id="e2"><![CDATA[And when bad luck does strike and you step into one of the many 'land mines' you have to painstakingly scrape the remains off your soles.]]></edu>

  <edu id="e3"><![CDATA[Higher fines are therefore the right measure against negligent, lazy or simply thoughtless dog owners.]]></edu>

  <edu id="e4"><![CDATA[Of course, first they'd actually need to be caught in the act by public order officers,]]></edu>

  <edu id="e5"><![CDATA[but once they have to dig into their pockets, their laziness will sure vanish!]]></edu>

  <adu id="a1" type="pro"/>

  <adu id="a2" type="pro"/>

  <adu id="a3" type="pro"/>

  <adu id="a4" type="opp"/>

  <adu id="a5" type="pro"/>

  <edge id="c6" src="e1" trg="a1" type="seg"/>

  <edge id="c7" src="e2" trg="a2" type="seg"/>

  <edge id="c8" src="e3" trg="a3" type="seg"/>

  <edge id="c9" src="e4" trg="a4" type="seg"/>

  <edge id="c10" src="e5" trg="a5" type="seg"/>

  <edge id="c1" src="a1" trg="a3" type="sup"/>

  <edge id="c2" src="a2" trg="a3" type="sup"/>

  <edge id="c4" src="a4" trg="a3" type="reb"/>

  <edge id="c5" src="a5" trg="c4" type="und"/>

</arggraph>'''

root = ET.fromstring(xml)

interesting_edges_src = [e.attrib['src'] for e in root.findall('.//edge') if e.attrib['type'] != 'seg' ]

print(interesting_edges_src)

輸出


['a1', 'a2', 'a4', 'a5']


查看完整回答
反對(duì) 回復(fù) 2023-12-05
?
手掌心

TA貢獻(xiàn)1942條經(jīng)驗(yàn) 獲得超3個(gè)贊

這里可以被認(rèn)為是某種接近最終答案的答案


myList = []??

myEdgesList=[]

#read the whole text from?

for root, dirs, files in os.walk(path):

? ? for file in files:

? ? ? ? if file.endswith('.xml'):

? ? ? ? ? ? with open(os.path.join(root, file), encoding="UTF-8") as content:

? ? ? ? ? ? ? ? tree = ET.parse(content)

? ? ? ? ? ? ? ? myList.append(tree)

? ? ? ? ? ? ? ??

for k in myList:

? ? Edge= [e.attrib['src'] for e in k.findall('.//edge') if e.attrib['type'] != 'seg' ]

? ? myEdgesList.append(Edge)

這提供


['a1', 'a2', 'a4', 'a5'] 對(duì)于上面的示例以及所有其他示例的列表


[['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a4', 'a5'],

?['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a3', 'a4'],

?['a1', 'a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3', 'a4', 'a5'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a1', 'a2', 'a3'],

?['a2', 'a3', 'a4', 'a5'],

.

.

.


只剩下將此列表轉(zhuǎn)換為



(0,0,0,0,1) <----- ['a1', 'a2', 'a3', 'a4']


#as a5 is missing?



(0,0,1,0,0) <------? ['a1', 'a2', 'a4', 'a5']


#as a3 is misisng?

.

.

.

(0,0,1)? ? <-------? ?['a2', 'a3']


#as a1 is missing?

等等


如果您有任何想法請(qǐng)告訴我,我也在努力


查看完整回答
反對(duì) 回復(fù) 2023-12-05
?
牧羊人nacy

TA貢獻(xiàn)1862條經(jīng)驗(yàn) 獲得超7個(gè)贊

對(duì)于下一個(gè)問(wèn)題


myEdgtlistmap=[]

for lst in myEdgesList:

    tp=[]

    for el in lst:

        if el=="a1":

            tp.append(1)

        if el=="a2":

            tp.append(2)

        if el=="a3":

            tp.append(3)

        if el=="a4":

            tp.append(4)

        if el=="a5":

            tp.append(5)

        if el=="a6":

            tp.append(6)

    myEdgtlistmap.append(tp)

label=[]

for le in myEdgtlistmap:

    b=[1]*(len(le)+1)

    for v in le: 

        b[v-1]=0

    label.append(b)

y=[l for lab in label for l in lab ]


查看完整回答
反對(duì) 回復(fù) 2023-12-05
  • 3 回答
  • 0 關(guān)注
  • 190 瀏覽
慕課專欄
更多

添加回答

舉報(bào)

0/150
提交
取消
微信客服

購(gòu)課補(bǔ)貼
聯(lián)系客服咨詢優(yōu)惠詳情

幫助反饋 APP下載

慕課網(wǎng)APP
您的移動(dòng)學(xué)習(xí)伙伴

公眾號(hào)

掃描二維碼
關(guān)注慕課網(wǎng)微信公眾號(hào)