2 回答

TA貢獻(xiàn)1780條經(jīng)驗 獲得超5個贊
該Counter數(shù)項的頻率,所以會告訴你什么比這一次更出現(xiàn)。data從你的字典中取出:
from Collections import Counter
data = [
['00000000B42852FA', 'ADM_EIG', 'Administratiefeigenaar', 'ADM_EIG', 'ADM_EIG'],
['000000005880959E', 'OPZ', 'Opzeggingen', 'STANDAARD', nan]
]
您需要展平列表列表:
[item for sublist in data for item in sublist]
計數(shù)器將為您提供每個項目的頻率:
>>> Counter([item for sublist in data for item in sublist])
Counter({'ADM_EIG': 3, '00000000B42852FA': 1, 'Administratief eigenaar': 1, '000000005880959E': 1, 'OPZ': 1, 'Opzeggingen': 1, 'STANDAARD': 1, nan: 1})
然后您可以過濾您需要的內(nèi)容:
counter = Counter([item for sublist in data for item in sublist])
[value for value, count in counter.items() if count > 1]
這使 ['ADM_EIG']
編輯以匹配問題編輯
要查看所有行,請獲取所有數(shù)據(jù)并查找重復(fù)項:
data = []
for key, value in files_dict.items():
data.extend(value['data'])
counter = Counter([item for sublist in data for item in sublist])
print([value for value, count in counter.items() if count > 1])
添加回答
舉報