我有用于機器學(xué)習(xí)研究的數(shù)據(jù),但我堅持使用這些字符串特征。我想將 ) 映射them(object到number(int64).例如,在 feature 中workclass,制作一個map(dict)as {'private':0,'State-gov':1, etc}。那么,如何在 DataFrame 中處理它,我是否應(yīng)該編寫一個 for 循環(huán)來查找特征中的 n 個不同類,并為每個對象特征進行 n 鍵映射?# There are the code about data readingimport pandas as pddf_trainFeatures = pd.read_csv('data/trainFeatures.csv')object_features = ['workclass','education','Marital-status', 'occupation','occupation','relationship','race','sex','native-country']# list data typefor i in df_trainFeatures: print(df_trainFeatures[i].dtype,i)//Printingint64 ageobject workclassint64 fnlwgtobject educationint64 education-numobject Marital-statusobject occupationobject relationshipobject raceobject sexint64 capital-gainint64 capital-lossint64 hours-per-weekobject native-country子數(shù)據(jù)框如下:
如何使用大數(shù)據(jù)集將對象類型映射到 pandas.DataFrame 中的 int64
慕無忌1623718
2021-07-07 13:09:04