我是 Python 新手,如果太簡(jiǎn)單,請(qǐng)?zhí)崆爸虑?。我的代碼是# Split datay = starbucks_smote.iloc[:, -1]X = starbucks_smote.drop('label', axis = 1)# Count labels by typecounter = Counter(y)print(counter)Counter({0: 9634, 1: 2895})# Transform the datasetoversample = SMOTE()X, y = oversample.fit_resample(X, y)# Print the oversampled datasetcounter = Counter(y)print(counter)Counter({0: 9634, 1: 9634})如何保存過采樣數(shù)據(jù)集以備將來使用?我試過data_res = np.concatenate((X, y), axis = 1)data_res.to_csv('sample_smote.csv')出錯(cuò)了ValueError: all the input arrays must have same number of dimensions,?but the array at index 0 has 2 dimension(s) and the array at index 1 has 1 dimension(s)感謝任何提示!
1 回答

紫衣仙女
TA貢獻(xiàn)1839條經(jīng)驗(yàn) 獲得超15個(gè)贊
您可以創(chuàng)建數(shù)據(jù)框:
data_res?=?pd.DataFrame(X) data_res['y']?=?y
然后保存data_res
到 CSV。
基于連接 od 的解決方案numpy.arrays
也是可能的,但np.vstack
需要使維度兼容:
data_res?=?np.concatenate((X,?np.vstack(y)),?axis?=?1) data_res?=?pd.DataFrame(data_res)
添加回答
舉報(bào)
0/150
提交
取消