machine learning in coding(python):根据关键字合并多个表(

machine learning in coding(python):根据关键字合并多个表(构建组合feature)

分类:scikit-learnmachine learning in coding

三张表;train_set.csv;test_set.csv;feature.csv。三张表通过object_id关联。

import pandas as pdimport numpy as np# load training and test datasetstrain = pd.read_csv('../input/train_set.csv')test = pd.read_csv('../input/test_set.csv')features = pd.read_csv('../input/feature.csv')train = pd.merge(train,features,on='object_id',how='inner')test = pd.merge(test,features,on='object_id',how='inner')# drop useless columns and create labelstest = test.drop(['id', 'object_id'], axis = 1)labels = train.cost.valuestrain = train.drop(['object_id', 'cost'], axis = 1)# convert data to numpy array train = np.array(train) test = np.array(test)from:kaggle

版权声明:本文为博主原创文章,未经博主允许不得转载。

上一篇scikit-learn:External Resources, Videos and Talks下一篇scikit-learn(工程中用的相对较多的模型介绍):1.4. Support Vector Machines

顶1踩0

,怪天怪地,我都不会怪你,你有选择幸福的权利…

machine learning in coding(python):根据关键字合并多个表(

相关文章:

你感兴趣的文章:

标签云: