集成模型python实现,随机森林,梯度提升决策树
2017-06-02 23:36
302 查看
import pandas as pd; titanic = pd.read_csv('http://biostat.mc.vanderbilt.edu/wiki/pub/Main/DataSets/titanic.txt') X = titanic[['pclass', 'age', 'sex']] y = titanic['survived'] X['age'].fillna(X['age'].mean(), inplace = True) from sklearn.cross_validation import train_test_split X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.25, random_state = 33) from sklearn.feature_extraction import DictVectorizer vec = DictVectorizer(sparse = False) X_train = vec.fit_transform(X_train.to_dict(orient = 'record')) X_test = vec.transform(X_test.to_dict(orient = 'record')) from sklearn.tree import DecisionTreeClassifier dtc = DecisionTreeClassifier() dtc.fit(X_train, y_train) dtc_y_pred = dtc.predict(X_test) from sklearn.ensemble import RandomForestClassifier rfc = RandomForestClassifier() rfc.fit(X_train, y_train) rfc_y_pred = rfc.predict(X_test) from sklearn.ensemble import GradientBoostingClassifier gbc = GradientBoostingClassifier() gbc.fit(X_train, y_train) gbc_y_pred = gbc.predict(X_test) from sklearn.metrics import classification_report print 'The accuracy of decision tree is: ', dtc.score(X_test, y_test) print classification_report(dtc_y_pred, y_test) print 'The accuracy of random forest tree is: ', rfc.score(X_test, y_test) print classification_report(rfc_y_pred, y_test) print 'The accuracy of gradient tree boosting is: ', gbc.score(X_test, y_test) print classification_report(gbc_y_pred, y_test)
相关文章推荐
- python中使用集成模型,随机森林分类器,梯度提升决策树性能模型分析 可视化
- 单一决策树与集成模型(随机森林分类器、梯度提升决策树)的比较
- 决策树与随机森林相关概念及其Python实现
- 决策树与随机森林相关概念及其Python实现
- 决策树与随机森林相关概念及其Python实现
- 【ML笔记】梯度提升决策树(GBDT)和随机森林(RF)的异同
- 决策树与随机森林相关概念及其Python实现
- 决策树与随机森林相关概念及其Python实现
- 决策树(CART)、随机森林、GBDT(GBRT)新手导读及资料推荐,附加python实现代码
- 决策树与随机森林相关概念及其Python实现
- 机器学习总结(四)——随机森林与GBDT(梯度提升决策树)
- 决策树与随机森林相关概念及其Python实现
- 决策树与随机森林相关概念及其Python实现
- 决策树与随机森林相关概念及其Python实现
- python实现集成回归算法,包括随机森林,极端随机森林,梯度boosting算法
- CART分类决策树、回归树和模型树算法详解及Python实现
- 决策树与随机森林相关概念及其Python实现
- 决策树与随机森林相关概念及其Python实现
- 决策树与随机森林相关概念及其Python实现
- 干货|从决策树到随机森林:树型算法的实现原理与Python 示例