【Python学习系列二十】scikit-learn库模型持久化
2017-06-30 11:49
441 查看
场景:需要将模型保存到内存,或磁盘。
代码:
执行结果:
1)采用Python内建的持久性模型 pickle 保存scikit的模型,字符串形式;
2)用joblib(joblib.dump&joblib.load)保存到磁盘,文件形式;
代码:
# -*- coding: utf-8 -*- import pandas as pd import pickle as pkl from sklearn.externals import joblib from sklearn import svm #加载数据 label_ds=pd.read_csv(r"D:\\tmp\\sam_11.csv",sep=',',encoding='utf8',\ names=['u_spu_num','u_brand_num','u_cat_num','u_cat_spu','u_brand_spu','u_spu_date','action_type']) print "训练集,有", label_ds.shape[0], "行", label_ds.shape[1], "列" #模型训练 label_X = label_ds[['u_spu_num','u_brand_num','u_cat_num','u_cat_spu','u_brand_spu','u_spu_date']] label_y = label_ds['action_type']#类别 model = svm.SVC() model.fit(label_X, label_y) print model #模型导出导入磁盘 joblib.dump(model, 'D:\\tmp\\model.pkl') model2 = joblib.load('D:\\tmp\\model.pkl') print model2 #模型保存 s = pkl.dumps(model) model3 = pkl.loads(s) print model3数据集:
0,0,6,6,0,0,1 0,0,2,2,0,0,1 0,0,3,3,0,0,1 0,0,2,2,0,0,1 0,0,0,0,0,0,1 0,0,1,1,0,0,0 0,0,9,9,0,0,0 0,0,1,1,0,0,0 0,0,3,3,0,0,0
执行结果:
训练集,有 9 行 7 列 SVC(C=1.0, cache_size=200, class_weight=None, coef0=0.0, decision_function_shape=None, degree=3, gamma='auto', kernel='rbf', max_iter=-1, probability=False, random_state=None, shrinking=True, tol=0.001, verbose=False) SVC(C=1.0, cache_size=200, class_weight=None, coef0=0.0, decision_function_shape=None, degree=3, gamma='auto', kernel='rbf', max_iter=-1, probability=False, random_state=None, shrinking=True, tol=0.001, verbose=False) SVC(C=1.0, cache_size=200, class_weight=None, coef0=0.0, decision_function_shape=None, degree=3, gamma='auto', kernel='rbf', max_iter=-1, probability=False, random_state=None, shrinking=True, tol=0.001, verbose=False)
1)采用Python内建的持久性模型 pickle 保存scikit的模型,字符串形式;
2)用joblib(joblib.dump&joblib.load)保存到磁盘,文件形式;
相关文章推荐
- 【Python学习系列十七】基于scikit-learn库逻辑回归训练模型(delta比赛代码2)
- 【Python学习系列十六】基于scikit-learn库逻辑回归训练模型(delta比赛代码)
- 【Python学习系列十八】基于scikit-learn库逻辑回归训练模型(delta比赛代码3)
- python机器学习系列教程——深度学习框架比较TensorFlow、Theano、Caffe、SciKit-learn、Keras
- 【Python学习系列二十四】scikit-learn库逻辑回归实现唯品会用户购买行为预测
- 【Python学习系列二十三】Scikit_Learn库降维方法(矩阵分解)-PCA&FA
- Python scikit-learn 学习笔记—鸢尾花模型
- [Python & Machine Learning] 学习笔记之scikit-learn机器学习库
- 【scikit-learn】学习Python来分类现实世界的数据
- 【Python学习系列十】Python机器学习库scikit-learn实现Decision Trees案例
- scikit-learn使用joblib.dump()持久化模型过程中的问题详解--python
- Scikit-learn(python)学习笔记 (不定期更新)
- Python下的机器学习工具scikit-learn(学习笔记4)
- Python下的机器学习工具scikit-learn(学习笔记3--数据预处理)
- Python scikit-learn机器学习工具包学习笔记:cross_validation模块
- 【Python学习系列十三】Python机器学习库scikit-learn实现逻辑回归
- python中sklearn-learn模型持久化
- Python scikit-learn机器学习工具包学习笔记:cross_validation模块
- Python scikit-learn机器学习工具包学习笔记:cross_validation模块
- [Python][MachineLeaning]Python Scikit-learn学习笔记1-Datasets&Estimators