libsvm 之 easy.py（流程化脚本）注释

鉴于该脚本的重要性，很有必要对该脚本做一个全面的注释，以便可以灵活的使用libsvm。

#!/usr/bin/env python

# 这种设置python路径的方法更为科学

import sys

import os

from subprocess import *

# 输入参数太少就会提示程序用法

if len(sys.argv) <= 1:

    print('Usage: {0} training_file [testing_file]'.format(sys.argv[0]))

    raise SystemExit

# svm, grid, and gnuplot executable files

is_win32 = (sys.platform == 'win32')

if not is_win32:

    # Linux系统下的程序路径配置

    svmscale_exe = "../svm-scale"

    svmtrain_exe = "../svm-train"

    svmpredict_exe = "../svm-predict"

    grid_py = "./grid.py"

    gnuplot_exe = "/usr/bin/gnuplot"   #需要修改次路径，gnuplot为可执行程序的路径，不是文件夹路径

else:

    # windows系统下的程序路径配置

    svmscale_exe = r"..\windows\svm-scale.exe"

    svmtrain_exe = r"..\windows\svm-train.exe"

    svmpredict_exe = r"..\windows\svm-predict.exe"

    gnuplot_exe = r"C:\gnuplot\bin\gnuplot.exe"

    grid_py = r".\grid.py"

assert os.path.exists(svmscale_exe),"svm-scale executable not found"

assert os.path.exists(svmtrain_exe),"svm-train executable not found"

assert os.path.exists(svmpredict_exe),"svm-predict executable not found"

assert os.path.exists(gnuplot_exe),"gnuplot executable not found"

assert os.path.exists(grid_py),"grid.py not found"

# 创建训练数据集相关的文件：".scale"，".model"，".range"三个文件

train_pathname = sys.argv[1]

assert os.path.exists(train_pathname),"training file not found"

file_name = os.path.split(train_pathname)[1]

scaled_file = file_name + ".scale"

model_file = file_name + ".model"

range_file = file_name + ".range"

# 创建测试数据集相关文件：".scale"，".predict"两个文件

if len(sys.argv) > 2:

    test_pathname = sys.argv[2]

    file_name = os.path.split(test_pathname)[1]

    assert os.path.exists(test_pathname),"testing file not found"

    scaled_test_file = file_name + ".scale"

    predict_test_file = file_name + ".predict"

# 流程化命令一：svm-scale缩放，训练集缩放，参数如下：

cmd = '{0} -s "{1}" "{2}" > "{3}"'.format(svmscale_exe, range_file, train_pathname, scaled_file)

print('Scaling training data...')

Popen(cmd, shell = True, stdout = PIPE).communicate()    

# 流程化命令二：参数选优，使用grid.py脚本，进行交叉验证，参数如下:

cmd = '{0} -svmtrain "{1}" -gnuplot "{2}" "{3}"'.format(grid_py, svmtrain_exe, gnuplot_exe, scaled_file)

print('Cross validation...')

f = Popen(cmd, shell = True, stdout = PIPE).stdout

line = ''

while True:

    last_line = line

    line = f.readline()

    if not line: break

c,g,rate = map(float,last_line.split())

# 输出最优参数c，g

print('Best c={0}, g={1} CV rate={2}'.format(c,g,rate))

# 流程化命令三：svm-train训练，参数设置如下

cmd = '{0} -c {1} -g {2} "{3}" "{4}"'.format(svmtrain_exe,c,g,scaled_file,model_file)

print('Training...')

Popen(cmd, shell = True, stdout = PIPE).communicate()

print('Output model: {0}'.format(model_file))

if len(sys.argv) > 2:

    # 流程化命令四：svm-scale缩放，测试数据缩放，参数设置如下：

    cmd = '{0} -r "{1}" "{2}" > "{3}"'.format(svmscale_exe, range_file, test_pathname, scaled_test_file)

    print('Scaling testing data...')

    Popen(cmd, shell = True, stdout = PIPE).communicate()    

    # 流程化命令五：svm-predict预测，参数设置如下：

    cmd = '{0} "{1}" "{2}" "{3}"'.format(svmpredict_exe, scaled_test_file, model_file, predict_test_file)

    print('Testing...')

    Popen(cmd, shell = True).communicate()    

    print('Output prediction: {0}'.format(predict_test_file))

libsvm 之 easy.py（流程化脚本）注释的更多相关文章

【转】Windows下使用libsvm中的grid.py和easy.py进行参数调优
libsvm中有进行参数调优的工具grid.py和easy.py可以使用,这些工具可以帮助我们选择更好的参数,减少自己参数选优带来的烦扰. 所需工具:libsvm.gnuplot 本机环境:Windo ...
libsvm easy.py ValueError: need more than 0 values to unpack windows下终极解决
现象是: python easy.py train test 输出: Scaling training data...WARNING: original #nonzeros 100389 new #n ...
Atitit usrqbg1834 html的逻辑化流程化规范标准化解决方案
Atitit usrqbg1834 html的逻辑化流程化规范标准化解决方案常用指令1 ..v-if.v-else指令2 v-for指令3 MVVM大比拼4 常用指令本来按照Vue文档说明,常用 ...
Selenium2学习-018-WebUI自动化实战实例-016-自动化脚本编写过程中的登录验证码问题
日常的 Web 网站开发的过程中,为提升登录安全或防止用户通过脚本进行黄牛操作(宇宙最贵铁皮天朝魔都的机动车牌照竞拍中),很多网站在登录的时候,添加了验证码验证,而且验证码的实现越来越复杂,对其进行脚 ...
easy.py使用中ValueError: could not convert string to float: svm_options错误问题解决
在使用easy.py中出现如下图所示问题解决方法: 1.找到cmd = '{0} -svmtrain "{1}" -gnuplot "{2}" "{ ...
Centos7 系统初试化脚本
系统初始化设置 # 设置主机名,永久修改,再次登陆生效 hostnamectl set-hostname xxxxx # 安装eprl源,常用命令 yum install -y wget && ...
Wifite.py 修正版脚本代码
Kali2.0系统自带的WiFite脚本代码中有几行错误,以下是修正后的代码: #!/usr/bin/python # -*- coding: utf-8 -*- """ ...
JsDoc脚本注释文档生成
使用jsDoc可使用特定注释,将注释的内容生成文档,可用于生成脚本库的API文档 jsdoc 文档: http://usejsdoc.org/
Linux启动流程和脚本服务-6
授课笔记:----------------------------------- linux系统启动流程:一.初始化阶段:1.grub引导界面2.识别硬件3.初始化驱动二.加载/etc/rc.d/r ...

随机推荐

2016年10月17日星期一 --出埃及记 Exodus 19:1
2016年10月17日星期一 --出埃及记 Exodus 19:1 In the third month after the Israelites left Egypt--on the very d ...
3094 寻找sb4
3094 寻找sb4 时间限制: 1 s 空间限制: 32000 KB 题目等级 : 黄金 Gold 题解查看运行结果题目描述 Description sb有一天和sml吵架了,她 ...
QT笔记之实现阴影窗口
方法一: 代码实现在窗口构造函数中加入:setAttribute(Qt::WA_TranslucentBackground),保证不被绘制上的部分透明重写void paintEvent(QPain ...
iOS静态库和动态库的区别
一.什么是库? 库是共享程序代码的方式,一般分为静态库和动态库. 静态库:链接时完整地拷贝至可执行文件中,被多次使用就有多份冗余拷贝. 动态库:链接时不复制,程序运行时由系统动态加载到内存,供程序调用 ...
SqlSever基础一个条件group by 一列有两个内容，分组并查看每个内容有多少行，并用as起名
镇场诗:---大梦谁觉,水月中建博客.百千磨难,才知世事无常.---今持佛语,技术无量愿学.愿尽所学,铸一良心博客.------------------------------------------ ...
如何重置CentOS/RHEL 7中遗忘的根用户帐户密码
你有没有遇到过这种情况:想不起来Linux系统上的用户帐户密码?要是你忘了根用户密码,情况就更为糟糕.你无法执行任何面向整个系统的变更.要是你忘了用户密码,很容易使用根帐户来重置密码. 可要是你忘了根 ...
（1）创建一个叫做机动车的类：属性：车牌号(String)，车速(int)，载重量(double) 功能：加速(车速自增)、减速(车速自减)、修改车牌号，查询车的载重量。编写两个构造方法：一个没有形参，在方法中将车牌号设置“XX1234”，速度设置为100，载重量设置为100；另一个能为对象的所有属性赋值；（2）创建主类：在主类中创建两个机动车对象。
package a; public class Jidongche { private String chepaihao; private int chesu; private double zaiz ...
node-webkit中使用sqlite3（MAC平台）
前言最近使用node-webkit开发一款博客发布软件,来替换难用的Windows Live Writer(主要是对Markdown标签的支持很差劲).为了解决博文信息临时保存的问题,想到了使用sq ...
CUBRID学习笔记 41 sql语法之select
cubrid的中sql查询语法 SELECT [ ] [{TO | INTO} ][FROM ] [WHERE ][GROUP BY {col_name | expr} [ASC | DESC], . ...
hdu 1348 （凸包求周长）
链接:http://acm.hdu.edu.cn/showproblem.php?pid=1348 Wall Time Limit: 2000/1000 MS (Java/Others) Mem ...

libsvm 之 easy.py（流程化脚本）注释

libsvm 之 easy.py（流程化脚本）注释的更多相关文章

随机推荐

热门专题