RankLib参数翻译

写在前面，metric2t指标详解：

NDCG（Normalized discounted cumulative gain）即DCG/IDCG
CG（cumulative gain）
DCG（Discounted Cumulative Gain）
MAP(Mean Average Precision)
MRR(Mean Reciprocal Rank)

Usage: java -jar RankLib.jar <Params>
Params:
[+] Training (+ tuning and evaluation)
-train <file> 训练数据
-ranker <type> 指定要使用的排名算法
0: MART (gradient boosted regression tree)
1: RankNet
2: RankBoost
3: AdaRank
4: Coordinate Ascent
6: LambdaMART
7: ListNet
8: Random Forests
[ -feature <file> ] 特征描述文件：列出要学习的特征，如果不指定，默认使用所有特征。
[ -metric2t <metric> ] Metric to optimize on the training data. Supported: MAP, NDCG@k, DCG@k, P@k, RR@k, ERR@k (default=ERR@10)
[ -gmax <label> ] Highest judged relevance label. It affects the calculation of ERR (default=4, i.e. 5-point scale {0,1,2,3,4})
[ -silent ] Do not print progress messages (which are printed by default)
[ -validate <file> ] 是否在验证数据集上调整模型 Specify if you want to tune your system on the validation data (default=unspecified). If specified, the final model will be the one that performs best on the validation data
[ -tvs <x \in [0..1]> ] 训练-验证数据集的分割比例 Set train-validation split to be (x)(1.0-x)
[ -save <model> ] 学习的模型保存到指定文件 Save the learned model to the specified file (default=not-save)
[ -test <file> ] 是否要在数据上测试训练的模型 Specify if you want to evaluate the trained model on this data (default=unspecified)
[ -tts <x \in [0..1]> ] 训练-测试数据集的分割比例 Set train-test split to be (x)(1.0-x). -tts will override -tvs
[ -metric2T <metric> ] 默认与metric2t一样 Metric to evaluate on the test data (default to the same as specified for -metric2t)
[ -norm <method>] 归一化特征向量，方法包括求和归一化，均值/标准差归一化，最大值/最小值归一化 Normalize feature vectors (default=no-normalization). Method can be:
sum: normalize each feature by the sum of all its values
zscore: normalize each feature by its mean/standard deviation
linear: normalize each feature by its min/max values
[ -kcv <k> ] 在训练数据集上执行交叉验证 Specify if you want to perform k-fold cross validation using ONLY the specified training data (default=NoCV)
-tvs can be used to further reserve a portion of the training data in each fold for validation
[ -kcvmd <dir> ] 交叉验证训练模型的目录 Directory for models trained via cross-validation (default=not-save)
[ -kcvmn <model> ] Name for model learned in each fold. It will be prefix-ed with the fold-number (default=empty)
[-] RankNet-specific parameters RankNet特定参数
[ -epoch <T> ] 训练迭代次数 The number of epochs to train (default=100)
[ -layer <layer> ] 隐含层个数 The number of hidden layers (default=1)
[ -node <node> ] 每层隐含节点个数 The number of hidden nodes per layer (default=10)
[ -lr <rate> ] 学习率 Learning rate (default=0.00005)
[-] RankBoost-specific parameters RankBoost特定参数
[ -round <T> ] 训练迭代次数 The number of rounds to train (default=300)
[ -tc <k> ] 搜索的阈值候选个数 Number of threshold candidates to search. -1 to use all feature values (default=10)
[-] AdaRank-specific parameters AdaRank特定参数
[ -round <T> ] 训练迭代次数 The number of rounds to train (default=500)
[ -noeq ] Train without enqueuing too-strong features (default=unspecified)
[ -tolerance <t> ] 连续两轮学习之间的误差 Tolerance between two consecutive rounds of learning (default=0.002)
[ -max <times> ] 一个特征可以被连续选择而不改变性能的最大次数 The maximum number of times can a feature be consecutively selected without changing performance (default=5)
[-] Coordinate Ascent-specific parameters
[ -r <k> ] The number of random restarts (default=5)
[ -i <iteration> ] The number of iterations to search in each dimension (default=25)
[ -tolerance <t> ] Performance tolerance between two solutions (default=0.001)
[ -reg <slack> ] Regularization parameter (default=no-regularization)
[-] {MART, LambdaMART}-specific parameters LanbdaMART特定参数
[ -tree <t> ] 树的个数 Number of trees (default=1000)
[ -leaf <l> ] 每个树的叶子个数 Number of leaves for each tree (default=10)
[ -shrinkage <factor> ] 学习率 Shrinkage, or learning rate (default=0.1)
[ -tc <k> ] 树分割时的候选特征个数 Number of threshold candidates for tree spliting. -1 to use all feature values (default=256)
[ -mls <n> ] 一个叶子最少的样本个数 Min leaf support -- minimum #samples each leaf has to contain (default=1)
[ -estop <e> ] Stop early when no improvement is observed on validaton data in e consecutive rounds (default=100)
[-] Random Forests-specific parameters 随机森林特定参数
[ -bag <r> ] Number of bags (default=300)
[ -srate <r> ] Sub-sampling rate (default=1.0)子集采样率
[ -frate <r> ] Feature sampling rate (default=0.3) 特征采样率
[ -rtype <type> ] Ranker to bag (default=0, i.e. MART)
[ -tree <t> ] Number of trees in each bag (default=1) 树个数
[ -leaf <l> ] Number of leaves for each tree (default=100) 每个树的叶节点个数
[ -shrinkage <factor> ] Shrinkage, or learning rate (default=0.1) 学习率
[ -tc <k> ] 树分割时使用的候选特征阈值个数 Number of threshold candidates for tree spliting. -1 to use all feature values (default=256)
[ -mls <n> ] Min leaf support -- minimum #samples each leaf has to contain (default=1)
[ -estop <e> ] Stop early when no improvement is observed on validaton data in e consecutive rounds (default=100)
[+] Testing previously saved models 测试已经保存的模型
-load <model> 加载模型 The model to load
-test <file> 测试数据Test data to evaluate the model (specify either this or -rank but not both)
-rank <file> 对制定文件中的样本排序，与-test不能同时指定 Rank the samples in the specified file (specify either this or -test but not both)
[ -metric2T <metric> ] Metric to evaluate on the test data (default=ERR@10)
[ -gmax <label> ] Highest judged relevance label. It affects the calculation of ERR (default=4, i.e. 5-point scale {0,1,2,3,4})
[ -score <file> ] Store ranker's score for each object being ranked (has to be used with -rank)
[ -idv ] 打印单个排名列表上的性能（必须与-test一起使用） Print model performance (in test metric) on individual ranked lists (has to be used with -test)
[ -norm ] 特征归一化 Normalize feature vectors (similar to -norm for training/tuning)

RankLib参数翻译的更多相关文章

JVM 参数翻译汉化解释
博客搬家,新地址:http://www.zicheng.net/article/38.htm Behavioral Options(行为参数) Option and Default Value Des ...
hbase-default.xml（Hbase 默认参数翻译）
hbase.tmp.dir ${java.io.tmpdir}/hbase-${user.name} 本地文件系统上的临时目录.将'/tmp'改为其他可以持久保存文件的位置,通常能够解决java. ...
以太坊客户端Geth命令用法-参数详解
Geth在以太坊智能合约开发中最常用的工具(必备开发工具),一个多用途的命令行工具. 熟悉Geth可以让我们有更好的效率,大家可收藏起来作为Geth命令用法手册. 本文主要是对geth help的翻译 ...
以太坊客户端Geth命令用法-参数详解【转载】
原文链接:http://www.cnblogs.com/tinyxiong/p/7918706.html Geth在以太坊智能合约开发中最常用的工具(必备开发工具),一个多用途的命令行工具.熟悉Get ...
Spark配置参数的三种方式
1.Spark 属性Spark应用程序的运行是通过外部参数来控制的,参数的设置正确与否,好与坏会直接影响应用程序的性能,也就影响我们整个集群的性能.参数控制有以下方式:(1)直接设置在SparkCon ...
nginx 常见参数以及重定向参数配置
nginx 各参数翻译,作用 $arg_PARAMETER #这个变量包含GET请求中,如果有变量PARAMETER时的值. $args #这个变量等于请求行中(GET请求)的参数,例如foo=123 ...
PHP的CURL方法curl_setopt()函数案例介绍(抓取网页,POST数据)
通过curl_setopt()函数可以方便快捷的抓取网页(采集很方便),curl_setopt 是php的一个扩展库使用条件:需要在php.ini 中配置开启.(PHP 4 >= 4.0.2) ...
nginx常用命令
ps -ef | grep nginx在进程列表里面找master进程,它的编号就是主进程号了. 步骤2:发送信号从容停止Nginx: kill -QUIT 主进程号快速停止Nginx: kill ...
Linux磁盘与文件系统概念理解
磁盘级别概念这里讲的主要是网上所谓的老式磁盘,它是由一个个盘片组成的,我们先从个盘片结构讲起.如图1所示,图中的一圈圈灰色同心圆为一条条磁道,从圆心向外画直线,可以将磁道划分为若干个弧段,每 ...

随机推荐

getParameterMap()的返回值为Map<String, String[]>，从其中取得请求参数转为Map<String, String>的方法如下：
直接遍历报错:[Ljava.lang.String;@44739f3f Map<String, String> tempMap = new HashMap<String, Strin ...
RDLC开发笔记
1.当报表中使用FormatPercent求百分数时,若分母有小数,务必使用CDbl将其转为double类型,否则会报“错误号”.
Linux下如何修改Apache根目录
创建自己需要更改的新目录,并更改宿主权限为apache用户 chown -R apache:apache /mnt/www 停止Apache服务 service httpd stop 3 修改Ap ...
python SimpleHTTPRequestHandler初探
1,转自 https://blog.gtwang.org/web-development/python-simplehttpserver-web-server/ 如果你急需一个简单的Web Serve ...
sqlserver 事务日志过大收缩方法解决方案
sqlserver 事务日志过大,可能会导致备份失败或者数据库出现异常,所以要定期清除sqlserver 事务日志建议:为了防止日志文件无限扩大,可以对日志文件作一些限制. 清除sqlserver事 ...
Zotero 使用指南
DownLoad Page: https://onedrive.live.com/redir?resid=5084666E7B16AA85!109&authkey=!ABHQp7yfMnLpE ...
FlexBox标准及兼容写法速查表
FlexBox标准写法: 支持浏览器: IE11, Chrome29+, FireFox 20+, Safari加前缀 -webkit- 概述: 总的来说就是12个属性; 关于容器的6个,5个单一属 ...
category - junit用例分组执行
一.category 和 testSuite的比较 (1)testSuite是类级分组(xx.class) (2)category是用例级分组(@Test) (3)category是testSuite ...
绑定多个ddl
添加材料,需要绑定材料类型.设备名称.省份和所属终端客户等信息,前台页面如下: 前台.aspx <asp:Content ID="Content2" ContentPlace ...
The different of mouseover and mouseenter
l论事件onmouseover 和 onmouseenter:同类比较onmouseout 和 onmouseleave; 使用onmouseover时,鼠标除了被设置事件的元素,还会触发其子元素: ...

RankLib参数翻译

RankLib参数翻译的更多相关文章

随机推荐

热门专题