显著性,又称统计显著性(Statistical significance), 是指零假设为真的情况下拒绝零假设所要承担的风险水平,又叫概率水平,或者显著水平. [1] 显著性的含义是指两个群体的态度之间的任何差异是由于系统因素而不是偶然因素的影响.我们假定控制了可能影响两个群体之间差异的所有其他因素,因此,余下的解释就是我们所推断的因素,而这个因素不能够100%保证,所以有一定的概率值,叫显著性水平(Significant level). 总的来说,它表示群体之间得以相互区别的能力.在统计假设检…
Fisher's exact test[1][2][3] is a statistical significance test used in the analysis ofcontingency tables. Although in practice it is employed when sample sizes are small, it is valid for all sample sizes. Fisher's exact test( 费希尔精确检验)      是用于分析列联表(…
虽然我们现在完全没有甲基化的数据,但还是可以了解一下. 什么是DNA甲基化,与组蛋白修饰有什么联系? DNA Methylation and Its Basic Function 表观的定义就是DNA序列不变.表型变了.最重要的就是DNA的修饰(甲基化)和组蛋白的修饰(chip-seq里面的那几种) 为什么说甲基化比SNP在早筛方面要靠谱得多? DNA甲基化测序的基本原理?Bisulfite Treatment,会特异的改变原来的甲基化和非甲基化的序列,无甲基化的C全部变为U. Pan-Canc…
INTRODUCTION TO BIOINFORMATICS      这套教程源自Youtube,算得上比较完整的生物信息学领域的视频教程,授课内容完整清晰,专题化的讲座形式,细节讲解比国内的京师大学堂的Mooc教程好过10000倍.下面是视频的快速链接还有文档讲义哦,很好的东东,链接分享给国内的朋友们. =课程主页:http://ocw.metu.edu.tr/course/view.php?id=37,    Instructor: Tolga CAN    Added: 18 Novem…
Source: http://mindhive.mit.edu/book/export/html 1. What is the multiple-comparison problem? What is familywise error correction (FWE)? To start, Nichols and Hayasaka (PthresholdPapers) provide an excellent introduction to the issue of FWE in neuroim…
翻译来自:http://news.csdn.net/article_preview.html?preview=1&reload=1&arcid=2825492 摘要:本文解释了回归分析及其优势,重点总结了应该掌握的线性回归.逻辑回归.多项式回归.逐步回归.岭回归.套索回归.ElasticNet回归等七种最常用的回归技术及其关键要素,最后介绍了选择正确的回归模型的关键因素. [编者按]回归分析是建模和分析数据的重要工具.本文解释了回归分析的内涵及其优势,重点总结了应该掌握的线性回归.逻辑回归…
plot a critical difference diagram , MATLAB code 建立criticaldifference函数 function cd = criticaldifference(s,labels,alpha) % % CRITICALDIFFERNCE - plot a critical difference diagram % % CRITICALDIFFERENCE(S,LABELS) produces a critical difference diagra…
A Complete Tutorial on Tree Based Modeling from Scratch (in R & Python) MACHINE LEARNING PYTHON R   SHARE      MANISH SARASWAT, APRIL 12, 2016 / 52     Introduction Tree based learning algorithms are considered to be one of the best and mostly used s…
1.参考 reference 1. tutorial主页:http://www.bcs.rochester.edu/people/raizada/fmri-matlab.htm. 2.speech_brain_images.mat数据:speech_brain_images.mat. 3.showing_brain_images_tutorial显示大脑图像代码:showing_brain_images_tutorial.m . 4.overlaying_Tmaps_tutorial.m叠加t检…
转化自:https://trainings.analyticsvidhya.com/courses/course-v1:AnalyticsVidhya+LPDS2019+LPDS2019_T1/courseware/73167b5cca8447dfa535a80d3961dc61/1abd27dfd2a140d7b3c252432342cc71/?child=first 什么是决策树?它是如何工作的? 决策树是一种监督学习算法,常用于分类问题,可以工作于类别(categorical)和连续(co…
python信用评分卡(附代码,博主录制) https://study.163.com/course/introduction.htm?courseId=1005214003&utm_campaign=commission&utm_source=cp-400000000398149&utm_medium=share 变量筛选Variables Selection in Predictive Analytics Predictive Analytics: Variables Sele…
  欢迎关注博主主页,学习python视频资源,还有大量免费python经典文章 python风控评分卡建模和风控常识 https://study.163.com/course/introduction.htm?courseId=1005214003&utm_campaign=commission&utm_source=cp-400000000398149&utm_medium=share Toby,项目合作QQ:231469242 Credit Scorecards – Intr…
sklearn实战-乳腺癌细胞数据挖掘(博客主亲自录制视频教程) https://study.163.com/course/introduction.htm?courseId=1005269003&utm_campaign=commission&utm_source=cp-400000000398149&utm_medium=share https://en.wikipedia.org/wiki/Omnibus_test Omnibus tests are a kind of st…
更多来自:   www.vipcoursea.com   Ethics 部分 Objective of codes and standard:永远是为了maintain public trust in 1.Financial market  2.Investment profession 6个code of ethics 1.Code 1—ethics and pertinent d persons a. 2.Code 2---primacy of client’s interest a.Int…
在做基因表达分析时必然会要做差异分析(DE) DE的方法主要有两种: Fold change t-test fold change的意思是样本质检表达量的差异倍数,log2 fold change的意思是取log2,这样可以可以让差异特别大的和差异比较小的数值缩小之间的差距. Let's say there are 50 read counts in control and 100 read counts in treatment for gene A. This means gene A is…
Connect to Spark from R. The sparklyr package provides a complete dplyr backend. Filter and aggregate Spark datasets then bring them into R for analysis and visualization. Use Spark<u+2019>s distributed machine learning library from R. Create extens…
Week 2 OverviewHelp Center Week 2 On this page: Instructional Activities Time Goals and Objectives Key Phrases/Concepts Guiding Questions Readings and Resources Video Lectures Tips for Success Getting and Giving Help Instructional Activities Below is…
Multiple Regression What is multiple regression? Multiple regression is regression analysis with more than one independent variable. It is used to quantify the influence of two or more independent variables on a dependent variable. The general multip…
Correlation and Regression Sample Covariance The covariance between two random variables is a statistical measure of the degree to which the two variables move together. The covariance captures the linear relationship between two variables. A positiv…
sklearn实战-乳腺癌细胞数据挖掘(博主亲自录视频) https://study.163.com/course/introduction.htm?courseId=1005269003&utm_campaign=commission&utm_source=cp-400000000398149&utm_medium=share 医药统计项目QQ:231469242 fisher's exact test算法来自超几何分布 python代码 https://docs.scipy.o…
sklearn实战-乳腺癌细胞数据挖掘(博主亲自录视频) https://study.163.com/course/introduction.htm?courseId=1005269003&utm_campaign=commission&utm_source=cp-400000000398149&utm_medium=share 医药统计项目联系:QQ231469242 # -*- coding: utf-8 -*- # Import standard packages impor…
sklearn实战-乳腺癌细胞数据挖掘(博主亲自录制视频教程) https://study.163.com/course/introduction.htm?courseId=1005269003&utm_campaign=commission&utm_source=cp-400000000398149&utm_medium=share 医药统计项目联系QQ:231469242 P值:观察到极端值的概率 观察到的概率越低,结果就越显著.观察到概率低于P值时,认为足够证据支持H1(显著…
R in Nutshell 前言 例子(nutshell包) 本书中的例子包括在nutshell的R包中,使用数据,需加载nutshell包 install.packages("nutshell") 第一部分:基础 第一章 批处理(Batch Mode) R provides a way to run a large set of commands in sequence and save the results to a file. 以batch mode运行R的一种方式是:使用系统…
Written by Khang Nguyen Vo, khangvo88@gmail.com, for the RobustTechHouse blog. Khang is a graduate from the Masters of Quantitative and Computational Finance Program, John Von Neumann Institute 2014. He is passionate about research in machine learnin…
Chip-seq peak annontation Chip-seq peak annontation PeRl narrowPeak/boardPeak narrowPeak/boardPeak 是ENCODE可提供下载的两种 Chip-seq 经过参考人类基因组mapping后的关于peak的数据. 其他类型的seq数据储存个数可以参看FAQformat narrowPeak 数据按照以下规则储存: 1. string chrom: "Reference sequence chromosom…
T distribution 定义 在概率论和统计学中,学生t-分布(t-distribution),可简称为t分布,用于根据小样本来估计 呈正态分布且方差未知的总体的均值.如果总体方差已知(例如在样本数量足够多时),则应该用正态分布来估计总体均值. In probability and statistics, Student's t-distribution (or simply the t-distribution) is any member of a family of continuo…
Enhancements in AWR Baselines A baseline is any set of snapshots taken over a period of time. The snapshots are selected such that they yield a set of baselines that change during the period of time that you're collecting data. The baseline captures…
Case Study The case study Physicians' Reactions sought to determine whether physicians spend less time with obese patients. Physicians were sampled randomly and each was shown a chart of a patient complaining of a migraine headache. They were then as…
笔记-python-standard library-9.6 random 1.      random source code:Lib/random.py 1.1.    functions for integers random.randrange(stop) random.randrange(start, stop[, step]) 从range(start, stop, step)中返回一个随机选择的元素.注意并不会生成一个range对象. random.randint(a,b) ret…
https://www.analyticsvidhya.com/blog/2015/08/comprehensive-guide-regression/ What is Regression Analysis? Why do we use Regression Analysis? What are the types of Regressions? Linear Regression Logistic Regression Polynomial Regression Stepwise Regre…