数据挖掘方面重要会议的最佳paper集合,兴许将陆续分析一下内容:

主要有KDD、SIGMOD、VLDB、ICML、SIGIR

KDD (Data Mining)

2013

Simple and Deterministic Matrix
Sketching

Edo Liberty, Yahoo! Research

2012

Searching
and Mining Trillions of Time Series Subsequences under Dynamic Time Warping

Thanawin Rakthanmanon, University of California Riverside; et al.

2011

Leakage
in Data Mining: Formulation, Detection, and Avoidance

Shachar Kaufman, Tel-Aviv University; et al.

2010

Large linear classification
when data cannot fit in memory

Hsiang-Fu Yu, National Taiwan University; et al.

Connecting the dots between news
articles

Dafna Shahaf & Carlos Guestrin, Carnegie Mellon University

2009

Collaborative Filtering with
Temporal Dynamics

Yehuda Koren, Yahoo! Research

2008

Fastanova:
an efficient algorithm for genome-wide association study

Xiang Zhang, University of North Carolina at Chapel Hill; et al.

2007

Predictive
discrete latent factor models for large scale dyadic data

Deepak Agarwal & Srujana Merugu, Yahoo! Research

2006

Training linear SVMs in linear time

Thorsten Joachims, Cornell University

2005

Graphs
over time: densification laws, shrinking diameters and possible explanations

Jure Leskovec, Carnegie Mellon University; et al.

2004

A probabilistic
framework for semi-supervised clustering

Sugato Basu, University of Texas at Austin; et al.

2003

Maximizing the
spread of influence through a social network

David Kempe, Cornell University; et al.

2002

Pattern discovery
in sequences under a Markov assumption

Darya Chudova & Padhraic Smyth, University of California Irvine

2001

Robust space
transformations for distance-based operations

Edwin M. Knorr, University of British Columbia; et al.

2000

Hancock:
a language for extracting signatures from data streams

Corinna Cortes, AT&T Laboratories; et al.

1999

MetaCost:
a general method for making classifiers cost-sensitive

Pedro Domingos, Universidade Técnica de Lisboa

1998

Occam's Two Razors: The
Sharp and the Blunt

Pedro Domingos, Universidade Técnica de Lisboa

1997

Analysis
and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Di...

Foster Provost & Tom Fawcett, NYNEX Science and Technology

SIGMOD (Databases)

2013

Massive Graph Triangulation

Xiaocheng Hu, The Chinese University of Hong Kong; et al.

2012

High-Performance
Complex Event Processing over XML Streams

Barzan Mozafari, Massachusetts Institute of Technology; et al.

2011

Entangled
Queries: Enabling Declarative Data-Driven Coordination

Nitin Gupta, Cornell University; et al.

2010

FAST:
fast architecture sensitive tree search on modern CPUs and GPUs

Changkyu Kim, Intel; et al.

2009

Generating example data for
dataflow programs

Christopher Olston, Yahoo! Research; et al.

2008

Serializable isolation for
snapshot databases

Michael J. Cahill, University of Sydney; et al.

Scalable Network
Distance Browsing in Spatial Databases

Hanan Samet, University of Maryland; et al.

2007

Compiling mappings
to bridge applications and databases

Sergey Melnik, Microsoft Research; et al.

Scalable Approximate
Query Processing with the DBO Engine

Christopher Jermaine, University of Florida; et al.

2006

To
search or to crawl?: towards a query optimizer for text-centric tasks

Panagiotis G. Ipeirotis, New York University; et al.

2004

Indexing
spatio-temporal trajectories with Chebyshev polynomials

Yuhan Cai & Raymond T. Ng, University of British Columbia

2003

Spreadsheets in RDBMS for OLAP

Andrew Witkowski, Oracle; et al.

2001

Locally
adaptive dimensionality reduction for indexing large time series databases

Eamonn Keogh, University of California Irvine; et al.

2000

XMill: an efficient compressor
for XML data

Hartmut Liefke, University of Pennsylvania

Dan Suciu, AT&T Laboratories

1999

DynaMat:
a dynamic view management system for data warehouses

Yannis Kotidis & Nick Roussopoulos, University of Maryland

1998

Efficient
transparent application recovery in client-server information systems

David Lomet & Gerhard Weikum, Microsoft Research

Integrating
association rule mining with relational database systems: alternatives and implications

Sunita Sarawagi, IBM Research; et al.

1997

Fast parallel
similarity search in multimedia databases

Stefan Berchtold, University of Munich; et al.

1996

Implementing data cubes efficiently

Venky Harinarayan, Stanford University; et al.

VLDB (Databases)

2013

DisC
Diversity: Result Diversification based on Dissimilarity and Coverage

Marina Drosou & Evaggelia Pitoura, University of Ioannina

2012

Dense
Subgraph Maintenance under Streaming Edge Weight Updates for Real-time Story Identification

Albert Angel, University of Toronto; et al.

2011

RemusDB: Transparent
High-Availability for Database Systems

Umar Farooq Minhas, University of Waterloo; et al.

2010

Towards Certain Fixes
with Editing Rules and Master Data

Shuai Ma, University of Edinburgh; et al.

2009

A Unified Approach
to Ranking in Probabilistic Databases

Jian Li, University of Maryland; et al.

2008

Finding Frequent Items in Data
Streams

Graham Cormode & Marios Hadjieleftheriou, AT&T Laboratories

Constrained Physical Design Tuning

Nicolas Bruno & Surajit Chaudhuri, Microsoft Research

2007

Scalable
Semantic Web Data Management Using Vertical Partitioning

Daniel J. Abadi, Massachusetts Institute of Technology; et al.

2006

Trustworthy
Keyword Search for Regulatory-Compliant Records Retention

Soumyadeb Mitra, University of Illinois at Urbana-Champaign; et al.

2005

Cache-conscious
Frequent Pattern Mining on a Modern Processor

Amol Ghoting, Ohio State University; et al.

2004

Model-Driven Data Acquisition
in Sensor Networks

Amol Deshpande, University of California Berkeley; et al.

2001

Weaving Relations for Cache Performance

Anastassia Ailamaki, Carnegie Mellon University; et al.

1997

Integrating Reliable Memory in Databases

Wee Teck Ng & Peter M. Chen, University of Michigan

ICML (Machine Learning)

2013

Vanishing Component Analysis

Roi Livni, The Hebrew University of Jerusalum; et al.

Fast Semidifferential-based
Submodular Function Optimization

Rishabh Iyer, University of Washington; et al.

2012

Bayesian
Posterior Sampling via Stochastic Gradient Fisher Scoring

Sungjin Ahn, University of California Irvine; et al.

2011

Computational
Rationalization: The Inverse Equilibrium Problem

Kevin Waugh, Carnegie Mellon University; et al.

2010

Hilbert Space Embeddings
of Hidden Markov Models

Le Song, Carnegie Mellon University; et al.

2009

Structure preserving embedding

Blake Shaw & Tony Jebara, Columbia University

2008

SVM Optimization:
Inverse Dependence on Training Set Size

Shai Shalev-Shwartz & Nathan Srebro, Toyota Technological Institute at Chicago

2007

Information-theoretic metric learning

Jason V. Davis, University of Texas at Austin; et al.

2006

Trading convexity for scalability

Ronan Collobert, NEC Labs America; et al.

2005

A support
vector method for multivariate performance measures

Thorsten Joachims, Cornell University

1999

Least-Squares Temporal Difference
Learning

Justin A. Boyan, NASA Ames Research Center

SIGIR (Information Retrieval)

2013

Beliefs and Biases in Web Search

Ryen W. White, Microsoft Research

2012

Time-Based Calibration
of Effectiveness Measures

Mark Smucker & Charles Clarke, University of Waterloo

2011

Find
It If You Can: A Game for Modeling Different Types of Web Search Success Using Interaction Data

Mikhail Ageev, Moscow State University; et al.

2010

Assessing
the Scenic Route: Measuring the Value of Search Trails in Web Logs

Ryen W. White, Microsoft Research

Jeff Huang, University of Washington

2009

Sources of evidence for vertical
selection

Jaime Arguello, Carnegie Mellon University; et al.

2008

Algorithmic
Mediation for Collaborative Exploratory Search

Jeremy Pickens, FX Palo Alto Lab; et al.

2007

Studying
the Use of Popular Destinations to Enhance Web Search Interaction

Ryen W. White, Microsoft Research; et al.

2006

Minimal Test Collections
for Retrieval Evaluation

Ben Carterette, University of Massachusetts Amherst; et al.

2005

Learning
to estimate query difficulty: including applications to missing content detection and dis...

Elad Yom-Tov, IBM Research; et al.

2004

A Formal Study of Information
Retrieval Heuristics

Hui Fang, University of Illinois at Urbana-Champaign; et al.

2003

Re-examining
the potential effectiveness of interactive query expansion

Ian Ruthven, University of Strathclyde

2002

Novelty and redundancy
detection in adaptive filtering

Yi Zhang, Carnegie Mellon University; et al.

2001

Temporal summaries of new topics

James Allan, University of Massachusetts Amherst; et al.

2000

IR
evaluation methods for retrieving highly relevant documents

Kalervo Järvelin & Jaana Kekäläinen, University of Tampere

1999

Cross-language
information retrieval based on parallel texts and automatic mining of parallel text...

Jian-Yun Nie, Université de Montréal; et al.

1998

A theory
of term weighting based on exploratory data analysis

Warren R. Greiff, University of Massachusetts Amherst

1997

Feature
selection, perceptron learning, and a usability case study for text categorization

Hwee Tou Ng, DSO National Laboratories; et al.

1996

Retrieving
spoken documents by combining multiple index sources

Gareth Jones, University of Cambridge; et al.

推荐一个站点,感谢作者的努力搜集,主要是各种顶级会议的最佳论文集合。

http://jeffhuang.com/best_paper_awards.html

数据挖掘方面重要会议的最佳paper集合的更多相关文章

  1. C#最佳工具集合:IDE、分析、自动化工具等

    C#是企业中广泛使用的编程语言,特别是那些依赖微软的程序语言.如果您使用C#构建应用程序,则最有可能使用Visual Studio,并且已经寻找了一些扩展来对您的开发进行管理.但是,这个工具列表可能会 ...

  2. InfoQ一波文章:AdaSearch/JAX/TF_Serving/leon.bottou.org/Neural_ODE/NeurIPS_2018最佳论文

    和 Nested Partition 有相通之处? 伯克利提出 AdaSearch:一种用于自适应搜索的逐步消除方法 在机器学习领域的诸多任务当中,我们通常希望能够立足预先给定的固定数据集找出问题的答 ...

  3. paper 59:招聘

     借Valse宝地发条招聘广告:D[腾讯优图]技术大咖招聘 欢迎各位技术大咖尤其应届优秀毕业生投递简历.简历投递:youtu@tencent.com简历投递,邮件标题请按照以下格式:[腾讯_上海_招聘 ...

  4. CCKS 2018 | 最佳论文:南京大学提出DSKG,将多层RNN用于知识图谱补全

    作者:Lingbing Guo.Qingheng Zhang.Weiyi Ge.Wei Hu.Yuzhong Qu 2018 年 8 月 14-17 日,主题为「知识计算与语言理解」的 2018 全国 ...

  5. FPGA 17最佳论文导读 ESE: Efficient Speech Recognition Engine with Compressed LSTM on FPGA

    欢迎转载,转载请注明:本文出自Bin的专栏blog.csdn.net/xbinworld. 技术交流QQ群:433250724,欢迎对算法.机器学习技术感兴趣的同学加入. 后面陆续写一些关于神经网络加 ...

  6. 数据挖掘学习指引<一>

    对于当前热门的大数据.云计算等技术,被百度.阿里等国内互联网巨头炒的非常火,数据挖掘作为一门非常有用的技术,在商业管理.市场分析.科学计算等大数据方面发挥着大作用. 数据挖掘技术也变得非常火,why? ...

  7. 【转载】R中有关数据挖掘的包

    下面列出了可用于数据挖掘的R包和函数的集合.其中一些不是专门为了数据挖掘而开发,但数据挖掘过程中这些包能帮我们不少忙,所以也包含进来. 1.聚类 常用的包: fpc,cluster,pvclust,m ...

  8. sprint3 【每日scrum】 TD助手站立会议第十天

    站立会议 组员 昨天 今天 困难 签到 刘铸辉 (组长) 团队进入最终的功能测试阶段,准备发布Beta版 和团队发布Beta版,并开总结会议 总结会议 Y 刘静 团队集合软件测试 软件发布 没有 Y ...

  9. 老哥,您看我这篇Java集合,还有机会评优吗?

    集合在我们日常开发使用的次数数不胜数,ArrayList/LinkedList/HashMap/HashSet······信手拈来,抬手就拿来用,在 IDE 上龙飞凤舞,但是作为一名合格的优雅的程序猿 ...

随机推荐

  1. 原型链和new

    http://www.cnblogs.com/objectorl/archive/2010/01/11/Object-instancof-Function-clarification.html 构造器 ...

  2. HTML5 表单与文件

    -新增元素与属性 form.formaction.formmethod.placeholder(处于未输入状态时文本框显示的输入提示).autofocus(自动获取光标焦点).list(该属性的值为某 ...

  3. ThinkPHP接入支付宝支付功能

    最近做系统,需要实现在线支付功能,毫不犹豫,选择的是支付宝的接口支付功能.这里我用的是即时到帐的接口,具体实现的步骤如下: 一.下载支付宝接口包 下载地址:https://b.alipay.com/o ...

  4. mysql中显示方式的切换

    1. mysql中如果使用\G,则':'不用写.如果\G后面跟':'则会报"error:no query specified"错误.请知晓. 2. mysql在登陆时,mysql ...

  5. WPF会重写Windows GUI的历史吗?

    原文地址:http://tech.it168.com/zx/2007-09-15/200709141320653.shtml 你可能对微软的.NET框架3.0版本的最近的一次更新感到有点奇怪.主版本指 ...

  6. OpenSource.com 评出 2014 年十佳开源软件

    Docker 应用容器平台 “电源管理和虚拟化以相同的方式允许我们从服务器利用率中获取最大的利益.如何真正的解决虚拟化,这世界第一难题仍然是普遍存在的.Docker 自从 2013 年开源以来,刚好在 ...

  7. Dungeon Master

    poj2251:http://poj.org/problem?id=2251 题意:给你一个三维的立方体,然后给你一个起点,和终点的坐标.然后让你求从起点到终点的最短路程.题解:该题就是求三维的最短路 ...

  8. libevent带负载均衡的多线程使用示例

    功能: 主线程根据负载工作线程负载均衡算法,每隔一秒钟向特定的工作线程发送一条字符串信息,工作线程简单的把字符串信息打开出来.   Makefile   eventtest : eventtest.c ...

  9. c# 基础连接已经关闭: 连接被意外关闭,错误的解决

    原文:c# 基础连接已经关闭: 连接被意外关闭,错误的解决 调试一个使用HttpWebRequest模拟提交表单的程序的时候频繁出现上述错误提示,google了一下发现了几个解决方案.1.在appli ...

  10. Java switch-case

    首先从原理上来阐述这个问题: switch(表达式){case 常量表达式1:语句1;....case 常量表达式2:语句2;default:语句;}1.default就是如果没有符合的case就执行 ...