Understand User's Intent from Speech and Text

http://research.microsoft.com/en-us/projects/IntentUnderstanding/

Understanding what users like to do/need to get is critical in human computer interaction. When natural user interface like speech or natural language is used in human-computer interaction, such as in a spoken dialogue system or with an internet search engine, language understanding becomes an important issue. Intent understanding is about indentifying the action a user wants a computer to take or the information she/he would like to obtain, conveyed in a spoken utterance or a text query.

In this project, we develop robust data-driven technologies applicable todifferent domains, make them morepractical by leveraging large amount of unlabeled data via unsupervised/semi-supervised machine learning;by innovating machine learning algorithms that work better with less data or mismatched data; and by augmenting statistical models with domain knowledge obtainedin a semi-supervised fashion.Research activities fall into the following areas:

Data-Driven Approaches to Spoken Language/Query Understanding
Unsupervised/Semi-Supervised Learning
Automatic/Semi-automatic Acquisition of Domain Knowledge
Authoring Tools for Spoken Language Understanding
Application of Intent Undrestanding Technology

We have contributed to Microsoft products from the following teams:

Microsoft Live Search/Commerce Search
Microsoft adCenter
Microsoft Speech Component Group
Tellme

People

Li Deng

Y. C. Ju

Dong Yu

Geoffrey Zweig

Publications

Larry Heck, Dilek Hakkani-Tur, Madhu Chinthakunta, Gokhan Tur, Rukmini Iyer, Partha Parthasarathy, Lisa Stifelman, Elizabeth Shriberg, and Ashley Fidler, Multimodal Conversational Search and Browse, IEEE Workshop on Speech, Language and Audio in Multimedia, August 2013
Malcolm Slaney, Pay Attention, Please: Attention at the Telluride Neuromorphic Cognition Workshop, in IEEE SLTC Newsletter, IEEE, November 2012
Jingjing Liu, Xiao Li, Alex Acero, and Ye-Yi Wang, Lexicon Modeling for Query Understanding, in ICASSP, IEEE, May 2011
Xiao Li, Understanding the Semantic Structure of Noun Phrase Queries, in ACL, Association for Computational Linguistics, July 2010
Xiao Li, Ye-Yi Wang, Dou Shen, and Alex Acero, Learning with Click Graph for Query Intent Classification, in ACM Transaction on Information Systems, vol. 28, no. 3, Association for Computing Machinery, Inc., June 2010
Mehdi Hafezi Manshadi and Xiao Li, Semantic Tagging of Web Search Queries, in ACL, Association for Computational Linguistics, August 2009
Xiao Li, On the Use of Virtual Evidence in Conditional Random Fields, in EMNLP, August 2009
Xiao Li, Ye-Yi Wang, and Alex Acero, Extracting Structured Information from User Queries with Semi-Supervised Conditional Random Fields, in SIGIR, July 2009
Young-In Song, Ye-Yi Wang, Yun-Cheng Ju, Mike Seltzer, Ivan Tashev, and Alex Acero, Voice Search of Structured Media Data, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electornic Engineers, Inc., Taipei, Taiwan, April 2009
Ye-Yi Wang, Xiao Li, and Alex Acero, Inductive and Example-Based Learning for Text Classification, in Interspeech, International Speech Communication Association, Brisbane, Australia, September 2008

Understand User's Intent from Speech and Text的更多相关文章

利用Google Speech API实现Speech To Text
很久很久以前, 网上流传着一个免费的,识别率暴高的,稳定的 Speech To Text API, 那就是Google Speech API. 但是最近再使用的时候,总是返回500 Error. 后来 ...
Csharp: speech to text, text to speech in win
using System; using System.Collections.Generic; using System.ComponentModel; using System.Data; usin ...
Speech to Text for iOS
找了一下 speech to text 可以用的 SDK for iOS 以下幾種方案: NDEV Mobile (有免費方案,不過似乎不支援離線,客戶清單中有 wallmart,支援不少語言) iS ...
一次神奇的Azure speech to text rest api之旅
错误Max retries exceeded with url: requests.exceptions.ConnectionError: HTTPSConnectionPool(host='%20e ...
Sample example for Speech to Text in iOS
There are several libraries for this kind of conversion - I host two of those on GitHub: libsprec (t ...
各大厂的语音识别Speech To Text API使用体验
最近发现有声读物能极大促进我的睡眠,但每个前面都有一段开场语,想把它剪掉,但是有多个开场语,所以就要用到语音识别判断一下再剪. 前两年在本地搭建过识别的环境,奈何识别准确率不行,只能找找API了,后面 ...
CMUSphinx Learn - Basic concepts of speech
Basic concepts of speech Speech is a complex phenomenon. People rarely understand how is it produced ...
Intent属性详解二 Action、Category
先看效果图: 1.Action:该activity可以执行的动作该标识用来说明这个activity可以执行哪些动作,所以当隐式intent传递过来action时,如果跟这里<intent-fi ...
Android开发之Intent略解
Intent是一种运行时绑定(run-time binding)机制,它能在程序运行过程中连接两个不同的组件.通过Intent,你的程序可以向Android表达某种请求或者意愿,Android会根据意 ...

随机推荐

NSTimer实现读秒、倒计时等周期性操作
self.timerSchedule = [NSTimer scheduledTimerWithTimeInterval:0.2 target:self selector:@selector(spin ...
DDL、DML和DCL的理解
一.DDL 1.DDL的概述 DDL(Data Definition Language 数据定义语言)用于操作对象和对象的属性,这种对象包括数据库本身,以及数据库对象,像:表.视图等等, ...
C++为了兼容，所以并不是纯面向对象编程语言
理想如果不向现实做点妥协,理想就会归于尘土.面向对象怎能把一切传统抛开!
11个优秀的HTML5 & CSS3下拉菜单制作教程
下拉菜单是一个很常见的效果,在网站设计中被广泛使用.通过使用下拉菜单,设计者不仅可以在网站设计中营造出色的视觉吸引力,但也可以为网站提供了一个有效的导航方案.使用HTML5和CSS3可以更容易创造视觉 ...
oracle 数据库导入导出
要把公司的数据库导入到自己的电脑上(都需要再命令窗口下输入指令) 导出数据库的基本代码: exp zj_user_kf/oracle@tzsw_4 file=d:\test3.dmp full=y 导 ...
gcc与g++的区别
一:gcc与g++比较编译c/c++代码的时候,有人用gcc,有人用g++,于是各种说法都来了,譬如c代码用gcc,而 c++代码用g++,或者说编译用gcc,链接用g++,一时也不知哪个说法正确, ...
《linux下sudo服务的使用》RHEL6
/bin/ 下放的二进制文件命令都是普通用户可以使用的 Sbin 下放的二进制文件命令都是超级用户root可以使用的普通用户也想使用Sbin下的文件可以通过sudo来实现: 默认普通用户是不可以 ...
下载服务器dll文件并动态加载
1.新加一个类库 namespace ClassLibrary1 { public class Class1 { public int Add(int a, int b) { return a + b ...
mongodb学习之路1
第一节 MongoDB介绍及下载与安装引言 MongoDB是一个介于关系数据库和非关系数据库之间的产品,是非关系数据库当中功能最丰富,最像关系数据库的.他支持的数据结构非常松散,是类似 json的b ...
Android实现发短信与打电话的功能
//发短信 class SendMsgClickListener implements OnClickListener { public void onClick(View v) { //调用Andr ...

Understand User's Intent from Speech and Text

Understand User's Intent from Speech and Text的更多相关文章

随机推荐

热门专题