https://www.programmableweb.com/news/how-5-natural-language-processing-apis-stack/analysis/2014/07/28

The world is awash in digital data. The challenge: making sense of that data. To tackle that challenge, a growing number of companies are turning to natural language processing technology to understand and monetize their data.

Natural language processing, or NLP, refers to a field of technology focused on the application of algorithms and mathematical models to analyze human language. Its use has grown sharply as companies grapple with data volumes that make it virtually impossible to perform data analysis using techniques that require significant human involvement. Popular uses of NLP include content classification, sentiment analysis and automated summarization. For instance, media organizations may use NLP-based platforms to categorize, tag and summarize content, and many brands commonly employ tools that use NLP to determine if the social media buzz around their marketing campaigns is positive or negative.

Fortunately, what is a technically complicated field of computing is now accessible to even the smallest of businesses thanks to the existence of companies that provide NLP as a service. This article explores and compares five of the leading NLP service providers that offer API integration.

These service providers were selected based on the following criteria:

  • A live NLP-focused API offering that gives users access to at least several common low-level NLP functions.
  • Availability of public documentation and pricing information.
  • Self-serve registration/subscription.

AlchemyAPI

Founded in 2005, AlchemyAPI is one of the oldest players in the NLP-as-a-service space. Calling itself the "world's most popular natural language processing service," the company's claims more than 40,000 developers and says its technology is used to process more than 3.5 billion API calls per month.Track this API

Features

AlchemyAPI's AlchemyLanguage offering supports 12 text analysis functions: entity extraction, sentiment analysis, keyword extraction, concept tagging, relation extraction, taxonomy classification, author extraction, language detection, text extraction, microformats parsing, feed detection and linked data support.

The company’s REST API offers users the ability to receive responses in a number of formats, including XML, JSON, RDF and microformats. AlchemyAPI offers SDKs for Java, Perl, Ruby, Python, PHP, C/C++, C#, Node.js and Android, and its developer portal contains tutorials and sample projects for several common NLP use cases.

Pricing

AlchemyAPI offers a free usage tier that provides up to 1,000 transactions per day. Paid plans offer 90,000, 300,000 and 3 million transactions per month for $250, $750 and $1,750, respectively. For customers requiring high volumes, the company can create custom plans that support billions of transactions a month and can also offer its technology as an on-premises appliance.

Notable Differentiator

Earlier this year, AlchemyAPI launched a computer vision offering, AlchemyVision, which allows users to automatically extract and tag images. Using the AlchemyVision API companies can, for instance, identify the names of products in a photo without needing to supply text-based clues. With photos playing such a prominent role on the web today, having the ability to analyze image-based content could be an attractive option for some companies.

Aylien

more recent entry into the natural language processing space, Aylien's platform is designed to help media organizations and consumers extract intelligence from the web's never-ending and constantly expanding stream of content.Track this API

Features

Aylien's Text Analysis API supports a number of common functions based on NLP and machine learning technology, including classification, sentiment analysis and entity extraction. The company's API, which is RESTful and served by Mashape, also offers a summarization endpoint, which can be used to summarize long articles, and a hashtag suggestion endpoint, which can suggest appropriate hashtags for a piece of content.

Pricing

Aylien offers a basic free tier that allows for 1,000 requests per day with the ability to make extra requests at a cost of 1 cent per. Its Small, Medium and Large plans provide for 6,000, 80,000 and 180,000 requests per day for $199, $649 and $1,399, respectively. All paid plans include email support. An enterprise plan, which offers unlimited requests and telephone support, is also available.

Notable Differentiator

Aylien is planning to release a News API that will allow users to obtain new stories in real time from more than 50 popular sources. Users will be able to filter stories in a variety of ways, including by keyword, topic, category and social popularity, and stories will be enhanced with metadata, including entity extraction and sentiment analysis. This new API, coupled with the company's existing hashtag suggestion feature, could make Aylien especially appealing to users looking to employ NLP in social applications.

Fluxifi

Founded this year, Fluxifi is focused on providing tools that help companies analyze and make the most of social media content. As part of that, it offers an API that customers can use to perform natural language processing.Track this API

Features

Fluxifi's NLP API supports common NLP functions such as tokenization, sentiment analysis, language detection and part-of-speech tagging. The API is RESTful and supports XML and JSON response formats.

Pricing

Fluxifi offers two pricing plans for its NLP API: a £250 per month (approximately $335 per month based on exchange rates at time of publication) Professional tier that provides for up to 40,000 API calls per day and an Enterprise plan staring at £500 per month (approximately $672 per month) that provides for upward of 150,000 calls per day. The company can also structure custom plans for higher-volume customers. Enterprise and custom plans come with telephone support and an SLA.

Notable Differentiator

Fluxifi's NLP API is just one component of its platform. In addition to this API, the company offers a broader social monitoring and analytics platform that is connected to firehoses from popular social networks including Twitter, YouTube and Instagram. That may make Fluxifi a more attractive option for companies aiming to obtain and analyze social data.

Textalytics

Textalytics offers "meaning as a service" using its text analysis engine, which the company bills as the "most user-friendly" in the space.Track this API

Features

Textalytics' core API allows users to perform a variety of commonly used low-level natural language processing functions, including topic extraction, text classification, sentiment analysis and language identification.

In addition, in an effort to help companies in specific verticals, the company has two other APIs:

  • A media analysis API, which is designed to provide a high-level analysis of "mentions, topics, opinions and facts." This API combines thematic classification, key information identification and sentiment analysis.
  • A semantic publishing API, which combines a number of natural language processing functions that can help publishers more efficiently categorize, manage and produce content.

The Textalytics APIs are RESTful and support JSON and XML formats. The company offers SDKs for PHP, Java, Python and Visual Basic.

Pricing

Textalytics operates under a credits system with variable pricing for different types of requests. For instance, a request to the Language Identification API uses one credit, every two words processed through the Topics Extraction API use two credits, and every minute of speech run through the Speech Recognition API uses 10,000 credits.

Textalytics has a free plan that offers 500,000 credits per month. Professional and Business plans up the credits to 2 million and 10 million for €149 and €499 per month (approximately $253 and $847 per month), respectively. All plans permit up to five requests per second. For customers requiring more credits or higher throughput, custom enterprise plans are also available.

Notable Differentiator

For customers not wanting to integrate with the Textalytics API or perform experimentation before integration, the company offers a plug-in that allows them to perform analysis directly in Microsoft Excel.

TextRazor

Founded by a former Bloomberg employee who worked in search R&D for the financial giant, TextRazor aims to help customers "extract and understand the who, what, why and how" of their content. To do this, the company built its own natural language processing and machine learning stack from the ground up, which it offers in cloud-based and self-hosted packages.Track this API

Features

Using TextRazor's API, customers can perform core natural language processing functions, including entity recognition and enrichment, topic tagging, relationship extraction, and entailment. Through its indexing of information from Freebase, TextRazor can enrich entities with information such as location data and birth dates.

The company's platform can automatically detect 142 languages and provides entity recognition and topic detection for 10 languages, including English, Spanish, German, French and Russian. The TextRazor API is RESTful, returns responses in JSON format, can be accessed over HTTP or HTTPS, and supports optional GZIP compression. Official SDKs are provided for Python, PHP and Java.

Pricing

TextRazor offers four pricing plans for its cloud-based platform. These range in price from free to $1,200 per month. The free plan allows 500 requests per day and two simultaneous requests, while the $1,200-per-month Pro plan provides 120,000 requests per day and up to 15 simultaneous requests. The company can also create custom enterprise plans for customers who need millions of daily requests or hundreds of simultaneous requests.

Notable Differentiator

TextRazor has a Prolog-based rules engine that developers can tap into to customize its natural language processing algorithms. As the company explains, "Customization and domain adaptation is often crucial to the development of accurate text analytics applications." For example, this capability can be used to add custom ontologies and topic lists for topic classification and entity extraction.

Which Solution Is Right for You?

All but one of the companies discussed in this article give developers the ability to use their APIs free of charge, and a number offer online demo consoles. While many of companies offer similar solutions and support the same core NLP functions, in my own simple tests I did observe that some services produced better results for certain sample content types than others. As such, it is worthwhile to test several products and see which one delivers the highest quality for your specific content and use cases.

How 5 Natural Language Processing APIs Stack Up的更多相关文章

  1. Natural Language Processing with Python - Chapter 0

    一年之前,我做梦也想不到会来这里写技术总结.误打误撞来到了上海西南某高校,成为了文科专业的工科男,现在每天除了膜ha,就是恶补CS.导师是做计算语言学的,所以当务之急就是先自学计算机自然语言处理,打好 ...

  2. spaCy is a library for advanced natural language processing in Python and Cython:spaCy 工业级自然语言处理工具

    spaCy is a library for advanced natural language processing in Python and Cython. spaCy is built on ...

  3. (zhuan) Speech and Natural Language Processing

    Speech and Natural Language Processing obtain from this link: https://github.com/edobashira/speech-l ...

  4. Natural Language Processing 课程,文章,论文

    CS224n: Natural Language Processing with Deep Learning http://cs224d.stanford.edu/syllabus.html http ...

  5. [C5W2] Sequence Models - Natural Language Processing and Word Embeddings

    第二周 自然语言处理与词嵌入(Natural Language Processing and Word Embeddings) 词汇表征(Word Representation) 上周我们学习了 RN ...

  6. 图书分享 -《Natural Language Processing with Python》

    -<Natural Language Processing with Python> 链接:https://pan.baidu.com/s/1_oalRiUEw6bXbm2dy5q_0Q ...

  7. 吴恩达《深度学习》-课后测验-第五门课 序列模型(Sequence Models)-Week 2: Natural Language Processing and Word Embeddings (第二周测验:自然语言处理与词嵌入)

    Week 2 Quiz: Natural Language Processing and Word Embeddings (第二周测验:自然语言处理与词嵌入) 1.Suppose you learn ...

  8. 吴恩达《深度学习》-第五门课 序列模型(Sequence Models)-第二周 自然语言处理与词嵌入(Natural Language Processing and Word Embeddings)-课程笔记

    第二周 自然语言处理与词嵌入(Natural Language Processing and Word Embeddings) 2.1 词汇表征(Word Representation) 词汇表示,目 ...

  9. Natural Language Processing Computational Linguistics

    http://www.nltk.org/book/ch00.html After this, the pace picks up, and we move on to a series of chap ...

随机推荐

  1. 存储那些事儿(三):OpenStack的块存储Cinder与商业存储的融合

    OpenStack是一个美国国家航空航天局和Rackspace合作研发的云端运算‎软件,以Apache许可证授权,并且是一个自由软件和开放源代码项目.OpenStack是IaaS(基础设施即服务)‎软 ...

  2. XML解析之sax解析案例(二)使用sax解析把 xml文档封装成对象

    Demo1类: import java.io.File; import java.util.List; import javax.xml.parsers.SAXParser; import javax ...

  3. 学习TensorFlow,邂逅MNIST数据集

    如果说"Hello Word!"是程序员的第一个程序,那么MNIST数据集,毫无疑问是机器学习者第一个训练的数据集,本文将使用Google公布的TensorFLow来学习训练MNI ...

  4. Linux多线程实践(8) --Posix条件变量解决生产者消费者问题

    Posix条件变量 int pthread_cond_init(pthread_cond_t *cond, pthread_condattr_t *cond_attr); int pthread_co ...

  5. OJ题:字符串最后一个单词的长度

    题目描述 计算字符串最后一个单词的长度,单词以空格隔开. 输入描述: 一行字符串,非空,长度小于5000. 输出描述: 整数N,最后一个单词的长度. 输入例子: hello world 输出例子: 5 ...

  6. OJ题:奇偶归一猜想——求归一过程中的最大值

    题目: 题目内容: 奇偶归一猜想--对于每一个正整数,如果它是奇数,则对它乘3再加1,如果它是偶数,则对它除以2,如此循环,最终都能够得到1. 如n = 11,得序列:11, 34, 17, 52, ...

  7. 【leetcode】经典算法题-Counting Bits

    题目描述: 给定一个数字n,统计0-n之间的数字二进制的1的个数,并用数组输出 例子: For num = 5 you should return [0,1,1,2,1,2]. 要求: 算法复杂复o( ...

  8. Java对象引用处理机制

    翻译人员: 铁锚 翻译时间: 2013年11月13日 原文链接: How does Java handle aliasing? 什么是Java的引用别名机制 Java的引用别名机制(原文为Aliasi ...

  9. Web报表工具FineReport二次开发JS之字符串

    在报表开发过程中,有些需求可能无法通过现有的功能来实现,需要开发人员二次开发,以FineReport为例,可以使用网页脚本.API接口等进行深入的开发与控制. 考虑到JS脚本开发的使用较多,这里先先简 ...

  10. Shell 正则表达式总结及其含义举例

    基本元字符集及其含义: ^  只匹配行首 $   只匹配行尾 *   只一个单字符后紧跟*,匹配0个或多个此单字符 [ ]   只匹配[ ]内字符.可以是一个单字符,也可以是字符序列.可以使用 - 表 ...