Hadoop, Python, and NoSQL lead the pack for big data jobs

 

Rise in cloud-based analytics could increase demand for employees with more diversified skill sets

The demand for job skills related to data processing -- NoSQL, Apache Hadoop, Python, and a smattering of other such skills -- has hit all-time highs, according to statistics collected by tech job site Dice.com. The biggest gains, though, are for all things NoSQL.

Dice claims the number of job postings for "NoSQL experts" -- those with experience in unstructured data systems like MongoDB -- has risen 54 percent since last year. Other, related skills, such as Apache Hadoop and Python, have also posted significant year-over-year gains (43 percent and 16 percent, respectively). Python has become one of the big go-to languages for data processing, thanks to its simplicity and its wide selection of data-processing libraries.

Indeed.com and its Job Trends graph provide more details about which big data skills were most in demand.  Indeed.com's stats show MongoDB is the most commonly mentioned of the NoSQL variants in job listings, with 4,979 entries as of this writing. Couchbase, Redis, and CouchDB are the three next most common NoSQL variants, with Riak, Hbase, Neo4j, and ElasticSearch all trailing far behind.

When comparing MongoDB, Python, and Hadoop, Python is by far the most in-demand of the three, with some 27,000 jobs. However,  Python developer jobs cover a great deal more than just big data, as expertise in Python can be applied to a broader range of jobs than MongoDB and Hadoop.

That said, the more analytics-related skills appear to command slightly higher pay. Indeed.com estimates that the majority of MongoDB jobs start somewhere north of $60,000, while with Python and Hadoop the majority of the pay is in the $50,000 and up range.

Other, more generic job requests related to big data are also up, with the term "big data" showing a major surge in appearances -- up 46 percent year-over-year. Generic requests for expertise in SaaS and cloud are also up, by 20 percent and 27 percent, respectively. Dice claims one side effect of a rise in cloud-based analytics is a growing demand for employees with multiple skills in this category -- for example, both Hadoop and cloud storage.

Michael Rappa, creator of the first academic program devoted to data analytics, made a similar observation when InfoWorld spoke to him about big data jobs in 2012. Rappa's take at the time was that big data wasn't "a new specialty or suite of tools we have to train people into," but rather a "new organizational reality that everyone will need to adjust to occupationally," where multiple occupations across an organization would require new awareness of how to work with big data.

This story, "Hadoop, Python, and NoSQL lead the pack for big data jobs" was originally published by InfoWorld .

Hadoop, Python, and NoSQL lead the pack for big data jobs的更多相关文章

  1. python 内存NoSQL数据库

    python 内存NoSQL数据库 来自于网络,经过修改,秉承Open Source精神,回馈网络! #!/usr/bin/python #-*- coding: UTF-8 -*- # # memd ...

  2. Python爬虫学习:四、headers和data的获取

    之前在学习爬虫时,偶尔会遇到一些问题是有些网站需要登录后才能爬取内容,有的网站会识别是否是由浏览器发出的请求. 一.headers的获取 就以博客园的首页为例:http://www.cnblogs.c ...

  3. hadoop datanode启动失败(All directories in dfs.data.dir are invalid)

    由于hadoop节点的磁盘满了,导致节点死掉,今天对其进行扩容.首先,将原节点的数据拷贝到目标节点下,从而避免数据的丢失,但是在执行hadoop_daemon.sh start datanode后没有 ...

  4. Python学习——struct模块的pack、unpack示例

    he struct module includes functions for converting between strings of bytes and native Python data t ...

  5. Python使用struct处理二进制(pack和unpack用法)

    转载自:http://www.cnblogs.com/gala/archive/2011/09/22/2184801.html 这篇文章写的很好,所以无耻的转了.. 有的时候需要用python处理二进 ...

  6. Python学习笔记 - day12 - Python操作NoSQL

    NoSQL(非关系型数据库) NoSQL,指的是非关系型的数据库.NoSQL有时也称作Not Only SQL的缩写,是对不同于传统的关系型数据库的数据库管理系统的统称.用于超大规模数据的存储.(例如 ...

  7. Python操作nosql数据库之redis

    一.NoSQL的操作 NoSQL,泛指非关系型的数据库.随着互联网web2.0网站的兴起,传统的关系数据库在应付web2.0网站,特别是超大规模和高并发的SNS类型的web2.0纯动态网站已经显得力不 ...

  8. Python:struct模块的pack、unpack

    mport struct pack.unpack.pack_into.unpack_from 1 # ref: http://blog.csdn<a href="http://lib. ...

  9. [Python] How to unpack and pack collection in Python?

    It  is a pity that i can not add the video here. As a result, i offer the link as below: How to unpa ...

随机推荐

  1. PyDev+eclipse的编码问题

    1.在代码的开始声明编码为utf-8

  2. uva 1030 - Image Is Everything(迭代更新)

    题目链接:uva 1030 - Image Is Everything 题目大意:有一个最大为n*n*n的立方体的一个不规整立体,由若干个1*1*1的小正方体构成(每一个小正方体被涂成不同的颜色),给 ...

  3. 遮罩层的实现(纯js兼容版)

    这个代码是我以前测试时候的代码了,主要用到的知识点是opacity和filer分别实现 “标准浏览器”和IE浏览器下的半透明,使用js的document.body.offsetwidth 和scree ...

  4. spring 整合JDBC

    使用Spring提供的三个JDBC模板类(JdbcTemplate.NamedParameterJdbcTemplate.SimpleJdbcTemplate)操作数据库 一.JdbcTemplate ...

  5. iBatis自动生成的主键 (Oracle,MS Sql Server,MySQL)【转】

    iBatis的sqlMap配置文件的selectKey元素有个type属性,可以指定pre或者post表示前生成(pre)还是后生成(post). Oracle设置 <!-- Oracle SE ...

  6. JAVA 上加密算法的实现用例---转载

    通常 , 使用的加密算法 比较简便高效 , 密钥简短,加解密速度快,破译极其困难.本文介绍了 MD5/SHA1,DSA,DESede/DES,Diffie-Hellman 的使用. 第 1 章基础知识 ...

  7. python学习笔记--Django入门三 Django 与数据库的交互:数据建模

    把数据存取逻辑.业务逻辑和表现逻辑组合在一起的概念有时被称为软件架构的 Model-View-Controller (MVC)模式.在这个模式中, Model 代表数据存取层,View 代表的是系统中 ...

  8. java 流程执行 循环 foreach循环

    一. if分支 1. 结构  if  else if   else 2.执行原则 if  if  if 结构  会一直去执行()里的判断语句 if else if  else if 结构  只要一条( ...

  9. hdu2035java

    人见人爱A^B Time Limit: 2000/1000 MS (Java/Others)    Memory Limit: 65536/32768 K (Java/Others)Total Sub ...

  10. vi 使用笔记

    基本A 当前行追加J 去除本行和下一行之间的换行符(写CSS利器)~ 光标所在处的字符进行大小写互换* 向前搜索目前光标所在的单词# 向后搜索目前光标所在的单词% 查找与光标所在处相匹配的反括号, 包 ...