Hadoop, Python, and NoSQL lead the pack for big data jobs
Hadoop, Python, and NoSQL lead the pack for big data jobs
Rise in cloud-based analytics could increase demand for employees with more diversified skill sets
The demand for job skills related to data processing -- NoSQL, Apache Hadoop, Python, and a smattering of other such skills -- has hit all-time highs, according to statistics collected by tech job site Dice.com. The biggest gains, though, are for all things NoSQL.
Dice claims the number of job postings for "NoSQL experts" -- those with experience in unstructured data systems like MongoDB -- has risen 54 percent since last year. Other, related skills, such as Apache Hadoop and Python, have also posted significant year-over-year gains (43 percent and 16 percent, respectively). Python has become one of the big go-to languages for data processing, thanks to its simplicity and its wide selection of data-processing libraries.
Indeed.com and its Job Trends graph provide more details about which big data skills were most in demand. Indeed.com's stats show MongoDB is the most commonly mentioned of the NoSQL variants in job listings, with 4,979 entries as of this writing. Couchbase, Redis, and CouchDB are the three next most common NoSQL variants, with Riak, Hbase, Neo4j, and ElasticSearch all trailing far behind.
When comparing MongoDB, Python, and Hadoop, Python is by far the most in-demand of the three, with some 27,000 jobs. However, Python developer jobs cover a great deal more than just big data, as expertise in Python can be applied to a broader range of jobs than MongoDB and Hadoop.
That said, the more analytics-related skills appear to command slightly higher pay. Indeed.com estimates that the majority of MongoDB jobs start somewhere north of $60,000, while with Python and Hadoop the majority of the pay is in the $50,000 and up range.
Other, more generic job requests related to big data are also up, with the term "big data" showing a major surge in appearances -- up 46 percent year-over-year. Generic requests for expertise in SaaS and cloud are also up, by 20 percent and 27 percent, respectively. Dice claims one side effect of a rise in cloud-based analytics is a growing demand for employees with multiple skills in this category -- for example, both Hadoop and cloud storage.
Michael Rappa, creator of the first academic program devoted to data analytics, made a similar observation when InfoWorld spoke to him about big data jobs in 2012. Rappa's take at the time was that big data wasn't "a new specialty or suite of tools we have to train people into," but rather a "new organizational reality that everyone will need to adjust to occupationally," where multiple occupations across an organization would require new awareness of how to work with big data.
This story, "Hadoop, Python, and NoSQL lead the pack for big data jobs" was originally published by InfoWorld .
Hadoop, Python, and NoSQL lead the pack for big data jobs的更多相关文章
- python 内存NoSQL数据库
python 内存NoSQL数据库 来自于网络,经过修改,秉承Open Source精神,回馈网络! #!/usr/bin/python #-*- coding: UTF-8 -*- # # memd ...
- Python爬虫学习:四、headers和data的获取
之前在学习爬虫时,偶尔会遇到一些问题是有些网站需要登录后才能爬取内容,有的网站会识别是否是由浏览器发出的请求. 一.headers的获取 就以博客园的首页为例:http://www.cnblogs.c ...
- hadoop datanode启动失败(All directories in dfs.data.dir are invalid)
由于hadoop节点的磁盘满了,导致节点死掉,今天对其进行扩容.首先,将原节点的数据拷贝到目标节点下,从而避免数据的丢失,但是在执行hadoop_daemon.sh start datanode后没有 ...
- Python学习——struct模块的pack、unpack示例
he struct module includes functions for converting between strings of bytes and native Python data t ...
- Python使用struct处理二进制(pack和unpack用法)
转载自:http://www.cnblogs.com/gala/archive/2011/09/22/2184801.html 这篇文章写的很好,所以无耻的转了.. 有的时候需要用python处理二进 ...
- Python学习笔记 - day12 - Python操作NoSQL
NoSQL(非关系型数据库) NoSQL,指的是非关系型的数据库.NoSQL有时也称作Not Only SQL的缩写,是对不同于传统的关系型数据库的数据库管理系统的统称.用于超大规模数据的存储.(例如 ...
- Python操作nosql数据库之redis
一.NoSQL的操作 NoSQL,泛指非关系型的数据库.随着互联网web2.0网站的兴起,传统的关系数据库在应付web2.0网站,特别是超大规模和高并发的SNS类型的web2.0纯动态网站已经显得力不 ...
- Python:struct模块的pack、unpack
mport struct pack.unpack.pack_into.unpack_from 1 # ref: http://blog.csdn<a href="http://lib. ...
- [Python] How to unpack and pack collection in Python?
It is a pity that i can not add the video here. As a result, i offer the link as below: How to unpa ...
随机推荐
- [Node] 逃离回调地狱
逃离Node回调地狱 Background : 在Node中,函数的返回结果大多利用回调的方式处理.如简单的判断文件是否存在并读取内容: var fs = require('fs'); fs.exis ...
- hdu 4523 威威猫系列故事——过生日 小模拟
威威猫系列故事——过生日 Time Limit: 500/200 MS (Java/Others) Memory Limit: 65535/32768 K (Java/Others) Total ...
- SSH开源框架考试题
一.选择题 1.不属于Action接口中定义的字符串常量的是____B___. A.SUCCESS B.FAILURE C.ERROR ...
- iOS APNS远程推送(史上最全步骤)
/*****************************************1************************************************/ waterma ...
- System.currentTimeMillis();
1. 意义: currentTimeMillis()返回以毫秒为单位的当前时间,返回的是当前时间与协调世界时 1970 年 1 月 1 日午夜之间的时间差(以毫秒为单位測量).注意,当返回值的时间单 ...
- C语言中volatilekeyword的作用
一.前言 1.编译器优化介绍: 由于内存訪问速度远不及CPU处理速度,为提高机器总体性能,在硬件上引入硬件快速缓存Cache,加速对内存的訪问.另外在现代CPU中指令的运行并不一定严格依照顺序运行,没 ...
- 服务器后端开发系列——《实战FastDFS分布式文件系统》[转]
1.FastDFS的配置.部署与API使用解读(1)Get Started with FastDFS 内容:讲解FastDFS的背景.基本原理,并讲述基本的配置.部署和测试的内容. 2.FastDFS ...
- O(1)调度器的时间计算公式与CFS调度器
http://blog.csdn.net/dog250/article/details/48750809 O(1): 优先级计算: 进程优先级公式:prio=MAX_RT_PRIO+nice+20其中 ...
- Day05 - Python 常用模块
1. 模块简介 模块就是一个保存了 Python 代码的文件.模块能定义函数,类和变量.模块里也能包含可执行的代码. 模块也是 Python 对象,具有随机的名字属性用来绑定或引用. 下例是个简单的模 ...
- sqlite 获取数据库中的所有表
SELECT name from sqlite_master where type='table'