每日英语:The Risks of Big Data for Companies
Big data. It's the latest IT buzzword, and it isn't hard to see why. The ability to parse more information, faster and deeper, is allowing companies, governments, researchers and others to understand the world in a way they could only dream about before.
buzzword:流行词 parse:解析,分析
All that is true. And yet . . .
It's also true that in our rush to embrace the possibilities of big data, we may be overlooking the challenges that big data poses -- including the way companies interpret the information, manage the politics of data and find the necessary talent to make sense of the flood of new information.
overlooking:俯瞰,远眺
Big data, in other words, introduces high stakes to the data-analytics game. There's a greater potential for privacy invasion, greater financial exposure in fast-moving markets, greater potential for mistaking noise for true insight, and a greater risk of spending lots of money and time chasing poorly defined problems or opportunities.
high stake:高风险 invasion:入侵,侵略 exposure:暴露,曝光,揭露
Unless we understand, and deal with, these challenges, we risk turning all that data from something that has the potential to enhance our organizations into a diversion, an illusion or a paralyzing turf battle.
diversion:转移,消遣,分散注意力 paralyzing:使瘫痪,使麻痹
Let me be more specific about those challenges.
These Are Great Tools, but Who Has the Skills to Use Them?
Getting people qualified to work in such data-analytical tools as Hive, Pig, Cassandra, MongoDB or Hadoop is only the first layer of this onion. Few companies have in-house experts who can even make a business case to justify the cost of hiring big-data experts, let alone assess the quality of the applicants. Many managers also lack basic numeracy, so getting decision makers who can grasp more sophisticated statistical mechanics can be a challenge.
numeracy:计算能力,识数
Complicating the matter, big-data tools aren't ready for prime time: They are evolving rapidly, aren't taught in most universities, have less-than-ideal vendor support and require levels of user flexibility that more mature tools don't. That makes finding the right people all the more crucial.
Here's another layer of the onion: For big data to be useful, programmers and analysts also must understand the basics of the industry they are programming for. Imagine, for instance, that data analysts at a pharmaceutical company see a spike in aspirin sales in January as measured by point-of-sale data in near real time. Aha, they say, flu season is intensifying. But before committing sales resources to a big campaign and increasing production, it's worth comparing sales patterns to past years. Maybe lots of people had hangovers after their New Year's Eve parties. If the analysts don't know the business, and the questions to ask, the company risks running down a lot of expensive dead ends.
One final layer is IT security. If it's true that many companies don't have the skills to work with big-data tools, they certainly don't yet have the skills to keep that data secure. As more information is gathered, that's more information that can be leaked or stolen.
pharmaceutical:药物的,制药 hangover:宿醉,遗物,残留物
Information Is Power. So a Lot of Information Is a Lot of Power.
Control over information is frequently thought to bring power within an organization. Clearly, whoever gets to make decisions about what gets measured in the big-data era will accumulate even greater power.
Moreover, information sharing across organizational boundaries, which is part of the nature of big data, can upset traditional power relationships.
Consider a company with a Canadian plant and a U.S. plant. Streaming data from sensors shows the Canadian plant is churning out engines with 97% reliability, while the U.S. plant's engines clock in at only 80%. Suddenly, the managers of the Canadian plant may see their star rise in the organization -- to the consternation of those in the U.S.
consternation:惊愕,惊慌失措,恐怖 churn out:大量炮制,艰苦做出
The bottom line might benefit, but the internal politics is something that executives must be prepared to manage.
Or consider that big data offers the opportunity to measure what previously was unmeasurable. If a large retailer can now more quickly and easily measure consumer reaction to various marketing campaigns -- whether it be a Super Bowl commercial, magazine coupon or newspaper ad -- the different stakeholders might find their relative positions within the organization change. They also may resent the social-media team for offering proof with click-through data of their ability (or inability) to move the revenue needle.
Such upheavals will be exacerbated by the fact that processes that traditionally take months to plan and execute might be assessed in minutes. People with years of experience with annual sales reviews often struggle with weekly or even daily revenue tallies. Those who ruled under the old way of doing things might find themselves falling behind in the new world.
upheaval:剧变,隆起 exacerbated:加重,恶化,使激怒
Just Because Something Can Be Measured Doesn't Mean It Should Be Measured
Once people know that information is power, they may try to game the system -- to the detriment of the company.
detriment:损害,伤害
Say a large company starts tracking website traffic as a function of Twitter mentions. The results are updated continuously on an executive dashboard.
dashboard:仪表盘
The manager of one sales team previously had, with great success, generated most of its leads and eventual sales from trade shows and conferences. But once Twitter mentions become the key metric being measured, the manager changes the department's focus, declaring, 'We need to win the dashboard.' The result is that the department may indeed win the dashboard, but it leads to an unprofitable emphasis on website clicks and social-media traffic with unqualified leads rather than on successful events.
What Do We Do With All These Numbers?
Standard databases have been around for about 35 years, so a substantial body of experience makes these tools relatively easy to understand and use. Big data, by contrast, is just being invented, so the techniques for organizing and understanding the underlying meaning are still in their infancy.
What's more, it isn't easy for us to make sense of information at this scale. 'One, two, three, many' is how the security guru Bruce Schneier summarizes many people's math acumen. Spreadsheets, still the main tool for quantitative analysis in many companies, can't remotely scale to convey the number of cars on the road at a given moment in a particular city, or this week's federal spending on transportation projects.
acumen:聪明,敏锐
Visualization can be extremely helpful with this kind of data, but the field is still immature, and its special language not widely understood.
The Challenge of Thinking Big
What does it mean to think at such a large scale? How do we learn to ask questions of the transmission of every car on the road in a metropolitan area, of the smartphone of every customer visiting a large retail chain, or of every overnight parcel on a delivery truck? How can more businesspeople learn to think probabilistically rather than anecdotally? Thanks to the book and movie, the Moneyball approach is by now well known among sports fans. But they'll also recall how that approach upended an organization and was copied by competitors.
metropolitan:大都市的 upend:倒放,颠倒
In some ways, it requires a whole new way of looking at the world.
But also, the principles of good management extend to the domain of big data. Before businesses can profit from big data, managers must refuse to get lost in the noise that can obscure the basic forces represented by customers, value and execution. The volume, velocity and variety of big data can feel foreign, and make it easy to be dazzled by numerical tsunamis.
obscure:模糊的,晦涩的,昏暗的
So it's always crucial to insist on the basics of sound analytical practice. And to remember: Numbers can tell you things you never even knew to ask. But they never speak for themselves.
每日英语:The Risks of Big Data for Companies的更多相关文章
- 每日英语:Google Scraps Plan to Build Hong Kong Data Center
Internet giant Google Inc. has scrapped a plan to build its own data center in Hong Kong and will in ...
- 每日英语:As World's Kids Get Fatter, Doctors Turn To The Knife
Daifailluh al-Bugami was just a year old when his parents noticed that his lips turned blue as he sl ...
- 每日英语:Can Robots Better Spot Terrorists at Airports?
Next to have their jobs automated: airport-security screeners? Aviation and government authorities a ...
- 每日英语:China Bond Trading Dives
SHANGHAI—Trading volume in China's bond market has plummeted in recent months, in another reminder o ...
- 每日英语:Risk-Averse Culture Infects U.S. Workers, Entrepreneurs
Americans have long taken pride on their willingness to bet it all on a dream. But that risk-taking ...
- 每日英语:Success Outside the Dress Code
Anyone who has felt like the odd duck of the group can take heart from new research from Harvard Bus ...
- 每日英语:Mistrust Between U.S., Malaysia Strains Probe
Mistrust between U.S. and Malaysian air-accident investigators has hampered a multinational probe in ...
- 每日英语:Don't Call Us Bossy
[Confident girls are often called the other B-word, and it can keep them from reaching their full po ...
- 每日英语:Who Needs to Know How to Code
Like many 10-year-olds, Nick Wald takes private lessons. His once-a-week tutor isn't helping him wit ...
随机推荐
- spring事务配置的两种方式
spring所有的事务管理策略类都继承自org.springframework.transaction.PlatformTransactionManager接口. <!-- 事务管理器配置,单数 ...
- 获取客户机的ip和mac地址
只获取clientIP package com.ppms.utils; import javax.servlet.http.HttpServletRequest; /** * Created by l ...
- spring加载jar包中多个配置文件(转)
转自:http://evan0625.iteye.com/blog/1598366 在使用spring加载jar包中的配置文件时,不支持通配符,需要一个一个引入,如下所示: Java代码 <co ...
- linux shell 脚本攻略学习20--awk命令入门详解
awk生于1977年,创始人有三个,分别为 Alfred Aho,Peter Weinberger, 和 Brian Kernighan,名称源于三个创始人的姓的首字母. 作用:处理文本文件. awk ...
- JavaScript Window Screen 用户屏幕的信息
window.screen 对象包含有关用户屏幕的信息. Window Screen window.screen 对象在编写时可以不使用 window 这个前缀. 一些属性: screen.avail ...
- maven groupID 和 ArtifactID的区别与作用
GroupID是项目组织唯一的标识符,实际对应JAVA的包的结构,是main目录里java的目录结构. ArtifactID就是项目的唯一的标识符,实际对应项目的名称,就是项目根目录的名称.一般Gro ...
- 【DeepLearning】Exercise:PCA in 2D
Exercise:PCA in 2D 习题的链接:Exercise:PCA in 2D pca_2d.m close all %%=================================== ...
- MySQL备份与还原详细过程示例
MySQL备份与还原详细过程示例 一.MySQL备份类型 1.热备份.温备份.冷备份 (根据服务器状态) 热备份:读.写不受影响: 温备份:仅可以执行读操作: 冷备份:离线备份:读.写操作均中止: 2 ...
- linux之间文件传输(之scp)
linux的scp命令 linux 的 scp 命令 可以 在 linux 之间复制 文件 和 目录: ==================scp 命令==================scp 可以 ...
- labview程序性能优化
课时15: 中级08:LabVIEW运行性能(作者:NI应用工程师 李甫成) 一.避免强制类型转换 二.防止内存泄漏 三.将vi的一部分转化为子vi 四轴项目中所占内存对比,变为子vi后执行速度也快了 ...