熟悉HBase基本操作

ssh localhost

start-dfs.sh

start-hbase.sh

hbase shell

create 'Student', 'S_No', 'S_Name', 'S_Sex', 'S_Age'

put 'Student', '', 'S_No', ''

put 'Student', '', 'S_Name', 'Zhangsan'

put 'Student', '', 'S_Sex', 'male'

put 'Student', '', 'S_Age', ''

put 'Student', '', 'S_No', ''

put 'Student', '', 'S_Name', 'Marry'

put 'Student', '', 'S_Sex', 'female'

put 'Student', '', 'S_Age', ''

put 'Student', '', 'S_No', ''

put 'Student', '', 'S_Name', 'Lisi'

put 'Student', '', 'S_Sex', 'male'

put 'Student', '', 'S_Age', ''

list

scan 'Student'

alter 'Student', NAME=>'S_Course'

put 'Student', '', 'S_Course:math', ''

alter 'Student', {NAME=>'S_Course', METHOD=>'delete'}

count 'Student'

truncate 'Student'

hdfs dfs -rm input/*.txt

hdfs dfs -put ~/lyric.txt input/

import sys

for line in sys.stdin:

    line = line.strip()

    words = line.split()

    for word in words:

        print('%s\t%s' % (word, 1))

from operator import itemgetter

import sys

current_word = None

current_count = 0

word = None

for line in sys.stdin:

    line = line.strip()

    word, count = line.split('\t', 1)

    try:

        count = int(count)

    except ValueError:

        continue

    if current_word == word:

        current_count += count

    else:

        if current_word:

            print '%s\t%s' % (current_word, current_count)

        current_count = count

        current_word = word

if current_word == word:

    print '%s\t%s' % (current_word, current_count)

export HADOOP_HOME=/usr/local/hadoop

export STREAM=$HADOOP_HOME/share/hadoop/tools/lib/hadoop-streaming-*.jar

hadoop jar $STREAM \

-D stream.non.zero.exit.is.failure=false \

-file /home/hadoop/mapper.py \

-mapper 'python /home/hadoop/mapper.py' \

-file /home/hadoop/reducer.py \

-reducer 'python /home/hadoop/reducer.py' \

-input /user/hadoop/input/*.txt \

-output /user/hadoop/wcoutput

熟悉HBase基本操作的更多相关文章

hbase基本操作
public class Demo { private Configuration conf; private Connection conn; @Before public void prepare ...
Hbase记录-HBase基本操作（二）
HBase Exists 可以使用exists命令验证表的存在.下面的示例演示了如何使用这个命令. hbase(main):024:0> exists 'emp' Table emp doe ...
Hbase记录-HBase基本操作（一）
HBase创建表可以使用命令创建一个表,在这里必须指定表名和列族名.在HBase shell中创建表的语法如下所示. create ‘<table name>’,’<column ...
HBase 基本操作
如何添加列族很简单,跟rdbms一样直接用alter,但是alter之前必须先disable这个表 ---->disable 'test' ...
hadoop之hbase基本操作
hbase shell 进入hbase命令行 list 显示HBASE表 status 系统上运行的服务器的细节和系统的状态 version 返回HBase系统使用的版本 table_help 引导如 ...
HBase基本操作-Java实现
创建Table public static void createTable(String tableName){ try { HBaseAdmin hbaseAdmin = new HBaseAdm ...
Hbase设计实战
Hbase设计实战本文通过一个游戏公司客户实际案例的讲解,分析了 Hbase 表设计及开发在实际案例中的运用,对比了不同的 Hbase 设计考量对客户端访问模式及检索性能的差异.读者通过案例中 Hb ...
HBase笔记--编程实战
HBase总结:http://blog.csdn.net/lifuxiangcaohui/article/details/39997205 (very good) Spark使用Java读取hbas ...
HBase零基础高阶应用实战（CDH5、二级索引、实践、DBA）
HBase是一个分布式的.面向列的开源数据库,该技术来源于 Fay Chang 所撰写的Google论文“Bigtable:一个结构化数据的分布式存储系统”.就像Bigtable利用了Google文件 ...

随机推荐

Widows自带系统监控工具——24小时监控服务器性能
博文来源:https://blog.csdn.net/qq_41650233/article/details/84313153 操作步骤1.运行程序perfmon.exe 2.在数据收集器下选择[用户 ...
ollydbg入门记录
1.软件窗口说明 OllyDBG 中各个窗口的名称如下图.简单解释一下各个窗口的功能, 反汇编窗口:显示被调试程序的反汇编代码,标题栏上的地址.HEX 数据.反汇编.注释可以通过在窗口中右击出现的菜单 ...
项目实战-使用PySpark处理文本多分类问题
原文链接:https://cloud.tencent.com/developer/article/1096712 在大神创作的基础上,学习了一些新知识,并加以注释. TARGET:将旧金山犯罪记录(S ...
[NOI2018]你的名字
题解: 前68分非常简单建立SAM 另一个串在上面跑,然后求一个树链的并我们会发现暴力就是min(l^2,n)的所以复杂度最多是nsqrt(n)的当然我们也可以nlogn维护把所有点按照df ...
dbus-launch
NAME dbus-launch - Utility to start a message bus from a shell script dbus-launch - 从shell脚本启动一个消息总线 ...
学习使人快乐7--Mail收发原理+计划
本篇了解邮件收发的原理与机制(smtp,pop3协议).不打算作重点学习辣~~~~~~~~~~~~~~~~ 日常感谢大佬gacl 另:打算把每周计划学的东西发在blog上面,勉励一下本咸鱼本周计划:1 ...
做IT，必备的安全知识！
以前的认知以前刚接触IT行业,而我身为运维,我以为我所需要做的安全就是修改服务器密码为复杂的,ssh端口改为非22,还有就是不让人登录服务器就可以保证我维护的东西安全. 现在的认知工作也好几年了, ...
利用kibana插件对Elasticsearch查询
利用kibana插件对Elasticsearch查询 Elasticsearch是功能非常强大的搜索引擎,使用它的目的就是为了快速的查询到需要的数据. 查询分类: 基本查询:使用Elasticsear ...
python 获取mac地址zz
通过python获取当前mac地址的方法如下:(1)通用方法,借助uuid模块def get_mac_address(): import uuid node = uuid.getnode() ...
MonggoDB(二)
分组聚合如果你有数据存储在MongoDB中,你想做的可能就不仅仅是将数据提取出来这么简单,可能需要对数据进行分析并加以利用. 聚合框架:可以使用多个构件创建一个管道,上一个构件的结果传给下一个构件. ...

熟悉HBase基本操作

熟悉HBase基本操作的更多相关文章

随机推荐

热门专题