规划

ansible 节点

ansible   controller

镜像rhel 8.2

镜像ansible

hadoop 集群

master

slave1

slave2

镜像centos 1810

0.初始化

hadoop集群配网络修改主机名

10.104.44.25   master

10.104.44.49   slave1

10.104.44.36   slave2

一.配置ansible，配置集群hadoop用户免密

1.配置ansible controller etc/hosts

[root@localhost ansible]# cat /etc/hosts

127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4

::1         localhost localhost.localdomain localhost6 localhost6.localdomain6

10.104.44.25   master

10.104.44.49   slave1

10.104.44.36   slave2

2.安装ansible并配置hosts

[root@localhost mnt]# ansible --version

ansible 2.9.11

  config file = /etc/ansible/ansible.cfg

  configured module search path = ['/root/.ansible/plugins/modules', '/usr/share/ansible/plugins/modules']

  ansible python module location = /usr/lib/python3.6/site-packages/ansible

  executable location = /usr/bin/ansible

  python version = 3.6.8 (default, Dec  5 2019, 15:45:45) [GCC 8.3.1 20191121 (Red Hat 8.3.1-5)]

[root@localhost ansible]# cat hosts

master

slave1

slave2

3.集群添加hadoop用户

[root@localhost ansible]# cat user_hadoopcreate

- hosts: all

  vars:

          password: '123'

  tasks:

          - name: create

            user:

                    name: hadoop

                    uid: 1200

                    password: "{{ password | password_hash('sha512') }}"

[root@localhost ansible]# 

[root@localhost ansible]# ansible-playbook user_hadoopcreate -k

PLAY RECAP *********************************************************************

master                     : ok=2    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0

slave1                     : ok=2    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0

slave2                     : ok=2    changed=1    unreachable=0    failed=0    skipped=0    rescued=0    ignored=0

4.配置hadoop用户提权

ansible all -m  copy -a "dest=/etc/sudoers.d/ansible content='hadoop ALL=\(ALL\)  NOPASSWD:ALL'" -k

5.使root用户免密登录集群，使用Hadoop用户

ansible controller创建密匙

[root@localhost ansible]# ssh-keygen

ansible免密操控hadoop用户

[root@localhost .ssh]#     ansible  all  -m  copy -a 'src=/root/.ssh/id_rsa.pub dest=/home/hadoop/.ssh/authorized_keys   owner=hadoop group=hadoop'  -k

6.配置ansible 配置文件

host_key_checking = False  #不检查公钥

允许提权

  become=True

  become_method=sudo

  become_user=root

  become_ask_pass=False

远程用户为hadoop

remote_user = hadoop

sed -i 's/remote_user = root/remote_user = hadoop/' ansible.cfg.bak

sed -i 's/^#become/become/g' ansible.cfg.bak

sed -i 's/^# host_key_checking/host_key_checking/g' ansible.cfg.bak

7.同步hosts

[root@localhost ansible]#     ansible  all  -m  copy -a 'src=/etc/hosts dest=/etc/hosts'

8.集群之间的配置hadoop用户免密登录

执行脚本前需要controller下载expect

yum reinstall --downloadonly --downloaddir=/etc/ansible/ -y libtcl*

yum reinstall --downloadonly --downloaddir=/etc/ansible/ -y expect

编写脚本，生产密钥，并发送公钥给其他集群

[root@localhost ansible]# cat ask.sh 

 expect <<EOF

    set timeout 10

    spawn  ssh-keygen

    expect {

    "save the key"  { send "\n"; exp_continue }

        "passphrase"   { send "\n"; exp_continue }

        "passphrase again" { send "\n"; exp_continue }

    }

    spawn ssh-copy-id hadoop@slave1

    expect {

        "yes/no" { send "yes\n";exp_continue }

        "password" { send "123\n";exp_continue }

    }

    spawn ssh-copy-id hadoop@slave2

    expect {

        "yes/no" { send "yes\n";exp_continue }

        "password" { send "123\n";exp_continue }

    }

    spawn ssh-copy-id hadoop@master

    expect {

        "yes/no" { send "yes\n";exp_continue }

        "password" { send "123\n";exp_continue }

    }

EOF

编写playbook，hadoop节点安装expect软件，并执行免密脚本

[root@localhost ansible]# cat key_operation.yml

- hosts: slave2

  remote_user: hadoop

  become: false

  vars:

         package: /etc/ansible/tcl-8.6.8-2.el8.x86_64.rpm

         package2: /etc/ansible/expect-5.45.4-5.el8.x86_64.rpm

  tasks:

            - name: deliver

              copy:

                        src: "{{ item }}"

                        dest: /home/hadoop

              loop:

                    - "{{ package }}"

                    - "{{ package2 }}"

            - name: rpm

              yum:

                  name:

                        - /home/hadoop/tcl-8.6.8-2.el8.x86_64.rpm

                        - /home/hadoop/expect-5.45.4-5.el8.x86_64.rpm

                  state:  present

            - name: try script

              script: /etc/ansible/ask.sh

9.将以上操作整理成first脚本，可以跑脚本部署以上操作

    [root@localhost first]# cat first.sh

    #!/bin/bash

    expect  << EOF

        set timeout 10

        spawn ansible-playbook user_hadoopcreate.yml -k

        expect {

            "password" { send "123\n";exp_continue }

        }

        spawn ansible all -m file -a "path=/home/hadoop/.ssh state=directory  owner=hadoop group=hadoop" -k

        expect {

            "password" { send "123\n";exp_continue }

        }

        spawn ansible all -m  copy -a "dest=/etc/sudoers.d/ansible content='hadoop ALL=\(ALL\)  NOPASSWD:ALL'" -k

        expect {

            "password" { send "123\n";exp_continue }

        }

        spawn ansible  all  -m  copy -a "src=/root/.ssh/id_rsa.pub dest=/home/hadoop/.ssh/authorized_keys   owner=hadoop group=hadoop"  -k

        expect {

            "password" { send "123\n";exp_continue }

        }

    EOF

    ansible  all  -m  copy -a 'src=/etc/hosts dest=/etc/hosts'  

    # change configuration

    sed -i 's/remote_user = root/remote_user = hadoop/' ansible.cfg

    sed -i 's/^#become/become/g' ansible.cfg

    sed -i 's/^# host_key_checking/host_key_checking/g' ansible.cfg

    yum reinstall --downloadonly --downloaddir=/etc/ansible/ -y libtcl*

    yum reinstall --downloadonly --downloaddir=/etc/ansible/ -y expect

    ansible-playbook key_operation.yml

二.配置hadoop基础配置

上传jdk与hadoop到ansible节点

四个配置文件内容

[root@localhost second]# cat hdfs.config

<configuration>

    <property>

        <name>dfs.namenode.name.dir</name>

        <value>file:/opt/hadoop/dfs/name</value>

    </property>

    <property>

        <name>dfs.datanode.data.dir</name>

        <value>file:/opt/hadoop/dfs/data</value>

    </property>

    <property>

        <name>dfs.replication</name>

        <value>2</value>

    </property>

</configuration>

[root@localhost second]# cat yarn.config

<configuration>

    <property>

        <name>arn.resourcemanager.address</name>

        <value>master:8032</value>

    </property>

    <property>

        <name>yarn.resourcemanager.scheduler.address</name>

        <value>master:8030</value>

    </property>

    <property>

        <name>yarn.resourcemanager.webapp.address</name>

        <value>master:8088</value>

    </property>

    <property>

        <name>yarn.resourcemanager.resource-tracker.address</name>

        <value>master:8031</value>

    </property>

    <property>

        <name>yarn.resourcemanager.admin.address</name>

        <value>master:8033</value>

    </property>

    <property>

        <name>yarn.nodemanager.aux-services</name>

        <value>mapreduce_shuffle</value>

    </property>

    <property>

        <name>yarn.nodemanager.aux-services.mapreduce_shuffle.class</name>

        <value>org.apache.hadoop.mapred.ShuffleHandler</value>

    </property>

</configuration>

[root@localhost second]# cat core.config

<configuration>

    <property>

        <name>fs.defaultFS</name>

        <value>hdfs://master:9000</value>

    </property>

    <property>

        <name>io.file.buffer.size</name>

        <value>131072</value>

    </property>

    <property>

        <name>hadoop.tmp.dir</name>

        <value>file:/opt/hadoop/tmp</value>

    </property>

</configuration>

[root@localhost second]# cat mapred.config

<configuration>

    <property>

        <name>mapreduce.framework.name</name>

        <value>yarn</value>

    </property>

    <property>

        <name>mapreduce.jobhistory.address</name>

        <value>master:10020</value>

    </property>

    <property>

        <name>mapreduce.jobhistory.webapp.address</name>

        <value>master:19888</value>

    </property>

</configuration>

ansible-playbook 配置

cat clucig.yml 

#关闭防火墙与selinux

- hosts: all

  tasks:

          - name: Disable SELinux

            selinux:

                state: disabled

          - name: stop firewalld

            systemd:

                state: stopped

                name: firewalld

                enabled: no

            ignore_errors: yes

#将包解压到目标主机

- hosts: all

  tasks:

          - name: tar

            unarchive:

                        src: "/opt/{{ item }}"

                        dest: /opt/

            loop:

                        - jdk-8u152-linux-x64.tar.gz

                        - hadoop-2.7.1.tar.gz

          - name: mv

            shell: mv /opt/jdk1.8.0_152 /opt/jdk

          - name: mv2

            shell: mv /opt/hadoop-2.7.1 /opt/hadoop

#更改环境变量

- hosts: all

  tasks:

          - name: copy

            copy:

                    dest: /etc/profile.d/hadoop.sh

                    content: "export JAVA_HOME=/opt/jdk

                                export HADOOP_HOME=/opt/hadoop

                                export PATH=${JAVA_HOME}/bin:${HADOOP_HOME}/bin:${HADOOP_HOME}/sbin:$PATH\n"

          - name: echo

            shell: echo  'export JAVA_HOME=/opt/jdk' >> /opt/hadoop/etc/hadoop/hadoop-env.sh           

#修改配置文件

#1

          - name: delete something

            shell: sed  -i '/<configuration>/,/<\/configuration>/d' /opt/hadoop/etc/hadoop/hdfs-site.xml

          - name: copy

            copy:

                    src: hdfs.config

                    dest: /opt/

          - name: add configuration

            shell: cat /opt/hdfs.config >> /opt/hadoop/etc/hadoop/hdfs-site.xml

          - name: shell operate

            shell: mkdir -p /opt/hadoop/dfs/{name,data}

#2

          - name: delete something

            shell: sed  -i '/<configuration>/,/<\/configuration>/d' /opt/hadoop/etc/hadoop/core-site.xml

          - name: copy

            copy:

                        src: core.config

                        dest: /opt/

          - name: add configuration

            shell: cat /opt/core.config >> /opt/hadoop/etc/hadoop/core-site.xml

          - name: shell operate2

            shell: mkdir -p /opt/hadoop/tmp

#3

          - name: copy template

            shell: cp /opt/hadoop/etc/hadoop/mapred-site.xml.template /opt/hadoop/etc/hadoop/mapred-site.xml

          - name: delete something

            shell: sed  -i '/<configuration>/,/<\/configuration>/d' /opt/hadoop/etc/hadoop/mapred-site.xml

          - name: copy

            copy:

                        src: mapred.config

                        dest: /opt/

          - name: add configuration

            shell: cat /opt/mapred.config >> /opt/hadoop/etc/hadoop/mapred-site.xml

#4

          - name: delete something

            shell: sed  -i '/<configuration>/,/<\/configuration>/d' /opt/hadoop/etc/hadoop/yarn-site.xml

          - name: copy

            copy:

                        src: yarn.config

                        dest: /opt/

          - name: add configuration

            shell: cat /opt/yarn.config >> /opt/hadoop/etc/hadoop/yarn-site.xml

#configuration finish

          - name: master or slave

            copy:

                        dest: /opt/hadoop/etc/hadoop/masters

                        content: "10.104.44.25\n"

          - name: master or slave

            copy:

                        dest: /opt/hadoop/etc/hadoop/slaves

                        content: "10.104.44.36\n10.104.44.49\n"

          - name: chmod

            shell: chown -R hadoop.hadoop /opt/

剧本可以完成hadoop第四章配置文件

三.启动hadoop

windwos配置域名解析

c:\windows\system32\drivers\etc\hosts

手动启动hadoop

start-all.sh

hdfs namenode -format

start-dfs.sh

上传文件

hdfs dfs -put rhel-8.2-x86_64-dvd.iso

ansible 部署hadoop的更多相关文章

CentOSLinux系统中Ansible自动化运维的安装以及利用Ansible部署JDK和Hadoop
Ansible 安装和配置 Ansible 说明 Ansible 官网:https://www.ansible.com/ Ansible 官网 Github:https://github.com/an ...
使用ansible部署CDH 5.15.1大数据集群
使用ansible离线部署CDH 5.15.1大数据集群作者:尹正杰版权声明:原创作品,谢绝转载!否则将追究法律责任. 在此之前,我之前分享过使用shell自定义脚本部署大数据集群,不管是部署CD ...
Docker部署Hadoop集群
Docker部署Hadoop集群 2016-09-27 杜亦舒前几天写了文章"Hadoop 集群搭建"之后,一个朋友留言说希望介绍下如何使用Docker部署,这个建议很好,Doc ...
使用Ambari快速部署Hadoop大数据环境
使用Ambari快速部署Hadoop大数据环境发布于2013-5-24 前言做大数据相关的后端开发工作一年多来,随着Hadoop社区的不断发展,也在不断尝试新的东西,本文着重来讲解下Amb ...
001.Ansible部署RHCS存储集群
一前期准备 1.1 前置条件至少有三个不同的主机运行monitor (MON)节点: 至少三个直接存储(非外部SAN硬件)的OSD节点主: 至少两个不同的manager (MGR)节点: 如果使用 ...
如何部署hadoop集群
假设我们有三台服务器,他们的角色我们做如下划分: 10.96.21.120 master 10.96.21.119 slave1 10.96.21.121 slave2 接下来我们按照这个配置来部署h ...
使用Ansible部署etcd 3.2高可用集群
之前写过一篇手动搭建etcd 3.1集群的文章<etcd 3.1 高可用集群搭建>,最近要初始化一套新的环境,考虑用ansible自动化部署整套环境, 先从部署etcd 3.2集群开始. ...
ansible部署,规划
部署管理服务器第一步:先检查有没有ssh服务 [root@iZm5eeyc1al5vzh8bo57zyZ ~]# rpm -qf /etc/init.d/sshd openssh-server-5. ...
批量部署Hadoop集群环境（1）
批量部署Hadoop集群环境(1) 1. 项目简介: 前言:云火的一塌糊涂,加上自大二就跟随一位教授做大数据项目,所以很早就产生了兴趣,随着知识的积累,虚拟机已经不能满足了,这次在服务器上以生产环境来 ...
CentOS7.5 -- Ansible部署与应用
第1章 Ansible概述 Ansible是一个配置管理系统configuration management system python 语言是运维人员必须会的语言 ansible 是一个基于pyth ...

随机推荐

『手撕Vue-CLI』拉取版本号
开篇在上一篇文章中,给 nue-cli 添加了与用户终端交互的功能,这一次来实现一个拉取版本号的功能. 这个功能的背景是,有时候我们在使用脚手架的时候,不同版本的脚手架可能会有不同的功能,所以用户有 ...
《剑指offer - 题目2》
题目描述请实现一个函数,将一个字符串中的每个空格替换成"%20".例如,当字符串为We Are Happy.则经过替换之后的字符串为We%20Are%20Happy. 注意方 ...
mac http&git代理配置
git代理清除git config --global --unset http.proxygit config --global --unset https.proxy 输出代理:$echo $htt ...
【论文笔记】GoogLeNet系列
[深度学习]总目录 Inception技术演进 Inception-V1: 2014年9月,<Going Deeper with Convolutions>提出了Inception模块,在 ...
开机启动VM WARE 某台虚拟机
新建一个批处理,内容如下: set vm_root=C:\Program Files (x86)\VMware\VMware Workstation "%vm_root%\vmrun.exe ...
(六)基于Scrapy爬取网易新闻中的新闻数据
需求:爬取这国内.国际.军事.航空.无人机模块下的新闻信息 1.找到这五个板块对应的url 2.进入每个模块请求新闻信息我们可以明显发现''加载中'',因此我们判断新闻数据是动态加载出来的. 3. ...
Vue学习：22.Vue组件库-Vant
Vue组件库是一系列预先构建好的.可复用的UI组件集合,它们设计用于加速Vue.js应用程序的开发过程.这些组件通常遵循一定的设计规范,提供统一的外观和交互体验,让开发者能够快速搭建用户界面. 组件库 ...
以沙箱的方式运行容器:安全容器gvisor
目录一.系统环境二.前言三.安全容器隔离技术简介四.Gvisor简介五.容器runtime简介六.docker容器缺陷七.配置docker使用gVisor作为runtime 7.1 安装 ...
硬件开发笔记（二十）：AD21导入外部下载的元器件原理图库、封装库和3D模型
前言在硬件设计的过程中,会遇到一些元器件,这些元器件在本地已有的库里面没有,但是可以从外部下载或者获取到对应的. 本篇就是引入TPS54331D电源芯片作为示例,详细描述整个过程. 创建T ...
Linux C 读写超过2G的大文件注意事项
背景在项目中做大文件的增量读写,遇到了问题: fopen : Value too large for defined data type. 习惯性地根据这个提示查阅的有关资料显示: 1)工具链太老了 ...

ansible 部署hadoop

规划