Yarn【架构、原理、多队列配置】

一.什么是yarn
二.yarn的基本架构和角色
三.yarn的工作机制
四.任务提交流程
五.资源调度器
六.容量调度器多队列提交案例实操
- 1.案例：配置default、hive多队列
- 2.配置Hive的默认提交队列

一.什么是yarn

Yarn是一个资源调度平台，负责为运算程序提供服务器运算资源，相当于一个分布式的操作系统平台，而MapReduce等运算程序则相当于运行于操作系统之上的应用程序。

二.yarn的基本架构和角色

yarn主要由ResourceManager、NodeManager、ApplicationMaster和Container等组件构成。

三.yarn的工作机制

（1）MR程序提交到客户端所在的节点。

（2）YarnRunner向ResourceManager申请一个Application。

（3）RM将该应用程序的资源路径返回给YarnRunner。

（4）该程序将运行所需资源提交到HDFS上。

（5）程序资源提交完毕后，申请运行mrAppMaster。

（6）RM将用户的请求初始化成一个Task。

（7）其中一个NodeManager领取到Task任务。

（8）该NodeManager创建容器Container，并产生MRAppmaster。

（9）Container从HDFS上拷贝资源到本地。

（10）MRAppmaster向RM 申请运行MapTask资源。

（11）RM将运行MapTask任务分配给另外两个NodeManager，另两个NodeManager分别领取任务并创建容器。

（12）MR向两个接收到任务的NodeManager发送程序启动脚本，这两个NodeManager分别启动MapTask，MapTask对数据分区排序。

（13）MrAppMaster等待所有MapTask运行完毕后，向RM申请容器，运行ReduceTask。

（14）ReduceTask向MapTask获取相应分区的数据。

（15）程序运行完毕后，MR会向RM申请注销自己。

四.任务提交流程

作业提交全过程详解

（1）:作业提交

第1步：Client调用job.waitForCompletion方法，向整个集群提交MapReduce作业。

第2步：Client向RM申请一个作业id。

第3步：RM给Client返回该job资源的提交路径和作业id。

第4步：Client提交jar包、切片信息和配置文件到指定的资源提交路径。

第5步：Client提交完资源后，向RM申请运行MrAppMaster。

（2）作业初始化

第6步：当RM收到Client的请求后，将该job添加到容量调度器中。

第7步：某一个空闲的NM领取到该Job。

第8步：该NM创建Container，并产生MRAppmaster。

第9步：下载Client提交的资源到本地。

（3）:任务分配

第10步：MrAppMaster向RM申请运行多个MapTask任务资源。

第11步：RM将运行MapTask任务分配给另外两个NodeManager，另两个NodeManager分别领取任务并创建容器。

（4）:任务运行

第12步：MR向两个接收到任务的NodeManager发送程序启动脚本，这两个NodeManager分别启动MapTask，MapTask对数据分区排序。

第13步：MrAppMaster等待所有MapTask运行完毕后，向RM申请容器，运行ReduceTask。

第14步：ReduceTask向MapTask获取相应分区的数据。

第15步：程序运行完毕后，MR会向RM申请注销自己。

（5）:进度和状态更新

YARN中的任务将其进度和状态(包括counter)返回给应用管理器, 客户端每秒(通过mapreduce.client.progressmonitor.pollinterval设置)向应用管理器请求进度更新, 展示给用户。

（6）:作业完成

除了向应用管理器请求作业进度外, 客户端每5秒都会通过调用waitForCompletion()来检查作业是否完成。时间间隔可以通过mapreduce.client.completion.pollinterval来设置。作业完成之后, 应用管理器和Container会清理工作状态。作业的信息会被作业历史服务器存储以备之后用户核查。

五.资源调度器

Hadoop作业调度器主要有三种：FIFO、Capacity Scheduler和Fair Scheduler。Hadoop3.1.3默认的资源调度器是Capacity Scheduler。

具体设置详见：yarn-default.xml文件

<property>

    <description>The class to use as the resource scheduler.</description>

    <name>yarn.resourcemanager.scheduler.class</name>

<value>org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler</value>

</property>

FIFO

先进先出，同一时间队列中只有一个任务在执行

容量调度器

多队列；每个队列内部先进先出, 同一时间队列中只有一个任务在执行, 队列的并行度为队列的个数。

公平调度器

多队列；每个队列内部按照缺额大小分配资源启动任务，同一时间队列中有多个任务执行。队列的并行度大于等于队列的个数

六.容量调度器多队列提交案例实操

需求

Yarn默认调度器为Capacity Scheduler（容量调度器），且默认只有一个队列——default。该调度器单个队列内的调度策略为FIFO，故单个队列的任务并行度为1，即一个队列，同一时刻，只能有一个任务运行。

在实际使用中会出现单个任务阻塞整个队列的情况。同时，随着业务的增长，公司需要分业务限制集群使用率。这就需要我们按照业务种类配置多条任务队列。

1.案例：配置default、hive多队列

默认Yarn的配置下，容量调度器只有一条Default队列。在$HADOOP_HOME/etc/hadoop/capacity-schdualer.xml中可以配置多条队列，并降低default队列资源占比：

①增加队列，添加队列的属性配置

<!--

  Licensed under the Apache License, Version 2.0 (the "License");

  you may not use this file except in compliance with the License.

  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software

  distributed under the License is distributed on an "AS IS" BASIS,

  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

  See the License for the specific language governing permissions and

  limitations under the License. See accompanying LICENSE file.

-->

<configuration>

   <!-- 容量调度器最多可以容纳多少个job-->

  <property>

    <name>yarn.scheduler.capacity.maximum-applications</name>

    <value>10000</value>

    <description>

      Maximum number of applications that can be pending and running.

    </description>

  </property>

  <!-- 当前队列中启动的MRAppMaster进程，所占用的资源可以达到队列总资源的多少

		通过这个参数可以限制队列中提交的Job数量

  -->

  <property>

    <name>yarn.scheduler.capacity.maximum-am-resource-percent</name>

    <value>0.1</value>

    <description>

      Maximum percent of resources in the cluster which can be used to run

      application masters i.e. controls number of concurrent running

      applications.

    </description>

  </property>

  <!-- 为Job分配资源时，使用什么策略进行计算

  -->

  <property>

    <name>yarn.scheduler.capacity.resource-calculator</name>

    <value>org.apache.hadoop.yarn.util.resource.DefaultResourceCalculator</value>

    <description>

      The ResourceCalculator implementation to be used to compare

      Resources in the scheduler.

      The default i.e. DefaultResourceCalculator only uses Memory while

      DominantResourceCalculator uses dominant-resource to compare

      multi-dimensional resources such as Memory, CPU etc.

    </description>

  </property>

   <!-- root队列中有哪些子队列-->

  <property>

    <name>yarn.scheduler.capacity.root.queues</name>

    <value>default,hive</value>

    <description>

      The queues at the this level (root is the root queue).

    </description>

  </property>

  <!-- root队列中default队列占用的容量百分比

		所有子队列的容量相加必须等于100

  -->

  <property>

    <name>yarn.scheduler.capacity.root.default.capacity</name>

    <value>30</value>

    <description>Default queue target capacity.</description>

  </property>

  <property>

    <name>yarn.scheduler.capacity.root.hive.capacity</name>

    <value>70</value>

    <description>Default queue target capacity.</description>

  </property>

    <!-- 队列中用户能使用此队列资源的极限百分比

  -->

  <property>

    <name>yarn.scheduler.capacity.root.default.user-limit-factor</name>

    <value>1</value>

    <description>

      Default queue user limit a percentage from 0.0 to 1.0.

    </description>

  </property>

   <property>

    <name>yarn.scheduler.capacity.root.hive.user-limit-factor</name>

    <value>1</value>

    <description>

      Default queue user limit a percentage from 0.0 to 1.0.

    </description>

  </property>

  <!-- root队列中default队列占用的容量百分比的最大值

  -->

  <property>

    <name>yarn.scheduler.capacity.root.default.maximum-capacity</name>

    <value>100</value>

    <description>

      The maximum capacity of the default queue.

    </description>

  </property>

   <property>

    <name>yarn.scheduler.capacity.root.hive.maximum-capacity</name>

    <value>100</value>

    <description>

      The maximum capacity of the default queue.

    </description>

  </property>

    <!-- root队列中每个队列的状态

  -->

  <property>

    <name>yarn.scheduler.capacity.root.default.state</name>

    <value>RUNNING</value>

    <description>

      The state of the default queue. State can be one of RUNNING or STOPPED.

    </description>

  </property>

    <property>

    <name>yarn.scheduler.capacity.root.hive.state</name>

    <value>RUNNING</value>

    <description>

      The state of the default queue. State can be one of RUNNING or STOPPED.

    </description>

  </property>

  <!-- 限制向default队列提交的用户-->

  <property>

    <name>yarn.scheduler.capacity.root.default.acl_submit_applications</name>

    <value>*</value>

    <description>

      The ACL of who can submit jobs to the default queue.

    </description>

  </property>

  <property>

    <name>yarn.scheduler.capacity.root.hive.acl_submit_applications</name>

    <value>*</value>

    <description>

      The ACL of who can submit jobs to the default queue.

    </description>

  </property>

  <property>

    <name>yarn.scheduler.capacity.root.default.acl_administer_queue</name>

    <value>*</value>

    <description>

      The ACL of who can administer jobs on the default queue.

    </description>

  </property>

  <property>

    <name>yarn.scheduler.capacity.root.hive.acl_administer_queue</name>

    <value>*</value>

    <description>

      The ACL of who can administer jobs on the default queue.

    </description>

  </property>

  <property>

    <name>yarn.scheduler.capacity.node-locality-delay</name>

    <value>40</value>

    <description>

      Number of missed scheduling opportunities after which the CapacityScheduler

      attempts to schedule rack-local containers.

      Typically this should be set to number of nodes in the cluster, By default is setting

      approximately number of nodes in one rack which is 40.

    </description>

  </property>

  <property>

    <name>yarn.scheduler.capacity.queue-mappings</name>

    <value></value>

    <description>

      A list of mappings that will be used to assign jobs to queues

      The syntax for this list is [u|g]:[name]:[queue_name][,next mapping]*

      Typically this list will be used to map users to queues,

      for example, u:%user:%user maps all users to queues with the same name

      as the user.

    </description>

  </property>

  <property>

    <name>yarn.scheduler.capacity.queue-mappings-override.enable</name>

    <value>false</value>

    <description>

      If a queue mapping is present, will it override the value specified

      by the user? This can be used by administrators to place jobs in queues

      that are different than the one specified by the user.

      The default is false.

    </description>

  </property>

</configuration>

②分发配置文件到集群，重启Yarn

③测试，向default ，hive队列分别提交任务

提交到default队列：

hadoop jar /opt/module/hadoop-3.1.3/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.1.3.jar pi -Dmapreduce.job.queuename=hive 1 1

提交到hive队列

hadoop jar /opt/module/hadoop-3.1.3/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.1.3.jar pi -Dmapreduce.job.queuename=hive 1 1

查看yarn任务ui界面:http://hadoop103:8088/cluster/scheduler

提交就可以查看了，不然任务跑完就看不到了....

2.配置Hive的默认提交队列

方式一

当前hive的shell窗口有效，关闭就不行了

SET mapreduce.job.queuename=hive;

SET mapreduce.job.priority=HIGH;

方式二

修改配置文件，永久生效

<!--配置hive默认的提交队列-->

<property>

    <name>mapreduce.job.queuename</name>

    <value>hive</value>

</property>