Spark源码分析

ExecutorBackend

很简单的接口

package org.apache.spark.executor

/**

 * A pluggable interface used by the Executor to send updates to the cluster scheduler.

 */

private[spark] trait ExecutorBackend {

  def statusUpdate(taskId: Long, state: TaskState, data: ByteBuffer)

}

StandaloneExecutorBackend

维护executor, 并负责注册executor以及executor和driver之间的通信

private[spark] class StandaloneExecutorBackend(

    driverUrl: String,

    executorId: String,

    hostPort: String,

    cores: Int)

  extends Actor

  with ExecutorBackend

  with Logging {

  var executor: Executor = null

  var driver: ActorRef = null

  override def preStart() {

    logInfo("Connecting to driver: " + driverUrl)

    driver = context.actorFor(driverUrl) // 创建driver actor ref, 以便于和driver通信

    driver ! RegisterExecutor(executorId, hostPort, cores) // 向driver注册executor

  }

  override def receive = {

    case RegisteredExecutor(sparkProperties) =>

      logInfo("Successfully registered with driver")

      // Make this host instead of hostPort ?

      executor = new Executor(executorId, Utils.parseHostPort(hostPort)._1, sparkProperties) // 当注册成功后, 创建Executor

    case RegisterExecutorFailed(message) =>

      logError("Slave registration failed: " + message)

      System.exit(1)

    case LaunchTask(taskDesc) =>

      logInfo("Got assigned task " + taskDesc.taskId)

      if (executor == null) {

        logError("Received launchTask but executor was null")

        System.exit(1)

      } else {

        executor.launchTask(this, taskDesc.taskId, taskDesc.serializedTask) // 调用executor.launchTask,启动task

      }

    case Terminated(_) | RemoteClientDisconnected(_, _) | RemoteClientShutdown(_, _) =>

      logError("Driver terminated or disconnected! Shutting down.")

      System.exit(1)

  }

  override def statusUpdate(taskId: Long, state: TaskState, data: ByteBuffer) {

    driver ! StatusUpdate(executorId, taskId, state, data) // 当task状态变化时, 报告给driver actor

  }

}

Executor

对于Executor, 维护一个threadPool, 可以run多个task, 取决于core的个数

所以对于launchTask, 就是在threadPool中挑个thread去run TaskRunner

private[spark] class Executor(

    executorId: String,

    slaveHostname: String,

    properties: Seq[(String, String)])

  extends Logging

{

  // Initialize Spark environment (using system properties read above)

  val env = SparkEnv.createFromSystemProperties(executorId, slaveHostname, 0, false, false)

  SparkEnv.set(env)

  // Start worker thread pool

  val threadPool = new ThreadPoolExecutor(

    1, 128, 600, TimeUnit.SECONDS, new SynchronousQueue[Runnable])

  def launchTask(context: ExecutorBackend, taskId: Long, serializedTask: ByteBuffer) {

    threadPool.execute(new TaskRunner(context, taskId, serializedTask))

  }

TaskRunner

  class TaskRunner(context: ExecutorBackend, taskId: Long, serializedTask: ByteBuffer)

    extends Runnable {

    override def run() {

      try {

        SparkEnv.set(env)

        Accumulators.clear()

        val (taskFiles, taskJars, taskBytes) = Task.deserializeWithDependencies(serializedTask) // 反序列化

        updateDependencies(taskFiles, taskJars)

        val task = ser.deserialize[Task[Any]](taskBytes, Thread.currentThread.getContextClassLoader) // 反序列化

        attemptedTask = Some(task)

        logInfo("Its epoch is " + task.epoch)

        env.mapOutputTracker.updateEpoch(task.epoch)

        taskStart = System.currentTimeMillis()

        val value = task.run(taskId.toInt)  // 调用task.run执行真正的逻辑

        val taskFinish = System.currentTimeMillis()

        val accumUpdates = Accumulators.values

        val result = new TaskResult(value, accumUpdates, task.metrics.getOrElse(null)) // 生成TaskResult

        val serializedResult = ser.serialize(result) // 将TaskResult序列化

        logInfo("Serialized size of result for " + taskId + " is " + serializedResult.limit)

        context.statusUpdate(taskId, TaskState.FINISHED, serializedResult) // 将任务完成和taskresult,通过statusUpdate报告给driver

        logInfo("Finished task ID " + taskId)

      } catch { // 处理各种fail, 同样也要用statusUpdate event通知driver

        case ffe: FetchFailedException => {

          val reason = ffe.toTaskEndReason

          context.statusUpdate(taskId, TaskState.FAILED, ser.serialize(reason))

        }

        case t: Throwable => {

          val serviceTime = (System.currentTimeMillis() - taskStart).toInt

          val metrics = attemptedTask.flatMap(t => t.metrics)

          for (m <- metrics) {

            m.executorRunTime = serviceTime

            m.jvmGCTime = getTotalGCTime - startGCTime

          }

          val reason = ExceptionFailure(t.getClass.getName, t.toString, t.getStackTrace, metrics)

          context.statusUpdate(taskId, TaskState.FAILED, ser.serialize(reason))

          // TODO: Should we exit the whole executor here? On the one hand, the failed task may

          // have left some weird state around depending on when the exception was thrown, but on

          // the other hand, maybe we could detect that when future tasks fail and exit then.

          logError("Exception in task ID " + taskId, t)

          //System.exit(1)

        }

      }

    }

  }

Spark源码分析 – Executor的更多相关文章

Spark源码分析 – 汇总索引
http://jerryshao.me/categories.html#architecture-ref http://blog.csdn.net/pelick/article/details/172 ...
Spark源码分析（三）-TaskScheduler创建
原创文章,转载请注明: 转载自http://www.cnblogs.com/tovin/p/3879151.html 在SparkContext创建过程中会调用createTaskScheduler函 ...
【转】Spark源码分析之-deploy模块
原文地址:http://jerryshao.me/architecture/2013/04/30/Spark%E6%BA%90%E7%A0%81%E5%88%86%E6%9E%90%E4%B9%8B- ...
Spark源码分析：多种部署方式之间的区别与联系（转）
原文链接:Spark源码分析:多种部署方式之间的区别与联系(1) 从官方的文档我们可以知道,Spark的部署方式有很多种:local.Standalone.Mesos.YARN.....不同部署方式的 ...
Spark 源码分析 -- task实际执行过程
Spark源码分析 – SparkContext 中的例子, 只分析到sc.runJob 那么最终是怎么执行的? 通过DAGScheduler切分成Stage, 封装成taskset, 提交给Task ...
Spark源码分析 – BlockManager
参考, Spark源码分析之-Storage模块对于storage, 为何Spark需要storage模块?为了cache RDD Spark的特点就是可以将RDD cache在memory或dis ...
Spark源码分析 -- TaskScheduler
Spark在设计上将DAGScheduler和TaskScheduler完全解耦合, 所以在资源管理和task调度上可以有更多的方案现在支持, LocalSheduler, ClusterSched ...
Spark源码分析 – SchedulerBackend
SchedulerBackend, 两个任务, 申请资源和task执行和管理对于SparkDeploySchedulerBackend, 基于actor模式, 主要就是启动和管理两个actor De ...
Spark源码分析 – Deploy
参考, Spark源码分析之-deploy模块 Client Client在SparkDeploySchedulerBackend被start的时候, 被创建, 代表一个application和s ...

随机推荐

219. Insert Node in Sorted Linked List【Naive】
Insert a node in a sorted linked list. Example Given list = 1->4->6->8 and val = 5. Return ...
windows phone 切换多语言时，商店标题显示错误的问题
前段时间,用业余时间写了一款 wp8 app(“超级滤镜”商店,中文地址:英文地址),在多语言的时候,给 app title 和 app tile title 进行多语言时(参考 MSDN),中文商店 ...
html-blogsdemo
博客标题小样,代码预览是有动态效果的,但在博客园发布就没动画了,知道的大神麻烦告知下,谢谢. code <!DOCTYPE html> <html lang="en&quo ...
【转】logstash配置java环境
1.bin/logstash,新增 JAVA_CMD=/home/admin/soft/jdk1.8.0_121/bin JAVA_HOME=/home/admin/soft/jdk1.8.0_121 ...
替换元素节点replaceChild()
替换元素节点replaceChild() replaceChild 实现子节点(对象)的替换.返回被替换对象的引用. 语法: node.replaceChild (newnode,oldnew ) 参 ...
Windows Phone Unit Test 环境搭建
单元测试对工程质量带来的作用就不详细说明了,本文只讨论如何在WP开发环境下搭建测试工程历史从WP7时代官方是不支持UnitTest工程的,因此需要采用WPToolkitTest这个工程来实 ...
sqlite3命令读出sqlite3格式的文件内容案例
/********************************************************************* * Author : Samson * Date ...
关于Cocos2d-x开发一个游戏的过程自述
我在2016年12月6号完成了我的第一个自己独立完成的游戏,期间遇到各种各样的问题和困难,但是幸运的是问题都一一被解决了,现在我想总结一个整个的制作游戏的过程使用的环境是VS2013+cocos2d ...
【cf492】E. Vanya and Field（拓展欧几里得）
http://codeforces.com/contest/492/problem/E 一开始没时间想,,诶真是.. 挺水的一道题.. 将每个点的横坐标都转换成0,然后找纵坐标有多少即可..即解方程 ...
【BZOJ】1684: [Usaco2005 Oct]Close Encounter（暴力+c++）
http://www.lydsy.com/JudgeOnline/problem.php?id=1684 这货完全在考精度啊.. 比如奇葩 (llf)a/b*i (llf)(a/b*i)和(llf)( ...

Spark源码分析 – Executor

ExecutorBackend

StandaloneExecutorBackend

Executor

Spark源码分析 – Executor的更多相关文章

随机推荐

热门专题