Hadoop：开发机运行spark程序，抛出异常：ERROR Shell: Failed to locate the winutils binary in the hadoop binary path

问题：

windows开发机运行spark程序，抛出异常：ERROR Shell: Failed to locate the winutils binary in the hadoop binary path，但是可以正常执行，并不影响结果。

// :: WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable

// :: ERROR Shell: Failed to locate the winutils binary in the hadoop binary path

java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.

    at org.apache.hadoop.util.Shell.getQualifiedBinPath(Shell.java:)

    at org.apache.hadoop.util.Shell.getWinUtilsPath(Shell.java:)

    at org.apache.hadoop.util.Shell.<clinit>(Shell.java:)

    at org.apache.hadoop.util.StringUtils.<clinit>(StringUtils.java:)

    at org.apache.hadoop.security.Groups.parseStaticMapping(Groups.java:)

    at org.apache.hadoop.security.Groups.<init>(Groups.java:)

    at org.apache.hadoop.security.Groups.<init>(Groups.java:)

    at org.apache.hadoop.security.Groups.getUserToGroupsMappingService(Groups.java:)

    at org.apache.hadoop.security.UserGroupInformation.initialize(UserGroupInformation.java:)

    at org.apache.hadoop.security.UserGroupInformation.ensureInitialized(UserGroupInformation.java:)

    at org.apache.hadoop.security.UserGroupInformation.loginUserFromSubject(UserGroupInformation.java:)

    at org.apache.hadoop.security.UserGroupInformation.getLoginUser(UserGroupInformation.java:)

    at org.apache.hadoop.security.UserGroupInformation.getCurrentUser(UserGroupInformation.java:)

    at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$.apply(Utils.scala:)

    at org.apache.spark.util.Utils$$anonfun$getCurrentUserName$.apply(Utils.scala:)

    at scala.Option.getOrElse(Option.scala:)

    at org.apache.spark.util.Utils$.getCurrentUserName(Utils.scala:)

    at org.apache.spark.SparkContext.<init>(SparkContext.scala:)

    at org.apache.spark.api.java.JavaSparkContext.<init>(JavaSparkContext.scala:)

    at com.lm.sparkLearning.utils.SparkUtils.getJavaSparkContext(SparkUtils.java:)

    at com.lm.sparkLearning.rdd.RddLearning.main(RddLearning.java:)

// :: WARN RddLearning: singleOperateRdd mapRdd->[, , , ]

// :: WARN RddLearning: singleOperateRdd flatMapRdd->[, , , , , , , ]

// :: WARN RddLearning: singleOperateRdd filterRdd->[, ]

// :: WARN RddLearning: singleOperateRdd distinctRdd->[, , ]

// :: WARN RddLearning: singleOperateRdd sampleRdd->[, ]

// :: WARN RddLearning: the program end

这里所执行的程序是：

package com.lm.sparkLearning.rdd;

import java.util.Arrays;

import java.util.Iterator;

import java.util.List;

import org.apache.spark.api.java.JavaPairRDD;

import org.apache.spark.api.java.JavaRDD;

import org.apache.spark.api.java.JavaSparkContext;

import org.apache.spark.api.java.function.FlatMapFunction;

import org.apache.spark.api.java.function.Function;

import org.apache.spark.api.java.function.Function2;

import org.apache.spark.api.java.function.VoidFunction;

import org.slf4j.Logger;

import org.slf4j.LoggerFactory;

import com.lm.sparkLearning.utils.SparkUtils;

public class RddLearning {

    private static Logger logger = LoggerFactory.getLogger(RddLearning.class);

    public static void main(String[] args) {

        JavaSparkContext jsc = SparkUtils.getJavaSparkContext("RDDLearning", "local[2]", "WARN");

        SparkUtils.createRddExternal(jsc, "D:/README.txt");

        singleOperateRdd(jsc);

        jsc.stop();

        logger.warn("the program end");

    }

    public static void singleOperateRdd(JavaSparkContext jsc) {

        List<Integer> nums = Arrays.asList(new Integer[] { 1, 2, 3, 3 });

        JavaRDD<Integer> numsRdd = SparkUtils.createRddCollect(jsc, nums);

        // map

        JavaRDD<Integer> mapRdd = numsRdd.map(new Function<Integer, Integer>() {

            private static final long serialVersionUID = 1L;

            @Override

            public Integer call(Integer v1) throws Exception {

                return (v1 + 1);

            }

        });

        logger.warn("singleOperateRdd mapRdd->" + mapRdd.collect().toString());

        JavaRDD<Integer> flatMapRdd = numsRdd.flatMap(new FlatMapFunction<Integer, Integer>() {

            private static final long serialVersionUID = 1L;

            @Override

            public Iterable<Integer> call(Integer t) throws Exception {

                return Arrays.asList(new Integer[] { 2, 3 });

            }

        });

        logger.warn("singleOperateRdd flatMapRdd->" + flatMapRdd.collect().toString());

        JavaRDD<Integer> filterRdd = numsRdd.filter(new Function<Integer, Boolean>() {

            private static final long serialVersionUID = 1L;

            @Override

            public Boolean call(Integer v1) throws Exception {

                return v1 > 2;

            }

        });

        logger.warn("singleOperateRdd filterRdd->" + filterRdd.collect().toString());

        JavaRDD<Integer> distinctRdd = numsRdd.distinct();

        logger.warn("singleOperateRdd distinctRdd->" + distinctRdd.collect().toString());

        JavaRDD<Integer> sampleRdd = numsRdd.sample(false, 0.5);

        logger.warn("singleOperateRdd sampleRdd->" + sampleRdd.collect().toString());

    }

}

解决方案：

1.下载winutils的windows版本
GitHub上，有人提供了winutils的windows的版本，项目地址是：https://github.com/srccodes/hadoop-common-2.2.0-bin,直接下载此项目的zip包，下载后是文件名是hadoop-common-2.2.0-bin-master.zip,随便解压到一个目录。
2.配置环境变量
增加用户变量HADOOP_HOME，值是下载的zip包解压的目录，然后在系统变量path里增加$HADOOP_HOME\bin 即可。

添加“%HADOOP%\bin”到path

再次运行程序，正常执行。

Hadoop：开发机运行spark程序，抛出异常：ERROR Shell: Failed to locate the winutils binary in the hadoop binary path的更多相关文章

ERROR Shell: Failed to locate the winutils binary in the hadoop binary path
文章发自:http://www.cnblogs.com/hark0623/p/4170172.html 转发请注明 14/12/17 19:18:53 ERROR Shell: Failed to ...
Spark- ERROR Shell: Failed to locate the winutils binary in the hadoop binary path java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.
运行 mport org.apache.log4j.{Level, Logger} import org.apache.spark.rdd.RDD import org.apache.spark.{S ...
Windows本地运行调试Spark或Hadoop程序失败：ERROR util.Shell: Failed to locate the winutils binary in the hadoop binary path
报错内容 ERROR util.Shell: Failed to locate the winutils binary in the hadoop binary path java.io.IOExce ...
idea 提示：ERROR util.Shell: Failed to locate the winutils binary in the hadoop binary path java.io.IOException解决方法
Windows系统中的IDEA链接Linux里面的Hadoop的api时出现的问题提示:ERROR util.Shell: Failed to locate the winutils binary ...
windows本地调试安装hadoop（idea） : ERROR util.Shell: Failed to locate the winutils binary in the hadoop binary path
1,本地安装hadoop https://mirrors.tuna.tsinghua.edu.cn/apache/hadoop/common/ 下载hadoop对应版本 (我本意是想下载hadoop ...
ERROR [org.apache.hadoop.util.Shell] - Failed to locate the winutils binary in the hadoop binary path
错误日志如下: -- ::, DEBUG [org.apache.hadoop.metrics2.lib.MutableMetricsFactory] - field org.apache.hadoo ...
WIN7下运行hadoop程序报：Failed to locate the winutils binary in the hadoop binary path
之前在mac上调试hadoop程序(mac之前配置过hadoop环境)一直都是正常的.因为工作需要,需要在windows上先调试该程序,然后再转到linux下.程序运行的过程中,报Failed to ...
Windows7系统运行hadoop报Failed to locate the winutils binary in the hadoop binary path错误
程序运行的过程中,报Failed to locate the winutils binary in the hadoop binary path Java.io.IOException: Could ...
Spark报错：Failed to locate the winutils binary in the hadoop binary path
之前在mac上调试hadoop程序(mac之前配置过hadoop环境)一直都是正常的.因为工作需要,需要在windows上先调试该程序,然后再转到linux下.程序运行的过程中,报 Failed to ...

随机推荐

centos7 打造基于python语言Selenium2自动化开发环境
1. 准备安装模块 # yum groupinstall "Development tools" # yum install zlib-devel bzip2-devel ope ...
使用 IntraWeb (20) - 基本控件之 TIWGrid
TIWGrid 最终通过 Html Table 呈现; 其每个 Cell 都是一个 TIWGridCell 对象, Cell 对象的 Control 属性非常好, 可以非常方便地嵌入其他控件. TIW ...
IBM MR10i阵列卡配置Raid0/Raid1/Raid5（转）
RAID5配置: 其实RAID0/RAID1都基本一致,只是选择的类型不同. 1. 开机看到ctrl+h的提示按下相应的键,等ServerRaid 10-i卡初始化完成则进入WebBIOS 配置界面: ...
MikroTik RouterOS授权级别
抄了一份来自淘宝代理商的说明: P系列许可级别(适用于联网的虚拟机,如:云主机,虚拟机,VPS等) 您必须在MikroTik官网 https://mikrotik.com/client/ 上拥有一个帐 ...
HDU 4786 Fibonacci Tree （2013成都1006题）
Fibonacci Tree Time Limit: 4000/2000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Others)To ...
python及扩展程序安装
安装从官方网站下载python程序,我下载的是python-3.3.2.msi 然后下载python扩展程序,我下载的是pywin32-218.win32-py3.3.exe 最后下载wmi插件,我 ...
Revit API布置卫浴装置
//放置卫浴装置 [Transaction(TransactionMode.Manual)] [Regeneration(RegenerationOption.Manual)] public clas ...
Ubuntu上的Hadoop安装教程
Install Hadoop 2.2.0 on Ubuntu Linux 13.04 (Single-Node Cluster) This tutorial explains how to insta ...
Subversion detected an unsupported working copy version
关于这个错误:Subversion detected an unsupported working copy version while checking the status of 'XXXX'. ...
I/O会一直占用CPU吗？【转载】
转自:https://www.zhihu.com/question/27734728 知乎上看到的一个提问,可以参考如下图:(图片摘自网络) 在进行I/O操作的时候,是将任务交给DMA来处理,请求发 ...

Hadoop：开发机运行spark程序，抛出异常：ERROR Shell: Failed to locate the winutils binary in the hadoop binary path

问题：

解决方案：

Hadoop：开发机运行spark程序，抛出异常：ERROR Shell: Failed to locate the winutils binary in the hadoop binary path的更多相关文章

随机推荐

热门专题