【原创】大叔问题定位分享（20）hdfs文件create写入正常，append写入报错

最近在hdfs写文件的时候发现一个问题，create写入正常，append写入报错，每次都能重现，代码示例如下：

        FileSystem fs = FileSystem.get(conf);

        OutputStream out = fs.create(file);

        IOUtils.copyBytes(in, out, 4096, true); //正常

        out = fs.append(file);

        IOUtils.copyBytes(in, out, 4096, true); //报错

通过hdfs fsck命令检查出问题的文件，发现只有一个副本，难道是因为这个？

看FileSystem.append执行过程：

org.apache.hadoop.fs.FileSystem

    public abstract FSDataOutputStream append(Path var1, int var2, Progressable var3) throws IOException;

实现类在这里：

org.apache.hadoop.hdfs.DistributedFileSystem

    public FSDataOutputStream append(Path f, final int bufferSize, final Progressable progress) throws IOException {

        this.statistics.incrementWriteOps(1);

        Path absF = this.fixRelativePart(f);

        return (FSDataOutputStream)(new FileSystemLinkResolver<FSDataOutputStream>() {

            public FSDataOutputStream doCall(Path p) throws IOException, UnresolvedLinkException {

                return DistributedFileSystem.this.dfs.append(DistributedFileSystem.this.getPathName(p), bufferSize, progress, DistributedFileSystem.this.statistics);

            }

            public FSDataOutputStream next(FileSystem fs, Path p) throws IOException {

                return fs.append(p, bufferSize);

            }

        }).resolve(this, absF);

    }

这里会调用DFSClient.append方法

org.apache.hadoop.hdfs.DFSClient

    private DFSOutputStream append(String src, int buffersize, Progressable progress) throws IOException {

        this.checkOpen();

        DFSOutputStream result = this.callAppend(src, buffersize, progress);

        this.beginFileLease(result.getFileId(), result);

        return result;

    }

    private DFSOutputStream callAppend(String src, int buffersize, Progressable progress) throws IOException {

        LocatedBlock lastBlock = null;

        try {

            lastBlock = this.namenode.append(src, this.clientName);

        } catch (RemoteException var6) {

            throw var6.unwrapRemoteException(new Class[]{AccessControlException.class, FileNotFoundException.class, SafeModeException.class, DSQuotaExceededException.class, UnsupportedOperationException.class, UnresolvedPathException.class, SnapshotAccessControlException.class});

        }

        HdfsFileStatus newStat = this.getFileInfo(src);

        return DFSOutputStream.newStreamForAppend(this, src, buffersize, progress, lastBlock, newStat, this.dfsClientConf.createChecksum());

    }

DFSClient.append最终会调用NameNodeRpcServer的append方法

org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer

    public LocatedBlock append(String src, String clientName) throws IOException {

        this.checkNNStartup();

        String clientMachine = getClientMachine();

        if (stateChangeLog.isDebugEnabled()) {

            stateChangeLog.debug("*DIR* NameNode.append: file " + src + " for " + clientName + " at " + clientMachine);

        }

        this.namesystem.checkOperation(OperationCategory.WRITE);

        LocatedBlock info = this.namesystem.appendFile(src, clientName, clientMachine);

        this.metrics.incrFilesAppended();

        return info;

    }

这里调用到FSNamesystem.append

org.apache.hadoop.hdfs.server.namenode.FSNamesystem

    LocatedBlock appendFile(String src, String holder, String clientMachine) throws AccessControlException, SafeModeException,

...

                lb = this.appendFileInt(src, holder, clientMachine, cacheEntry != null);

    private LocatedBlock appendFileInt(String srcArg, String holder, String clientMachine, boolean logRetryCache) throws

...

                lb = this.appendFileInternal(pc, src, holder, clientMachine, logRetryCache);

    private LocatedBlock appendFileInternal(FSPermissionChecker pc, String src, String holder, String clientMachine, boolean logRetryCache) throws AccessControlException, UnresolvedLinkException, FileNotFoundException, IOException {

        assert this.hasWriteLock();

        INodesInPath iip = this.dir.getINodesInPath4Write(src);

        INode inode = iip.getLastINode();

        if (inode != null && inode.isDirectory()) {

            throw new FileAlreadyExistsException("Cannot append to directory " + src + "; already exists as a directory.");

        } else {

            if (this.isPermissionEnabled) {

                this.checkPathAccess(pc, src, FsAction.WRITE);

            }

            try {

                if (inode == null) {

                    throw new FileNotFoundException("failed to append to non-existent file " + src + " for client " + clientMachine);

                } else {

                    INodeFile myFile = INodeFile.valueOf(inode, src, true);

                    BlockStoragePolicy lpPolicy = this.blockManager.getStoragePolicy("LAZY_PERSIST");

                    if (lpPolicy != null && lpPolicy.getId() == myFile.getStoragePolicyID()) {

                        throw new UnsupportedOperationException("Cannot append to lazy persist file " + src);

                    } else {

                        this.recoverLeaseInternal(myFile, src, holder, clientMachine, false);

                        myFile = INodeFile.valueOf(this.dir.getINode(src), src, true);

                        BlockInfo lastBlock = myFile.getLastBlock();

                        if (lastBlock != null && lastBlock.isComplete() && !this.getBlockManager().isSufficientlyReplicated(lastBlock)) {

                            throw new IOException("append: lastBlock=" + lastBlock + " of src=" + src + " is not sufficiently replicated yet.");

                        } else {

                            return this.prepareFileForWrite(src, iip, holder, clientMachine, true, logRetryCache);

                        }

                    }

                }

            } catch (IOException var11) {

                NameNode.stateChangeLog.warn("DIR* NameSystem.append: " + var11.getMessage());

                throw var11;

            }

        }

    }

    public boolean isSufficientlyReplicated(BlockInfo b) {

        int replication = Math.min(this.minReplication, this.getDatanodeManager().getNumLiveDataNodes());

        return this.countNodes(b).liveReplicas() >= replication;

    }

在append文件的时候，会首先取出这个文件最后一个block，然后会检查这个block是否满足副本要求，如果不满足就抛出异常，如果满足就准备写入；
看来原因确实是因为文件只有1个副本导致append报错，那为什么新建文件只有1个副本，后来找到原因是因为机架配置有问题导致的，详见 https://www.cnblogs.com/barneywill/p/10114504.html

【原创】大叔问题定位分享（20）hdfs文件create写入正常，append写入报错的更多相关文章

【报错】spring整合activeMQ,pom.xml文件缺架包，启动报错：Caused by: java.lang.ClassNotFoundException: org.apache.xbean.spring.context.v2.XBeanNamespaceHandler
spring版本:4.3.13 ActiveMq版本:5.15 ======================================================== spring整合act ...
（未解决）flume监控目录，抓取文件内容推送给kafka，报错
flume监控目录,抓取文件内容推送给kafka,报错: /export/datas/destFile/220104_YT1013_8c5f13f33c299316c6720cc51f94f7a0_2 ...
【原创】大叔问题定位分享（5）Kafka客户端报错SocketException: Too many open files 打开的文件过多
kafka0.8.1 一问题 10月22号应用系统忽然报错: [2014/12/22 11:52:32.738]java.net.SocketException: 打开的文件过多 [2014/12/ ...
【原创】大叔问题定位分享（13）HBase Region频繁下线
问题现象:hive执行sql报错 select count(*) from test_hive_table; 报错 Error: java.io.IOException: org.apache.had ...
【原创】大叔问题定位分享（3）Kafka集群broker进程逐个报错退出
kafka0.8.1 一问题现象生产环境kafka服务器134.135.136分别在10月11号.10月13号挂掉: 134日志 [2014-10-13 16:45:41,902] FATAL [ ...
【原创】大叔问题定位分享（32）mysql故障恢复
mysql启动失败,一直crash,报错如下: 2019-03-14T11:15:12.937923Z 0 [Note] InnoDB: Uncompressed page, stored check ...
【原创】大叔问题定位分享（30）mesos agent启动失败：Failed to perform recovery: Incompatible agent info detected
mesos agent启动失败,报错如下: Feb 15 22:03:18 server1.bj mesos-slave[1190]: E0215 22:03:18.622994 1192 slave ...
【原创】大叔问题定位分享（28）openssh升级到7.4之后ssh跳转异常
服务器集群之间忽然ssh跳转不通 # ssh 192.168.0.1The authenticity of host '192.168.0.1 (192.168.0.1)' can't be esta ...
【原创】大叔问题定位分享（25）ambari metrics collector内置standalone hbase启动失败
ambari metrics collector内置hbase目录位于 /usr/lib/ams-hbase 配置位于 /etc/ams-hbase/conf 通过ruby启动 /usr/lib/am ...

随机推荐

building 'twisted.test.raiser' extension error: Microsoft Visual C++ 14.0 is required. Get it with "Microsoft Visual C++ Build Tools": http://landinghub.visualstudio.com/visual-cpp-build-tools
Error msg: building 'twisted.test.raiser' extension error: Microsoft Visual C++ 14.0 is required. Ge ...
Django之自带的认证系统 auth模块
01-Django自带的用户认证我们在开发一个网站的时候,无可避免的需要设计实现网站的用户系统.此时我们需要实现包括用户注册.用户登录.用户认证.注销.修改密码等功能,这还真是个麻烦的事情呢. Dj ...
xshell连接虚拟机ubuntu
在ubuntu界面,打开终端terminal,输入: ifconfig 出现如下界面: fb993608316@ubuntu:/$ ifconfig eth0 Link encap:Ethernet ...
[转帖]Linux中的15个基本‘ls’命令示例
Linux中的15个基本‘ls’命令示例 https://linux.cn/article-5109-1.html ls -lt 和 ls -ltr 来查看文件新旧顺序. list time rese ...
Win10 登陆密码不正确（安全模式仍然启动不了）
今天朋友重启Win10后,登陆密码显示不正确,是用了很多方法都不行然后就瞎捣鼓就进去进入BIOS将启动模式调为USB模式重启启动不了后再改回系统启动就进去了(好神奇)
LODOP设置打印设计返回JS代码是变量
前面有一篇博文是介绍JS模版的加载和赋值,赋值有两种,详细可查看本博客的那篇博文:LodopJS代码模版的加载和赋值简单来说,就是打印项的值是变量,在添加打印项前进行赋值:打印项的值是字符串,给打印项 ...
luogu P1613 跑路
一开始看这道题时,发现是最短路,可是搜的又是倍增的题无可分说这是倍增+最短路但是Dijkstra,SPFA我又不熟,可是看了数据范围心中萌生一种用Floyd做的方法不扯了先设一个三维bool数组 ...
wireshark分析dhcp过程
---恢复内容开始--- DHCP DHCP(Dynamic Host Configuration Protocol)是一个用于主机动态获取IP地址的配置解析,使用UDP报文传送,端口号为67何68 ...
mysql慢查询日志按天切割归纳
问题描述: mysql开启慢查询功能,再正常不过,那么存在这样一种情况:慢查询写入的文件位置和文件名是指定好的,如果慢查询时间设定严苛,不出意外,记录慢查询的单个文件大小会日益增大,几十兆或者上百兆, ...
hdu 2829 Lawrence(四边形不等式优化dp)
T. E. Lawrence was a controversial figure during World War I. He was a British officer who served in ...

【原创】大叔问题定位分享（20）hdfs文件create写入正常，append写入报错

【原创】大叔问题定位分享（20）hdfs文件create写入正常，append写入报错的更多相关文章

随机推荐

热门专题