kafka源码分析之二客户端分析

客户端由两种：生产者和消费者

1. 生产者

先看一下生产者的构造方法：

private KafkaProducer(ProducerConfig config, Serializer<K> keySerializer, Serializer<V> valueSerializer) {

        try {

            log.trace("Starting the Kafka producer");

            Map<String, Object> userProvidedConfigs = config.originals();

            this.producerConfig = config;

            this.time = new SystemTime();

            MetricConfig metricConfig = new MetricConfig().samples(config.getInt(ProducerConfig.METRICS_NUM_SAMPLES_CONFIG))

                    .timeWindow(config.getLong(ProducerConfig.METRICS_SAMPLE_WINDOW_MS_CONFIG),

                            TimeUnit.MILLISECONDS);

            clientId = config.getString(ProducerConfig.CLIENT_ID_CONFIG);

            if (clientId.length() <= 0)

                clientId = "producer-" + PRODUCER_CLIENT_ID_SEQUENCE.getAndIncrement();

            List<MetricsReporter> reporters = config.getConfiguredInstances(ProducerConfig.METRIC_REPORTER_CLASSES_CONFIG,

                    MetricsReporter.class);

            reporters.add(new JmxReporter(JMX_PREFIX));

            this.metrics = new Metrics(metricConfig, reporters, time);

            this.partitioner = config.getConfiguredInstance(ProducerConfig.PARTITIONER_CLASS_CONFIG, Partitioner.class);

            long retryBackoffMs = config.getLong(ProducerConfig.RETRY_BACKOFF_MS_CONFIG);

            this.metadata = new Metadata(retryBackoffMs, config.getLong(ProducerConfig.METADATA_MAX_AGE_CONFIG));

            this.maxRequestSize = config.getInt(ProducerConfig.MAX_REQUEST_SIZE_CONFIG);

            this.totalMemorySize = config.getLong(ProducerConfig.BUFFER_MEMORY_CONFIG);

            this.compressionType = CompressionType.forName(config.getString(ProducerConfig.COMPRESSION_TYPE_CONFIG));

            /* check for user defined settings.

             * If the BLOCK_ON_BUFFER_FULL is set to true,we do not honor METADATA_FETCH_TIMEOUT_CONFIG.

             * This should be removed with release 0.9 when the deprecated configs are removed.

             */

            if (userProvidedConfigs.containsKey(ProducerConfig.BLOCK_ON_BUFFER_FULL_CONFIG)) {

                log.warn(ProducerConfig.BLOCK_ON_BUFFER_FULL_CONFIG + " config is deprecated and will be removed soon. " +

                        "Please use " + ProducerConfig.MAX_BLOCK_MS_CONFIG);

                boolean blockOnBufferFull = config.getBoolean(ProducerConfig.BLOCK_ON_BUFFER_FULL_CONFIG);

                if (blockOnBufferFull) {

                    this.maxBlockTimeMs = Long.MAX_VALUE;

                } else if (userProvidedConfigs.containsKey(ProducerConfig.METADATA_FETCH_TIMEOUT_CONFIG)) {

                    log.warn(ProducerConfig.METADATA_FETCH_TIMEOUT_CONFIG + " config is deprecated and will be removed soon. " +

                            "Please use " + ProducerConfig.MAX_BLOCK_MS_CONFIG);

                    this.maxBlockTimeMs = config.getLong(ProducerConfig.METADATA_FETCH_TIMEOUT_CONFIG);

                } else {

                    this.maxBlockTimeMs = config.getLong(ProducerConfig.MAX_BLOCK_MS_CONFIG);

                }

            } else if (userProvidedConfigs.containsKey(ProducerConfig.METADATA_FETCH_TIMEOUT_CONFIG)) {

                log.warn(ProducerConfig.METADATA_FETCH_TIMEOUT_CONFIG + " config is deprecated and will be removed soon. " +

                        "Please use " + ProducerConfig.MAX_BLOCK_MS_CONFIG);

                this.maxBlockTimeMs = config.getLong(ProducerConfig.METADATA_FETCH_TIMEOUT_CONFIG);

            } else {

                this.maxBlockTimeMs = config.getLong(ProducerConfig.MAX_BLOCK_MS_CONFIG);

            }

            /* check for user defined settings.

             * If the TIME_OUT config is set use that for request timeout.

             * This should be removed with release 0.9

             */

            if (userProvidedConfigs.containsKey(ProducerConfig.TIMEOUT_CONFIG)) {

                log.warn(ProducerConfig.TIMEOUT_CONFIG + " config is deprecated and will be removed soon. Please use " +

                        ProducerConfig.REQUEST_TIMEOUT_MS_CONFIG);

                this.requestTimeoutMs = config.getInt(ProducerConfig.TIMEOUT_CONFIG);

            } else {

                this.requestTimeoutMs = config.getInt(ProducerConfig.REQUEST_TIMEOUT_MS_CONFIG);

            }

            Map<String, String> metricTags = new LinkedHashMap<String, String>();

            metricTags.put("client-id", clientId);

            this.accumulator = new RecordAccumulator(config.getInt(ProducerConfig.BATCH_SIZE_CONFIG),

                    this.totalMemorySize,

                    this.compressionType,

                    config.getLong(ProducerConfig.LINGER_MS_CONFIG),

                    retryBackoffMs,

                    metrics,

                    time,

                    metricTags);

            List<InetSocketAddress> addresses = ClientUtils.parseAndValidateAddresses(config.getList(ProducerConfig.BOOTSTRAP_SERVERS_CONFIG));

            this.metadata.update(Cluster.bootstrap(addresses), time.milliseconds());

            ChannelBuilder channelBuilder = ClientUtils.createChannelBuilder(config.values());

            NetworkClient client = new NetworkClient(

                    new Selector(config.getLong(ProducerConfig.CONNECTIONS_MAX_IDLE_MS_CONFIG), this.metrics, time, "producer", metricTags, channelBuilder),

                    this.metadata,

                    clientId,

                    config.getInt(ProducerConfig.MAX_IN_FLIGHT_REQUESTS_PER_CONNECTION),

                    config.getLong(ProducerConfig.RECONNECT_BACKOFF_MS_CONFIG),

                    config.getInt(ProducerConfig.SEND_BUFFER_CONFIG),

                    config.getInt(ProducerConfig.RECEIVE_BUFFER_CONFIG),

                    this.requestTimeoutMs, time);

            this.sender = new Sender(client,

                    this.metadata,

                    this.accumulator,

                    config.getInt(ProducerConfig.MAX_REQUEST_SIZE_CONFIG),

                    (short) parseAcks(config.getString(ProducerConfig.ACKS_CONFIG)),

                    config.getInt(ProducerConfig.RETRIES_CONFIG),

                    this.metrics,

                    new SystemTime(),

                    clientId,

                    this.requestTimeoutMs);

            String ioThreadName = "kafka-producer-network-thread" + (clientId.length() > 0 ? " | " + clientId : "");

            this.ioThread = new KafkaThread(ioThreadName, this.sender, true);

            this.ioThread.start();

            this.errors = this.metrics.sensor("errors");

            if (keySerializer == null) {

                this.keySerializer = config.getConfiguredInstance(ProducerConfig.KEY_SERIALIZER_CLASS_CONFIG,

                        Serializer.class);

                this.keySerializer.configure(config.originals(), true);

            } else {

                config.ignore(ProducerConfig.KEY_SERIALIZER_CLASS_CONFIG);

                this.keySerializer = keySerializer;

            }

            if (valueSerializer == null) {

                this.valueSerializer = config.getConfiguredInstance(ProducerConfig.VALUE_SERIALIZER_CLASS_CONFIG,

                        Serializer.class);

                this.valueSerializer.configure(config.originals(), false);

            } else {

                config.ignore(ProducerConfig.VALUE_SERIALIZER_CLASS_CONFIG);

                this.valueSerializer = valueSerializer;

            }

            config.logUnused();

            AppInfoParser.registerAppInfo(JMX_PREFIX, clientId);

            log.debug("Kafka producer started");

        } catch (Throwable t) {

            // call close methods if internal objects are already constructed

            // this is to prevent resource leak. see KAFKA-2121

            close(0, TimeUnit.MILLISECONDS, true);

            // now propagate the exception

            throw new KafkaException("Failed to construct kafka producer", t);

        }

    }

很多代码是读取配置文件，但红色部分才是主要：

调用Sender线程的run方法

/**

     * Run a single iteration of sending

     *

     * @param now

     *            The current POSIX time in milliseconds

     */

    public void run(long now) {

        Cluster cluster = metadata.fetch();

        // get the list of partitions with data ready to send

        RecordAccumulator.ReadyCheckResult result = this.accumulator.ready(cluster, now);

        // if there are any partitions whose leaders are not known yet, force metadata update

        if (result.unknownLeadersExist)

            this.metadata.requestUpdate();

        // remove any nodes we aren't ready to send to

        Iterator<Node> iter = result.readyNodes.iterator();

        long notReadyTimeout = Long.MAX_VALUE;

        while (iter.hasNext()) {

            Node node = iter.next();

            if (!this.client.ready(node, now)) {

                iter.remove();

                notReadyTimeout = Math.min(notReadyTimeout, this.client.connectionDelay(node, now));

            }

        }

        // create produce requests

        Map<Integer, List<RecordBatch>> batches = this.accumulator.drain(cluster,

                                                                         result.readyNodes,

                                                                         this.maxRequestSize,

                                                                         now);

        List<RecordBatch> expiredBatches = this.accumulator.abortExpiredBatches(this.requestTimeout, cluster, now);

        // update sensors

        for (RecordBatch expiredBatch : expiredBatches)

            this.sensors.recordErrors(expiredBatch.topicPartition.topic(), expiredBatch.recordCount);

        sensors.updateProduceRequestMetrics(batches);

        List<ClientRequest> requests = createProduceRequests(batches, now);

        // If we have any nodes that are ready to send + have sendable data, poll with 0 timeout so this can immediately

        // loop and try sending more data. Otherwise, the timeout is determined by nodes that have partitions with data

        // that isn't yet sendable (e.g. lingering, backing off). Note that this specifically does not include nodes

        // with sendable data that aren't ready to send since they would cause busy looping.

        long pollTimeout = Math.min(result.nextReadyCheckDelayMs, notReadyTimeout);

        if (result.readyNodes.size() > 0) {

            log.trace("Nodes with data ready to send: {}", result.readyNodes);

            log.trace("Created {} produce requests: {}", requests.size(), requests);

            pollTimeout = 0;

        }

        for (ClientRequest request : requests)

            client.send(request, now);

        // if some partitions are already ready to be sent, the select time would be 0;

        // otherwise if some partition already has some data accumulated but not ready yet,

        // the select time will be the time difference between now and its linger expiry time;

        // otherwise the select time will be the time difference between now and the metadata expiry time;

        this.client.poll(pollTimeout, now);

    }

调用NetworkClient的send方法

    /**

     * Queue up the given request for sending. Requests can only be sent out to ready nodes.

     *

     * @param request The request

     * @param now The current timestamp

     */

    @Override

    public void send(ClientRequest request, long now) {

        String nodeId = request.request().destination();

        if (!canSendRequest(nodeId))

            throw new IllegalStateException("Attempt to send a request to node " + nodeId + " which is not ready.");

        doSend(request, now);

    }

    private void doSend(ClientRequest request, long now) {

        request.setSendTimeMs(now);

        this.inFlightRequests.add(request);

        selector.send(request.request());

    }

selector调用channel来发送：

    /**

     * Queue the given request for sending in the subsequent {@poll(long)} calls

     * @param send The request to send

     */

    public void send(Send send) {

        KafkaChannel channel = channelOrFail(send.destination());

        try {

            channel.setSend(send);

        } catch (CancelledKeyException e) {

            this.failedSends.add(send.destination());

            close(channel);

        }

    }

调用channel的send方法：

    public void setSend(Send send) {

        if (this.send != null)

            throw new IllegalStateException("Attempt to begin a send operation with prior send operation still in progress.");

        this.send = send;

        this.transportLayer.addInterestOps(SelectionKey.OP_WRITE);

    }

这里TransportLayer封装了通信的细节

2. 消费者

kafka源码分析之二客户端分析的更多相关文章

Kafka源码解析（二）---Log分析
上一篇文章讲了LogSegment和Log的初始化,这篇来讲讲Log的主要操作有哪些. 一般来说Log 的常见操作分为 4 大部分. 高水位管理操作日志段管理关键位移值管理读写操作其中关键位移 ...
Kafka源码分析(二) - 生产者
系列文章目录 https://zhuanlan.zhihu.com/p/367683572 目录系列文章目录一. 使用方式 step 1: 设置必要参数 step 2: 创建KafkaProduc ...
Kafka源码分析(一) - 概述
系列文章目录 https://zhuanlan.zhihu.com/p/367683572 目录系列文章目录一. 实际问题二. 什么是Kafka, 如何解决这些问题的三. 基本原理 1. 基本 ...
kafka源码分析之一server启动分析
0. 关键概念关键概念 Concepts Function Topic 用于划分Message的逻辑概念,一个Topic可以分布在多个Broker上. Partition 是Kafka中横向扩展和一 ...
# Volley源码解析（二）没有缓存的情况下直接走网络请求源码分析#
Volley源码解析(二) 没有缓存的情况下直接走网络请求源码分析 Volley源码一共40多个类和接口.除去一些工具类的实现,核心代码只有20多个类.所以相对来说分析起来没有那么吃力.但是要想分析透 ...
Kafka源码分析系列-目录(收藏不迷路)
持续更新中,敬请关注! 目录 <Kafka源码分析>系列文章计划按"数据传递"的顺序写作,即:先分析生产者,其次分析Server端的数据处理,然后分析消费者,最后再补充 ...
Kafka源码分析(三) - Server端 - 消息存储
系列文章目录 https://zhuanlan.zhihu.com/p/367683572 目录系列文章目录一. 业务模型 1.1 概念梳理 1.2 文件分析 1.2.1 数据目录 1.2.2 . ...
Apache Kafka源码分析 – Broker Server
1. Kafka.scala 在Kafka的main入口中startup KafkaServerStartable, 而KafkaServerStartable这是对KafkaServer的封装 1: ...
Spring源码分析之IOC的三种常见用法及源码实现（二）
Spring源码分析之IOC的三种常见用法及源码实现(二) 回顾上文我们研究的是 AnnotationConfigApplicationContext annotationConfigApplica ...

随机推荐

（转）python requests的安装与简单运用
requests是python的一个HTTP客户端库,跟urllib,urllib2类似,那为什么要用requests而不用urllib2呢?官方文档中是这样说明的: python的标准库urllib ...
markdown预览-快速入门
最近要写文档,领导指定用markdown. 这个两三年前用过两次的神器工具,都忘的差不多了. 为了熟练一点这个技能,决定好好的重新学一次. 于是乎:看快速入门文档 ...30分钟...看完文档发现要来 ...
java并发之volatile
volatile是轻量级的synchronized,它在多处理器应用开发中保证了共享变量的“可见性”(可见性指当一个线程修改共享变量后,其它线程可以看到这个修改). volatile如果使用合理会比s ...
Opencv算法学习二
1.直方图:图片中像素值分布情况的坐标图. 直方图均衡化:按一定规律拉伸像素值,提高像素值少的点,增加原图的对比度,使人感觉更清晰的函数. equalizeHist( src, dst ); 2.ha ...
HTML基础篇之视频音频
<audio src="song.ogg" controls="controls"></audio> <!-- 兼容的音频格式og ...
IOS网络第七天WebView-02WebView和网页的交互2，删除大众点评多余文字，加上蒙版进度
************ #import "HMViewController.h" @interface HMViewController () <UIWebViewDele ...
[翻译]理解Swift中的Optional
原文出处:Understanding Optionals in Swift 苹果新的Swift编程语言带来了一些新的技巧,能使软件开发比以往更方便.更安全.然而,一个很有力的特性Optional,在你 ...
IE10，11下_doPostBack未定义错误的解决方法
出现的原因 .NET2.0和.NET4.0一起发布的浏览器定义文件中有一个错误,它们保存相当一部分浏览器版本的定义.但是浏览器的有些版本(比如IE10,11)则不再在这个范围之内.因此,ASP.NET ...
Google分布式构建软件之四：分发构建结果
注:本文英文原文在google开发者工具组的博客上[需要FQ],以下是我的翻译,欢迎转载,但请尊重作者版权,注名原文地址. 之前的文章,介绍了Google在分布式构建软件过程中,如何把构建过程分发到许 ...
玩转Windows服务系列——Windows服务小技巧
伴随着研究Windows服务,逐渐掌握了一些小技巧,现在与大家分享一下. 将Windows服务转变为控制台程序由于默认的Windows服务程序,编译后为Win32的窗口程序.我们在程序启动或运行过程 ...

kafka源码分析之二客户端分析

kafka源码分析之二客户端分析的更多相关文章

随机推荐

热门专题