Kafka 源代码分析之FileMessageSet

这里主要分析FileMessageSet类

这个类主要是管理log消息的内存对象和文件对象的类.源代码文件在log目录下.这个类被LogSegment类代理调用用来管理分片.

下面是完整代码.代码比较简单.就不做过多说明了.这个类是MessageSet抽象类的实现类.

class FileMessageSet private[kafka](@volatile var file: File,

                                    private[log] val channel: FileChannel,

                                    private[log] val start: Int,

                                    private[log] val end: Int,

                                    isSlice: Boolean) extends MessageSet with Logging {

  /* the size of the message set in bytes */

  private val _size =

    if(isSlice)

      new AtomicInteger(end - start) // don't check the file size if this is just a slice view

    else

      new AtomicInteger(math.min(channel.size().toInt, end) - start)

  /* if this is not a slice, update the file pointer to the end of the file */

  if (!isSlice)

    /* set the file position to the last byte in the file */

    channel.position(channel.size)

  /**

   * Create a file message set with no slicing.

   */

  def this(file: File, channel: FileChannel) =

    this(file, channel, start = 0, end = Int.MaxValue, isSlice = false)

  /**

   * Create a file message set with no slicing

   */

  def this(file: File) =

    this(file, Utils.openChannel(file, mutable = true))

  /**

   * Create a file message set with mutable option

   */

  def this(file: File, mutable: Boolean) = this(file, Utils.openChannel(file, mutable))

  /**

   * Create a slice view of the file message set that begins and ends at the given byte offsets

   */

  def this(file: File, channel: FileChannel, start: Int, end: Int) =

    this(file, channel, start, end, isSlice = true)

  /**

   * Return a message set which is a view into this set starting from the given position and with the given size limit.

   *

   * If the size is beyond the end of the file, the end will be based on the size of the file at the time of the read.

   *

   * If this message set is already sliced, the position will be taken relative to that slicing.

   *

   * @param position The start position to begin the read from

   * @param size The number of bytes after the start position to include

   *

   * @return A sliced wrapper on this message set limited based on the given position and size

   */

  def read(position: Int, size: Int): FileMessageSet = {  //返回读取段对象

    if(position < 0)

      throw new IllegalArgumentException("Invalid position: " + position)

    if(size < 0)

      throw new IllegalArgumentException("Invalid size: " + size)

    new FileMessageSet(file,

                       channel,

                       start = this.start + position,

                       end = math.min(this.start + position + size, sizeInBytes()))

  }

  /**

   * Search forward for the file position of the last offset that is greater than or equal to the target offset

   * and return its physical position. If no such offsets are found, return null.

   * @param targetOffset The offset to search for.

   * @param startingPosition The starting position in the file to begin searching from.

   */

  def searchFor(targetOffset: Long, startingPosition: Int): OffsetPosition = {  //搜索读写点的方法

    var position = startingPosition

    val buffer = ByteBuffer.allocate(MessageSet.LogOverhead)

    val size = sizeInBytes()

    while(position + MessageSet.LogOverhead < size) {

      buffer.rewind()

      channel.read(buffer, position)

      if(buffer.hasRemaining)

        throw new IllegalStateException("Failed to read complete buffer for targetOffset %d startPosition %d in %s"

                                        .format(targetOffset, startingPosition, file.getAbsolutePath))

      buffer.rewind()

      val offset = buffer.getLong()

      if(offset >= targetOffset)

        return OffsetPosition(offset, position)

      val messageSize = buffer.getInt()

      if(messageSize < Message.MessageOverhead)

        throw new IllegalStateException("Invalid message size: " + messageSize)

      position += MessageSet.LogOverhead + messageSize

    }

    null

  }

  /**

   * Write some of this set to the given channel.

   * @param destChannel The channel to write to.

   * @param writePosition The position in the message set to begin writing from.

   * @param size The maximum number of bytes to write

   * @return The number of bytes actually written.

   */

  def writeTo(destChannel: GatheringByteChannel, writePosition: Long, size: Int): Int = {  //主要写方法

    // Ensure that the underlying size has not changed.

    val newSize = math.min(channel.size().toInt, end) - start

    if (newSize < _size.get()) {

      throw new KafkaException("Size of FileMessageSet %s has been truncated during write: old size %d, new size %d"

        .format(file.getAbsolutePath, _size.get(), newSize))

    }

    val bytesTransferred = channel.transferTo(start + writePosition, math.min(size, sizeInBytes), destChannel).toInt

    trace("FileMessageSet " + file.getAbsolutePath + " : bytes transferred : " + bytesTransferred

      + " bytes requested for transfer : " + math.min(size, sizeInBytes))

    bytesTransferred

  }

  /**

   * Get a shallow iterator over the messages in the set.

   */

  override def iterator() = iterator(Int.MaxValue)

  /**

   * Get an iterator over the messages in the set. We only do shallow iteration here.

   * @param maxMessageSize A limit on allowable message size to avoid allocating unbounded memory.

   * If we encounter a message larger than this we throw an InvalidMessageException.

   * @return The iterator.

   */

  def iterator(maxMessageSize: Int): Iterator[MessageAndOffset] = {  //主要构造器.

    new IteratorTemplate[MessageAndOffset] {

      var location = start

      val sizeOffsetBuffer = ByteBuffer.allocate(12)

      override def makeNext(): MessageAndOffset = {

        if(location >= end)

          return allDone()

        // read the size of the item

        sizeOffsetBuffer.rewind()

        channel.read(sizeOffsetBuffer, location)

        if(sizeOffsetBuffer.hasRemaining)

          return allDone()

        sizeOffsetBuffer.rewind()

        val offset = sizeOffsetBuffer.getLong()

        val size = sizeOffsetBuffer.getInt()

        if(size < Message.MinHeaderSize)

          return allDone()

        if(size > maxMessageSize)

          throw new InvalidMessageException("Message size exceeds the largest allowable message size (%d).".format(maxMessageSize))

        // read the item itself

        val buffer = ByteBuffer.allocate(size)

        channel.read(buffer, location + 12)

        if(buffer.hasRemaining)

          return allDone()

        buffer.rewind()

        // increment the location and return the item

        location += size + 12

        new MessageAndOffset(new Message(buffer), offset) //在这里做映射.同ByteBufferMessageSet里的实现方法类似.

      }

    }

  }

  /**

   * The number of bytes taken up by this file set

   */

  def sizeInBytes(): Int = _size.get()

  /**

   * Append these messages to the message set

   */

  def append(messages: ByteBufferMessageSet) {  //追加message的方法.被上层的append方法调用.

    val written = messages.writeTo(channel, 0, messages.sizeInBytes)

    _size.getAndAdd(written)

  }

  /**

   * Commit all written data to the physical disk

   */

  def flush() = {  //上层刷新方法的最终实现.

    channel.force(true)

  }

  /**

   * Close this message set

   */

  def close() {

    flush()

    channel.close()

  }

  /**

   * Delete this message set from the filesystem

   * @return True iff this message set was deleted.

   */

  def delete(): Boolean = {  //上层delete函数的最终实现方法

    Utils.swallow(channel.close()) //关闭内存数据

    file.delete()  //删除文件

  }

  /**

   * Truncate this file message set to the given size in bytes. Note that this API does no checking that the

   * given size falls on a valid message boundary.

   * @param targetSize The size to truncate to.

   * @return The number of bytes truncated off

   */

  def truncateTo(targetSize: Int): Int = {

    val originalSize = sizeInBytes

    if(targetSize > originalSize || targetSize < 0)

      throw new KafkaException("Attempt to truncate log segment to " + targetSize + " bytes failed, " +

                               " size of this log segment is " + originalSize + " bytes.")

    channel.truncate(targetSize)

    channel.position(targetSize)

    _size.set(targetSize)

    originalSize - targetSize

  }

  /**

   * Read from the underlying file into the buffer starting at the given position

   */

  def readInto(buffer: ByteBuffer, relativePosition: Int): ByteBuffer = {

    channel.read(buffer, relativePosition + this.start)

    buffer.flip()

    buffer

  }

  /**

   * Rename the file that backs this message set

   * @return true iff the rename was successful

   */

  def renameTo(f: File): Boolean = {

    val success = this.file.renameTo(f)

    this.file = f

    success

  }

}

object LogFlushStats extends KafkaMetricsGroup {

  val logFlushTimer = new KafkaTimer(newTimer("LogFlushRateAndTimeMs", TimeUnit.MILLISECONDS, TimeUnit.SECONDS))

}

Kafka 源代码分析之FileMessageSet的更多相关文章

Kafka 源代码分析之LogManager
这里分析kafka 0.8.2的LogManager logmanager是kafka用来管理log文件的子系统.源代码文件在log目录下. 这里会逐步分析logmanager的源代码.首先看clas ...
Kafka 源代码分析.
这里记录kafka源代码笔记.(代码版本是0.8.2.1) kafka的源代码如何下载.这里简单说一下. git clone https://git-wip-us.apache.org/repos/a ...
Kafka 源代码分析之LogSegment
这里分析kafka LogSegment源代码通过一步步分析LogManager,Log源代码之后就会发现,最终的log操作都在LogSegment上实现.LogSegment负责分片的读写恢复刷新 ...
kafka 源代码分析之Message（v0.10）
这里主要更新一下kafka 0.10.0版本的message消息格式的变化. message 的格式在0.10.0的版本里发生了一些变化(相对于0.8.2.1的版本)这里把0.10.0的message ...
Kafka 源代码分析之ByteBufferMessageSet
这里分析一下message的封装类ByteBufferMessageSet类 ByteBufferMessageSet类的源代码在源代码目录message目录下.这个类主要封装了message,mes ...
Kafka 源代码分析之Log
这里分析Log对象本身的源代码. Log类是一个topic分区的基础类.一个topic分区的所有基本管理动作.都在这个对象里完成.类源代码文件为Log.scala.在源代码log目录下. Log类是L ...
Kafka 源代码分析之Message
这里主要分析一下message的格式. 一条message的构成由以下部分组成 val CrcOffset = 0 //crc校验部分和字长 val CrcLength = 4 val MagicOf ...
Kafka 源代码分析之MessageSet
这里分析MessageSet类 MessageSet是一个抽象类,定义了一条log的一些接口和常量,FileMessageSet就是MessageSet类的实现类.一条日志中存储的log完整格式如下 ...
Kafka 源代码分析之log框架介绍
这里主要介绍log管理,读写相关的类的调用关系的介绍. 在围绕log的实际处理上.有很多层的封装和调用.这里主要介绍一下调用结构和顺序. 首先从LogManager开始. 调用关系简单如下:LogMa ...

随机推荐

swap与dd命令使用详解
处理交换文件和分区交换分区是系统RAM 的补充基本设置包括: 创建交换分区或者文件使用mkswap 写入特殊签名在/etc/fstab 文件中添加适当的条目使用swapon -a 挂载交换分 ...
Maven学习-Profile详解
Profile能让你为一个特殊的环境自定义一个特殊的构建:profile使得不同环境间构建的可移植性成为可能.Maven中的profile是一组可选的配置,可以用来设置或者覆盖配置默认值.有了prof ...
STM 8s 外部中断寄存器无法写入
虽然说单片机开发就是对手册的研究,但是开发过程中,还是要做些笔记的,方便以后注意那些坑. 工作要求所以接触了一下STM328s00f3这个芯片,配置外部中断的时候遇到了一点问题 PS:IAR这个开发软 ...
System.arraycopy(src, srcPos, dest, destPos, length) 与 Arrays.copyOf(original, newLength)区别
//System.arraycopy,只拷贝已存在的数组元素 int[] src = {0, 1, 2}; int[] dest = new int[3]; System.arraycopy(src, ...
VBS
1.msgbox脚本显示回车语法:msgbox"123"&vbcrlf&"456"
.NET中webservice如何使用，调用
webservice 只是"面向服务"编程的一种方式,现在把所有的方式都合在一起,就叫做WCF,,,,,, 1.创建 webservice服务,在web项目中添加"web ...
centos7搭建SVN+Apache+IF.svnadmin实现web管理SVN
阅读目录 1. 介绍 2. 软件准备 3. 建立SVN Server仓库 4. 配置安装PHP&IF.SVNadmin 5. 启动服务 1.介绍公司最近想把Windows server平台的 ...
利用formatter原理自动化参数化查询
前言:对于经常忙于服务端开发的小伙伴来说,与DB层打交道是在正常不过的事了,但是每次页面的查询条件新增往往意味着后端代码参数化同比增长,当然你可以不使用sqlhelper自带的参数化条件查询,可以直接 ...
JS实现鼠标移上去图片停止滚动移开恢复滚动效果
这是在做个人站的时候展示项目成果,因为不光需要展示,还需要介绍详细内容,就在滚动展示的地方做了这个效果以便于点开想要看的项目. 首先,要做的是一个需要滚动的区域.我前边写过一个关于图片循环滚动的示例, ...
今天重装系统后，Wdows更新提示“windows update当前无法检查更新，因为未运行服务。您可能需要重新启动计算机”
到百度搜了常用的解决方法,就是用命令提示符,但对我的情况不管用,提示“拒绝访问”.后来在08绿软站的一篇文章中找到了解决办法.原文如下(我本人也是用的第四种方法解决的): 试了下面几种解决方法,第四种 ...

Kafka 源代码分析之FileMessageSet

这里主要分析FileMessageSet类

Kafka 源代码分析之FileMessageSet的更多相关文章

随机推荐

热门专题