Download Article and Source Code

Download Integrating Intel® Media SDK with FFmpeg for mux/demuxing and audio encode/decode usages (PDF 568KB)
Download Source Code. (ZIP 98KB) (Note: Licensing terms match Media SDK 2012)

Introduction

The provided samples intend to illustrate how Intel® Media SDK can be used together with the popular FFmpeg suite of components to perform container muxing and demuxing (splitting). The samples also showcase integration of rudimentary FFmpeg audio decode and encode.

The sample projects are based on the Intel Media SDK 2012 R3 samples (http://software.intel.com/en-us/articles/vcsource-tools-media-sdk/) with only small modifications to original code such as adding new mux/demux command line directives.

Modified areas of the code with new integration code are tagged with

“// =========== ffmpeg … integration ============”.

FFmpeg integration functionality resides in the FFMPEGWriter and FFMPEGReader classes, which are subclasses of the generic Intel Media SDK sample file reader/writer CSmplBitstreamWriter and CSmplBitstreamReader classes, respectively.

To enable simplistic implementation of FFmpeg audio processing functionality, just enable the DECODE_AUDIO orENCODE_AUDIO define directive.

Since the provided samples are based on the Intel Media SDK decode/encode samples where video stream processing is in focus, the integration of FFmpeg container handling and audio processing may seem somewhat artificial. However, the samples should sufficiently illustrate the required Intel Media SDK and FFmpeg integration points making it quite straightforward to adapt to real-life applications, which would likely entail more sophisticated approaches such as threading, etc.

The provided samples illustrate the following use cases:

  1. Demux “mp4″ container file containing AVC(H.264) video stream and any FFmpeg supported audio stream. Decode the AVC(H.264) video stream and audio stream.
    (using “-split” option)
  2. Encode to AVC(H.264) video stream and AAC audio stream. Mux streams into “mp4″ container.
    (using “-mux” option)
  3. Demux “mpeg” container file containing MPEG-2 video stream and any FFmpeg supported audio stream. Decode the MPEG-2 video stream and audio stream.
    (using “-split” option)
  4. Encode to MPEG-2 video stream and MPEG audio stream. Mux streams into “mpeg” container.
    (using “-mux” option)
  5. Encode to AVC(H.264) or MPEG-2 video stream and Ogg Vorbis audio stream. Mux streams into “mkv” container (Matroska).
    (using “-mkv” option)

The audio encode part of the sample assumes input of raw PCM data as follows:

  • Using “mp4″ or “mpeg” container: 16-bit signed integer samples, 2 channels
  • Using “mkv” container: 32-bit float samples, 2 channels

You can generate raw PCM audio input file by demuxing a 2 channel @ 44100Hz audio stream using the provided decode sample. The encode sample can also easily be modified to support other audio input configurations. You can use such tools as “Audacity” to convert to/from different raw formats.

The current set of samples were tested and integrated using build “2012-08-27″ of FFmpeg from: http://ffmpeg.zeranoe.com/builds/

Please understand that this set of samples provides a snapshot of FFmpeg integration. The FFmpeg interfaces may change at any time, thus requiring modifications to the integration code.

Project file structure

Folder Content Notes
sample_decode – ffmpeg – sample_decode.sln

– include

– src

Contains decode/demux sample project. FFMPEGReader class located in pipeline_decode.h/cpp
sample_encode – ffmpeg – sample_encode.sln

– include

– src

Contains encode/mux sample project. FFMPEGWriter class located in pipeline_encode.h/cpp
ffmpeg – include

– lib_win32

– lib_x64

Contains FFmpeg component include files and pre-built binaries. See below “Requirements” section for details
sample_common – include

– src

Contains common Intel® Media SDK sample code functionality (this is a copy of the Intel Media SDK 2012 R3 sample_common folder)

Requirements

Intel Media SDK

The sample projects depend on Intel Media SDK API include files and dispatcher library. To be able to build the provided sample code, Intel Media SDK 2012 R2 or later must be installed. The SDK can be found here: http://software.intel.com/en-us/articles/vcsource-tools-media-sdk/

For more details about the Intel Media SDK samples, and Media SDK specific requirements or limitations, please refer to documentation and manuals of the SDK package.

FFmpeg

The provided sample projects do not include FFmpeg include and binary files. To be able to build the projects the required FFmpeg files for Windows* must therefore be downloaded from http://ffmpeg.zeranoe.com/builds/ (other Windows builds of FFmpeg also exists, but these have not been verified with this integration code)

Download the following packages:

  1. “<build_id>-<arch>-dev.7z” archive file (from “<arch>-bit Builds (Dev)” section on the webpage)
  2. “<build_id>-<arch>-shared.7z” archive file (from “<arch>-bit Builds (Shared)” section on the webpage)

From the above packages:

  • Copy content of “include” folder in (1) to “ffmpeg/include” folder
  • Copy *.lib content of “lib” folder in (1) to “ffmpeg/lib_<arch>” folder
  • Copy *.dll content of “bin” folder in (2) to “ffmpeg/lib_<arch>” folder

Note: “<arch>” is either win32 or x64.

msinttypes

The folder “ffmpeg/include/msinttypes-r26″ contains parameter type bridge required to be able to build FFmpeg project in Microsoft Visual Studio.

Download the required include files from http://code.google.com/p/msinttypes/

Note: The solution/projects were created using Microsoft Visual Studio* 2010, but there is nothing preventing the environment to be back-ported to older versions of Visual Studio, if needed.

How to build

  1. Open the solution (“.sln”) file in either “sample_decode – ffmpeg” or “sample_encode – ffmpeg” folder
  2. Select desired build configuration: Debug/Release, Win32/x64
  3. Build the solution

How to execute workloads

Below are some example command line workloads.

  1. Demux mp4 container and decode the AVC(H.264) video stream. If the container includes audio stream, it will be decoded into “audio.dat” (assuming sample has been built with “DECODE_AUDIO”)

     
    1
    <b>sample_decode.exe h264 –i file.mp4 -hw -d3d -split –o video.yuv</b>
  2. Demux mpeg container and decode the MPEG-2 video stream. If the container includes audio stream, it will be decoded into “audio.dat” (assuming the sample was built with “DECODE_AUDIO”)
     
    1
    <b>sample_decode.exe mpeg2 –i file.mpg -hw -d3d -split –o video.yuv</b>
  3. Encode raw YUV video data into AVC(H.264) video stream. Mux into mp4 container. If the sample was built with “ENCODE_AUDIO” the audio will be encoded using AAC encoder (raw audio PCM data read from file “audio.dat”).
     
    1
    <b>sample_encode.exe h264 -i video.yuv -w 640 -h 480 -o out.mp4 -hw -d3d -mux -b 1000 -f 30</b>
  4. Encode raw YUV video data into MPEG-2 video stream. Mux into mpeg container. If the sample was built with “ENCODE_AUDIO” the audio will be encoded using mpeg encoder (raw audio PCM data read from file “audio.dat”).
     
    1
    <b>sample_encode.exe mpeg2 -i video.yuv -w 640 -h 480 -o out.mpg -hw -d3d -mux -b 1000 -f 30</b>
  5. Encode raw YUV video data into the AVC(H.264) video stream. Mux into Matroska (mkv) container. If the sample was built with “ENCODE_AUDIO” the audio will be encoded using Ogg Vorbis encoder (raw audio PCM data read from file “audio.dat”).
     
    1
    <b>sample_encode.exe h264 -i video.yuv -w 640 -h 480 -o out.mkv -hw -d3d -mkv -b 1000 -f 30</b>

Known issues

– MPEG2 muxing results in “buffer underflow” warning. However, the warning does not seem to impact the content or validity of the resulting mpeg container.

References

This post comes from: http://software.intel.com/en-us/articles/integrating-intel-media-sdk-with-ffmpeg-for-muxdemuxing-and-audio-encodedecode-usages

转自:http://rg4.net/archives/966.html

(转)Integrating Intel® Media SDK with FFmpeg for mux/demuxing and audio encode/decode usages 1的更多相关文章

  1. Intel® Media SDK Media Samples Linux 学习笔记(转)

    最近折腾intel media sdk,主要硬件平台是在HD4600的核显上进行测试,intel media sdk是intel提供的一种基于核显的硬件编解码的解决方案,之前已经有使用ffmpeg进行 ...

  2. Intel Media SDK H264 encoder GOP setting

    1 I帧,P帧,B帧,IDR帧,NAL单元 I frame:帧内编码帧,又称intra picture,I 帧通常是每个 GOP(MPEG 所使用的一种视频压缩技术)的第一个帧,经过适度地压缩,做为随 ...

  3. Getting Started with the Intel Media SDK

    By Gael Hofemeier on March 19, 2015 Follow Gael on Twitter: @GaelHof Media SDK Developer’s Guide Med ...

  4. Intel® Media SDK(一)

    A cross-platform API for developing media applications on Windows* Fast video playback, encode, proc ...

  5. Intel Media SDK安装步骤

    !!!(gcc/g++版本要在4.8以上,本人使用的是5.4版本) 要先安装依赖,按以下步骤依次执行 1.LIBVA git clone https://github.com/intel/libva. ...

  6. Intel Media SDK 性能測试

    经过測试,发如今windows 7上 i3 i5 上Intel Media SDK 1080P仅仅能解6路,720P仅仅能解8路, 不知大家有没有測试过?

  7. 微软商店一直安装不上Intel Media SDK DFP

    具体表现为一直安装失败,但是下载进度条一直在,无法去除. 此方法来自 https://answers.microsoft.com/en-us/windows/forum/all/error-code- ...

  8. Intel® Media Server Studio Support

    复制自网址:https://software.intel.com/en-us/intel-media-server-studio-support/code-samples Code Samples M ...

  9. How to run Media SDK samples on Skylake【转载】

    In the last few days, we have seen lot of concern for using Intel® Media 2016 on 6th generation Inte ...

随机推荐

  1. [Windows Azure] How to Create and Deploy a Cloud Service?

    The Windows Azure Management Portal provides two ways for you to create and deploy a cloud service: ...

  2. CSS中Zen Coding

    值别名 有几个常用的别名: p → % e → em x → ex 可以用这些别名来代替完整的单位: w100p → width: 100% m10p30e5x → margin: 10% 30em ...

  3. MonoBehaviour类Invoke, Coroutine

    异步函数 在一个方法执行时调用另一个方法.而被调用的方法或者其中的某些语句不是立刻执行,而是过一段时间后才执行. MonoBehaviour提供了两种异步方法 调用(Invoke) 协程(Corout ...

  4. Spark SQL inferSchema实现原理探微(Python)【转】

    使用Spark SQL的基础是“注册”(Register)若干表,表的一个重要组成部分就是模式,Spark SQL提供两种选项供用户选择:   (1)applySchema     applySche ...

  5. 【Android】Android消息处理机制

    三大核心类 android的消息处理有三个核心类:Looper,Handler和Message. 其实还有一个Message Queue(消息队列),但是MQ被封装到Looper里面了 Looper ...

  6. 【qt】QT 的信号与槽机制

    QT 是一个跨平台的 C++ GUI 应用构架,它提供了丰富的窗口部件集,具有面向对象.易于扩展.真正的组件编程等特点,更为引人注目的是目前 Linux 上最为流行的 KDE 桌面环境就是建立在 QT ...

  7. 多媒体文件格式之TS

    [时间:2016-07] [状态:Open] TS流是MPEG-2标准中定义一种用于直播的码流结构,具有很好的容错能力.所有跟TS相关的标准可以从ISO/IEC_13818-1中找到. 通常TS流的后 ...

  8. django 利用PIL 保存图片

    在使用django时不知道怎么保存图片,又不想用它的form ,在网上找了许久,终于找到个解决方案,利用PIL.image 将POST上来的图片保存到media目录下,然后再修改models from ...

  9. python.pandas read and write CSV file

    #read and write csv of pandasimport pandas as pd goog =pd.read_csv(r'C:\python\demo\LiaoXueFeng\data ...

  10. [文件]Linux文本处理常用命令总结

    转自:https://www.cnblogs.com/sheeva/p/6406285.html 引子 作为一个偏爱windows的程序员,以前做文本处理的时候总是喜欢在windows下用notepa ...