This document describes the codecs (decoders and encoders) provided by the libavcodec library.
libavcodec provides some generic global options, which can be set on all the encoders and decoders. In addition each codec may support so-called private options, which are specific for a given codec.
Sometimes, a global option may only affect a specific kind of codec, and may be unsensical or ignored by another, so you need to be aware of the meaning of the specified options. Also some options are meant only for decoding or encoding.
Options may be set by specifying -option value in the
FFmpeg tools, or by setting the value explicitly in the
AVCodecContext
options or using the ‘libavutil/opt.h’ API
for programmatic use.
The list of supported options follow:
Set bitrate in bits/s. Default value is 200K.
Set audio bitrate (in bits/s). Default value is 128K.
Set video bitrate tolerance (in bits/s). In 1-pass mode, bitrate tolerance specifies how far ratecontrol is willing to deviate from the target average bitrate value. This is not related to min/max bitrate. Lowering tolerance too much has an adverse effect on quality.
Set generic flags.
Possible values:
Use four motion vector by macroblock (mpeg4).
Use 1/4 pel motion compensation.
Use loop filter.
Use fixed qscale.
Use gmc.
Always try a mb with mv=<0,0>.
Use internal 2pass ratecontrol in first pass mode.
Use internal 2pass ratecontrol in second pass mode.
Only decode/encode grayscale.
Do not draw edges.
Set error[?] variables during encoding.
Normalize adaptive quantization.
Use interlaced DCT.
Force low delay.
Place global headers in extradata instead of every keyframe.
Use only bitexact stuff (except (I)DCT).
Apply H263 advanced intra coding / mpeg4 ac prediction.
Deprecated, use mpegvideo private options instead.
Deprecated, use mpegvideo private options instead.
Apply interlaced motion estimation.
Use closed gop.
Deprecated, currently unused.
Set motion estimation method.
Possible values:
zero motion estimation (fastest)
full motion estimation (slowest)
EPZS motion estimation (default)
esa motion estimation (alias for full)
tesa motion estimation
dia motion estimation (alias for epzs)
log motion estimation
phods motion estimation
X1 motion estimation
hex motion estimation
umh motion estimation
iter motion estimation
Set extradata size.
Set codec time base.
It is the fundamental unit of time (in seconds) in terms of which frame timestamps are represented. For fixed-fps content, timebase should be 1/framerate and timestamp increments should be identically 1.
Set the group of picture size. Default value is 12.
Set audio sampling rate (in Hz).
Set number of audio channels.
Set cutoff bandwidth.
Set audio frame size.
Each submitted frame except the last must contain exactly frame_size samples per channel. May be 0 when the codec has CODEC_CAP_VARIABLE_FRAME_SIZE set, in that case the frame size is not restricted. It is set by some decoders to indicate constant frame size.
Set the frame number.
Set video quantizer scale compression (VBR). It is used as a constant in the ratecontrol equation. Recommended range for default rc_eq: 0.0-1.0.
Set video quantizer scale blur (VBR).
Set min video quantizer scale (VBR). Must be included between -1 and 69, default value is 2.
Set max video quantizer scale (VBR). Must be included between -1 and 1024, default value is 31.
Set max difference between the quantizer scale (VBR).
Set max number of B frames.
Set qp factor between P and B frames.
Set ratecontrol method.
Set strategy to choose between I/P/B-frames.
Set RTP payload size in bytes.
Workaround not auto detected encoder bugs.
Possible values:
some old lavc generated msmpeg4v3 files (no autodetection)
Xvid interlacing bug (autodetected if fourcc==XVIX)
(autodetected if fourcc==UMP4)
padding bug (autodetected)
illegal vlc bug (autodetected per fourcc)
old standard qpel (autodetected per fourcc/version)
direct-qpel-blocksize bug (autodetected per fourcc/version)
edge padding bug (autodetected per fourcc/version)
Workaround various bugs in microsoft broken decoders.
trancated frames
Set single coefficient elimination threshold for luminance (negative values also consider DC coefficient).
Set single coefficient elimination threshold for chrominance (negative values also consider dc coefficient)
Specify how strictly to follow the standards.
Possible values:
strictly conform to a older more strict version of the spec or reference software
strictly conform to all the things in the spec no matter what consequences
allow unofficial extensions
allow non standardized experimental things
Set QP offset between P and B frames.
Set error detection flags.
Possible values:
verify embedded CRCs
detect bitstream specification deviations
detect improper bitstream length
abort decoding on minor error detection
consider things that violate the spec and have not been seen in the wild as errors
consider all spec non compliancies as errors
consider things that a sane encoder should not do as an error
Use MPEG quantizers instead of H.263.
How to keep quantizer between qmin and qmax (0 = clip, 1 = use differentiable function).
Set experimental quantizer modulation.
Set experimental quantizer modulation.
Set rate control equation. When computing the expression, besides the standard functions defined in the section ’Expression Evaluation’, the following functions are available: bits2qp(bits), qp2bits(qp). Also the following constants are available: iTex pTex tex mv fCode iCount mcVar var isI isP isB avgQP qComp avgIITex avgPITex avgPPTex avgBPTex avgTex.
Set max bitrate tolerance (in bits/s). Requires bufsize to be set.
Set min bitrate tolerance (in bits/s). Most useful in setting up a CBR encode. It is of little use elsewise.
Set ratecontrol buffer size (in bits).
Currently useless.
Set QP factor between P and I frames.
Set QP offset between P and I frames.
Set initial complexity for 1-pass encoding.
Set DCT algorithm.
Possible values:
autoselect a good one (default)
fast integer
accurate integer
floating point AAN DCT
Compress bright areas stronger than medium ones.
Set temporal complexity masking.
Set spatial complexity masking.
Set inter masking.
Compress dark areas stronger than medium ones.
Select IDCT implementation.
Possible values:
floating point AAN IDCT
Set error concealment strategy.
Possible values:
iterative motion vector (MV) search (slow)
use strong deblock filter for damaged MBs
Set prediction method.
Possible values:
Set sample aspect ratio.
Print specific debug info.
Possible values:
picture info
rate control
macroblock (MB) type
per-block quantization parameter (QP)
motion vector
error recognition
memory management control operations (H.264)
visualize quantization parameter (QP), lower QP are tinted greener
visualize block types
picture buffer allocations
threading operations
Visualize motion vectors (MVs).
Possible values:
forward predicted MVs of P-frames
forward predicted MVs of B-frames
backward predicted MVs of B-frames
Set full pel me compare function.
Possible values:
sum of absolute differences, fast (default)
sum of squared errors
sum of absolute Hadamard transformed differences
sum of absolute DCT transformed differences
sum of squared quantization errors (avoid, low quality)
number of bits needed for the block
rate distortion optimal, slow
0
sum of absolute vertical differences
sum of squared vertical differences
noise preserving sum of squared differences
5/3 wavelet, only used in snow
9/7 wavelet, only used in snow
Set sub pel me compare function.
Possible values:
sum of absolute differences, fast (default)
sum of squared errors
sum of absolute Hadamard transformed differences
sum of absolute DCT transformed differences
sum of squared quantization errors (avoid, low quality)
number of bits needed for the block
rate distortion optimal, slow
0
sum of absolute vertical differences
sum of squared vertical differences
noise preserving sum of squared differences
5/3 wavelet, only used in snow
9/7 wavelet, only used in snow
Set macroblock compare function.
Possible values:
sum of absolute differences, fast (default)
sum of squared errors
sum of absolute Hadamard transformed differences
sum of absolute DCT transformed differences
sum of squared quantization errors (avoid, low quality)
number of bits needed for the block
rate distortion optimal, slow
0
sum of absolute vertical differences
sum of squared vertical differences
noise preserving sum of squared differences
5/3 wavelet, only used in snow
9/7 wavelet, only used in snow
Set interlaced dct compare function.
Possible values:
sum of absolute differences, fast (default)
sum of squared errors
sum of absolute Hadamard transformed differences
sum of absolute DCT transformed differences
sum of squared quantization errors (avoid, low quality)
number of bits needed for the block
rate distortion optimal, slow
0
sum of absolute vertical differences
sum of squared vertical differences
noise preserving sum of squared differences
5/3 wavelet, only used in snow
9/7 wavelet, only used in snow
Set diamond type & size for motion estimation.
Set amount of motion predictors from the previous frame.
Set pre motion estimation.
Set pre motion estimation compare function.
Possible values:
sum of absolute differences, fast (default)
sum of squared errors
sum of absolute Hadamard transformed differences
sum of absolute DCT transformed differences
sum of squared quantization errors (avoid, low quality)
number of bits needed for the block
rate distortion optimal, slow
0
sum of absolute vertical differences
sum of squared vertical differences
noise preserving sum of squared differences
5/3 wavelet, only used in snow
9/7 wavelet, only used in snow
Set diamond type & size for motion estimation pre-pass.
Set sub pel motion estimation quality.
Set limit motion vectors range (1023 for DivX player).
Set intra quant bias.
Set inter quant bias.
Possible values:
variable length coder / huffman coder
arithmetic coder
raw (no encoding)
run-length coder
deflate-based coder
Set context model.
Set macroblock decision algorithm (high quality mode).
Possible values:
use mbcmp (default)
use fewest bits
use best rate distortion
Set scene change threshold.
Set min lagrange factor (VBR).
Set max lagrange factor (VBR).
Set noise reduction.
Set number of bits which should be loaded into the rc buffer before decoding starts.
Possible values:
allow non spec compliant speedup tricks
Deprecated, use mpegvideo private options instead
skip bitstream encoding
place global headers at every keyframe instead of in extradata
Frame data might be split into multiple chunks
Show all frames before the first keyframe
Deprecated, use mpegvideo private options instead
Deprecated, use mpegvideo private options instead.
Possible values:
detect a good number of threads
Set motion estimation threshold.
Set macroblock threshold.
Set intra_dc_precision.
Set nsse weight.
Set number of macroblock rows at the top which are skipped.
Set number of macroblock rows at the bottom which are skipped.
Possible values:
Possible values:
Decode at 1= 1/2, 2=1/4, 3=1/8 resolutions.
Set frame skip threshold.
Set frame skip factor.
Set frame skip exponent.
Set frame skip compare function.
Possible values:
sum of absolute differences, fast (default)
sum of squared errors
sum of absolute Hadamard transformed differences
sum of absolute DCT transformed differences
sum of squared quantization errors (avoid, low quality)
number of bits needed for the block
rate distortion optimal, slow
0
sum of absolute vertical differences
sum of squared vertical differences
noise preserving sum of squared differences
5/3 wavelet, only used in snow
9/7 wavelet, only used in snow
Increase the quantizer for macroblocks close to borders.
Set min macroblock lagrange factor (VBR).
Set max macroblock lagrange factor (VBR).
Set motion estimation bitrate penalty compensation (1.0 = 256).
Possible values:
Possible values:
Possible values:
Refine the two motion vectors used in bidirectional macroblocks.
Downscale frames for dynamic B-frame decision.
Set minimum interval between IDR-frames.
Set reference frames to consider for motion compensation.
Set chroma qp offset from luma.
Set rate-distortion optimal quantization.
Set value multiplied by qscale for each frame and added to scene_change_score.
Adjust sensitivity of b_frame_strategy 1.
Set GOP timecode frame start number, in non drop frame format.
Set desired number of audio channels.
Possible values:
Possible values:
Set the log level offset.
Number of slices, used in parallelized encoding.
Select multithreading type.
Possible values:
Set audio service type.
Possible values:
Main Audio Service
Effects
Visually Impaired
Hearing Impaired
Dialogue
Commentary
Emergency
Voice Over
Karaoke
Set sample format audio decoders should prefer. Default value is
none
.
Set the input subtitles character encoding.
Decoders are configured elements in FFmpeg which allow the decoding of multimedia streams.
When you configure your FFmpeg build, all the supported native decoders
are enabled by default. Decoders requiring an external library must be enabled
manually via the corresponding --enable-lib
option. You can list all
available decoders using the configure option --list-decoders
.
You can disable all the decoders with the configure option
--disable-decoders
and selectively enable / disable single decoders
with the options --enable-decoder=DECODER
/
--disable-decoder=DECODER
.
The option -codecs
of the ff* tools will display the list of
enabled decoders.
A description of some of the currently available video decoders follows.
Raw video decoder.
This decoder decodes rawvideo streams.
Specify the assumed field type of the input video.
the video is assumed to be progressive (default)
bottom-field-first is assumed
top-field-first is assumed
Internal wave synthetizer.
This decoder generates wave patterns according to predefined sequences. Its use is purely internal and the format of the data it accepts is not publicly documented.
libcelt decoder wrapper
libcelt allows libavcodec to decode the Xiph CELT ultra-low delay audio codec.
Requires the presence of the libcelt headers and library during configuration.
You need to explicitly configure the build with --enable-libcelt
.
libgsm decoder wrapper
libgsm allows libavcodec to decode the GSM full rate audio codec. Requires
the presence of the libgsm headers and library during configuration. You need
to explicitly configure the build with --enable-libgsm
.
This decoder supports both the ordinary GSM and the Microsoft variant.
libilbc decoder wrapper
libilbc allows libavcodec to decode the Internet Low Bitrate Codec (iLBC)
audio codec. Requires the presence of the libilbc headers and library during
configuration. You need to explicitly configure the build with
--enable-libilbc
.
The following option is supported by the libilbc wrapper.
Enable the enhancement of the decoded audio when set to 1. The default value is 0 (disabled).
libopencore-amrnb decoder wrapper
libopencore-amrnb allows libavcodec to decode the Adaptive Multi-Rate
Narrowband audio codec. Using it requires the presence of the
libopencore-amrnb headers and library during configuration. You need to
explicitly configure the build with --enable-libopencore-amrnb
.
An FFmpeg native decoder for AMR-NB exists, so users can decode AMR-NB without this library.
libopencore-amrwb decoder wrapper.
libopencore-amrwb allows libavcodec to decode the Adaptive Multi-Rate
Wideband audio codec. Using it requires the presence of the
libopencore-amrwb headers and library during configuration. You need to
explicitly configure the build with --enable-libopencore-amrwb
.
An FFmpeg native decoder for AMR-WB exists, so users can decode AMR-WB without this library.
libopus decoder wrapper.
libopus allows libavcodec to decode the Opus Interactive Audio Codec.
Requires the presence of the libopus headers and library during
configuration. You need to explicitly configure the build with
--enable-libopus
.
This codec decodes the bitmap subtitles used in DVDs; the same subtitles can also be found in VobSub file pairs and in some Matroska files.
Specify the global palette used by the bitmaps. When stored in VobSub, the palette is normally specified in the index file; in Matroska, the palette is stored in the codec extra-data in the same format as in VobSub. In DVDs, the palette is stored in the IFO file, and therefore not available when reading from dumped VOB files.
The format for this option is a string containing 16 24-bits hexadecimal
numbers (without 0x prefix) separated by comas, for example 0d00ee,
ee450d, 101010, eaeaea, 0ce60b, ec14ed, ebff0b, 0d617a, 7b7b7b, d1d1d1,
7b2a0e, 0d950c, 0f007b, cf0dec, cfa80c, 7c127b
.
Encoders are configured elements in FFmpeg which allow the encoding of multimedia streams.
When you configure your FFmpeg build, all the supported native encoders
are enabled by default. Encoders requiring an external library must be enabled
manually via the corresponding --enable-lib
option. You can list all
available encoders using the configure option --list-encoders
.
You can disable all the encoders with the configure option
--disable-encoders
and selectively enable / disable single encoders
with the options --enable-encoder=ENCODER
/
--disable-encoder=ENCODER
.
The option -codecs
of the ff* tools will display the list of
enabled encoders.
A description of some of the currently available audio encoders follows.
AC-3 audio encoders.
These encoders implement part of ATSC A/52:2010 and ETSI TS 102 366, as well as the undocumented RealAudio 3 (a.k.a. dnet).
The ac3 encoder uses floating-point math, while the ac3_fixed
encoder only uses fixed-point integer math. This does not mean that one is
always faster, just that one or the other may be better suited to a
particular system. The floating-point encoder will generally produce better
quality audio for a given bitrate. The ac3_fixed encoder is not the
default codec for any of the output formats, so it must be specified explicitly
using the option -acodec ac3_fixed
in order to use it.
The AC-3 metadata options are used to set parameters that describe the audio, but in most cases do not affect the audio encoding itself. Some of the options do directly affect or influence the decoding and playback of the resulting bitstream, while others are just for informational purposes. A few of the options will add bits to the output stream that could otherwise be used for audio data, and will thus affect the quality of the output. Those will be indicated accordingly with a note in the option list below.
These parameters are described in detail in several publicly-available documents.
Allow Per-Frame Metadata. Specifies if the encoder should check for changing metadata for each frame.
The metadata values set at initialization will be used for every frame in the stream. (default)
Metadata values can be changed before encoding each frame.
Center Mix Level. The amount of gain the decoder should apply to the center channel when downmixing to stereo. This field will only be written to the bitstream if a center channel is present. The value is specified as a scale factor. There are 3 valid values:
Apply -3dB gain
Apply -4.5dB gain (default)
Apply -6dB gain
Surround Mix Level. The amount of gain the decoder should apply to the surround channel(s) when downmixing to stereo. This field will only be written to the bitstream if one or more surround channels are present. The value is specified as a scale factor. There are 3 valid values:
Apply -3dB gain
Apply -6dB gain (default)
Silence Surround Channel(s)
Audio Production Information is optional information describing the mixing environment. Either none or both of the fields are written to the bitstream.
Mixing Level. Specifies peak sound pressure level (SPL) in the production
environment when the mix was mastered. Valid values are 80 to 111, or -1 for
unknown or not indicated. The default value is -1, but that value cannot be
used if the Audio Production Information is written to the bitstream. Therefore,
if the room_type
option is not the default value, the mixing_level
option must not be -1.
Room Type. Describes the equalization used during the final mixing session at
the studio or on the dubbing stage. A large room is a dubbing stage with the
industry standard X-curve equalization; a small room has flat equalization.
This field will not be written to the bitstream if both the mixing_level
option and the room_type
option have the default values.
Not Indicated (default)
Large Room
Small Room
Copyright Indicator. Specifies whether a copyright exists for this audio.
No Copyright Exists (default)
Copyright Exists
Dialogue Normalization. Indicates how far the average dialogue level of the program is below digital 100% full scale (0 dBFS). This parameter determines a level shift during audio reproduction that sets the average volume of the dialogue to a preset level. The goal is to match volume level between program sources. A value of -31dB will result in no volume level change, relative to the source volume, during audio reproduction. Valid values are whole numbers in the range -31 to -1, with -31 being the default.
Dolby Surround Mode. Specifies whether the stereo signal uses Dolby Surround (Pro Logic). This field will only be written to the bitstream if the audio stream is stereo. Using this option does NOT mean the encoder will actually apply Dolby Surround processing.
Not Indicated (default)
Not Dolby Surround Encoded
Dolby Surround Encoded
Original Bit Stream Indicator. Specifies whether this audio is from the original source and not a copy.
Not Original Source
Original Source (default)
The extended bitstream options are part of the Alternate Bit Stream Syntax as
specified in Annex D of the A/52:2010 standard. It is grouped into 2 parts.
If any one parameter in a group is specified, all values in that group will be
written to the bitstream. Default values are used for those that are written
but have not been specified. If the mixing levels are written, the decoder
will use these values instead of the ones specified in the center_mixlev
and surround_mixlev
options if it supports the Alternate Bit Stream
Syntax.
Preferred Stereo Downmix Mode. Allows the user to select either Lt/Rt (Dolby Surround) or Lo/Ro (normal stereo) as the preferred stereo downmix mode.
Not Indicated (default)
Lt/Rt Downmix Preferred
Lo/Ro Downmix Preferred
Lt/Rt Center Mix Level. The amount of gain the decoder should apply to the center channel when downmixing to stereo in Lt/Rt mode.
Apply +3dB gain
Apply +1.5dB gain
Apply 0dB gain
Apply -1.5dB gain
Apply -3.0dB gain
Apply -4.5dB gain (default)
Apply -6.0dB gain
Silence Center Channel
Lt/Rt Surround Mix Level. The amount of gain the decoder should apply to the surround channel(s) when downmixing to stereo in Lt/Rt mode.
Apply -1.5dB gain
Apply -3.0dB gain
Apply -4.5dB gain
Apply -6.0dB gain (default)
Silence Surround Channel(s)
Lo/Ro Center Mix Level. The amount of gain the decoder should apply to the center channel when downmixing to stereo in Lo/Ro mode.
Apply +3dB gain
Apply +1.5dB gain
Apply 0dB gain
Apply -1.5dB gain
Apply -3.0dB gain
Apply -4.5dB gain (default)
Apply -6.0dB gain
Silence Center Channel
Lo/Ro Surround Mix Level. The amount of gain the decoder should apply to the surround channel(s) when downmixing to stereo in Lo/Ro mode.
Apply -1.5dB gain
Apply -3.0dB gain
Apply -4.5dB gain
Apply -6.0dB gain (default)
Silence Surround Channel(s)
Dolby Surround EX Mode. Indicates whether the stream uses Dolby Surround EX (7.1 matrixed to 5.1). Using this option does NOT mean the encoder will actually apply Dolby Surround EX processing.
Not Indicated (default)
Dolby Surround EX Off
Dolby Surround EX On
Dolby Headphone Mode. Indicates whether the stream uses Dolby Headphone encoding (multi-channel matrixed to 2.0 for use with headphones). Using this option does NOT mean the encoder will actually apply Dolby Headphone processing.
Not Indicated (default)
Dolby Headphone Off
Dolby Headphone On
A/D Converter Type. Indicates whether the audio has passed through HDCD A/D conversion.
Standard A/D Converter (default)
HDCD A/D Converter
Stereo Rematrixing. Enables/Disables use of rematrixing for stereo input. This is an optional AC-3 feature that increases quality by selectively encoding the left/right channels as mid/side. This option is enabled by default, and it is highly recommended that it be left as enabled except for testing purposes.
These options are only valid for the floating-point encoder and do not exist for the fixed-point encoder due to the corresponding features not being implemented in fixed-point.
Enables/Disables use of channel coupling, which is an optional AC-3 feature that increases quality by combining high frequency information from multiple channels into a single channel. The per-channel high frequency information is sent with less accuracy in both the frequency and time domains. This allows more bits to be used for lower frequencies while preserving enough information to reconstruct the high frequencies. This option is enabled by default for the floating-point encoder and should generally be left as enabled except for testing purposes or to increase encoding speed.
Selected by Encoder (default)
Disable Channel Coupling
Enable Channel Coupling
Coupling Start Band. Sets the channel coupling start band, from 1 to 15. If a value higher than the bandwidth is used, it will be reduced to 1 less than the coupling end band. If auto is used, the start band will be determined by the encoder based on the bit rate, sample rate, and channel layout. This option has no effect if channel coupling is disabled.
Selected by Encoder (default)
LAME (Lame Ain’t an MP3 Encoder) MP3 encoder wrapper
Requires the presence of the libmp3lame headers and library during
configuration. You need to explicitly configure the build with
--enable-libmp3lame
.
The following options are supported by the libmp3lame wrapper, the LAME-equivalent options follow the FFmpeg ones.
FFmpeg | LAME |
b | b
FFmpeg b option is expressed in bits/s, lame bitrate
in kilobits/s. |
q | V Quality setting for VBR. |
compression_level | q Algorithm quality. Valid options are integers from 0-9. |
reservoir | N.A. Enable use of bit reservoir. LAME has this enabled by default. |
joint_stereo | -m j Enables the the encoder to use (on a frame by frame basis) either L/R stereo or mid/side stereo. |
OpenCORE Adaptive Multi-Rate Narrowband encoder.
Requires the presence of the libopencore-amrnb headers and library during
configuration. You need to explicitly configure the build with
--enable-libopencore-amrnb --enable-version3
.
This is a mono-only encoder. Officially it only supports 8000Hz sample rate, but you can override it by setting ‘strict’ to ‘unofficial’ or lower.
Set bitrate in bits per second. Only the following bitrates are supported, otherwise libavcodec will round to the nearest valid bitrate.
Allow discontinuous transmission (generate comfort noise) when set to 1. The default value is 0 (disabled).
TwoLAME MP2 encoder wrapper
Requires the presence of the libtwolame headers and library during
configuration. You need to explicitly configure the build with
--enable-libtwolame
.
The following options are supported by the libtwolame wrapper. The TwoLAME-equivalent options follow the FFmpeg ones and are in parentheses.
(b) Set bitrate in bits/s. Note that FFmpeg b
option is
expressed in bits/s, twolame b
in kilobits/s. The default
value is 128k.
(V) Set quality for experimental VBR support. Maximum value range is from -50 to 50, useful range is from -10 to 10.
(mode) Set MPEG mode. Possible values:
Choose mode automatically based on the input. This is the default.
Stereo
Joint stereo
Dual channel
Mono
(psyc-mode) Set psychoacoustic model to use in encoding. The argument must be an integer between -1 and 4, inclusive. The higher the value, the better the quality. The default value is 3.
(energy) Enable energy levels extensions when set to 1. The default value is 0 (disabled).
(protect) Enable CRC error protection when set to 1. The default value is 0 (disabled).
(copyright) Set MPEG audio copyright flag when set to 1. The default value is 0 (disabled).
(original) Set MPEG audio original flag when set to 1. The default value is 0 (disabled).
VisualOn AAC encoder
Requires the presence of the libvo-aacenc headers and library during
configuration. You need to explicitly configure the build with
--enable-libvo-aacenc --enable-version3
.
The VisualOn AAC encoder only support encoding AAC-LC and up to 2 channels. It is also CBR-only. It is considered to be worse than the native experimental FFmpeg AAC encoder.
Bitrate.
VisualOn Adaptive Multi-Rate Wideband encoder
Requires the presence of the libvo-amrwbenc headers and library during
configuration. You need to explicitly configure the build with
--enable-libvo-amrwbenc --enable-version3
.
This is a mono-only encoder. Officially it only supports 16000Hz sample rate, but you can override it by setting ‘strict’ to ‘unofficial’ or lower.
Set bitrate in bits/s. Only the following bitrates are supported, otherwise libavcodec will round to the nearest valid bitrate.
Allow discontinuous transmission (generate comfort noise) when set to 1. The default value is 0 (disabled).
libopus Opus Interactive Audio Codec encoder wrapper.
Requires the presence of the libopus headers and library during
configuration. You need to explicitly configure the build with
--enable-libopus
.
Most libopus options are modeled after the opusenc
utility from
opus-tools. The following is an option mapping chart describing options
supported by the libopus wrapper, and their opusenc
-equivalent
in parentheses.
Set the bit rate in bits/s. FFmpeg’s ‘b’ option is
expressed in bits/s, while opusenc
’s ‘bitrate’ in
kilobits/s.
Set VBR mode. The FFmpeg ‘vbr’ option has the following
valid arguments, with the their opusenc
equivalent options
in parentheses:
Use constant bit rate encoding.
Use variable bit rate encoding (the default).
Use constrained variable bit rate encoding.
Set encoding algorithm complexity. Valid options are integers in the 0-10 range. 0 gives the fastest encodes but lower quality, while 10 gives the highest quality but slowest encoding. The default is 10.
Set maximum frame size, or duration of a frame in milliseconds. The
argument must be exactly the following: 2.5, 5, 10, 20, 40, 60. Smaller
frame sizes achieve lower latency but less quality at a given bitrate.
Sizes greater than 20ms are only interesting at fairly low bitrates.
The default of FFmpeg is 10ms, but is 20ms in opusenc
.
Set expected packet loss percentage. The default is 0.
Set intended application type. Valid options are listed below:
Favor improved speech intelligibility.
Favor faithfulness to the input (the default).
Restrict to only the lowest delay modes.
Set cutoff bandwidth in Hz. The argument must be exactly one of the following: 4000, 6000, 8000, 12000, or 20000, corresponding to narrowband, mediumband, wideband, super wideband, and fullband respectively. The default is 0 (cutoff disabled).
A description of some of the currently available video encoders follows.
Theora format supported through libtheora.
Requires the presence of the libtheora headers and library during
configuration. You need to explicitly configure the build with
--enable-libtheora
.
The following global options are mapped to internal libtheora options which affect the quality and the bitrate of the encoded stream.
Set the video bitrate, only works if the qscale
flag in
‘flags’ is not enabled.
Used to enable constant quality mode encoding through the
‘qscale’ flag, and to enable the pass1
and pass2
modes.
Set the GOP size.
Set the global quality in lambda units, only works if the
qscale
flag in ‘flags’ is enabled. The value is clipped
in the [0 - 10*FF_QP2LAMBDA
] range, and then multiplied for 6.3
to get a value in the native libtheora range [0-63]. A higher value
corresponds to a higher quality.
For example, to set maximum constant quality encoding with
ffmpeg
:
ffmpeg -i INPUT -flags:v qscale -global_quality:v "10*QP2LAMBDA" -codec:v libtheora OUTPUT.ogg
VP8 format supported through libvpx.
Requires the presence of the libvpx headers and library during configuration.
You need to explicitly configure the build with --enable-libvpx
.
Mapping from FFmpeg to libvpx options with conversion notes in parentheses.
g_threads
g_profile
rc_target_bitrate
kf_max_dist
kf_min_dist
rc_min_quantizer
rc_max_quantizer
rc_buf_sz
(bufsize * 1000 / vb)
rc_buf_optimal_sz
(bufsize * 1000 / vb * 5 / 6)
rc_buf_initial_sz
(rc_init_occupancy * 1000 / vb)
rc_undershoot_pct
rc_dropframe_thresh
rc_2pass_vbr_bias_pct
rc_2pass_vbr_maxsection_pct
(maxrate * 100 / vb)
rc_2pass_vbr_minsection_pct
(minrate * 100 / vb)
VPX_CBR
(minrate == maxrate == vb)
VPX_CQ
, VP8E_SET_CQ_LEVEL
VPX_DL_BEST_QUALITY
VPX_DL_GOOD_QUALITY
VPX_DL_REALTIME
VP8E_SET_CPUUSED
VP8E_SET_NOISE_SENSITIVITY
VP8E_SET_STATIC_THRESHOLD
VP8E_SET_TOKEN_PARTITIONS
VP8E_SET_MAX_INTRA_BITRATE_PCT
VPX_EFLAG_FORCE_KF
VP8E_SET_ENABLEAUTOALTREF
VP8E_SET_ARNR_MAXFRAMES
VP8E_SET_ARNR_TYPE
VP8E_SET_ARNR_STRENGTH
g_lag_in_frames
g_error_resilient
For more information about libvpx see: http://www.webmproject.org/
x264 H.264/MPEG-4 AVC encoder wrapper
This encoder requires the presence of the libx264 headers and library
during configuration. You need to explicitly configure the build with
--enable-libx264
.
libx264 supports an impressive number of features, including 8x8 and 4x4 adaptive spatial transform, adaptive B-frame placement, CAVLC/CABAC entropy coding, interlacing (MBAFF), lossless mode, psy optimizations for detail retention (adaptive quantization, psy-RD, psy-trellis).
Many libx264 encoder options are mapped to FFmpeg global codec
options, while unique encoder options are provided through private
options. Additionally the ‘x264opts’ and ‘x264-params’
private options allows to pass a list of key=value tuples as accepted
by the libx264 x264_param_parse
function.
The x264 project website is at http://www.videolan.org/developers/x264.html.
The following options are supported by the libx264 wrapper. The
x264
-equivalent options or values are listed in parentheses
for easy migration.
To reduce the duplication of documentation, only the private options and some others requiring special attention are documented here. For the documentation of the undocumented generic options, see the Codec Options chapter.
To get a more accurate and extensive documentation of the libx264
options, invoke the command x264 --full-help
or consult
the libx264 documentation.
Set bitrate in bits/s. Note that FFmpeg’s ‘b’ option is
expressed in bits/s, while x264
’s ‘bitrate’ is in
kilobits/s.
Set motion estimation method. Possible values in the decreasing order of speed:
Diamond search with radius 1 (fastest). ‘epzs’ is an alias for ‘dia’.
Hexagonal search with radius 2.
Uneven multi-hexagon search.
Exhaustive search.
Hadamard exhaustive search (slowest).
Set entropy encoder. Possible values:
Enable CABAC.
Enable CAVLC and disable CABAC. It generates the same effect as
x264
’s ‘--no-cabac’ option.
Set full pixel motion estimation comparation algorithm. Possible values:
Enable chroma in motion estimation.
Ignore chroma in motion estimation. It generates the same effect as
x264
’s ‘--no-chroma-me’ option.
Set multithreading technique. Possible values:
Slice-based multithreading. It generates the same effect as
x264
’s ‘--sliced-threads’ option.
Frame-based multithreading.
Set encoding flags. It can be used to disable closed GOP and enable
open GOP by setting it to -cgop
. The result is similar to
the behavior of x264
’s ‘--open-gop’ option.
Set the encoding preset.
Set tuning of the encoding params.
Set profile restrictions.
Enable fast settings when encoding first pass, when set to 1. When set
to 0, it has the same effect of x264
’s
‘--slow-firstpass’ option.
Set the quality for constant quality mode.
In CRF mode, prevents VBV from lowering quality beyond this point.
Set constant quantization rate control method parameter.
Set AQ method. Possible values:
Disabled.
Variance AQ (complexity mask).
Auto-variance AQ (experimental).
Set AQ strength, reduce blocking and blurring in flat and textured areas.
Use psychovisual optimizations when set to 1. When set to 0, it has the
same effect as x264
’s ‘--no-psy’ option.
Set strength of psychovisual optimization, in psy-rd:psy-trellis format.
Set number of frames to look ahead for frametype and ratecontrol.
Enable weighted prediction for B-frames when set to 1. When set to 0,
it has the same effect as x264
’s ‘--no-weightb’ option.
Set weighted prediction method for P-frames. Possible values:
Disabled
Enable only weighted refs
Enable both weighted refs and duplicates
Enable calculation and printing SSIM stats after the encoding.
Enable the use of Periodic Intra Refresh instead of IDR frames when set to 1.
Set the influence on how often B-frames are used.
Set method for keeping of some B-frames as references. Possible values:
Disabled.
Strictly hierarchical pyramid.
Non-strict (not Blu-ray compatible).
Enable the use of one reference per partition, as opposed to one
reference per macroblock when set to 1. When set to 0, it has the
same effect as x264
’s ‘--no-mixed-refs’ option.
Enable adaptive spatial transform (high profile 8x8 transform)
when set to 1. When set to 0, it has the same effect as
x264
’s ‘--no-8x8dct’ option.
Enable early SKIP detection on P-frames when set to 1. When set
to 0, it has the same effect as x264
’s
‘--no-fast-pskip’ option.
Enable use of access unit delimiters when set to 1.
Enable use macroblock tree ratecontrol when set to 1. When set
to 0, it has the same effect as x264
’s
‘--no-mbtree’ option.
Set loop filter parameters, in alpha:beta form.
Set fluctuations reduction in QP (before curve compression).
Set partitions to consider as a comma-separated list of. Possible values in the list:
8x8 P-frame partition.
4x4 P-frame partition.
4x4 B-frame partition.
8x8 I-frame partition.
4x4 I-frame partition. (Enabling ‘p4x4’ requires ‘p8x8’ to be enabled. Enabling ‘i8x8’ requires adaptive spatial transform (‘8x8dct’ option) to be enabled.)
Do not consider any partitions.
Consider every partition.
Set direct MV prediction mode. Possible values:
Disable MV prediction.
Enable spatial predicting.
Enable temporal predicting.
Automatically decided.
Set the limit of the size of each slice in bytes. If not specified but RTP payload size (‘ps’) is specified, that is used.
Set the file name for multi-pass stats.
Set signal HRD information (requires ‘vbv-bufsize’ to be set). Possible values:
Disable HRD information signaling.
Variable bit rate.
Constant bit rate (not allowed in MP4 container).
Set any x264 option, see x264 --fullhelp
for a list.
Argument is a list of key=value couples separated by ":". In filter and psy-rd options that use ":" as a separator themselves, use "," instead. They accept it as well since long ago but this is kept undocumented for some reason.
For example to specify libx264 encoding options with ffmpeg
:
ffmpeg -i foo.mpg -vcodec libx264 -x264opts keyint=123:min-keyint=20 -an out.mkv
Override the x264 configuration using a :-separated list of key=value parameters.
This option is functionally the same as the ‘x264opts’, but is duplicated for compability with the Libav fork.
For example to specify libx264 encoding options with ffmpeg
:
ffmpeg -i INPUT -c:v libx264 -x264-params level=30:bframes=0:weightp=0:\ cabac=0:ref=1:vbv-maxrate=768:vbv-bufsize=2000:analyse=all:me=umh:\ no-fast-pskip=1:subq=6:8x8dct=0:trellis=0 OUTPUT
Encoding ffpresets for common usages are provided so they can be used with the general presets system (e.g. passing the ‘pre’ option).
Xvid MPEG-4 Part 2 encoder wrapper.
This encoder requires the presence of the libxvidcore headers and library
during configuration. You need to explicitly configure the build with
--enable-libxvid --enable-gpl
.
The native mpeg4
encoder supports the MPEG-4 Part 2 format, so
users can encode to this format without this library.
The following options are supported by the libxvid wrapper. Some of the following options are listed but are not documented, and correspond to shared codec options. See the Codec Options chapter for their documentation. The other shared options which are not listed have no effect for the libxvid encoder.
Set specific encoding flags. Possible values:
Use four motion vector by macroblock.
Enable high quality AC prediction.
Only encode grayscale.
Enable the use of global motion compensation (GMC).
Enable quarter-pixel motion compensation.
Enable closed GOP.
Place global headers in extradata instead of every keyframe.
Set motion estimation method. Possible values in decreasing order of speed and increasing order of quality:
Use no motion estimation (default).
Enable advanced diamond zonal search for 16x16 blocks and half-pixel refinement for 16x16 blocks. ‘x1’ and ‘log’ are aliases for ‘phods’.
Enable all of the things described above, plus advanced diamond zonal search for 8x8 blocks, half-pixel refinement for 8x8 blocks, and motion estimation on chroma planes.
Enable all of the things described above, plus extended 16x16 and 8x8 blocks search.
Set macroblock decision algorithm. Possible values in the increasing order of quality:
Use macroblock comparing function algorithm (default).
Enable rate distortion-based half pixel and quarter pixel refinement for 16x16 blocks.
Enable all of the things described above, plus rate distortion-based half pixel and quarter pixel refinement for 8x8 blocks, and rate distortion-based search using square pattern.
ffmpeg, ffplay, ffprobe, ffserver, libavcodec
The FFmpeg developers.
For details about the authorship, see the Git history of the project
(git://source.ffmpeg.org/ffmpeg), e.g. by typing the command
git log
in the FFmpeg source directory, or browsing the
online repository at http://source.ffmpeg.org.
Maintainers for the specific components are listed in the file ‘MAINTAINERS’ in the source code tree.