Avoiding time_increment_bits problem when encoding bad header MPEG4 videos to Ogg Theora

There is some debate going on lately about the migration of YouTube to [[HTML5]], and whether they (i.e. YouTube’s owner, Google) should support [[H.264]] or [[Theora]] as standard codecs for the upcoming <video> tag. See, for example, how the FSF asks for support for Theora.

The thing is, I discovered [[x264]] not so long ago, and I thought it was a “free version” of H.264. I began using it to reencode the medium-to-low quality videos I keep (e.g., movies and series). The resulting quality/file size ratio stunned me. I could reencode most material downloaded from e.g. p2p sources to 2/3 of their size, keeping the copy indistinguishable from the original with the bare eye.

However, after realizing that x264 is just a free implementation of the proprietary H.264 codec, and in the wake of the H.264/Theora debate, I decided to give Ogg Theora a go. I expected a fair competitor to H.264, although still noticeably behind in quality/size ratio. And that I found. I for one do not care if I need a 10% larger file to attain the same quality, if it means using free formats, so I decided to adopt Theora for everyday reencoding.

After three paragraphs of introduction, let’s get to the point. Which is that reencoding some files with [[ffmpeg2theora]] I would get the following error:

% ffmpeg2theora -i example_video.avi -o output.ogg
[avi @ 0x22b7560]Something went wrong during header parsing, I will ignore it and try to continue anyway.
[NULL @ 0x22b87f0]hmm, seems the headers are not complete, trying to guess time_increment_bits
[NULL @ 0x22b87f0]my guess is 15 bits ;)
[NULL @ 0x22b87f0]looks like this file was encoded with (divx4/(old)xvid/opendivx) -> forcing low_delay flag
Input #0, avi, from 'example_video.avi':
  Metadata:
    Title           : example_video.avi
  Duration: 00:44:46.18, start: 0.000000, bitrate: 1093 kb/s
    Stream #0.0: Video: mpeg4, yuv420p, 624x464, 23.98 tbr, 23.98 tbn, 23.98 tbc
    Stream #0.1: Audio: mp3, 48000 Hz, 2 channels, s16, 32 kb/s
  .

[mpeg4 @ 0x22b87f0]hmm, seems the headers are not complete, trying to guess time_increment_bits
[mpeg4 @ 0x22b87f0]my guess is 16 bits ;)
[mpeg4 @ 0x22b87f0]hmm, seems the headers are not complete, trying to guess time_increment_bits
[mpeg4 @ 0x22b87f0]my guess is 16 bits ;)
[mpeg4 @ 0x22b87f0]looks like this file was encoded with (divx4/(old)xvid/opendivx) -> forcing low_delay flag
    Last message repeated 1 times
[mpeg4 @ 0x22b87f0]warning: first frame is no keyframe

I searched the web for solutions, but to no avail. Usually pasting literal errors in Google yields good results, but in this case I only found developer forums where this bug was discussed. What I haven’t found is simple instructions on how to avoid it in practice.

Well, here it goes my simple solution: pass it through [[MEncoder]] first. Where the following fails:

% ffmpeg2theora -i input.avi -o output.ogg

the following succeeds:

% mencoder input.avi -ovc copy -oac copy -o filtered.avi
% ffmpeg2theora -i filtered.avi -o output.ogg

I guess that what happens is basically that mencoder takes the “raw” video data in input.avi and makes a copy into filtered.avi (which ends up being exactly the same video), building sane headers in the process.

3 Comments »

  1. Inigo said,

    February 25, 2010 @ 23:11 pm

    I may be wrong, but as far as I know MPEG-4, and so H.264 as well, are both open standards.

  2. Inigo said,

    February 25, 2010 @ 23:45 pm

    Hi again,

    mmm it seems you’re totally rigth. The MPEG-LA hides its traces quite well, I had to dig a lot to find references specific to it. My fault.

    ps: next time come and say hi if you’re around ;-)

  3. isilanes said,

    March 1, 2010 @ 11:06 am

    Hi Iñigo,

    I am no expert on “legal” things, copyright and so on, but I think the H.264 codec is patented software. This means that the format might be public, and that there might be free encoders and decoders (actually x264 is free software, and is regarded as the best H.264 encoder available). It also means that where software patents are not allowed (i.e. outside of the USA, as of today), H.264 is effectively as free as Theora or Dirac.

    The bad news is that, since the USA is so prevalent in IT, and since the cancer of software patents could spread, once H.264 is widely used, the patent holders could start charging web sites (well, they do it right now) for the right to serve H.264-encoded media. They could also start charging embedded devices capable of running H.264-encoded media via hardware decoding. They could even try to charge software companies for making H.264 decoders, so vlc or ffmpeg would not be able to play H.264 unless they paid the patent fee. I don’t know how much of this is actually going to happen, or how much besides this can happen, but I sure know that I don’t want to find out.

RSS feed for comments on this post · TrackBack URI

Leave a Comment