Recent Posts

M3-CVC: A Glimpse into the Future of AI-Driven Video Compression

A new AI-based codec proved 18% more efficient than VVC but substantial decoding requirements will limit short-term commercial application. Here’s a summary of the white paper.  In December 2024, researchers from Fudan University introduced M3-CVC, an AI-based video compression framework that combines large multimodal models (LMMs) for semantic understanding and conditional diffusion models (CDMs) for high-fidelity reconstruction. The framework employs …

Read More »

Comparing Fixed GOPs to Variable GOPs with I-Frames at Scene Changes

I first encountered the line, “Anything worth doing is worth overdoing,” in the Robert Heinlein novel Time Enough for Love. I bring this up because this is my third recent article on GOP size, and I think I’m close to beating this topic into the ground. I’ll let you be the judge. To recount, I reported on testing in an …

Read More »

Real-World Perspectives on Choosing the Optimal GOP Size

One of the most fundamental encoding decisions is the size of the Group of Pictures (GOP) or the frequency of I-frames within an encoded file. I-frames, also known as keyframes, are the starting points for groups of pictures, consisting of I-, B-, and P-frames. Traditionally, the GOP size is directed by adaptive bitrate streaming considerations, such as ensuring an I-frame …

Read More »