Archive for August 28th, 2011
After I get MP4 dedup working, FLV files are the next step. Some resources that showed up:
TL;DR: I have a bunch of MP4 video files that have identical content, but the internal metadata differs. I’m writing a Python function/module/program that’ll strip out the metadata, so hashing functions can find the duplicates.