I think you underestimate complexity of audio & video encoding standards. There are hundreds and hundreds of pages of specification. How many times do you need to execute real ffmpeg to get all tiny details?
It's certainly possible to reverse-engineer it from a blackbox access, but it would take *years* and this test has a time limit.
It's certainly possible to reverse-engineer it from a blackbox access, but it would take *years* and this test has a time limit.