Check if it's possible to improve audio handling via Mutagen

It would be nice to read Mutagen's documentation, and see if we're missing some metadata, like in #78 (closed).