Could this approach be used for media compression? I've wondered how compressibl...

kuschku · on June 24, 2017

This would be basically MIDI, right?

noonespecial · on June 24, 2017

Or sheet music. It always amazed me that humans came up with any solution at all to "here's a piece of paper, tell me what your song sounds like" to say nothing of one that actually works to some degree.

I've always wondered how much classical music sounds the way it does because sheet music is the way it is.

the_cat_kittles · on June 24, 2017

my guess is that the sheet music has an enormous effect. because it can encode somethings very well, and other things poorly.

oddlyaromatic · on June 25, 2017

An example of this is Chinese guqin tablature. It can be centuries old and includes a lot of detail on where to place fingers and how to strike the strings, which can give you hints about pitch and timbre when combined with knowing the tuning, strings, etc. But the tablature has almost nothing to say about the LENGTH of each note, so rhythm has to be inferred by the performer from what they know about the culture.

ssttoo · on June 25, 2017

Look up the documentary "thin red line" on YouTube

ssalazar · on June 25, 2017

MIDI encodes very little information about timbre, which is a huge source of variation in modern pop music.

nnd · on June 25, 2017

Very little? More like none.

ssalazar · on June 27, 2017

Program change + general MIDI instrument set is implementation-dependent but was pretty common in the 90s, and encodes timbre in an extremely limited way. Now of course nobody outside of fringe artists really use it.

iammyIP · on June 25, 2017

it got very basic description in form of instrument index, so program 0 is piano, 4 guitar and so on.

stevehiehn · on June 24, 2017

MIDI is just a protocol to send instructions to turn notes on/off (and some other expressive info aswell).

tomcam · on June 24, 2017

Would probably require an enormous dictionary on the decoding end

sowbug · on June 24, 2017

It calls to mind the old joke about how someone wrote a compressor that turns Microsoft Word from a 20MB file into a 1 byte file, except the compressor is 20MB. (Adjust the file name and size until it's funny. When I first heard it, 20MB was an extraordinarily large size.)

But in this case you could imagine the right balance where it does end up with a significant savings.

ulber · on June 24, 2017

Would anything approaching typical bitrates used in audio codecs imply an enormous dictionary? Also I wonder if any statement could be made about the learnability of codecs, e.g., are Fourier transforms something deep networks can arrive at?