Sorry for the cliché, but "the most exciting phrase to hear in science, the one that heralds the most discoveries, is not "Eureka!" (I found it!) but 'That's funny...'" --Asimov
You might be missing the point somewhat. With these methods, you could eg. test the vast swaths of potentially overlooked composers to see if any of them merit closer listening.
Very highly unlikely. The method in the paper isn’t measuring quality or novelty or authenticity or listenability or anything useful for evaluating composers without listening to them. It’s measuring the compressibility of MIDI files. We already know that Bach is less compressible than Philip Glass, and more compressible that Charles Ives. The methods in this paper cannot tell you if a composer is boring or derivative, nor whether they’re fresh and innovative for their time. They also can’t tell you anything about a performance. I mean go ahead and try, I’m all for experimenting, but I predict that trying to apply this paper to looking for overlooked composers will be an exercise in sifting through noise, more effort than searching manually, and spending time writing code instead of listening to good music.
I think (have never tried) that analyzing the harmony and rhythm (can you quantify syncopation?) you’d have a good start at determining if a song is worth listening to.
First, the methods of the paper don’t have to be a Mendelssohn replacement to be useful. Second, if you don’t like that potential application, consider all the other predictive models that could benefit from these features.
And likewise, you might be missing the point. This paper doesn’t really seem to add any useful knowledge to the corpus, and its methods are extremely unlikely to be useful at all for any of the purposes you are suggesting or imagining. We’ve already had gzip for a long time, and we already know it does not make a good predictive model for anything except storage space.
Like I would totally agree that there’s value in predictive models. I just don’t think the work we’re commenting on is one of those, nor headed toward making one.
My point is just that proving known facts can be useful and interesting.
As for the paper, network entropy and node heterogeneity seem to be perfectly sensible statistical concepts, and encode useful information. They also dovetail conveniently with powerful tools in machine learning. Criticizing this paper for lack of potential applications feels unreasonable.
You’re totally right in the abstract, those statements in your comment about concepts and tools are true if not tautological. What’s missing is that this paper provides no useful information about music or composers, and not does not prove anything nor demonstrate anything not already known and/or proven. It’s not a viable path to discriminating the quality of musical compositions, and I think we can prove that (I already suggested known counter-examples).
I didn’t argue with your information-free tautologies about tools, I’m pointing out that they are straw man when it comes to using entropy to identify the quality level of music.
> network entropy doesn’t need to identify the quality level of music on it’s own to be useful.
Okay. Another context-free tautology. So what are we even talking about then, what is your point? You offered above “you could eg. test the vast swaths of potentially overlooked composers to see if any of them merit closer listening.” Are you taking back that suggestion?
Feel free to offer something - anything - more specific on how the entropy can provide useful information about music. What uses are you envisioning? What other metrics in combination with entropy are you thinking of?
What I don’t see in your argument is a single specific reason the specific paper we’re commenting on has value, and what that value is. You’re suggesting that someone else doing something else might someday uncover usefulness or applications, and maybe it will build on this paper. That could happen, and yet measuring entropy is already a well known idea, and the applications to compression have been well explored already, and we can demonstrate that entropy of music has no correlation with quality, therefore the probability of what you suggest actually happening still seems rather low, and the discussion doesn’t seem to be improving the odds.
You said I was using tautologies as straw men, which is incoherent and suggests you’re not arguing in good faith.
Anyhow, of course entropy correlates to music quality; maximum entropy music is white noise! I’ve even had luck finding interesting jazz musicians from the distribution of key signatures they use—- anything more entropic than the Real Book is a great indicator. Similarly, network entropy makes it easier to identify musicians with a flexible arsenal of riffs. You could adapt it to chord progressions to find unusual reharmonizations in live jazz to study and practice. It could be a helpful regularizer for neural network music generation. Entropic methods are among the most powerful in statistics.
It looks like we've hit a reply depth limit, which is maybe for the best, because I don't think we're making any progress here.
> You did use tautologies...
You seem to think calling something a tautology is a way to dismiss it. Almost everything in mathematics is a tautology-- most of what I say is a tautology. Any rigorous argument is tautological; it's the aspiration of literally all formal reasoning.
> Lots of uninteresting and bad music is also entropic.
And here, you seem to think someone is claiming that entropy is equivalent to music quality, not just a useful correlate or eg. indicator of something that might be more likely to show up in good music than bad music. I don't know of anyone making that claim; all the examples I gave require mild correlation.
You did use tautologies, and they are right there above and still irrelevant, and thus straw man arguments in the context of the question what useful information is this specific paper contributing to the corpus of knowledge. The irony of flinging bad faith accusations and ad-hominem when trying to distract from the failure to have a relevant argument isn’t lost on me though.
As you point out, white noise is more entropic than the Real Book. Lots of uninteresting and bad music is also entropic. Why exactly is that a good indicator? I’m glad you finally have some examples, but this doesn’t demonstrate that entropy is a decent discriminator of anything.
Nb. Plato claims Solon found written records of 'Atlantis' on a visit to Egypt, around 600 BC. That seems consistent with the Santorini hypothesis and Egyptian historians.
We know that Minoans were actively trading to Egypt by around 1600-1400BCE before they get suddenly replaced by Mycenaean Greeks, so there seems to be some support for that hypothesis...
Wikipedia says "Elements is the oldest extant large-scale deductive treatment of mathematics."
There is a category for which we can count it as "the oldest"
The only older "important publications in math" Wikiepedia has are
* "Moscow Mathematical Papyrus" from 1850 BC. This is referred to as a manuscript rather than a "text" though.
* "Baudhayana Sulba Sutra" from 8th century BC. Which by all accounts seems to be actually a mass produced text with large reach. But also seems like these general don't contain proofs. and are more like a reference book than an explanation book.
So Elements is several hundred years behind the competition, but for a journalist to simplify "Oldest major math book that made an attempt at explaining the core fundamentals of how math works instead of just giving equations and examples" down to "oldest math text" feels fair enough.
As far as I can see, any older texts basically fall under "here is how the pythagorean theorem works" and not "here are the intrinsic laws that explain how math works, and thus why/how the pythagorean theorem works". The title is true for some definitions of "math text".
I think paleographers think of the text as an abstract object and manuscripts as approximations of it. E.g. you might have a couple hundred manuscripts (and other things like printings, etc) of Aristotle’s Physics, but the text is what you get after you identify and try to correct scribal errors.
whereas I think in common usage text is anything written and manuscript is a subset of document.
Although manuscript in pre-computer days generally meant something handwritten, nowadays I guess you can turn in a manuscript that was written on the computer so not exactly the same.
Veritasium has made a video defending their use of clickbait. They claim that the content is good enough that it justifies of misleading titles, and the titles are necessary to draw viewers.
I assumed so from the title and almost didn't check out the video when YouTube recommended it yesterday. Really glad I still did. As usual from Veritasium the video included a lot of interesting historical context that I had no idea about.