That's assuming it is actual people reading it, and not just click fraud. Amazon has been battling that for years, books filled with literal gibberish reach the top few hundred titles in a category and the only reasonable explanation is that some group(s) are gaming the system by buying/phishing Kindle Unlimited subscriptions then skimming through their own "books" to turn a profit.
Using LLMs to make the books vaguely coherent instead of meaningless word salad adds a layer of plausible deniability which might make Amazon slower to notice what's going on.
It seems like a kind of strange strategy for click fraud. A KU subscription is $10, Amazon takes a 30% cut, and for $700 a month you would need 100 KU bots with unique accounts and payment info.
I don't think this "book" filled the pages with loren ipsum, but it is entirely possible that they fed all of the available news stories into a language model such that someone actually sat there page flipping through chapters of decorated text.
But ultimately I think there is a stark difference between someone who actually paid $7 for an ebook, and someone who downloaded something on Kindle Unlimited and flipped through it. I am curious how KU influences the best seller charts numerically, which I can't find a clarification of.
How hard would it be to write a script that downloads books and "reads" them -- i.e. exercises the Kindle app (say on a desktop-alike) -- giving Amazon plausible excuse to say "n words read"?
Point being: if there's a way to game a system, people will game that system.
> I suspect that anything that rises to the best-seller level gets examined by Amazon very closely.
I think you’re vastly overestimating how much Amazon cares.
See their physical goods store: it’s also a race to the bottom with zero oversight whatsoever (commingling, counterfeits, used/returned/dirty items sold as new…) I don’t see why the virtual books store would be any different.
Amazon wrote the proverbial book on enshitification/race to the bottom :(
It's based on the number of words read, not minutes.
It can't be pure junk, or people wouldn't be reading enough of it to make it climb the charts.