Yeah, I agree. I think that what3words hasn't spent enough effort on this, or perhaps is suffering from trying to cram everywhere into 3 words, which means the wordlist needs to be unmanageably large.
Even for my attempt at the problem, I did various experiments on the word list, but an ideal attempt would check for similarity across common accents, etc and I certainly wasn't able to do that.
Having said that, I think it's a valid and realistic goal for good word encoder systems to aim for good roundtripability via voice or memory.
The problem with any system like this is that for it to be useful you can't change it after its launched. Any problems with the word list or location allocation are permanently baked in.
That's not necessarily true. You can make sure your decoding system understands both the new wordlist and the old wordlist but only gives you new encodings based on the new wordlist.
English has suffered the Great Vowel Shift, and has a wide variety of accents. It's a language that has the Spelling Bee, -teen/-ty numbers, -ough suffix, and "ghoti". There are many languages with much more regular, phonetic spelling and smaller variation of accents.
> It's a language that has the Spelling Bee, -teen/-ty numbers, -ough suffix, and "ghoti".
For the second point, -teen/-ty numbers sound different, are spelled differently, and mean different things. How are they supposed to support your point?
For the fourth, it's just false; "ghoti" in the pronunciation /fɪʃ/ does not come close to being valid written English. There is no such thing as syllable-initial "gh" /f/ or syllable-final "ti" /ʃ/.
The -ough suffix is a real case of one sound diverging into two sounds, but that is obviously not relevant to the problem of determining, from the sound of a word, which word you just heard. It comes up in the opposite problem of determining how to pronounce a word from the spelling, which we aren't talking about here.
The spelling bee is a cultural artifact; every language whose writing system is not extremely recent exhibits the phenomenon that the spelling of a word cannot be predicted from its sound. (In China, where spelling is much, much tougher, they don't have spelling bees. They do have traditional dictation exercises.)
Some words may also be difficult to pronounce/hear/spell by non-native speakers. Unlike regular sentences, there's no context to disambiguate.