Providing the source would be a step to improve transparency and reproducibility (the text does not provide sufficient detail for even someone working in the field to reproduce what they did so that he would arrive at the same results); however, the more crucial thing is the data. Switchboard, Fisher, and WSJ are available (provided you have a few grand to spend), but they say they collected 5000 h of read speech from 9600 speakers.. That's a huge effort!