I'm running a pilot project with my former university faculty as a client. Datasets I've been told are a huge problem with regards to man hours.
Disregarding data that comes from sensors and the like, how much time is spent collecting data from other sources?
My current test case are patent applications. I want to get out of the echo chamber a bit and get some external feedback.
I know how to automate data collection to answer questions like that and was wondering what other pains might be out there.
Thanks!