Don't forget the graphic design and HCI parts, and definitely do not underestimate them. I often find that people with a scientific/technical background find it hard to visualise and present data well because of their background - 'I love lots of information and data; therefore, everyone else must love lots of information and data'. Wrong. You can number crunch all day long, but if you can't translate the output into something that people can understand (i.e. turn information into knowledge) then there is no point.
What curriculum are you pursuing to become a data scientist? Is it an actual degree program, are you cobbling together your own set of courses, or is it a degree in one of CS/Stats/Visual Design with electives in the others?
I have decided to do a masters degree in computational biology (bioinformatics).
There are a few masters courses that could lead you to a career in data science, but it depends on your background. Most of the courses I found were specialisations, e.g. a masters in computer vision or HCI for those with a computer science background, or a masters in digital media or design and technology for those with an arts background. I had neither, so I thought I might have to do a conversion course into computer science or art and then specialise. However, when I read the article above, I realised I was more on the design side anyway (happier in Photoshop and Illustrator than a text editor) and that I needed to learn the computing and statistics side. This brought me to bioinformatics since I already have a background in biochemistry. The course will teach me programming, data mining and statistics. I also get to do three projects, which I am going to do in visualisation of life science data - which is why I got interested in this topic in the first place.
I think undergraduates have a greater opportunity to 'cobble' a course together tailored towards data science.
I found it useful to do a bit of research on anyone who does a visualisation that you like and see what their background is.
I'm coming at it as someone who was doing NLP a decade ago, and recently came back to it by taking courses part time here at Carnegie Mellon (where I am employed). So I have courses under my belt now covering the intersection of machine learning and NLP (which is pretty much all of NLP these days) and planning to take the Masters Machine Learning course this Fall.
The part I would need to add next, I guess, is the data visualization part.
For the large scale data part, there is this program:
Looks like an interesting course. And you couldn't be in a better place for it. My second choice of course was a masters in information science, which looks very similar to the vlis one.
I think data science is a very broad subject area that covers the science of organising information (information science and vlis), right through to information visualisation.
Information science would have given me a good grounding in the skills I need for my start-up (which is involved in organising research information), but I thought it would be best to build on my expertise and interests (design, HCI, UX, psychology) and team up with someone who knows a lot more about the technical side of building information systems - more than I could ever learn from a one year masters.
It sounds like we are starting at opposite ends of the data/information science spectrum; it will be interesting to see if we meet in the middle after our masters.
You will also find that a lot of the top data scientists/information visualisers actually come from an artistic background, e.g. http://www.visualcomplexity.com/vc/ and http://www.vimeo.com/5091290
The original article inspired me to go back to university so that I can become a data scientist. Can't wait.