Analyze Your HN Posts with Watson User Modeling

Kronopath · on Oct 8, 2014

It's worth keeping in mind that with things like these you get what you put in.

When the service first came up on the front page, I punched in some personal writings from my journal into it. Comparing my results from that to this HN assessment is like night and day. The HN assessment shows me as highly intellectual, imaginative, and adventurous, with a strong value for self-enhancement. The personal stuff showed me as being much more emotional, with a huge emotional range, because—surprise, surprise—I tend to write more about emotional parts of my life in my journal than on HN comments.

While this service might be useful to get a broad sense of people for marketing purposes, using it on an individual basis is like talking to a fortune teller—it could tell you nearly anything about yourself, and you'd be able to come up with an explanation to justify it.

eksith · on Oct 8, 2014

Exactly. We adopt a different tone based on the venue and, since no one shows the same face everywhere, even with the same name, these are going to be wildly different.

Ironically, this really did feel like a machine based cold-reading.

jcomis · on Oct 8, 2014

Seems to be down for me: 404 Not Found: Requested route ('hn.mybluemix.net') does not exist.

Jonovono · on Oct 8, 2014

Ah, it's just hosted with IBM right now so they must limit.

meifun · on Oct 8, 2014

it is going up and down, it seems. I am getting results though

aameek · on Oct 9, 2014

To clarify, hn.mybluemix.net is not an IBM app. A HN user built that app and it appears to crash often

aselzer · on Oct 8, 2014

Did this with Twitter: twurl "/1.1/statuses/user_timeline.json?count=200" | jq -r ".[] | .text" | pbcopy

paste it into http://watson-um-demo.mybluemix.net/demo

87% Openness, 5% Agreeableness, that's funny.

Tiksi · on Oct 9, 2014

This inspired me to do the same with reddit, so I threw together this function in zsh (should work with bash too):

        function redcom(){ ! [ -z "$2" ] && i="&after=$2" || i=""; data=$(curl -s "https://www.reddit.com/user/${1}/comments.json?count=100${i}"); j=$(jq -e -r '.data["after"]' <<<$data); echo $j; (jq -e -r '.data["children"][]["data"]["body"]' <<<$data)>>${1}-redcom.txt; echo $(echo -e $data|wc -l) lines; ( [ "$j" = "null" ] || [ -z "$j" ] ) || redcom $1 $j ;} ;

It's a bit more complicated due to the reddit api, but if you run that, then run redcom <user> and it should throw all your comments into <user>-redcom.txt

Never user/heard of jq before, it's a pretty nice tool.

tim333 · on Oct 8, 2014

I got 92% Openness, 10% Agreeableness on tim333, HN

Maybe they all come out a bit like that?

minimaxir · on Oct 8, 2014

Is there any insight as to how this works? I got an Openness rating of 97% and a Harmony rating of 100%, both of which I know are not true. (I also received a Love rating of 1% under my Needs, although that's pretty accurate.)

jonnathanson · on Oct 8, 2014

I, too, got a "Love" rating in the low single digits. That's probably not too far from the truth in a global sense, but remember that in this case, the API is working with a local context (HN comments). I'm probably unlikely to have discussed the subject of love on HN, or anything tangentially related to it, or anything that would somehow give an indication of my need for love. Now, if you were to run my Facebook history through the same process, I imagine you'd find a slightly different analysis.

To some extent, our personalities are our personalities. We behave, on some level, the same in every context and in every community. But the extent to which that's the case is up for debate. We probably use different approaches, or if you prefer, we show different aspects of our personalities, in different contexts and in front of different groups. That's why it's extremely difficult to take one context (HN, for instance), and extrapolate universal characteristics from it.

Even within the set of HN history, I got some oddball results. For instance, Watson considers me very "Fiery" (51%) here. There are plenty of areas in my life in which the word "Fiery" makes a bit of sense. HN isn't one of them.

bane · on Oct 9, 2014

A problem I've observed with these kind of blackbox systems is that the process from input to output really is a mystery.

When the results are right, they're just "right" so you should accept them, when they're wrong they're actually also right by whatever magical hamster wheel is operating inside of the thing and you just don't "get it".

The problem is that humans like to have some clue as to how the results were derived, something easy to explain that gets the gist across. Something like "Watson counted all the words you use and compared them to different reference lexicons to arrive at the score". This provides a little bit of context so we understand the semantics of the result and how to consider them and reason with them.

But for all we know the results we're seeing are from some arbitrary stochastic method:

openness=rand(90,99) harmony=rand(90,100)

etc.

For things like this to be accepted by the users (humans) there needs to be a quick explanation for how this works otherwise we get head scratchers.

keelyw · on Oct 9, 2014

Please see my other 2 responses in this thread for some insight. I think I posted them about the same time you posted this.

keelyw · on Oct 9, 2014

We plan to add more information to our docs soon about the service, including a description of each of the traits, and possibly reference some of the many data sources used.

Meanwhile, you can do a search on "IBM System U" (the project's not-so-internal code name.) This particular slideshare.net prez has some great info on the methodology, validation tests and references: http://slidesha.re/1ri0vPV

Jonovono · on Oct 8, 2014

It's using this Watson API: http://www.ibm.com/smarterplanet/us/en/ibmwatson/developercl...

and using the sample Node project to show the results. I am still exploring it myself :p

aroch · on Oct 8, 2014

I also get rather high Openess and harmony scores, though I'm particularly amused that Hedonism is scored. How can I be 72% sympathetic and 47% coopoerative but not agreeable?

The IBM documentation doesn't really say anything about how these numbers are calculated

waterlesscloud · on Oct 8, 2014

I don't even have scores for stability and practicality. Hmmm.

ultimoo · on Oct 8, 2014

This is very cool. Although I don't know how accurate this is, if calibrated and tuned to yield a certain degree of accuracy it will have a variety of use cases.

For example -- When interviewing someone, being able to run their github username (if known of course) to analyze their commit messages, comments, discussions. Or even their hn, reddit, twitter user names (if the usernames are linked with their first names, nothing creepy). It will potentially help to identify candidates that are downright rude, arrogant etc.

Or analyze internal mailing lists, hipchat/slack channels for co workers who are potentially burnt out.

smtddr · on Oct 8, 2014

>>It will potentially help to identify candidates that are downright rude, arrogant etc.

This sounds very dangerous to me. I assume when recruiters and/or lead engineers decide to reach out to me via LinkedIn, they did their homework on me. I purposely link to enough stuff for them to realize "smtddr" is my handle. The same blog & youtube channel in my HN profile is also in my LinkedIn. But I expect a human to look, not some computer judging me. I can totally see people getting lazy and just doing stuff like only filtering for people who rate 80% on openness or something. Then everyone will start grooming their posts simply to get positive results... then someone will create a social website that claims to block those scanners so people can say whatever they want.

It just forces people underground and the filters won't work anymore since at that point you might as well assume everyone is gaming the system.

(fwiw, I'm also against standardized tests. Anything that forces a whole group of people to start grooming themselves for a very specific measurement kills diversity, imho. Since the very term "standardized" kinda goes against the concept of diverse... and people become lazy and just rely on such tests to make or break the deal)

htns · on Oct 8, 2014

You can test the accuracy yourself: http://www.outofservice.com/bigfive/ . It's a good idea to take that test before looking at OP's results.

meifun · on Oct 8, 2014

But can you really judge a person based upon commits them might make?

You can read the words they wrote but you cannot perceive the tone they wrote it in....I dont think this is helpful and may dismiss candidates that are otherwise perfect except when Watson analyzes them.

flatline · on Oct 8, 2014

Would love to see something like this for reddit, where I'm a more active poster on a wider variety of issues.

Tiksi · on Oct 9, 2014

Another commenter mentioned http://watson-um-demo.mybluemix.net/demo above, so I threw this together in bash for reddit comments. Bit more of a pain that entering a username, but if you take this:

        function redcom(){ ! [ -z "$2" ] && i="&after=$2" || i=""; data=$(curl -s "https://www.reddit.com/user/${1}/comments.json?count=100${i}"); j=$(jq -e -r '.data["after"]' <<<$data); echo $j; (jq -e -r '.data["children"][]["data"]["body"]' <<<$data)>>${1}-redcom.txt; echo $(echo -e $data|wc -l) lines; ( [ "$j" = "null" ] || [ -z "$j" ] ) || redcom $1 $j ;} ;

run it in bash, then run redcom <user>, it should throw all your comments into <user>-redcom.txt which you can then copy/paste to http://watson-um-demo.mybluemix.net/demo

ljk · on Oct 8, 2014

something like this?

http://www.redditinvestigator.com/

Jonovono · on Oct 8, 2014

Interesting! I like the fun guessed data. I am from Canada, game of choice is ping pong, nor do I have children.

"""

Probably from: Canada

Support OWS: Probably no or doesn't care.

Children: I do not think so...

Gamer: Only pong probably...

Like trees: Must be in a really good mood.

Behavior: Candidate as replace for Good Guy Greg

"""

Jonovono · on Oct 8, 2014

For those that like this. There is a similar thing I saw awhile on here (I am guessing) that analyzes facebook posts and compares you:

http://labs.five.com/

EGreg · on Oct 8, 2014

Gregariousness 8%

Given my name I found that funny Also I know it's wildly inaccurate on several characteristics, but maybe my persona here is like that! Interesting.

waterlesscloud · on Oct 8, 2014

Yeah, I think it's worth keeping in mind it's scoring you based on your comments on a particular kind of site which tends towards certain kinds of interactions.

thegeomaster · on Oct 8, 2014

Does Watson internally use observations drawn from this study[1]?

And another question---is this applicable to non-native English speakers? Do they acquire the same language habits as if English was their mother tongue?

[1]: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3783449/

ColinWright · on Oct 9, 2014

Now I'm consistently getting this:

    502 Bad Gateway: Registered endpoint
                     failed to handle the
                     request.

So, still not working for me.

Edit: In fact, entering someone else's username worked, and another, but mine continues to fail. Is it just me?

aameek · on Oct 9, 2014

I believe you are talking about the hn.mybluemix.net app. That is not an IBM app - a HN user put it together. It appears to be crash often

ColinWright · on Oct 10, 2014

Well, yes, because that's the link in this submission.

Houshalter · on Oct 8, 2014

If you picked random English words and put random numbers next to them, I wouldn't be able to tell.

Sven7 · on Oct 9, 2014

But that's how the stock market works...

chuckcode · on Oct 8, 2014

Not a lot of documentation on the IBM page about what the characteristics mean or how they learn a mapping from text to these categories.

Curious to know what the average hacker news scores look like? I'm imagining it is a pretty small segment of "normal" society.

keelyw · on Oct 9, 2014

See my response to minimaxir above about additional documentation coming soon, and a link with good background info on the technology. Meanwhile, here are brief descriptions of the Big 5 Personality traits: Big 5 Personality: - Openness - associated with curiosity, intellect, and an appreciation for art and adventure - Conscientiousness - associated with organization and industriousness - Extraversion - associated with positive and outgoing attitudes toward other people - Agreeableness - associated with compassion and cooperation toward other people - Emotional Range - associated with a sensitivity to negative emotions

For more information on systematic associations between personality and individual differences in word use, please refer to studies like Tal Yarkoni, "Personality in 100,000 words: A Large scale analysis of personality and word use among bloggers", 2010

ommunist · on Oct 8, 2014

It worked from the UK IP for me, but not from the US one. Anyway - returned 502 after request.

xchaotic · on Oct 8, 2014

I bet Watson now thinks you like UKIP. And now me, so meta.

ommunist · on Oct 10, 2014

I cheated it from Russia. UKIP is not my political fav.

meifun · on Oct 8, 2014

Interesting, I analyzed myself and patio11.

any insight on the tech stack and how it was implemented?

elyrly · on Oct 8, 2014

It would be interesting to see the code.

Jonovono · on Oct 8, 2014

I'll throw it on Github right away. I basically just used IBMs Node sample code which had it all there I just hooked it up to the HN api :p. https://www.ng.bluemix.net/docs/#starters/nodejs/index.html#...

hoopism · on Oct 8, 2014

100% Challenge?

Not sure what that means... but maybe this post is contributing to it?

spindritf · on Oct 8, 2014

Conscientiousness 23%

Ouch! How did it know?

Some are completely off, with plenty of 1% cop-outs but a few — spot on. What's the methodology? What does the need for practicality mean for example?

throwaway344 · on Oct 8, 2014

I would be curious to see if some of these characteristics are correlated with higher average karma/total karma.

ColinWright · on Oct 8, 2014

Hmm.

    404 Not Found:
      Requested route ('hn.mybluemix.net')
        does not exist.

xchaotic · on Oct 8, 2014

Shall we rename slashdot effect?

dwd · on Oct 8, 2014

I wonder how well it detects sarcasm?

arikrak · on Oct 8, 2014

I analyzed my HN posts with the Watson API on an Oculus Rift and the singularity happened.