There's a GREAT book out that demonstrates collecting and analyzing data like this, except it's in python. The concepts are pretty straightforward and very portable though. Check out "Programming collective intelligence by Toby Segaran." http://www.amazon.com/s/ref=nb_sb_ss_i_ ... Caps%2C222
. I don't like his naming conventions in his code (single and two letter identifiers that mean nothing and cloud the purpose) , and there are a couple of typos that stop example code from working if you follow directly in the book, but most errata are documented on the official site, and working example code. Beyond those two caveats though, this is a GREAT book, chock full of huge dataset analytic goodness.