Episode 4: Book Review (Part I): Dataclysm: Who We Are (When We Think No One's Looking) by Christian Rudder
This week I read the book, “Dataclysm - Who We Are (When We Think No One’s Looking)” [1] by Christian Rudder, and this is everything I want to remember from it.
[Background: Christian Rudder is the co-founder of the online dating site OkCupid. He got himself into a mess when he said he used data from his site to experiment on human beings. Earlier, he started a blog [2] to share his reflections from the data collected by OkCupid, which later turned into a book [1]. Rudder grew up in Arkansas, and like other co-founders of OkCupid, he was a math major at Harvard. In his book he explains some useful mathematical and data science concepts, using highly inappropriate examples from the world of online dating.]
Concepts:
- What is variance?
The author explains variance with the help of an unusual example, one that you would probably not use in a classroom, but you can always replace it with a more appropriate scenario that uses the same underlying theory.
The graph below shows how men rated women based on their attractiveness, with 1 and 5 being the score assigned to the least and most attractive women, respectively.
Take the group of women who get the average score of 3.0. The author lists five scenarios describing how men would have voted. The vote patterns become more polarized from A to E, but they all average out to give 3.0 as the final score. Variance is “how widely data is scattered around a central value. It goes up the further the data points fall from the average.” Variance is often used by investors to decide between two companies that return the same amount of profits in the same period of time. The one that has lower variance (and hence, “fewer heart palpitations”) is considered to be a more stable investment.
-
What is the famous “Ballou Letter”?
Major Sullivan Ballou fought on the Potomac under the Union Army in mid-19th century. He wrote a heartfelt farewell letter to his wife a week before he died. The letter was never mailed and was recovered from the battlefield by the Governor of Rhode Island when he visited Virginia. There is also a movie “The Civil War” by Ken Burns, in which the letter is read out loud by one of the characters. The author writes that, the Ballou letter has become so famous that when it searched “famous letter” on Google, it shows up second in the list (though Google doesn’t do the same for me!) [3] -
What is culturomics?
It’s the study of tracking changes in the usage of words through time. The term was first used by the researchers Jean Baptiste Michel and Erez Lieberman Aiden from Harvard and other researchers from MIT, in their paper “Quantitative Analysis of Culture Using Millions of Digitized books.” [4] Rudder writes, “Once language and data come together, it’s that extra dimension, time, that’s so compelling…Google books is working to repair our historical blind spot: in collaboration with libraries around the world, they have digitized 30 million unique books, great and small, and, true to their expertise, they have made the whole searchable. The body of data has created a new field of quantitative cultural studies called culturomics.” The author show a chart that he calls “Pizza Now, Pizza Forever”
-
How have smartphones affected the average message length?
This one in particular made me terribly sad. The data from OkCupid shows that the best messages that get the highest response rate have decreased in length drastically over the years. Nowadays, a good message is considered to be only 40-60 letters long. I already feel misplaced in this time and age, as I am used to sending painstakingly long, hand-written letters and emails to my friends. Seeing these figures makes me wonder if they even read my letters at all.
-
Favorite quote from the book:
“…Writing, like life itself, abides. It changes form, it replicates in odd ways, it finds unexpected niches…it even, like anything alive, occasionaly stinks. But realize this: we are living through writing’s Cambrian explosion, not its mass extinction. Language is more varied than ever before, even if some of it is directly copied from the clipboard – variety is the preservation of an art, not a threat to it. From the high-flown language of literary fiction to the simple, even misspelled, status update, through all this writing runs a common purpose. Whether a friend to friend, stranger to stranger, lober to lover, or author to reader, we use words to connect. And as long as there is a person bored, excited, enraged, transported, in love, curious, or missing his home and afraid for his future, he’ll be writing about it.”
Thought of the Week:
This is how the past few days have looked like and its getting busier.
But one thing that helps me in dealing with all that chaos in my life is imagining myself next to an assembly line, with tasks, big and small, simply whizzing by and all I need to do is simply lift them one or two at a time and get the work done. There are, of course, a number of caveats to this type of sunny-side-up way of thinking and it’s useful to be mindeful of them:
- All the time you spend worrying about how little time you have, is all the time you waste not doing the work.
- Often the only thing that’s stopping us from working is not knowing where to start. That is what stops us from getting up early every day and charging into the day to get the actual work done.
- We counter our anxieties by scrolling through our news feeds, binge-watching TV shows and feeling guilty afterwards, staying up late for no good reason, and then running amok, announcing to the world how little time we have. We all do it, no one’s immune to it but what eventually saves us is having an ideal baseline that we can return to, one that allows us to live up to our highest potential. It’s very important to have that ideal baseline.
See you next week!
References:
[1] Dataclysm
[2] OkCupid Blog
[3] Sullivan Ballou Letter
[4] Culturomics