Twitter analysis reveals global human moodiness
Cornell University social scientists use Twitter and Hadoop to study human behavior
IDG News Service - Twitter, Facebook and other social media sites are often criticized for encouraging people to share thoughts of little consequence, though social scientists are finding these electronic missives, when assembled en masse and analyzed with big data tools, can offer a wealth of new information about how people think and act.
A pair of researchers from Cornell University are the latest to mine social networks for such academic insight. Scott Golder and Michael Macy analyzed 509 million Twitter messages emitted over a period of two years by 2.4 million users across 84 different countries. From this data, they have gleaned that people have the same daily cycle of moods, regardless of their culture or language.
A paper summarizing the work, "Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures," is in Thursday's issue of the journal Science.
Beyond the immediate results, the work points to a possible new path in academic research, that of mining social networks for academic insight, the researchers said.
"The paper illustrates the opportunities for doing social and behavioral science in a new way," Macy said. "The growing tendency for human beings all over the globe to interact with one another using digital devices opens opportunities for research that were unimaginable even five years ago."
"Far from seeing conversations as mundane and useless, we see value in the fact that they are real-time, time-stamped messages produced by people on the spot for sharing with their friends," Golder added.
The researchers used Twitter's API (application programming interface) to gather the messages. They set up a six-node cluster to extract the data, which arrived packaged in XML, and converted the results into flat files. They then used a 55-node Hadoop cluster, running at the Cornell University Center for Advanced Computing, to analyze the dataset.
The analysis tool they used, Linguistic Inquiry and Word Count, links specific words to various positive and negative moods. Messages that might include words such as "awesome," "fantastic" or "pretty" could indicate a positive state, whereas words like "remorse," "abandonment," "fear" or "fury" indicate a negative state of some sort.
The results showed people tend to be more chirpy in the morning and during weekends. The messages revealed that they wake up happy and slowly grow more disgruntled and sour as the day goes on, though their affect usually rebounds in the evening. This behavior happens on both weekdays and weekends, though the weekend tweets usually start about two hours later than the weekday ones, perhaps because people are sleeping in.
Even in countries where the weekend is not Saturday and Sunday (in the United Arab Emirates, for instance, people work Sunday through Thursday), these patterns were evident.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Thinking Outside The Data Warehouse
- This high level, business problem focused eBook uses 5 customer scenarios to show how people and organizations are tackling real issues using IBM...
- Using BD for Smarter Decision Making
- This paper looks at new developments in business analytics and discusses the benefits analyzing big data bring to the business.
- Measuring the Business Value of CI in the Data Center
- One of the key strategies that IT teams are pursuing to reduce capital costs while boosting asset utilization and employee productivity is the...
- Switching Schedulers - Not As Complicated As You Think
- Changing or consolidating job schedulers may seem daunting. However, the benefits of switching to enterprise workload automation outweigh the risks. Read how BMC...
- Capture-Enabled Business Process Management
- Organizations today must deal with a vast amount of incoming information from many different sources. Efficient, automated business processes are critical to managing... All BI and Analytics White Papers
- Live Webcast
Data Privacy and Protection in Production Environments: New Research from Ponemon Institute - Date: Wednesday, June 13, 2012, 1:00 PM EDT / 10:00 AM PDT
In a recent study conducted by Ponemon Institute, fifty-five percent of respondents... - Live Webcast
A Geek's Guide to Presenting to Business People - Live Webcast: Wednesday, June 20th at 1:00 PM EDT
Join this live webinar with Paul Glen, author of Leading Geeks, to learn how to... - Live Webcast
Today's NAS: A Solution Beyond Old Limits - Date: Tuesday, July 17, 2012 2:00 PM EDT
Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management... - InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
- BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's... All BI and Analytics Webcasts