Twitter Used To Map Happiness In New York
Written by Janet Swift   
Monday, 26 August 2013

Central Park is the happiest spot in New York City according to research that classified over six hundred thousand tweets in order to map people's mood to their time and location.

Researchers at the New England Complex Systems Institute (NECSI) used Twitter data to generate a sentiment map of New York City that provides a time-sensitive and geographically specific analysis of public mood. The data revealed that public mood is generally highest in public parks and lowest at transportation hubs.

Over the course of two weeks in April 2012, the research team supervised by Professor Yaneer Bar-Yam, Founding President of NECSI collected 603,954 tweets via the Twitter API restricted to those which were tagged with geocoordinates around the immediate New York metropolitan area.

Using tweets that contained the following emoticons, the researchers built two classifiers for positive and negative tweets. That is the presence of an emoticon was used to determine if the tweet was positive or negative and these were used to create classifiers using the text. The classifiers could then classify tweets that didn't have emoticons. 

nycemoticons

Then, for each tweet in the full set, URLs and usernames were removed, the text was tokenized and assigned a sentiment score based on the classifiers. Combining the sentiment ratings with geotags resulted in a public sentiment map for the New York City metropolitan area in which cyan represents the most positive sentiment and magenta the most negative. White represents areas with insufficient tweet density for analysis.

 

(click for larger version)

Spatial analysis of the tweets shows that sentiment progressively improves with proximity to Times Square:

nycdist

Periodic patterns of sentiment were also revealed with fluctuations on both a daily and a weekly scale: more positive tweets are posted on weekends than on weekdays, with a daily peak in sentiment around midnight and a low point between 9:00 a.m. and noon:

nyctime

Due to the use of geotagging, the researchers were able to locate specific areas of extreme sentiment - apart from parks and transportation hubs they included cemeteries, medical centers, a jail, and a sewage facility. In the part of the map that shows Manhattan, Central Park (A1) and Highland Park (A9) stand out as positive; Penn Sation (B4) and Brooklyn Bridge (B7) are negative as is Riker's island (D1), New York City's main jail complex. The report also notes:

"One area with markedly negative sentiment is Maspeth Creek in Brooklyn (E1). While its geographic features are unremarkable, this area is one of the most polluted urban water bodies in the country."

and it goes into graphic details about this site so that you are likely to imagine the smell of sludge and untreated sewage.

nyctweetmap

The report concludes with a comment on the advantages of this data mining exercise:

"Our method of public mood analysis has several strengths. By utilizing Twitter's abundance of geotagged data, we can obtain spatial information that is both wide-ranging and fi ne-grained. The brevity of tweets allows for rapid processing and classification, while their frequency produces a time-sensitive picture of public sentiment."

This is a clever methodology and one that produces results that fit in with common sense - parks are positive places and sewage is sad. 

More Information

"Sentiment in New York City: A High Resolution Spatial and Temporal View" by Karla Z. Bertrand, Maya Bialik, Kawandeep Virdee, Andreas Gros, Yaneer Bar-Yam

Related Articles

Analytics Big Bang

Twitter Can't Predict Elections Either

Google Uses Search Data To Predict Box Office Hits

 

To be informed about new articles on I Programmer, install the I Programmer Toolbar, subscribe to the RSS feed, follow us on, Twitter, Facebook, Google+ or Linkedin,  or sign up for our weekly newsletter.

 

espbook

 

Comments




or email your comment to: comments@i-programmer.info

 

Banner


Meta Releases OpenSource Podcast Generating Tool
28/11/2024

Meta has released an open source project that can be used to automatically convert a PDF file into a podcast. Meta says Notebook Llama can be considered an open-source version of Google's NotebookLM.

 [ ... ]



Raspberry Pi CM5 - Expensive And Undocumented
27/11/2024

So the unexpected has happened - the Compute Module 5 has been launched. But it simply emphasises some problems with adopting the Pi as an IoT device.


More News

 

 

 

Last Updated ( Monday, 26 August 2013 )