An analysis of data from programming language subreddits reveals some insights into how programmers feel about the languages they use and the overlap between languages.
Project author, Tobias Hermann (Dobiasd on GitHub) says:
While reading about various programming languages, I developed a hunch about how often different languages are mentioned by other communities and about the average conversational tones used by relative members.
To test his hunch he collected and analysed all comments (about 300k) written to submissions (about 40k) in programming language subreddits from August 2013 to July 2104 using SQLite and PRAW, a Python package that gives access to reddit's API.
One interesting analysis was to discover how much a language is talked about a language compared to how much it is supposedly used according to the TIOBE index:
I guess this illustrates what we all suspect - that Haskell is talked about more often than it is used. There might also be a "name droping" phenomena going on as well and perhaps a bit of Visual Basic shame.
The size of a language is set by how often the others talk about it in sum. One connection represents the mutual mentions of two communities. The widths on each end is determined by the relative frequency of the mentionee being referenced by the respective other community. So PHP talks more about SQL than SQL talks about PHP while Python and PHP discuss each other in a balanced way.
There are lots of inferences to be drawn from this interactive graph. On Hacker News there's a comment on the way it indicates how compatible or co-used two technologies are in practice.
Dobiased also look at the choice of words used when referring to programming language. This revealed an obsession with abstract concepts by the Haskell people and the consideration of hardware issues by people using C and C++, that those talking about PHP were most prone to swearing, while those using Lisp and Clojure seemed to be have the most positive attitudes:
At the end of this analysis Dobiased asks:
But what is up with the Visual Basic community? They are neither angry nor happy. They just ... are? :)"
A comment on Hacker news solves the puzzle:
This is answered by the mentions relative to TIOBE graph. They use VB, but they are careful not to talk about it.
If you click on this chord graph you'll access the interactive version and be able to explore it on one language at a time.
In a world with so much textual/symbolic data to be generated, read, stored and searched, string algorithms are increasingly important. A challenging new short course on the Coursera platform that ope [ ... ]