Digital.Maag Repository

Sentiment analysis on java source code in large software repositories

Show simple item record

dc.contributor.author Sinha, Vinayak en_US
dc.date.accessioned 2016-11-03T17:43:14Z
dc.date.accessioned 2019-09-08T02:58:30Z
dc.date.available 2016-11-03T17:43:14Z
dc.date.available 2019-09-08T02:58:30Z
dc.date.issued 2016
dc.identifier 959237621 en_US
dc.identifier.other b22149934 en_US
dc.identifier.uri http://hdl.handle.net/1989/11975
dc.description x, 61 leaves : illustrations ; 29 cm en_US
dc.description.abstract While developers are writing code to accomplish the task assigned to them, their sentiments play a vital role and have a massive impact on quality and productivity. Sentiments can have either a positive or a negative impact on the tasks being performed by developers. This thesis presents an analysis of developer commit logs for GitHub projects. In particular, developer sentiment in commits is analyzed across 28,466 projects within a seven-year time frame. We use the Boa infrastructure’s online query system to generate commit logs as well as files that were changed during the commit. Two existing sentiment analysis frameworks (SentiStrength and NLTK) are used for sentiment extraction. We analyze the commits in three categories: large, medium, and small based on the number of commits using sentiment analysis tools. In addition, we also group the data based on the day of week the commit was made and map the sentiment to the file change history to determine if there was any correlation. Although a majority of the sentiment was neutral, the negative sentiment was about 10% more than the positive sentiment overall. Tuesdays seem to have the most negative sentiment overall. In addition, we do find a strong correlation between the number of files changed and the sentiment expressed by the commits the files were part of. It was also observed that SentiStrength and NLTK show consistent results and similar trends. Future work and implications of these results are discussed. en_US
dc.description.statementofresponsibility by Vinayak Sinha. en_US
dc.language.iso en_US en_US
dc.subject.lcsh Computer software developers--Attitudes. en_US
dc.subject.lcsh Computer programming. en_US
dc.subject.lcsh Computer science. en_US
dc.title Sentiment analysis on java source code in large software repositories en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital.Maag


Advanced Search

Browse

My Account