By Christian Bird, Alex Gourley, Prem Devanbu, Michael Gertz, and Anand Swaminathan

Published in Proceedings of the Third International Workshop on Mining software repositories

Communication and Coordination activities are central to large software projects, but are difficult to observe and study in traditional (closed-source, commercial) settings because of the prevalence of informal, direct communication modes. OSS projects, on the other hand, use the internet as the communication medium, and typically conduct discussions in an open, public manner. As a result, the email archives of OSS projects provide a useful trace of the communication and co-ordination activities of the participants. However, there are various challenges that must be addressed before this data can be effectively mined. Once this is done, we can construct social networks of email correspondents, and begin to address some interesting questions. These include questions relating to participation in the email; the social status of different types of OSS participants; the relationship of email activity and commit activity (in the CVS repositories) and the relationship of social status with commit activity. In this paper, we begin with a discussion of our infrastructure and then discuss our approach to mining the email archives; and finally we present some preliminary results from our data analysis.


  author = {Christian Bird and Alex Gourley and Prem Devanbu and Michael Gertz
	and Anand Swaminathan},
  title = {{Mining Email Social Networks} },
  booktitle = {Proceedings of the Third International Workshop on Mining software
  year = {2006},
  publisher = {ACM},
  location = {Shanghai, China}

Mining Email Social Networks (MSR 06)