MITAO: A User Friendly and Modular Software for Topic Modelling


  • Paolo Ferri University of Bologna
  • Ivan Heibi University of Bologna
  • Luca Pareschi Tor Vergata University of Rome
  • Silvio Peroni University of Bologna



text analysis, discourse analysis, topics, frames, discourse, themes, topic modelling, visualization, MITAO, Python


Texts are among the most relevant data sources for social scientists, and traditionally researchers adopt qualitative methods for dealing with them. Yet, new computer aided techniques offer promising methodological avenues for scholars, which can now deal with big corpora of texts. One of the techniques that recently gained more relevance is Topic Modelling, which permits extracting bag of words which co-occur often in texts. While Topic Modelling was fruitfully used in sociology and management, existing software for performing it requires coding skills, and are not user friendly. In this paper we present MITAO, a new graphic-based, user friendly, open source software for performing topic modelling and other analysis on textual data.


Blei, D.M., Ng, A.Y., and Jordan, M.I. (2003), “Latent Dirichlet Allocation”, Journal of Machine Learning Research, 3 (1): 993–1022.

Bonilla, T., and Grimmer, J. (2013), “Elevated Threat Levels and Decreased Expectations: How Democracy Handles Terrorist Threats”, Poetics, 41 (6): 650–669.

Cho, Y.-J., Fu, P.-W., and Wu, C.-C. (2017), “Popular Research Topics in Marketing Journals, 1995–2014”, Journal of Interactive Marketing, 40: 52–72.

Croidieu, G., and Kim, P.H. (2018), “Labor of Love: Amateurs and Lay-Expertise Legitimation in the Early U.S. Radio Field”, Administrative Science Quarterly, 63 (1): 1–42.

DiMaggio, P., Nag, M., and Blei., D. (2013), “Exploiting Affinities between Topic Modeling and the Sociological Perspective on Culture: Application to Newspaper Coverage of U.S. Government Arts Funding”, Poetics, 41 (6): 570–606.

Ferri, P., Lusiani, M., and Pareschi, L. (2018), “Accounting for Accounting History: A Topic Modeling Approach (1996–2015)”, Accounting History, 23 (1-2): 173–205.

Flick, U. (2014), An Introduction to Qualitative Research, London: SAGE.

Gamson, W.A. (1992), Talking Politics, Cambridge: Cambridge University Press.

Goffman, E. (1974), Frame Analysis: An Essay on the Organization of Experience, Cambridge, MA: Harvard University Press.

Hannigan, T., Haans, R.F.J., Vakili, K., Tchalian, H., Glaser, V., Wang, M., Kaplan, S., and Devereaux Jennings, P. (2019), “Topic Modeling in Management Research: Rendering New Theory from Textual Data”, Academy of Management Annals, 13 (2):

–632. DOI: 10.5465/annals.2017.0099.

Jockers, M.L., and Mimno, D. (2013), “Significant Themes in 19th-Century Literature”, Poetics, 41 (6): 750–769.

Kaplan, S., and Vakili, K. (2015), “The Double-Edged Word of Recombination in Breakthrough Innovation”, Strategic Management Journal, 36 (10): 1435–1457.

Krippendorff, K. (2004), Content Analysis: An Introduction to Its Methodology (2nd edn.), Thousand Oaks, CA: SAGE.

Levy, K.E.C., and Franklin, M. (2014), “Driving Regulation: Using Topic Models to Examine Political Contention in the U.S. Trucking Industry”, Social Science Computer Review, 32 (2): 182–194.

Marshall, E.A. (2013), “Defining Population Problems: Using Topic Models for Cross-National Comparison of Disciplinary Development”, Poetics, 41 (6): 701–724.

McFarland, D.A., Ramage, D., Chuang, J., Heer, J., Manning, C.D., and Jurafsky, D. (2013), “Differentiating Language Usage through Topic Models”, Poetics, 41 (6): 607–625.

Miller, I.M. (2013), “Rebellion, Crime and Violence in Qing China, 1722–1911: A Topic Modeling Approach”, Poetics, 41 (6): 626–649.

Mohr, J.W., and Bogdanov, P. (2013), “Introduction – Topic Models: What They Are and Why They Matter”, Poetics, 41 (6): 545–569.

Silverman, D. (2007), Interpreting Qualitative Data, London: SAGE.

Tangherlini, T.R., and Leonard, P. (2013), “Trawling in the Sea of the Great Unread: Sub-Corpus Topic Modeling and Humanities Research”, Poetics, 41 (6): 725–749.

Thornton, P.H., Ocasio, W., and Lounsbury, M. (2012), The Institutional Logics Perspective: A New Approach to Culture, Structure and Process, New York, NY: Oxford University Press.

Wang, Y., and Chaudhry, A. (2018), “When and How Managers’ Responses to Online Reviews Affect Subsequent Reviews”, Journal of Marketing Research, 55 (2): 163–177.

Whorf, B.L. (1956), Language, Thought, and Reality: Selected Writings of Benjamin Lee Whorf, Cambridge, MA: Technology Press of Massachusetts Institute of Technology.




How to Cite

Ferri, P., Heibi, I., Pareschi, L., & Peroni, S. (2020). MITAO: A User Friendly and Modular Software for Topic Modelling. PuntOorg International Journal, 5(2), 135-149.