Methodology for Content Analysis in Text Mining Projects and Fifth Generation of CAQDAS Cover Image

Metodyka dla analizy treści w projektach stosujących techniki text mining i rozwiązania CAQDAS piątej generacji
Methodology for Content Analysis in Text Mining Projects and Fifth Generation of CAQDAS

Author(s): Krzysztof Tomanek
Subject(s): Methodology and research technology, ICT Information and Communications Technologies
Published by: Uniwersytet Łódzki - Wydział Ekonomiczno-Socjologiczny
Keywords: Content Analysis; Mixed Methods Approach; Big Data; Text Mining; CAQDAS;

Summary/Abstract: Projects which we work with—large volumes of text data that are acquired from various sources and stored in a variety of formats—rise many dilemmas of a methodological nature, often require unstandardized decisions and solutions. In particular,compiling data of various quality, unstructured types, and of quan and qual nature requires dynamic strategies, ideas, and ways of analysis. The article describes an example of this approach. It shows its value in classification of written statements. In such context,the author of the article faces the following objectives: (a) can we use CAQDAS so that semiautomatic or automatic work would replace some manual work regarding classification of the expressions; (b) how to construct a classification methodology for data of various quality; (c) when the automatic classification is useful and when there is no chance of success? The article will be marked with moments in which the analyst reaches for knowledge typical for qualitative data analysis, and when the knowledge of this area is no longer sufficient to classify content (natural language processing, machine learning). An example of a project being the background of this article forced the use of several tools and languages to support work with the data. Work on the transformation, classification, and visualization of results required applications such as: MySQL, R, QDA Miner, WordStat, QlikSense. Role and limits of the computer-assisted qualitative data analysis software tools have also been noted.

  • Issue Year: XIII/2017
  • Issue No: 2
  • Page Range: 128-143
  • Page Count: 16
  • Language: Polish