Analyzing Quantitative and Qualitative Data

Objective

After completing this lesson, you will be able to describe best practices to deal with quantitative and qualitative data

Classifying different types of data

Difference between quantitative and qualitative data

In the BriscoBikes case study, Sandra already talked about the key characteristics of quantitative and qualitative data.

Banner image of several people sitting side by side using a smartphone, laptop, and tablet, with the headline ‘RECAP: What you’ve already learned’.

Go to the following lesson for a short overview of the characteristics and the added value of quantitative data and qualitative data,: Getting Started with Organizational Change Management. Then scroll down to the video and listen to Sandra’s input, starting at 2:17.

Definition of quantitative and qualitative data

Before focusing on the data analysis, let’s have a look at the definition of the two key data types:

  • Quantitative data refers to information that can be measured and expressed in numerical values or quantities, e.g., counts, percentages, height, temperature, weight, or age. This type of data enables statistical analyses to identify patterns, relationships, and trends.

  • Qualitative data encompasses non-numeric information. It captures the qualities, characteristics, and descriptions of a subject or topic. Thus, it allows a deeper understanding of underlying motivations, attitudes and concepts. Typical examples are interview transcripts, open-ended survey responses, observational notes, or audio and video recordings.

Besides the distinction between quantitative and qualitative data, there are three important ways to classify change survey data:

  • Categorical data can be divided into groups or categories, but is not associated with a specific numerical value. Examples are gender or roles.

  • Ordinal data have a meaningful order or ranking. While the order matters, the difference between the categories might not be consistent. An example is a rating scale for the perceived benefits of a new cloud solution, providing the answering options "no benefits", "few benefits", "some benefits" and "many benefits".

  • Interval data is an important type of quantitative data where the intervals between values are meaningful, but there is no true zero point. Likert scales are a typical example for this data type facilitating an interval level of measurement. An example is the level of satisfaction with training courses, differentiating from 1 to 5, with "1" being "very dissatisfied" and "5" being "very satisfied".

Of course, the different data types have implications for the subsequent data analysis. Therefore, specific attention should be paid on the data type and the respective analysis methods to avoid misleading interpretations and wrong conclusions. This especially applies for advanced analysis methods.

Analyzing quantitative data

Before starting a quantitative data analysis, it is recommended to conduct some preparatory activities, to ensure high data quality. This enhances the accuracy of derived conclusions, and the actions based on it.

Data preparation

An important step after the end of the data collection period is the data cleansing. Quantitative data should be reviewed to detect irregularities and errors and to remove the data of the respective survey participant. Here are some typical data cleansing activities:

  • Review the data for incorrect or inadmissible answers (e.g. numbers beyond the rating scale) and delete them. This might especially occur in paper-based surveys.

  • Verify that values conform to expected relationships and constraints (e.g., ensuring that the sum of parts equals the whole) and make sure to identify and remove inconsistent datasets. An example: If a survey participant answers the question "How many hours per week do you work with the SAP cloud solution?" with "100", this is theoretically possible, but very unlikely; so the value should be deleted.

  • Watch out for incomplete feedback: If a respondent has missed out entire sections of the survey, you need to consider whether the rest of their answers should be included or not. It is recommended to define a threshold of maximum permissible missing values (e.g., 20 percent) and to exclude the data of survey participants that does not meet this criterion.

  • Look for pattern in the data indicating random responses, for example straight-lining (all responses in a straight line) or an alternating clicking of extreme scale poles.

  • Check the metadata to find the fastest respondents. Their answers should be verified thoroughly for any inconsistencies.

Leveraging sophisticated statistical methods and artificial intelligence can significantly facilitate the data cleansing process, for example by automated error detection (e.g., scanning the data for incorrect data and inconsistencies) or pattern recognition, supporting the identification of straight-lining and other distorting systematic data entries.

Definition of evaluation units

To get as many insights as possible out of the collected data, the data is usually structured into different evaluation units. There are many different options to "slice and dice" data. See the following chart for some typical examples in change surveys and practical advice:

“The chart shows how survey data can be structured into different evaluation units. Around the central question ‘How can you structure survey data?’ it lists examples such as geographical distribution, functional units, roles, level of responsibility, hierarchy level, time of measurement, level of experience, and demographic information.

The relevant information for breaking down the data in different categories is usually collected in a "General questions" section of the questionnaire. Invest sufficient time to think about useful sub-group analyses and ask for the respective data. If this opportunity is missed out, these analyses will not be possible. It is tempting to list as many aspects as possible to have a broad range of options for a detailed data analysis. However, the more personal information is requested from the survey participants, the higher is the danger that they fear their anonymity might not be guaranteed, which can lead to insufficient answers or even dropouts.

Additionally, if the number of sub-groups is too high, the complexity of the resulting data is very difficult to manage, and the practical added value is usually limited. Therefore, it is recommended to limit the number of general questions and only ask for information that most probably will have a practical added value for either evaluating previous change management activities, or planning future change management support.

Descriptive statistical analysis

There are many ways to look for insights in quantitative data. The easiest analysis is to sum up the frequency distribution of different answering options. Another very common analysis step is the calculation of mean values.

The mean or arithmetical average requires interval level of measurement. It is determined by adding up all values of the respondents’ answers related to an item and dividing the resulting sum by the count of numbers of your dataset (i.e. number of respondents). The mean is easily calculated and frequently used to analyze change survey data. However, it does not contain information regarding the data distribution. Therefore, it can be biased through statistical outliers. The chart below shows an example for different data equaling the same mean:

The graphic titled ‘Different data distributions resulting in equal mean values’ compares three bar charts of survey answers on a five-point agreement scale (from strongly disagree to strongly agree). In the first chart, the distribution is unimodal and symmetric, with the highest bar at ‘Neutral’ and smaller, roughly equal bars decreasing toward both ends. In the second chart, the distribution is uniform, with all answer options having bars of the same height. In the third chart, the distribution is U-shaped, with high bars at both extremes (‘Strongly disagree’ and ‘Strongly agree’) and the lowest bar in the middle at ‘Neutral’. All three distributions result in the same mean value of 3.0, highlighting that the mean alone does not reflect how responses are distributed.

Hint

Besides the calculation of frequencies and means as most common analyses in the change management context, there are many additional statistical analyses that allow deeper insights into quantitative data, based on interval scale level of measurement (e.g., Likert scales). Some common analyses are listed below:

The standard deviation quantifies how much the various data points differ from the mean of the dataset, provides insight into the spread and consistency of data.

With a correlation analysis, it is possible to examine the strength and direction of the relationship between different variables. It measures how changes in one variable are associated with changes in another. A positive correlation means that if one variable increases, the other increases, too. If one variable increases, and the other decreases, the correlation is negative.

T-tests examine whether different means are only a random result or a real, significant difference.

A regression analysis is a statistical method used to examine the relationship between one outcome (dependent variable) and one or more predictors (independent variables). It helps you understand the following aspects:

  • Whether a relationship exists between variables, e.g. does change communication and change enablement affect change readiness?

  • The direction of the relationship, e.g. positive: As X increases, Y increases or negative: As X increases, Y decreases

  • The strength of the relationship, e.g. how strongly predictors explain changes in the outcome

All of these basic statistical methods can be performed with common statistics programs.

Analyzing qualitative data

As Sandra and Paul have already conducted several change management surveys during the first phase of the S/4HANA implementation of BriscoBikes, Paul feels comfortable with the quantitative data analysis. However, he has still some questions regarding the handling of qualitative data. Therefore, he reaches out to Mira to get his questions answered.