Grounded Theory sounds like a interesting approach, until you try to apply it. Suddenly, you’re drowning in concepts like open coding, categories, and constant comparison. Where do you even start?
If open coding has you scratching your head, don’t worry, you’re in the right place. This guide breaks it down in a way that’s easy to follow, so you can confidently use this method in your research.
What Makes Grounded Theory Different?
Open coding is a part of the data analysis process in the Grounded Theory approach. If you need a refresher on the basics of Grounded Theory, check out the introductory tutorial first.
Typically, when conducting a study, you might create an interview guide, conduct a dozen expert interviews, and then analyze them using content analysis.
In Grounded Theory, things work a bit differently. Here, data collection and data analysis can alternate. You first conduct a few interviews, transcribe them, and then step back to conduct a preliminary analysis. You apply techniques such as open coding (which we will discuss in detail shortly) and develop your initial categories.
Based on these findings, you return to the field and conduct the next round of interviews. This time, your questions build on what you discovered in your first analysis. This allows you to probe more precisely, dive deeper, and identify insights that help refine your emerging mini-theory.
This process, known as theoretical sampling, is the key characteristic that differentiates Grounded Theory from other qualitative research approaches.
Now, let’s get back to open coding.
Open Coding in Grounded Theory
According to Strauss and Corbin (1998), open coding is the first of three phases in the qualitative data analysis process for Grounded Theory:
- Open Coding
- Axial Coding
- Selective Coding
Open coding is the first step in engaging with your data—whether it consists of interview transcripts, social media posts, notes, or other written reflections (referred to as memos in Grounded Theory terminology).
To make this process as easy as possible for you, here are five key goals to keep in mind when conducting open coding. After explaining all the steps, we will go through an example from an interview excerpt to illustrate how to apply open coding in practice.

Step 1: Identify Your Categories
Open coding is essentially the embodiment of inductive category formation, meaning you approach the data with a completely open mind, without considering existing theories.
Some scholars recommend incorporating existing theories at a later stage of your analysis—comparing the categories and relationships you have developed with what has already been established.
For open coding, however, the key is to go through your data without any preconceptions, marking similar content with the same category. In other words, everything that belongs together conceptually should be grouped into the same category.
By the way, the terms “code” and “category” mean the same thing. Coding is simply the process of categorizing.
But how exactly do you form a category?
Step 2: Abstraction Instead of Description
Rather than merely summarizing the content and using the interviewee’s wording, you should abstract the content. For example, consider this sentence from an interview transcript:
“I usually use TikTok to get the latest news.”
Instead of assigning the code “news” to this sentence, a more abstract code like “information acquisition” would be more appropriate.
According to Strauss & Corbin, categories can also have different dimensions. For instance, the category “information acquisition” could include the dimension “frequency”—how often does this information acquisition occur?
Another dimension could be “content”, referring to the type of information being acquired. In this example, it’s news, but in another case, it could be the latest fashion trends.
Dimensions of a category can also be treated as subcategories—a concept you may already be familiar with from qualitative content analysis. If you use software like MAXQDA, creating subcategories can be especially helpful.
Step 3: Constant Comparison
This goal revolves around the following questions:
- Are codes being assigned consistently?
- Are the same criteria being applied to categorize dimensions?
You can achieve consistency by continuously cross-checking previously coded text excerpts to ensure that similar meanings are being coded the same way, or adjusting if differentiation is needed.
At this stage, memos play an important role again. Here, you can jot down all your ideas as you code. What is the broader context? What could be a general explanation for what you are reading? Memos serve as a playground where you draft your initial theory.

Step 4: Achieving Saturation in Open Coding
You’ve probably heard that you should stop conducting interviews when no new information emerges. This principle is particularly associated with Grounded Theory literature.
In this context, the term theoretical saturation means that no new categories, variations, or relationships between categories emerge from your data.
However, there are two challenges with saturation:
- The exact point of saturation is difficult to determine objectively.
Researchers disagree on when and how saturation is reached, and numerous articles (e.g., Saunders et al., 2018) discuss this topic. - In a thesis, you don’t have time for 30 interviews.
If you are working on a thesis, your time is limited. No one expects you to conduct interviews indefinitely until you reach this elusive saturation point. In this case, it is perfectly acceptable—after consulting with your advisor—to set a target number of interviews and stick to it.
For a fully comprehensive Grounded Theory study, the standard 12–15 interviews in a master’s thesis are usually insufficient. Therefore, it is essential to clearly define in your methodology section whether Grounded Theory is being used as a holistic approach or merely as a methodological toolkit without aiming for a fully developed Grounded Theory study.
Alternatively, you could collaborate with someone else and write a joint thesis. This way, you could impress your evaluators with 20–30 interviews and a more comprehensive Grounded Theory study.
For a PhD dissertation, the expectations are different. Here, you must address theoretical saturation and theoretical sampling, ensuring that the Grounded Theory approach is fully implemented from start to finish.
Example of Open Coding (Grounded Theory)
Let’s take this example from a fictional interview transcript:
“Wow, when I first put on the VR headset, it felt like I was in another world. I was totally surprised by how quickly you can connect with others in this Metaverse. I was in a virtual seminar room and later spoke with the professor. She was from Canada and super friendly and open to my ideas.”
The first step is to identify the key W-questions (Who? Where? What? How? Why?).
- Where? Metaverse
- Who? Professors and students
- What? Virtual seminars
- How? Through a VR headset in a virtual seminar room
The most interesting statement here is about quick social connection—which the interviewee has already highlighted by expressing surprise. You could code this part as “social connection”.
“Wow, when I first put on the VR headset, it felt like I was in another world. I was totally surprised by how quickly you can connect with others in this Metaverse. I was in a virtual seminar room and later spoke with the professor. She was from Canada and super friendly and open to my ideas.”
In later interviews, you would then explore how and why this interaction happens so easily. Could it be because users are represented by avatars, which lowers the barrier to approaching others?
Maybe. Maybe not.
Only Grounded Theory can reveal the answer.