Blog - shribe!

7 Undergraduate (Bachelor) Thesis Tips No One Tells You About

Post author By admin
Post date June 27, 2025
No Comments on 7 Undergraduate (Bachelor) Thesis Tips No One Tells You About

You let the first half of your thesis timeline slip by, you’re buried in literature, ChatGPT keeps spitting out nonsense, and your supervisor has basically given up on you? It’s time for some real thesis tips that actually help.

That’s exactly why this post gives you seven tips for your Bachelor’s thesis that no one else will tell you – but that can actually help you reduce the pressure and finally feel like: “Okay, I’ve got this.”

1. Your Bachelor’s Thesis Is Just a Training Exercise

The biggest misconception students have is believing their Bachelor’s thesis needs to be a groundbreaking scientific contribution. Maybe it’s the term “academic thesis,” or maybe it’s just the unrealistic expectations we place on ourselves.

But here’s the truth: your thesis is, first and foremost, proof that you’re capable of doing academic work. Nothing more, but also nothing less.

What does that actually mean?

You need to show that you can develop a meaningful research question.
You need to research, analyze, and correctly cite relevant literature.
You need to apply a method in a clear, comprehensible way to answer your question.

What you don’t need to do:

Develop new theories
Open up an entirely new field of research
Or produce world-changing insights

Once you let go of the idea that your thesis has to be perfect, things get a whole lot easier. The goal isn’t to reinvent the wheel, but to work cleanly and transparently.

Pro tip: Write this sentence on a sticky note and slap it on your laptop screen: “Good is good enough.”

2. Your Research Question Is the Key

A lot of people think the topic is the most important thing. They spend weeks hunting for the perfect one.
“I want to do something with social media… or maybe sustainability?”

But let’s be honest: the topic alone won’t get you far. What really matters is the question you ask.

It’s the clearly defined question that turns a general topic into an actual research project. It gives your thesis direction, narrows the scope, and determines what literature you need and which method you should use. If you linger too long on a vague topic, you’ll lose your thread and start thinking in all directions at once—without making any progress.

Example: “Loneliness in remote work” – sure, that’s a topic. But what exactly is it about?

If you ask: “How does working from home affect the sense of belonging among entry-level employees in service agencies?” – then you suddenly have a clear direction. Something you can actually study.

So it’s really worth putting in extra care at this stage. If you can clearly and precisely formulate your question, everything else will become a lot easier. And your research question can be more niche than you might think – as long as it’s important to a specific target group.

3. Your Supervisor Is Not Your Personal Coach

Many students expect supervision to involve regular meetings, constructive feedback, and guidance during difficult phases. The reality? Usually… far less ideal.

Supervisors are usually extremely busy. They’re working on their own research, teaching classes, writing grant proposals – and supervising ten other theses besides yours. That doesn’t mean they don’t want to help. But they often just don’t have the capacity to walk you through every step.

That’s why one of the most underrated thesis tips is this: don’t wait around for help and manage the relationship actively.

Ask specific questions, be well-prepared, and provide clear interim results when needed. Instead of saying “I’m unsure about my outline,” say something like: “I’m considering combining chapters 2 and 3 – what’s your take on that?” These types of questions are easier to answer and increase the chances of getting genuinely helpful feedback.

The better you communicate, the more likely you are to get useful responses.

You need to act as the project manager of your own thesis. And that means: making your own decisions.

4. Writer’s Block Has Causes

Hardly anyone just breezes through their Bachelor’s thesis. Most people hit a phase where nothing works anymore. You open your document, stare at the screen, and have no clue how to even begin. But these blocks have causes – and they can be deciphered.

It’s usually not laziness or a lack of motivation – it’s uncertainty. You don’t know exactly what to write. Or you’re afraid of getting something wrong. Or maybe you’re just overwhelmed by the amount of information.

So what helps? Just start writing with no pressure.

Type out your thoughts, even if they’re still rough. Or start by jotting down a few bullet points outlining the general direction. The main thing is: get something written. Use an AI tool to get feedback – ChatGPT won’t judge you!

Be patient with yourself and remember: writing helps you think. Not everything needs to be perfect from the beginning. A lot of clarity comes only once things are on the page.

5. Rhythm Over Rigid Plans

The most carefully crafted weekly schedule is pointless if it doesn’t fit into your actual life. Many students plan based on an ideal version of themselves: they assume they’ll be equally productive every day, that motivation will always be available on demand, and that the writing process will unfold in a straight line. But reality is different. Some days you’re in the flow – and some days, you’re not. And that’s totally fine.

One of the most practical thesis tips is this: good time management doesn’t mean scheduling every hour of your day. It means creating realistic time slots when you can work – and building in space for setbacks and breaks. A writing day where you only get one page done can be just as valuable as one where you restructure two chapters or sort through literature.

Grab the books Deep Work and The One Thing – or just a summary. That’s really all you need.

6. The Biggest Time Trap? Literature

You want to start your research – and suddenly you’ve found 50 articles, 20 of which sound super relevant, and each one references 10 more. Welcome to the literature jungle.

What many underestimate: It’s not the writing that eats up your time – it’s the endless searching, reading, and sorting. Eventually, you lose track of everything. AI tools won’t help either – they can’t tell a Organization Science article from some dodgy MDPI fluff piece.

The trick isn’t to read everything – it’s knowing what you don’t need to read. A solid research question helps enormously because it automatically narrows down your literature. You only need to read what brings you closer to answering your research question and what’s valued in your academic field. If not – toss it out, even if it’s thematically related.

A reference management program can also help you keep track and insert citations correctly – without spending hours formatting.

7. The Discussion Is Not a Summary

If you’ve made it to the discussion section, you’ve already come a long way. And yet – this is often the hardest part of the whole thesis. Because now it’s no longer just about presenting results. Now you have to interpret and critically reflect on them.

The discussion is where you show that you truly understand the scientific process. This is your chance to connect the dots and derive meaning. You relate your findings to previous studies or theoretical frameworks. You consider possible societal or practical implications.

Ask yourself: What do my results mean? How do they align with the existing literature – and where do they differ? What can we infer from them?

And yes – you’re allowed to point out uncertainties and limitations. Where might your method have been too narrow? What influencing factors couldn’t you account for? What could have been done differently?

The discussion is not the place for summaries – it’s the place for independent thinking. This is where your thesis starts to become genuinely interesting. Unfortunately, this is also the point where most students stop putting in effort.

One of the most overlooked thesis tips? Make the discussion the heart of your thesis, and you’ll go from borderline pass to top-grade candidate.

Uncategorized

Internal and External Validity (Simply Explained)

Post author By admin
Post date June 14, 2025
No Comments on Internal and External Validity (Simply Explained)

Internal and external validity, these terms pop up all the time in research methods courses, but what do they actually mean?

If you’ve ever wondered how to tell whether a study is truly solid or just looks good on paper, you’re in the right place.

In this article, we’re cutting through the jargon and getting to the heart of what internal and external validity really mean, clearly explained, with examples that actually make sense.

So whether you’re working on a research project or just trying to wrap your head around these concepts for an exam, let’s break it down together.

What was validity again?

Exactly—it’s one of the three quality criteria used to assess the rigor of scientific studies.

Validity answers the question:

How well does a method measure what it is supposed to measure?

Or put differently: Can I answer my research question using this approach?

In another article, I’ve already given a general overview of the three quality criteria: objectivity, reliability, and validity.

But to clarify what validity means, here’s a simple example:

Imagine you step on the scale every morning to check your body weight. However, the scale is off by 2 kg every time. It provides consistent readings, so it’s reliable. But because it doesn’t show your true weight, it lacks validity.

The question of validity mainly comes up in quantitative research. Within this context, the characteristics of validity have been refined over time. In general, we distinguish between internal and external validity.

#1 Internal validity (quantitative research)

Internal validity refers to the degree to which researchers can draw causal conclusions from the results of a particular empirical study.

The most cited work on this is the classic “Experimental and Quasi-Experimental Designs for Research” by Campbell and Stanley (1963). Since experiments are considered the gold standard for testing causal relationships, internal validity is especially important in this context.

In an experiment, we test the causal link between an independent variable and a dependent variable. Internal validity is achieved when valid causal relationships can be derived from the results.

But how do we determine that?

Essentially, by ruling things out. Are there any other factors that might have influenced the dependent variable—factors that could explain the change even without manipulating the independent variable? In other words, are there any confounding variables that blur the causal relationship?

Let’s take an example:

In an experiment, the temperature inside a car is adjusted. People sitting in the car are asked how comfortable they feel at each temperature. Temperature is the independent variable. The reported comfort level is the dependent variable.

If you want to measure the effect of temperature on comfort, you better make sure the seat massage function is turned off! Otherwise, people might feel more comfortable—not because of the heat, but because of the massage. That doesn’t mean temperature has no effect, but it does make it harder to separate the influence of heat from the massage.

So, internal validity is all about minimizing the influence of potential confounding variables (Hussy et al., 2010).

#2 External validity (quantitative research)

Once internal validity is ensured, we can think about external validity. This refers to how well the results can be transferred to other contexts—such as different people, places, or points in time.

More casually, external validity is about whether the findings can be generalized to other situations (Hussy et al., 2010).

The term isn’t only used in experimental research. It’s also relevant in survey research, where generalizability matters too.

So how do we establish external validity?

Again, we use a kind of elimination process to identify and reduce threats to external validity.

One such threat is sampling bias—whether the sample is skewed in some way. This might happen, for example, if participants didn’t volunteer freely, or if certain conditions make the sample unrepresentative of the broader population.

#3 Internal and external validity in qualitative research

In quantitative research, internal validity has top priority. Only once that’s established do we look at external validity.

In qualitative research, it’s the opposite—or more accurately, internal validity doesn’t apply. Qualitative research isn’t about testing causal relationships, so internal validity isn’t relevant.

External validity, however, can play a role. Generalizing to a broader population might be important, especially if the sample is randomly selected.

For example, if you’re conducting a content analysis and select a sample of 100 videos from a population of 1,000 YouTube videos, you might ask whether the findings generalize to the other 900. But this kind of generalization isn’t usually the goal of qualitative studies.

Where external validity does matter is in generalizing to other contexts and situations. That is a goal of qualitative research. Just think about Grounded Theory: the aim is to generalize the findings of an empirical study by developing useful theoretical concepts and identifying their relationships (Hussy et al., 2010).

Research Methods

Topic Modeling Explained (LDA, BERT, Machine Learning)

Post author By admin
Post date May 16, 2025
No Comments on Topic Modeling Explained (LDA, BERT, Machine Learning)

Have you ever wondered how Netflix always seems to know exactly which series you’ll love, or why Spotify suddenly suggests the perfect song just when you wanted to hear it? The answer: topic modeling! And the best part? You can use this method for your own academic projects too.

In this post, I’ll walk you through exactly what topic modeling is, how it works, and why—even with minimal prior knowledge—you can take your term paper or thesis to the next level using this machine learning method.

What is Topic Modeling?

Topic modeling is a computer-assisted research method from the field of machine learning that uncovers hidden thematic structures in large volumes of text. Imagine you have hundreds of articles, books, or social media posts and want to figure out what core themes they cover. This is where topic modeling comes in: it helps you identify patterns in qualitative data—without having to read every single text from start to finish.

A “topic” isn’t a fixed category but rather a statistical distribution over words that frequently appear together. It’s up to you, as the researcher, to interpret these clusters and assign them meaningful labels, depending on your research question and the specific context.

The best-known technique in topic modeling is Latent Dirichlet Allocation (LDA). This method automatically identifies topics by grouping together words that frequently co-occur. In the next step, you can interpret these clusters to uncover underlying meanings—tailored to your research question and the type of texts you’re analyzing. These days, even more powerful models like BERT can capture the contextual meaning of individual words—but more on that in a moment!

How can you use Topic Modeling in your studies?

You might know the struggle: you’re deep into the data analysis phase of your bachelor’s thesis or seminar paper and facing a mountain of qualitative texts—from archive documents and social media posts to open-ended survey responses. Manually analyzing all of this is incredibly time-consuming and prone to errors. That’s exactly where topic modeling comes in: it helps you automatically detect thematic patterns in your data. For example, it can reveal which topics your respondents mention most often or highlight common viewpoints. This allows you to systematically identify trends and structures in large text datasets and draw well-founded conclusions from them. Especially if you’re working with an exploratory research design, topic modeling is a fantastic way to gain an overview of a large data set.

What’s more, topic modeling provides a systematic way to analyze qualitative data—without manually coding it, a technique I’ve discussed many times before on this channel.

How does Topic Modeling work in practice?

Don’t worry, this isn’t going to be a bunch of complicated jargon. Here’s a simple explanation: the basic idea behind topic modeling is that it treats texts as collections of terms (words) and tries to group these words into clusters. Each cluster then forms a topic. This happens using an algorithm that analyzes how often and in what context words appear together.

Here’s an example from social media analysis in the context of a crisis management project:

Imagine you’re studying how users communicate on X during a corporate crisis. You analyze thousands of posts to identify the key topics users are focused on:

Cluster 1: “apology, trust, transparency, accountability, measures” – Topic: company response and crisis communication
Cluster 2: “criticism, disappointment, boycott, mistakes, loss of reputation” – Topic: negative reactions and public perception
Cluster 3: “support, loyalty, understanding, community, solidarity” – Topic: positive user reactions and shows of support

This kind of clustering is done automatically by the algorithm, and in the end, you get a clear overview of the main themes and sentiments during the crisis. In the basic version, the algorithm outputs the word clusters, and it’s up to you to label and interpret them based on your research question.

Step-by-step guide: How to run your own topic modeling

To apply topic modeling yourself, follow these steps:

Step 1: Data collection

Gather all the texts you want to analyze—for example, articles, documents, transcripts, or social media posts. Make sure to save them in a readable format (such as .txt or .csv files).

Step 2: Text preparation (preprocessing)

In this step, you clean and prep your texts by:

Removing punctuation
Reducing words to their base form (lemmatization)
Removing stop words (common words without meaningful content, like “and,” “but,” “or”)

Tip: The quality of your results depends heavily on the quality of your preprocessing. Try out different approaches (for example, with or without lemmatization) to see how they affect your results. The more comfortable you are with programming, the better you’ll be able to automate the preprocessing. And if you’re working with a very large dataset, automation is a must—there’s just no way to do it manually.

Step 3: Choosing the right algorithm

Now it’s time to choose your topic modeling algorithm. For beginners, LDA is a great starting point because it’s easy to understand and reliably highlights general topic structures within text collections. BERT, on the other hand, is a far more complex model, which we’ll dive into in the next section. Your choice really depends on how deep you want to go in terms of content—and what technical resources you have available.

One more thing to keep in mind: with LDA, you need to decide upfront how many topics you want to generate. A sensible starting range is between 5 and 15 topics, depending on the size and thematic diversity of your text corpus.

Now you run the algorithm and let it work on your cleaned dataset. To do this, you’ll need some basic knowledge of R or Python. But don’t worry—you can pick up the necessary skills in just 2–3 days (even I managed it, and I never scored higher than a C in computer science!). There are also low-code options—these are tools that let you perform topic modeling without needing programming skills. I’ll show you the best tools for that at the end of this post.

Step 4: Evaluation and interpretation

After running the algorithm comes the most exciting part: interpreting the results.

What topics emerge from the word distributions?
Which topics are more dominant, and which are less common? (You’ll see this by how often certain words or clusters appear in your dataset.)
Are there overlaps or clear separations between topics?
Which terms appear in multiple topics?

Helpful tools for analysis include pyLDAvis, which lets you explore the topic distributions visually.

The simplest visual is a word cloud—it’s great for presentations, for example. But things get really interesting when you map out the clusters graphically and can show how tightly they group together or drift apart.

Keep in mind: interpretation is a creative process that depends heavily on your research question. Topics are statistically generated patterns—you give them meaning through your content analysis.

Step 5: Validating the results

A fascinating extension is to analyze topic modeling over time. For example, by segmenting social media posts by publication date, you can track how topics evolve—like during the course of a crisis. Which topics gain traction? Which fade away? This gives you more than just a static snapshot; it lets you uncover dynamics and trends—real added value for any data-driven analysis.

Remember, topic models don’t deliver “the truth”—they offer one perspective on your data. The results are strongly influenced by the choices you make: the number of topics, your preprocessing steps, and your data selection. That’s why it’s worth comparing your results with manual coding of a sample or getting expert feedback.

You can also assess the quality of your topics using tools like pyLDAvis or by calculating the coherence score. pyLDAvis helps you explore topic distributions visually and identify overlaps between topics. The coherence score gives you a quantitative measure of how semantically consistent each topic is.

BERT – the state of the art in topic modeling?

BERT (Bidirectional Encoder Representations from Transformers) is a modern algorithm based on neural networks and is currently one of the most powerful models for text analysis. While LDA groups words based solely on their frequency and co-occurrence—without considering their actual meaning—BERT analyzes each word in the context of its entire sentence. This means BERT can distinguish between different meanings of a word depending on how it’s used. For example, BERT recognizes that the word “bank” in “I’m sitting on the bank” refers to something different than in “I’m opening an account at the bank.”

Thanks to this context-based analysis, BERT can capture thematic subtleties and semantic nuances much more effectively, leading to significantly more precise and differentiated topic clusters.

BERT is particularly suited for advanced research projects where context and meaning nuances play a crucial role—for example, in social media analysis, emotionally charged topics, or ambiguous terms. It’s especially useful when you’re not just interested in word frequency but also want to detect underlying tones and sentiments.

Do you need programming skills?

Yes and no. If you just want a quick overview, there are plenty of programs and tools that offer topic modeling without any coding—like InfraNodus or MeaningCloud.

But if you want to dive deeper and run BERT or LDA independently, having basic programming skills in Python is a big plus. Don’t worry—Python is ideal for beginners because it’s very intuitive and comparatively easy to learn.

Common mistakes in topic modeling and how to avoid them

Before you get started, here are a few typical pitfalls from real-world practice:

Too small a data set: Most models won’t deliver stable results if you have fewer than 100–200 documents.
Choosing too many topics: If you specify 30 or more topics, the model may artificially split things that actually belong together.
Poor preprocessing: If you leave typos, emojis, or stop words in your data, the model quality will drop significantly.
No evaluation: Always use coherence scores, visualizations, or expert feedback to check your results.

Conclusion

Topic modeling isn’t just exciting for businesses and researchers—it can really help with your next seminar paper or thesis. You’ll save time, improve the quality of your results, and even pick up some skills in machine learning and programming along the way. Sounds like a pretty good deal, right?

Uncategorized

7 Genius ChatGPT Hacks for Academic Writing (No Plagiarism!)

Post author By admin
Post date May 6, 2025
No Comments on 7 Genius ChatGPT Hacks for Academic Writing (No Plagiarism!)

Learn how to use ChatGPT for Academic Writing – if you’ve ever used AI for academic writing, you’re probably familiar with the dilemma: where’s the line between smart support and academic misconduct?

Plagiarism can have serious consequences – ranging from poor grades to having your entire paper invalidated. But does that mean you shouldn’t use ChatGPT at all? Not at all! In fact, there are many smart and ethical ways to integrate AI into your academic writing process without falling into the plagiarism trap. And that’s exactly what this post is about!

Today, I’ll walk you through seven ways to use ChatGPT effectively and ethically in your academic writing.

1. Identifying Research Gaps and Formulating Hypotheses

You’ve done your research, but it feels like everything has already been said about your topic? ChatGPT can help you uncover unanswered research questions or methodological weaknesses in existing studies. This technique is used by many researchers to develop innovative research ideas – a real advantage if you’re writing your Bachelor’s or Master’s thesis.

ChatGPT is well-known for its ability to generate creative ideas. According to a study by Lee and Chung (2024), ChatGPT outperforms traditional tech-based brainstorming methods – like Googling – by a wide margin. This is because the AI can combine seemingly unrelated concepts in novel ways, sparking new insights.

Another trick: Use ChatGPT as a peer reviewer for your ideas. Ask it to evaluate your research question from the perspective of a critical academic. This can give you valuable insights on where to refine your approach.

Once you’ve got your research question, you can also use ChatGPT to support the hypothesis-building phase. Rather than asking it to generate a ready-made hypothesis, use it to explore relevant variables or test different hypothesis formats. For example, if your topic is the impact of remote work on productivity, you might ask: “What factors influence productivity when working from home?” or “Are there studies showing a positive correlation between remote work and increased productivity? What were the methodological, theoretical, or conceptual limitations of those studies?”

Try to think a little outside the box when writing your prompts – instead of just asking it to generate this or that.

2. Conducting a More Targeted Literature Search

Many students spend hours on Google Scholar or other academic databases without a clear system or search strategy. This typically leads to two major problems: First, relevant studies are often missed because the search terms aren’t well chosen. Second, the overwhelming number of search results can make it difficult to identify the most important and high-quality sources.

Instead of just typing keywords into a search bar, ask ChatGPT for support: Request relevant key terms or alternative expressions that are commonly used in academic literature. For instance, the term “digital learning” might also appear in studies as “e-learning” or “computer-assisted learning”. By asking ChatGPT for these synonyms, you’re less likely to miss key studies.

To narrow down the flood of results, ask ChatGPT which journals are the most reputable in your field. This will help you target your search and find higher-quality sources. Professors want to see papers that cite studies from the journals and conferences they themselves read and publish in. The quality of your references matters more than the topic match. Imagine you find a study that fits your research topic perfectly. If it’s from an obscure journal, it’s practically irrelevant to your academic field. It sounds harsh, but that’s the reality. Your reference list will be judged based on the quality of the sources it includes. So always make sure your argumentation is grounded in the most respected research.

However, having ChatGPT search for studies for you? That’s something you should seriously reconsider. In a study published in the prestigious PNAS journal, Lehr et al. (2024) tested ChatGPT’s performance across various research-related tasks. Literature searching was where the AI performed the worst. Interestingly, it did quite well when it came to advising on research ethics – which is kind of funny, if you think about it.

3. Improving Your Methodology

Choosing the right research method is one of the biggest challenges in academic writing. Instead of being unsure whether to go with a literature-based approach, or a qualitative or quantitative design, you can ask ChatGPT: “Which empirical method is best suited to answer my research question?” This can help you weigh the pros and cons of different methods more effectively.

Ideally, you choose the method that best fits your research question. But in reality, it’s often the other way around. Your supervisor tells you, “Please conduct interviews.” And now the method is fixed – your task becomes finding a research question that fits the method.

Operationalizing your concepts is also essential. ChatGPT can help you identify suitable items or scales that have already been used in previous studies. That way, you ensure your variables are clearly defined and measurable.

You can also use the AI to critically reflect on your methodology. Ask: “What methodological weaknesses might arise in my study?” This will help you identify potential limitations early on – and address them proactively.

If you’re planning an empirical study, ChatGPT can also assist in selecting appropriate statistical techniques or methods of analysis. This ensures that your data is analyzed systematically and with solid reasoning. However, you should not let ChatGPT analyze your data directly – the risk of errors is still too high. Instead, perform your statistical tests yourself and use ChatGPT to review your approach. According to Lehr et al. (2024), ChatGPT is quite good at spotting when you’re not following statistical best practices.

4. Researching Theories

Theories are the backbone of any academic paper. ChatGPT can give you an initial orientation by suggesting theories that might be relevant to your topic. Important: AI is not an academic source! Use ChatGPT as a starting point to explore theoretical frameworks, and once you’ve identified one that fits, switch to scholarly literature for further research.

A clever strategy is to ask ChatGPT to compare different theories, for example: “Compare Theory X with Theory Y – what are their strengths and weaknesses in relation to Topic X?” This kind of overview can make the decision-making process during your “theory casting” much easier.

You can also ask ChatGPT to name specific applications or research fields where a given theory is frequently used. This helps you assess how relevant a theory might be for your project — or, conversely, highlight that a theory has already been thoroughly examined in a specific area.

Another smart approach is to ask the AI about theoretical developments or critical debates: “How has Theory X evolved over the past ten years?” or “What are the main criticisms of Theory Y?” This gives you not just a general overview, but also insight into which aspects of a theory are currently debated or evolving.

Additionally, ChatGPT can help you draw interdisciplinary connections: “Are there overlaps between Theory X from psychology and Theory Y from sociology?” This can help you uncover new perspectives for your research and possibly create original theoretical links in your paper.

5. Strengthening Your Argument

A strong academic paper relies on clear argumentation and thoughtful discussion. And this is where ChatGPT can help in a somewhat unconventional way. Imagine you’ve constructed an argument that seems perfectly logical and convincing to you — but how would a critical reviewer assess it?

Most students overlook potential weaknesses in their arguments. But you can ask ChatGPT to deliberately deconstruct your reasoning. For example, ask: “What arguments could be used to contradict my thesis?” or “How might a critic challenge my reasoning?”

This turns ChatGPT into a “devil’s advocate,” giving you a fresh perspective on your work. It also allows you to proactively incorporate counterarguments — making your argumentation more robust and resistant to criticism.

6. Ensuring Argumentative Consistency

Once you’ve found strong arguments, it’s important to maintain consistency throughout your paper. Especially in longer texts, contradictions or unclear transitions can easily sneak in.

ChatGPT can help identify these inconsistencies if you ask it to systematically analyze your argument and point out any logical breaks. Try prompts like: “Identify inconsistencies or flawed reasoning in my text.”

Another common issue in academic writing is a lack of clear logical structure. Transitions between arguments might be missing, or there may be logical jumps that are hard for the reader to follow. ChatGPT can help by checking your paper for coherence and suggesting ways to improve the flow between points. A simple prompt would be: “Do you notice any logical leaps or missing links between my arguments?”

For smoother transitions, try: “How can I improve the argumentative connection between Section A and Section B?”

7. Improving Academic Writing Style

Another challenge many students face is mastering the academic writing style, and with ChatGPT, many simply ask the tool to rephrase their text to make it sound “better.”

But here’s what most don’t realize: ChatGPT can also help you improve your academic writing more deliberately. Instead of just asking it to rephrase your text, have it analyze and critique your writing style. For instance, you could ask: “What common writing mistakes do students in my field make?” or “Is my writing style appropriate for an academic paper?”

Keep in mind that ChatGPT has been trained on thousands of academic articles. So what it considers “academic” often reflects phrasing that frequently appears in published studies. This has led to a trend where professors joke about texts that overuse words like “crucial” or “key.” The widespread use of ChatGPT has caused an inflation of these terms. Don’t fall into this trap — and don’t let the AI drain the originality from your writing style.

One especially useful feature is that you can ask ChatGPT to adapt your style to a specific journal. If you’re planning to submit your paper to a respected journal, you can ask the AI to revise your text to match the style typically used in that publication. This helps you understand what academic writing looks like across different disciplines.

Final Thoughts – ChatGPT for Academic Writing

ChatGPT can make academic writing easier — but it’s no substitute for your own analysis and critical thinking. If used wisely, it can help you build better structures, sharpen your research question, compare theories, strengthen your arguments, and refine your writing style.

But don’t use ChatGPT to generate entire sections of your paper or accept its arguments uncritically. And above all: don’t use AI-generated citations! ChatGPT often invents sources — and that can lead to serious consequences. If you’re using an AI tool for literature searches, make sure it provides a link to the original source.

Never rely blindly on AI. Always verify academic sources yourself and treat ChatGPT as a research assistant — not a replacement for you as a researcher.

References

Lee, B.C., Chung, J.(. An empirical investigation of the impact of ChatGPT on creativity. Nat Hum Behav 8, 1906–1914 (2024). https://doi.org/10.1038/s41562-024-01953-1

Lehr, S. A., Caliskan, A., Liyanage, S., & Banaji, M. R. (2024). ChatGPT as research scientist: probing GPT’s capabilities as a research librarian, research ethicist, data generator, and data predictor. Proceedings of the National Academy of Sciences, 121(35), 1-9. https://doi.org/10.1073/pnas.2404328121

Research Methods

Axial Coding in Grounded Theory (+ Examples)

Post author By admin
Post date April 13, 2025
No Comments on Axial Coding in Grounded Theory (+ Examples)

You’ve successfully completed the first step of Grounded Theory, open coding, and now your initial codes are ready. But what’s next? How do you connect these codes meaningfully? That’s exactly where axial coding comes in.

In this video, I’ll show you step-by-step how to apply axial coding, systematically structure your findings, and lay the foundation for your own mini-theory.

#1 What is axial coding?

Axial coding is the second step in the three-stage coding process according to Strauss and Corbin (1998):

Open coding (data is broken down and labeled as codes)
Axial coding (relationships between codes are established to form categories)
Selective coding (core categories are developed)

While open coding breaks down data into smaller thematic units (open codes), axial coding goes further. It helps you discover how these relatively concrete codes relate to each other and how they can be grouped.

This means you’re no longer working directly with your data but rather with the open codes you’ve created. Keep in mind that other Grounded Theory books might use different terminology or slightly altered techniques. Here, when we speak about axial coding, we always refer to the approach of Strauss and Corbin (1998).

#2 How does axial coding differ from open coding?

A common misconception is that open and axial coding are entirely separate processes. In reality, they often overlap. While creating your initial codes during open coding, you might already notice certain codes seem related.

Axial coding helps systematically explore these relationships. You deliberately ask questions such as:

Which open codes have a cause-and-effect relationship?
Are there patterns or regularities in the data?
How can the open codes be grouped?

Where open coding primarily serves a descriptive function – breaking your data into meaningful units – axial coding has an analytical function. It allows you to reach the next level of abstraction using your open codes.

Example

Imagine you’re studying how students use AI-supported learning programs. Through open coding, you’ve discovered that AI is used in various contexts, like summarizing texts, creating flashcards, or supporting exam preparation.

Here are 6 of your open codes:

Summarizing lecture scripts
Creating flashcards
Audio transcription of lectures
Live quizzes to revise learning material
Weighing semester goals
Creating study plans

But how exactly do students employ AI? Could there be a temporal relationship? This is where axial coding steps in.

You might now group these 6 open codes into logical categories. For instance, “Summarizing lecture scripts” and “Creating flashcards” fit well together under a category called “Preparation.”

“Audio transcription of lectures” and “Live quizzes to revise learning material” might be grouped as “Active Support.”

The remaining two open codes could form a category called “Strategic Planning.”

If you consider the temporal dimension, you might conclude that the categories follow this sequence: (1) Strategic Planning, (2) Preparation, and then (3) Active Support.

This is already a great connection! But wait—is something missing? Do students not use AI for post-learning review? Oops, maybe you didn’t ask this in your interviews! With this realization, you can gather new data to explore exactly that. Perhaps new open codes and a “Review” category will emerge—or maybe not. The point is, after axial coding, you can always return to data collection or open coding. Grounded Theory is iterative, not linear!

#3 The coding paradigm by Strauss & Corbin (1998)

Strauss and Corbin (1998) offer very concrete guidance. If you’re unsure how to approach axial coding, use their coding paradigm. It helps systematically analyze and structure different aspects of a phenomenon. Let’s clarify this with a practical example.

Imagine you’re analyzing interviews with students about their use of AI tools like ChatGPT for learning. After open coding, you’ve identified several categories, including “Exam preparation,” “Skepticism towards AI,” and “Changes in learning behavior.”

Now you relate these categories to each other. Earlier, we looked at the dimension of time, but there are more aspects. You don’t need to analyze EVERYTHING in the coding paradigm—only what’s necessary for your theory to address your research question.

Phenomenon

The phenomenon describes the central theme or event you’re studying. Example: “Students use AI-supported tools for learning.”

Causal Conditions (Why does it happen?)

These conditions explain what causes or triggers the phenomenon. Example: “Students often have limited time, seeking more efficient learning methods to manage their studies better and cope with exam pressure.”

Context (Under what conditions does the phenomenon occur?)

Context describes specific circumstances or settings where the phenomenon takes place. Example: “In STEM fields such as computer science or engineering, AI usage is more widespread and socially accepted compared to humanities disciplines.”

Intervening Conditions (What additionally influences the phenomenon?)

These factors don’t directly cause but significantly affect how strongly a phenomenon occurs. Example: “Institutional university guidelines, like official prohibitions or recommendations regarding AI use, existing knowledge about AI, or students’ general technical affinity.”

Action Strategies (How do people respond?)

Here you examine how individuals react and what strategies they develop. Example: “Some students use AI tools intensively and regularly, others very selectively or critically reflectively, while still others avoid them completely.”

Consequences (What are the outcomes?)

Consequences describe the outcomes or results of the action strategies applied. Example: “Students alter their learning behavior. This can save time and enhance exam preparation, but might also result in superficial learning or dependency on AI tools.”

#4 Challenges in axial coding

Though axial coding is a powerful technique for theory-building, it can also pose challenges.

Distinguishing between various conditions isn’t always straightforward. Some factors can be interpreted as either causal or intervening conditions. It helps to continuously return to your data, asking: Why is something happening, and what influences it? Also, regularly ask yourself: What exactly should my theory describe? If you’re aiming for a process theory, causal conditions might be less important than mechanisms and temporal progression.

Researchers tend to draw premature conclusions. Once a pattern emerges, you might unintentionally look only for data supporting that pattern. Regularly challenge yourself: Are there alternative explanations? Ideally, use theoretical sampling to collect new data addressing your doubts or gaps in the theory at its current stage.

The coding paradigm should be applied flexibly. Not every study fits perfectly into the provided framework. Some parts may be irrelevant, others crucial. Only you can answer this clearly, keeping your research objective firmly in mind.

#5 Axial coding as the foundation for theory development

A crucial point in axial coding is that it doesn’t just help structure your data—it also paves the way to theory development.

As you identify more relationships through axial coding, a central category often emerges (sometimes two or three). These categories form the heart of selective coding—the third step of Grounded Theory—where you ascend another level of abstraction.

If, for instance, you find the most critical factor influencing AI use isn’t time pressure but trust in the technology, you might develop a theory of “Technology Acceptance in University Learning.” If you choose this direction, your codes and categories related to time pressure might become less relevant or even discarded in selective coding.

Remember, Grounded Theory is iterative. You’ll likely move back and forth between open and axial coding several times before developing a robust theoretical model.

Conclusion

Axial coding is central to Grounded Theory, enabling you to understand relationships between categories. By systematically applying Strauss and Corbin’s (1998) coding paradigm, you can uncover patterns and connections in your data, laying a solid theoretical foundation.

Once you’ve mastered axial coding, you can tackle selective coding—the final step, where you identify core categories and further develop your theory.

Questions or need help with coding? Drop a comment below!

Uncategorized

Open Coding in Grounded Theory (+ Example)

Post author By admin
Post date April 1, 2025
No Comments on Open Coding in Grounded Theory (+ Example)

Grounded Theory sounds like a interesting approach, until you try to apply it. Suddenly, you’re drowning in concepts like open coding, categories, and constant comparison. Where do you even start?

If open coding has you scratching your head, don’t worry, you’re in the right place. This guide breaks it down in a way that’s easy to follow, so you can confidently use this method in your research.

What Makes Grounded Theory Different?

Open coding is a part of the data analysis process in the Grounded Theory approach. If you need a refresher on the basics of Grounded Theory, check out the introductory tutorial first.

Typically, when conducting a study, you might create an interview guide, conduct a dozen expert interviews, and then analyze them using content analysis.

In Grounded Theory, things work a bit differently. Here, data collection and data analysis can alternate. You first conduct a few interviews, transcribe them, and then step back to conduct a preliminary analysis. You apply techniques such as open coding (which we will discuss in detail shortly) and develop your initial categories.

Based on these findings, you return to the field and conduct the next round of interviews. This time, your questions build on what you discovered in your first analysis. This allows you to probe more precisely, dive deeper, and identify insights that help refine your emerging mini-theory.

This process, known as theoretical sampling, is the key characteristic that differentiates Grounded Theory from other qualitative research approaches.

Now, let’s get back to open coding.

Open Coding in Grounded Theory

According to Strauss and Corbin (1998), open coding is the first of three phases in the qualitative data analysis process for Grounded Theory:

Open Coding
Axial Coding
Selective Coding

Open coding is the first step in engaging with your data—whether it consists of interview transcripts, social media posts, notes, or other written reflections (referred to as memos in Grounded Theory terminology).

To make this process as easy as possible for you, here are five key goals to keep in mind when conducting open coding. After explaining all the steps, we will go through an example from an interview excerpt to illustrate how to apply open coding in practice.

Step 1: Identify Your Categories

Open coding is essentially the embodiment of inductive category formation, meaning you approach the data with a completely open mind, without considering existing theories.

Some scholars recommend incorporating existing theories at a later stage of your analysis—comparing the categories and relationships you have developed with what has already been established.

For open coding, however, the key is to go through your data without any preconceptions, marking similar content with the same category. In other words, everything that belongs together conceptually should be grouped into the same category.

By the way, the terms “code” and “category” mean the same thing. Coding is simply the process of categorizing.

But how exactly do you form a category?

Step 2: Abstraction Instead of Description

Rather than merely summarizing the content and using the interviewee’s wording, you should abstract the content. For example, consider this sentence from an interview transcript:

“I usually use TikTok to get the latest news.”

Instead of assigning the code “news” to this sentence, a more abstract code like “information acquisition” would be more appropriate.

According to Strauss & Corbin, categories can also have different dimensions. For instance, the category “information acquisition” could include the dimension “frequency”—how often does this information acquisition occur?

Another dimension could be “content”, referring to the type of information being acquired. In this example, it’s news, but in another case, it could be the latest fashion trends.

Dimensions of a category can also be treated as subcategories—a concept you may already be familiar with from qualitative content analysis. If you use software like MAXQDA, creating subcategories can be especially helpful.

Step 3: Constant Comparison

This goal revolves around the following questions:

Are codes being assigned consistently?
Are the same criteria being applied to categorize dimensions?

You can achieve consistency by continuously cross-checking previously coded text excerpts to ensure that similar meanings are being coded the same way, or adjusting if differentiation is needed.

At this stage, memos play an important role again. Here, you can jot down all your ideas as you code. What is the broader context? What could be a general explanation for what you are reading? Memos serve as a playground where you draft your initial theory.

Step 4: Achieving Saturation in Open Coding

You’ve probably heard that you should stop conducting interviews when no new information emerges. This principle is particularly associated with Grounded Theory literature.

In this context, the term theoretical saturation means that no new categories, variations, or relationships between categories emerge from your data.

However, there are two challenges with saturation:

The exact point of saturation is difficult to determine objectively.
Researchers disagree on when and how saturation is reached, and numerous articles (e.g., Saunders et al., 2018) discuss this topic.
In a thesis, you don’t have time for 30 interviews.
If you are working on a thesis, your time is limited. No one expects you to conduct interviews indefinitely until you reach this elusive saturation point. In this case, it is perfectly acceptable—after consulting with your advisor—to set a target number of interviews and stick to it.

For a fully comprehensive Grounded Theory study, the standard 12–15 interviews in a master’s thesis are usually insufficient. Therefore, it is essential to clearly define in your methodology section whether Grounded Theory is being used as a holistic approach or merely as a methodological toolkit without aiming for a fully developed Grounded Theory study.

Alternatively, you could collaborate with someone else and write a joint thesis. This way, you could impress your evaluators with 20–30 interviews and a more comprehensive Grounded Theory study.

For a PhD dissertation, the expectations are different. Here, you must address theoretical saturation and theoretical sampling, ensuring that the Grounded Theory approach is fully implemented from start to finish.

Example of Open Coding (Grounded Theory)

Let’s take this example from a fictional interview transcript:

“Wow, when I first put on the VR headset, it felt like I was in another world. I was totally surprised by how quickly you can connect with others in this Metaverse. I was in a virtual seminar room and later spoke with the professor. She was from Canada and super friendly and open to my ideas.”

The first step is to identify the key W-questions (Who? Where? What? How? Why?).

Where? Metaverse
Who? Professors and students
What? Virtual seminars
How? Through a VR headset in a virtual seminar room

The most interesting statement here is about quick social connection—which the interviewee has already highlighted by expressing surprise. You could code this part as “social connection”.

“Wow, when I first put on the VR headset, it felt like I was in another world. I was totally surprised by how quickly you can connect with others in this Metaverse. I was in a virtual seminar room and later spoke with the professor. She was from Canada and super friendly and open to my ideas.”

In later interviews, you would then explore how and why this interaction happens so easily. Could it be because users are represented by avatars, which lowers the barrier to approaching others?

Maybe. Maybe not.

Only Grounded Theory can reveal the answer.

Uncategorized

How to Create a Codebook in Qualitative Research

Post author By admin
Post date March 26, 2025
No Comments on How to Create a Codebook in Qualitative Research

Are you looking for a quick yet precise guide on how to create a codebook for your next qualitative research project?

Then you’ve hit the jackpot with this article!

In this post, I’ll explain what a codebook is (in case you’re unfamiliar with it), guide you step by step through the creation process without overwhelming you, and even share how you might be able to skip most of the effort altogether while improving the validity of your qualitative data analysis.

What is a Codebook?

Before we dive in, let’s clarify that we are discussing codebook creation within qualitative research. That means that the data you will be analysing can be interview transcripts, documents, reports, videos, social media postings, and so on.

A codebook is essentially a coding manual that provides structured guidelines for assigning categories which represent broader thematic groupings to units of analysis within a qualitative dataset. Each category or theme consists of specific codes, which serve as labels for classification.

Using a codebook is more common in research projects that analyze qualitative data, but do so from a more quantitative perspective. I’ll explain what this means in a second.

In “hardcore” interpretive qualitative studies, for example when using Glaser’s grounded theory approach or Braun and Clarke’s reflexive thematic analysis, a codebook can also be used—but its purpose here is a little different.

Let’s start with the “quantitative” way of analyzing qualitative data. This is done in methods such as quantitative content analysis or deductive thematic analysis. I’ve made tutorials for both of these methods, so please feel free to check them out.

In a quantitative content analysis, you assign small bits of your qualitative data to certain categories. In the case of this method, you do not develop these categories yourself. Instead, you define them prior to your analysis. And how do you do that?

With a codebook!

The codebook contains all categories and descriptions of the categories, specifying how units of analysis (e.g., sentences, tweets, or images) should be classified.

The codebook may also define a numerical value (an ID) that you can assign per category: Category 1, category 2, category 3, and so on.

Example of a Codebook

Let’s look at a concrete example to make this clearer. Imagine we want to analyze tweets about COVID-19, specifically focusing on misinformation as part of our research question.

A codebook designed for this study would need to contain various categories of misinformation commonly found on social media.

Here’s an example from an actual codebook by Memon & Carley (2020):

The authors defined 16 categories into which they classified their material.

For each category, the codebook provides:

A detailed description
Examples
Justifications for why a particular example was classified under that category

Creating Your Own Codebook

You should only create a new codebook if, after thorough screening of the literature, you can’t find an existing one that suits your study or can be adapted to your needs.

Structure of a Codebook

A codebook, much like a scientific paper, should be well-structured for clarity. If necessary, include a table of contents for easy navigation.

Here’s a suggested structure:

#1 Introduction

A brief paragraph explaining:

The context in which the codebook was developed
What it is suitable for
Whether it builds upon an existing codebook (if so, specify which one)
The dataset used to develop the codebook

#2 Overview of Categories

Include a table summarizing all categories. Sometimes a codebook could have two levels, with categories and subcategories or main codes and sub-codes. Whether you call it codes, categories, or themes depends on the method. In content analysis researcher typically refer to categories, in thematic analysis it’s themes and so on. This means, if you are creating your own codebook, you should stick with the vocabulary of the method you want to apply the codebook to.

#3 Description of Categories

Categories can originate in two ways:

From existing literature or a previously established codebook
- In this case, provide the citation.
Developed based on your own dataset
- If you identify a new category during your analysis, you can add it to the codebook.

Each entry in the codebook should consist of:

Title of the Category
Description in your own words, explaining what the category represents and the conditions under which this category applies
Corresponding sub-categories that might be part of this category and what they represent
Unit of analysis (e.g., tweet, comment, video, text snippet)
At least one example (preferably several) from a real dataset
Explanation of why the example(s) were assigned this category or sub-category

For points 5 and 6, you can use a table format similar to the linked example. The key is to keep the codebook as clear and structured as possible for ease of use.

#4 References

Finally, list all sources used in your codebook, just as you would in any scientific work.

Using an Existing Codebook

Creating a new codebook from scratch can be time-consuming. That’s why it’s worth checking for existing codebooks first.

Where to Find Existing Codebooks

Open-Science Databases: Many researchers share datasets and resources, including codebooks, to support the academic community. Examples:
- Zenodo
- OSF (Open Science Framework)
Contacting Authors: If a paper references a codebook but doesn’t provide it in an appendix, try emailing the authors. Researchers often appreciate interest in their work and may be happy to share their codebook.
Adapting a Codebook: If you find a relevant codebook, you can modify it to fit your study. However, make sure to cite the original source and document any changes you made. If you include your adapted codebook in an appendix, provide a detailed explanation of modifications.

Codebooks in Inductive Qualitative Research

In the beginning, I mentioned that codebooks may also be used in inductive qualitative research, such as Glaserian grounded theory or reflexive thematic analysis.

The main difference here is that you are not looking for pre-defined categories. Instead, you start with a blank canvas and create all categories based on your data. The codebook is simply a tool to document your categories. This will help you and others (such as collaborators or reviewers) to better understand how the categories were built. You are essentially creating a documentation of all your categories and examples. But in contrast to quantitative content analysis and deductive thematic analysis, you are doing it during and after the analysis rather than before.

Final Thoughts

A well-structured codebook is essential for conducting research that aims to assign qualitative data to predefined categories or themes.

Whether you create one from scratch or adapt an existing codebook, being systematic, clear and consistent is key to ensuring valid and replicable results.

Uncategorized

Quantitative Content Analysis (7-Step Tutorial)

Post author By admin
Post date March 16, 2025
No Comments on Quantitative Content Analysis (7-Step Tutorial)

You’ve been trying to figure out quantitative content analysis, but no matter where you look, all you find are books, papers, and information on qualitative content analysis.

Help is on the way.

Quantitative content analysis often takes a backseat to its qualitative counterpart, receiving only a brief mention in methodology books. However, if this is the method you want to apply, you require more guidance. And that’s exactly what you’ll get here.

In this article, you will learn how to conduct a quantitative content analysis in seven steps and understand the key differences from qualitative content analysis.

Quantitative vs. Qualitative Content Analysis: The Key Differences

Quantitative content analysis traces back primarily to a methodology book by social psychologist Bernard Berelson. He defined content analysis as a “research technique for the objective, systematic, and quantitative description of the manifest content of communication” (Berelson, 1952, p. 489).

Note: This definition applies to content analysis in general, not just the quantitative approach. Naturally, this sparks debate, as the very term “quantitative” can provoke strong reactions from advocates of the qualitative research paradigm.

The subject matter of content analysis—whether qualitative or quantitative—is always somehow qualitative in nature. That is because content analysis helps us evaluate qualitative data sources, such as newspaper articles, films, social media posts, or documents. But the method itself is heavily informed by the quantitative paradigm, as its name suggests.

Quantitative content analysis systematically converts qualitative material into quantifiable data by applying structured coding schemes and statistical methods. We’ll explore how that works shortly.

Both quantitative and qualitative content analysis aim to systematically and objectively evaluate content. However, a key distinction is that the quantitative approach allows for greater intersubjective traceability, as it follows a structured and replicable coding process.

While qualitative content analysis relies more on the researcher’s judgment and interpretative creativity, quantitative content analysis follows a strict set of rules. It is designed to test theories by verifying hypotheses rather than generating new ones.

Let’s look at the 7 steps of applying quantitative content analysis.

Step #1: Theoretical Preparation for Quantitative Content Analysis

As with any research project within the quantitative paradigm, engaging with existing theories is crucial. Start by defining your research problem—what exactly do you want to investigate?

Ideally, your problem should focus on the relationship between variables that you can examine through content analysis. For example, you might study “news framing” related to climate change and the “emotions” in social media discussions.

Before conducting your quantitative content analysis, formulate hypotheses—testable assumptions about the relationships between variables. A strong hypothesis clearly defines the dependent and independent variables and ensures that the coding categories reflect these constructs. For example, in a study on climate change news framing, you might hypothesize that news articles from government-funded media use the ‘scientific consensus’ frame more often than private news outlets. Another example: “Climate change news framing (independent variable) influences the emotional responses of social media users (dependent variable).”

For a deeper dive into hypothesis formulation, check out my dedicated tutorial on the topic.

Step #2: Sampling

Now, you need to determine your sample. Suppose you want to analyze how climate change is framed in social media. Your sample could consist of a random selection of 500 tweets from major news outlets (e.g., BBC, CNN, Reuters) over the past six months.

Ensuring that the sample is representative is crucial, for example, by balancing sources from different political perspectives. A quantitative content analysis typically allows for a bit of a larger sample because breadth is more important than depth. For a qualitative content analysis, it’s the exact opposite.

Step #3: Defining the Unit of Analysis

At this stage, you specify the level at which your material will be analyzed. For example, if you are studying how climate change is framed in tweets, your unit of analysis could be (1) entire tweets, (2) individual hashtags, or (3) specific phrases related to emotions (e.g., ‘climate crisis’ vs. ‘climate hoax’). If you’re analyzing a text, the unit could be a full sentence or individual words, depending on your research objective.

If you’re looking for semantic nuances, such as emotional tones, it might make sense to analyze individual words. If you’re investigating broader themes, like news “frames,” analyzing entire sentences or text sections may be more appropriate.

Step #4: Defining Descriptive Categories

Before starting the analysis, you need to establish categories for classifying your units of analysis. This involves researching existing coding manuals or codebooks in the academic literature. If none suit your purpose, you must develop your own.

For example, in a study on news framing, a coding manual would list various frame types such as ‘scientific consensus,’ ‘economic impact,’ or ‘conspiracy theory’ and provide instructions for assigning sentences, tweets, videos, or images to these categories.

Authors of coding manuals typically include example cases and detailed coding guidelines, ensuring clarity and consistency. Think of the coding manual as a structured guide for analysis, whether for your use or for others replicating your study.

If you’d like me to create a video on how to develop a coding manual, let me know in the comments!

Step #5: Quantification

Once you’ve assigned each unit of analysis to a category, count how often each category appears in your sample. For example, if analyzing 500 tweets, you might find that 40% frame climate change as a ‘scientific consensus,’ while 25% present it as a ‘conspiracy theory.’ These frequencies allow for statistical comparison and further quantitative analysis.

The most common technique for evaluation is frequency analysis, which links category occurrences to the variables under investigation.

According to Krippendorff (1980), key techniques in quantitative content analysis include frequency analysis, contingency analysis, and cluster analysis. He emphasizes that quantitative content analysis must ensure reliability through systematic coding procedures and validation techniques. These methods help uncover statistical patterns while ensuring measurement validity and intercoder reliability.

A crucial aspect of any quantitative content analysis is ensuring reliability and validity. Intercoder reliability should be tested using Krippendorff’s Alpha or Cohen’s Kappa to ensure that different coders classify content consistently. Without strong reliability, the statistical findings of the analysis may not be meaningful.

If you are doing the analysis by yourself, you cannot calculate intercoder reliability. For this case, you may look into “intracoder” reliability.

Step #6: Statistical Analysis

Statistical analysis can be either descriptive (e.g., frequency distributions, cross-tabulations, means, and standard deviations) or inferential, depending on the dataset size. Inferential techniques include regression models to test relationships and factor analyses to identify underlying patterns in large datasets. Descriptive statistics summarize patterns within the data, while inferential techniques, such as regression models, examine relationships between variables. Factor analyses can identify latent patterns in large datasets, while contingency analysis tests the association between different categorical variables. For example, contingency analysis can reveal whether certain frames are more common in specific media sources, while a regression model can test how media framing influences audience perceptions.

For meaningful results, your categories must be clearly operationalized and directly related to the variables under examination. Thus, problem formulation, hypothesis generation, and category selection should be well-aligned.

Step #7: Presenting the Results of Quantitative Content Analysis

When reporting your results, tables are your best friend. First, present the absolute frequencies of your categories and describe them in your own words.

Next, outline the results of your statistical tests, explaining why you chose them and what the findings mean.

Finally, state which of your hypotheses were supported and which were rejected.

Conclusion: Why Choose Quantitative Content Analysis?

Quantitative content analysis is an excellent choice when you want to test an exisintg theory or framework with qualitative data. Some research questions cannot be effectively addressed through traditional quantitative methods like surveys or experiments. In such cases, content analysis provides a valuable alternative.

If this sounds like what you’re looking for—then quantitative content analysis is the right method for you!

Literature on Quantitative Content Analysis

Berelson, B. (1954). Content Analysis. In G. Lindzey (Ed.), Handbook of Social Psychology. Vol. 1: Theory and Method (pp. 488–522). London: Addison-Wesley.

Krippendorff, K. (1980). Content Analysis: An Introduction to Its Methodology. Sage Publications.

Uncategorized

Participant Observation (Research Method Explained)

Post author By admin
Post date March 10, 2025
No Comments on Participant Observation (Research Method Explained)

What is participant observation? Where does this research method originate? In which cases is it used? And most importantly: How can you successfully conduct this method yourself, which has sometimes been called “the last great adventure of social science” (Evans-Pritchard, 1973)?

If these questions matter to you, then you’re in the right place. Grab a drink, sit back, and enjoy this article as a smooth introduction to your own ethnographic adventure.

Ethno…what? Don’t worry, we’ll get to that.

Ethnographic Research

Participant observation is a core method in ethnographic research, often simply referred to as fieldwork. The aim is to gain insights into human behavior, group dynamics, and social interactions.

The subject of study can range from an indigenous tribe in Papua New Guinea to a tech startup in a small town in Germany.

The word “ethnos” comes from ancient Greek and roughly means “foreign people.” This research approach has its roots in anthropology and ethnology. Historically, it was used in expeditions to remote regions or isolated islands to study the people, tribes, and cultures living there. Today, ethnographic methods are widely used in various disciplines, including sociology, education, social psychology, and even business studies.

Observation

Observation is probably the most well-known method in ethnography. Spradley (1979) describes it in very simple terms:

“I want to understand the world from your point of view. I want to know what you know in the way you know it. I want to understand the meaning of your experience, to walk in your shoes, to feel things as you feel them, to explain things as you explain them. Will you become my teacher and help me understand?”

If you have developed a research question that can be answered by describing the behavior of individuals in their natural environment, then observation is a suitable method. By observing, you can see with your own eyes what you aim to study.

In contrast, methods like expert interviews or surveys require you to rely on participants’ statements being accurate and honest.

As a result, observation is one of the empirical methods where researcher subjectivity plays the largest role. Subjectivity is common in qualitative research, but in observation, it is even more pronounced, as everything is filtered through the researcher’s own perceptions and senses.

Non-Participant Observation

In non-participant observation—just as the name suggests—you remain an outsider, merely watching without engaging in the activity. Besides the distinction between participant and non-participant observation, another key factor is whether the observation is overt or covert.

In overt observation, you ask for permission beforehand, introduce yourself, and explain why the study is being conducted and how it might be beneficial for the participants.

Covert observation, on the other hand, takes place without the knowledge of those being observed. While this might yield highly authentic insights, it is rarely used—and for good reason. Ethically, covert observation is highly problematic and would have difficulty passing an ethics committee review.

Participant Observation

The “participant” aspect of participant observation refers to the extent to which you, as the researcher, are involved in the situation. There are different roles you can take on.

Gold (1958) identified four different roles that researchers can assume in participant observation:

Complete participation

When you are already a full member of the group you are studying, such as when you observe a company where you work as a student assistant.

Active participation

When you try to engage in the same activities as the group members but are still an outsider.

Moderate participation

When you alternate between observing and participating to maintain a balanced approach.

Passive participation

When you are present but do not engage in the activities, interacting minimally with the group.

For example, if you were studying an indigenous tribe in the Amazon, you might actively take part in a spiritual ritual. This would make you highly involved in the experience, possibly giving you access to insights and conversations you might not otherwise have. This would be considered active participation. Alternatively, you could just follow along quietly, staying in the background while smiling and clapping along—this would be passive participation. In non-participant observation, you would avoid any interaction altogether.

However, active participation also has a significant drawback: the people you observe may alter their behavior simply because you are participating. This effect must always be considered and critically discussed.

Additionally, you can conduct conversations with participants. These ethnographic interviews are quite different from structured expert interviews. There is no pre-defined questionnaire; instead, conversations occur naturally within the setting. The goal is to build a respectful and trusting relationship. These interactions might take place around a campfire outside usual working hours or in an unexpected setting. Instead of recording the conversation, you take notes and later document your insights in a research diary.

The Three Phases of Participant Observation

To help you prepare for your participant observation, here are three key phases that this method can be divided into (Spradley, 1980; Flick, 2019):

Describing the Research Environment

At the beginning of your participation in a group, you are an outsider and need time to acclimate. Your presence is something new for the group members, and they must adjust to having you around. During this phase, it is advisable to remain somewhat in the background and start by thoroughly documenting the environment. Write down everything you observe—what you see, hear, and experience. Simultaneously, take the opportunity to introduce yourself and gradually establish rapport with individual members.

Focused Observations

Once you have become an accepted presence within the group, you can transition to more purposeful observations. At this stage, you can initiate targeted conversations and immerse yourself in situations that directly contribute to answering your research questions. Your observations become more structured as you begin to refine the focus of your study.

Selective Observations

In the final phase of your study, you will have already gathered significant insights and formulated preliminary answers to your research questions. Now, your objective is to seek out specific examples and supporting evidence that substantiate your findings. This phase requires critical thinking and a keen eye for patterns and consistencies in behavior.

Data Collection and Analysis in Participant Observation

When it comes to collecting data, you can take either a structured or unstructured approach. If you have created checklists, formulated guiding questions, or prepared other documentation in advance, you are following a structured approach.

Conversely, if you enter the observation setting with an open mind and an empty notebook, allowing observations to guide your documentation process, your approach is unstructured. Both methods have their advantages and limitations.

Your research diary plays a crucial role in the analysis process. Alongside taking notes during your observations, you should later expand on them in your diary, adding reflections and interpretations. To ensure that you do not overlook documentation, consider setting aside dedicated time—perhaps a few hours or an entire day—away from the field to write down your impressions in detail.

After data collection, qualitative analysis techniques can be applied to make sense of the findings. Common methods include:

Thematic Analysis: Identifying recurring patterns, themes, and categories within the observational data.
Coding: Assigning labels to different aspects of the data to systematically organize insights.
Narrative Analysis: Examining how observed interactions and behaviors construct meaning within a specific social context.

These approaches help translate raw observations into meaningful interpretations, allowing you to draw conclusions from your study.

Now, lace up your boots and embark on your research adventure!

Uncategorized

What is a Histogram? (Statistics Basics)

Post author By admin
Post date February 27, 2025
No Comments on What is a Histogram? (Statistics Basics)

What is a histogram in statistics? How does it visualize data? And how can this visualization help you with data analysis?

In this video, I’ll show you how to ace your next statistics exam and take your data analysis to the next level using histograms.

Histograms are a standard tool in statistics and are essential for many academic papers. To help you understand and use histograms effectively, I’ll walk you through the basics today.

Of course, I’ll also show you how to create a histogram for any dataset in no time.

1. What is a Histogram?

A histogram is a type of chart that represents a frequency distribution. As you can see in the graphic, the x-axis represents intervals, while the y-axis shows their corresponding frequencies.

A key characteristic of a histogram is that the bars are directly adjacent to one another, with no gaps in between. This emphasizes the continuous nature of the data, as each bar represents a range of values rather than discrete categories. This is because histograms are used for continuous data (e.g., measurements like weight, length, or time spans).

In contrast, bar charts represent categorical data (nominal data such as the number of students in different study programs like law, psychology, or business administration). That’s why bars in a bar chart are separated from each other.

It’s also crucial that the y-axis of a histogram starts at a frequency of 0. The height of each bar represents the number of data points in that interval.

If the baseline is altered, the perceived heights of the bars change, potentially distorting the actual distribution of the data. This could lead to an overestimation of low frequencies or an underestimation of high frequencies.

2. Where Are Histograms Used?

Histograms are widely used across various fields. In economics, for example, they help analyze income distribution across different demographic groups. In medicine, they assist in understanding the distribution of measurements like blood pressure or BMI within a population.

They are also crucial for fundamental statistical data analysis, such as checking whether a dataset follows a normal distribution.

3. Creating a Histogram in Statistics

Let’s create a histogram using a real-world example. We have a dataset of exam scores from the last statistics test:

53, 41, 71, 91, 99, 93, 87, 74, 97, 81, 85, 89, 78, 61, 66, 71, 86.

First, you need to create a frequency distribution table and group the scores into intervals.

The intervals must have equal width, ensuring that all bars are the same size. If the intervals are too wide, important details might be lost, whereas too narrow intervals could make the chart too complex. For this example, I’ve chosen intervals of 10 points each (40-49, 50-59, 60-69, etc.).

In statistics, class intervals for histograms are typically chosen so that the lower boundary is inclusive, and the upper boundary is exclusive.

This means that an interval of 60-69 includes all values from 60 up to but not including 69. If we instead used an interval of 60-70, the value 70 would belong to two intervals (both 60-70 and 70-80), leading to ambiguity. To avoid this issue and ensure a clear, unambiguous assignment of data points to intervals, histogram intervals do not overlap.

Now let’s look at the frequencies.

One student scored in the 40-49 range.
Another student scored between 50-59.
Two students scored between 60-69.
Four students scored between 70-79.
And so on…

Now, you need to plot this data using software like Excel or R. The result for our example looks like this:

4. Understanding a Histogram in Statistics

Interpreting a histogram in statistics is a crucial step in understanding your collected data. A histogram provides a visual representation of how data is distributed.

It helps identify patterns and anomalies that may indicate specific trends or issues. Keep in mind that in density histograms, probabilities are represented by the area of the bars, while in frequency histograms, the bar height indicates the number of observations in each interval.

1. Data Distribution

Histograms show the frequency of data within different intervals, making it easy to assess distribution at a glance. Researchers can quickly determine whether the data is normally distributed, skewed left or right, or exhibits other patterns like bimodal distributions.

A normal distribution, often called a bell curve, means that most data points cluster around a central value, with symmetrical tails extending on both sides. In a university setting, this could represent exam scores, where most students achieve average marks, while very high or very low scores are less common.

A skewed distribution indicates that the data is asymmetrically spread. A positively skewed (right-skewed) histogram shows a concentration of low values with a few high values—such as the time students spend studying for a subject. Many may spend only a little time, while a few invest a lot. A negatively skewed (left-skewed) distribution suggests the opposite.

A bimodal distribution, featuring two peaks, may indicate the presence of two distinct groups. For example, in a class attended by both first-year and advanced students, two peaks might suggest that each group tends to score differently.

2. Identifying Anomalies

Visualizing data can reveal outliers, unusual patterns, or anomalies that may warrant further investigation. The width of the intervals shows how data is grouped.

Narrow bars indicate a detailed distribution.
Wider bars provide a more generalized overview.
Bar height represents the number of observations in each interval—taller bars indicate higher frequencies.

3. Comparing Datasets

Histograms allow for easy comparison of two or more datasets. You can use them to examine how data is distributed under different conditions or across different groups.

4. Hypothesis Testing

Histograms can help formulate or test hypotheses about data. For example, if you hypothesize that a particular variable follows a normal distribution, a histogram in statistics can confirm or disprove this assumption.

5. Decision-Making

In practice, such as in quality control, histograms are used to determine whether a business process meets specific specifications.

5. Interpreting Our Example Histogram

To better understand a histogram in statistics, I’ll now pose a few questions about our example. Feel free to pause and try answering before checking the solutions.

Would you say the data is symmetric, or is it skewed left or right?

You can see that the taller bars are on the left side. This suggests a left-skewed distribution, meaning the data has negative skewness. In other words, students scored relatively high in this exam.

What is the mode of this dataset?

The mode is the interval with the highest frequency. In this case, most students scored between 80 and 89, making this the mode.

How many students scored up to 69 points?

Adding the first three bars: 1+1+2 = 4 students scored up to 69 points.

How many students scored at least 80 points?

Adding the last two bars: 5+4 = 9 students scored at least 80 points.

How many students scored between 60 and 89 points?

Adding the middle bars: 2+4+5 = 11 students scored within the intervals 60-69, 70-79, and 80-89.

6. Histograms and Probabilities

Histograms help navigate large datasets. These visual representations display probability distributions, which are essential for understanding a dataset’s dynamics.

Returning to our exam example: the bar heights indicate how many students fall within specific score ranges. But they also reflect the probability of a randomly selected student achieving a particular result.

A clustering of values around a central score suggests a normal distribution, which many statistical tests assume. The histogram helps determine whether this assumption holds or if another testing approach is needed.

Histograms also allow us to infer conclusions about an entire population from a sample, provided that the sample is representative and sufficiently large. For instance, a histogram of a class’s exam scores can provide insights into the performance of all students in the program.

All in all, a histogram is like a Swiss Army knife in a statistics. If you want to dive deeper, I highly recommend Andy Field’s book Discovering Statistics.