Thematic analysis of qualitative interview transcripts follows six phases: familiarize yourself with the data, generate initial codes from meaningful segments, search for themes across codes, review themes against the data, define and name each theme, and write up your findings.
That framework, originally outlined by Braun and Clarke in 2006, has since become one of the most cited methodology papers in social science, with over 300,000 Google Scholar citations.
But the framework only works if the transcripts underneath it are coded well. Every phase depends on how consistently you've tagged your data, from that first read-through all the way to the final write-up.
In this guide, I’ll walk you through each phase with practical coding decisions attached, covering the tools and workflows that support the process, and flagging the mistakes that cost researchers the most time during analysis.
Why your transcripts need a systematic coding workflow
Picture this: you've just wrapped 12 interviews. There are 200+ pages of transcripts on your screen, and your mind is drifting towards the quotes that stood out for you. That instinct might not always be right.
Without a structured coding workflow, analysis defaults to whatever you remember best, which usually means the most recent interviews and the responses that confirm what you expected to find.
And this isn't a fringe risk, either. McMullin (2021) highlights that 40-80% of social science and third-sector studies rely on qualitative methods like interviews, focus groups, and ethnographic observation.
That's an enormous volume of data that depends on the researcher's coding discipline to produce credible findings.
And if you're not careful about biases, you end up building themes on incomplete evidence.
Having said that, a systematic approach to coding qualitative interview data doesn't eliminate interpretation.
What it does is force you to account for the full dataset rather than the parts that feel most relevant in the moment.
If you're still weighing whether to transcribe qualitative research interviews in the first place, that's a separate decision. This guide assumes you already have transcripts and need a reliable way to move from raw text to defensible themes.
Codes, categories, and themes: What each term means in qualitative analysis
Before you start coding themes in qualitative data, it helps to be clear on what each term actually refers to. These three concepts serve separate purposes, and confusing them is one of the most common errors in thematic analysis of interview transcripts.
- A code is a descriptive label you apply to a small segment of data. It captures what that segment is about in a word or short phrase
- A category groups related codes together, which gives you a way to organize your codes without jumping straight to interpretation
- A theme is broader. Themes are patterns of meaning that cut across categories and connect back to your research question. You don't just find themes in the data. You construct them through the process of reading, coding, and re-reading
| Factors | Code | Category | Theme |
|---|---|---|---|
| Definition | A label applied to a meaningful segment of text | A cluster of related codes | A pattern of meaning that addresses the research question |
| Example | "Felt unsupported by manager" | Workplace support experiences | Structural isolation in hybrid teams |
| Level of abstraction | Low, close to the data | Mid-level, organizational | High, interpretive |
One thing worth remembering: if you've used transcription or qualitative data analysis software, you've probably seen the terms "tags," "codes," and "labels" used interchangeably. They refer to the same action. Tags in a transcription tool like HappyScribe are functionally equivalent to codes in qualitative methodology.
I point this out because the distinction is small, but it might confuse researchers who move between tools. A tag you apply in your research transcription software carries the same weight as a code you'd assign in NVivo or ATLAS.ti, so treat it with the same rigor from the start.
The six phases of thematic analysis for interview transcripts
Braun and Clarke's (2006) six-phase framework is the most widely used approach to thematic analysis, and for good reason.
The phases are sequential but not strictly linear. You'll move back and forth between them as your understanding of the data deepens, which is expected and part of the process.
What follows is an adapted walkthrough for researchers working specifically with interview transcripts.
Phase 1: Familiarization
Read your transcripts from start to finish. Then read them again. If you still have access to the original audio, listen back while following the text, because tone, pauses, and emphasis often carry meaning that doesn't reflect on the page. A transcription platform that offers accurate transcripts and synced media playback will speed up your work.
The goal here isn't to start coding. It's to build a broad sense of what's in the data before you make any analytical decisions.
💡Pro tip:
Keep a notebook or separate document for initial impressions, recurring ideas, and anything that surprises you. The output at this stage is informal.
Where this phase goes wrong is when you rush it. 12 interviews can feel familiar after one read-through, but familiarity isn't the same as comprehension. The patterns you notice on a second or third round might just be the ones that end up shaping your themes.
For a deeper look at how familiarization fits into broader research, check our guide on qualitative research methodology.
Phase 2: Generating initial codes
This is where you move from reading to labeling. Work through each transcript line by line or segment by segment, and assign a code to every meaningful unit of data.
- Inductive (or open) coding lets codes emerge from the data itself, without a predefined framework
- Deductive coding works in the other direction, where you derive codes from your research questions, theoretical framework, or existing literature
- In vivo coding uses the participant's own words as the code label, which can be useful when the exact phrasing carries significance
Most researchers blend approaches. You might start with a deductive set of codes drawn from your research questions, then let inductive codes surface as you work through transcripts that take unexpected turns.
Here's what that looks like in practice, using a short passage from a fictional interview about remote work:
Interviewer: How has the shift to remote work affected your relationship with your team?
P7: I don't really talk to my manager unless something's wrong. [limited manager contact] Before, I'd catch him in the hallway and we'd sort things out quickly. [loss of informal communication] Now everything has to be a scheduled meeting, which means small problems add up until they're big enough to justify a calendar invite. [escalation through formality] I've stopped raising things that aren't urgent. [self-censoring]
Two of those codes, "limited manager contact" and "loss of informal communication," could later be grouped under a category like workplace support experiences, which connects to the theme of structural isolation in hybrid teams from the earlier table. That's the trajectory: code to category to theme.
The common mistake at this stage is coding too broadly or too narrowly. I'll cover this in more detail later.
Phase 3: Searching for themes
Once your transcripts are coded, step back and look at the codes as a collection. Group related codes into candidate themes, and use visual tools like thematic maps or code trees to see how clusters relate to each other.
This is an exploratory phase. You're testing groupings, noticing which codes work together naturally and which ones don’t belong in any categories. Don’t commit to final themes yet.
Phase 4: Reviewing themes
At this stage, you take your candidate themes back to the data. Do the coded segments actually support the theme you've built around them? Does the theme still work when you re-read the full transcripts?
As you observe alignment, the themes start to get split, merged, or dropped altogether. The final output is a refined set of themes with clearer boundaries and stronger data support.
Phase 5: Defining and naming themes
Each surviving theme now gets a written description that explains what the theme captures, what it doesn't, and how it relates to adjacent themes. Give it a concise, descriptive name so you can find it easily.
Good theme names carry a lot of analytical weight. "Structural isolation in hybrid teams" tells the reader something specific but "Remote work issues" doesn't. Once you get the labels right, your dataset will resemble a thematic framework: named themes with written definitions and scope boundaries.
Phase 6: Writing up
The final phase turns your thematic framework into a narrative that answers the research question. Each theme becomes a section or argument, supported by representative quotes from the transcripts.
Select quotes carefully. The strongest passages are those where the participant's words illustrate the theme directly and can stand with minimal contextual explanation.
💡Pro tip:
Two or three well-chosen quotes per theme land better than a long string of supporting evidence.
For a more detailed walkthrough of moving from coded transcripts to a written analysis, this guide on analyzing qualitative research interview transcriptions covers the process in depth.
How to code interview transcripts in practice
The six phases give you the conceptual framework. But now it's time to cover the decisions you'll make when you sit down to actually code.
Manual vs. software-assisted coding
Manual coding has stood the test of time and plenty of researchers still use Word or Google Docs with comment threads and color-coded highlighting.
It works for smaller projects with fewer transcripts, but the limitations show up when you try to scale.
Once you start working across 15 or 20 interviews, tracking codes consistently through comments and highlights gets difficult to manage.
CAQDAS tools like NVivo, ATLAS.ti, and MAXQDA are built for that. They offer code hierarchies, cross-transcript queries, and visual mapping. Worth noting that NVivo and ATLAS.ti are now both owned by Lumivero, while MAXQDA remains independent and is often considered more approachable for mixed-methods work.
Outside traditional CAQDAS, Dovetail serves UX and product teams, Delve is popular with dissertation researchers, and Taguette is a free open-source option for unfunded projects.
Regardless of the tool, the coding logic stays the same.
Building your codebook
A codebook is the reference document that keeps your coding consistent across multiple reviews or team members.
If you're working solo on a small study, you can get away with a less formal version, but the discipline still pays off in the long run.
- Start with a small set of codes drawn from your first few transcripts or your research questions
- Write a short definition for each code, with an example passage that shows when it applies
- Note boundary rules for when a code doesn't apply, particularly for codes that overlap
- Refine iteratively as you work through more transcripts, splitting, merging, or dropping codes that aren't working
- Track every change and the reasoning behind it, because that log becomes your audit trail. I’ll expand on this later
Coding with a team
When more than one researcher is coding the same data, inter-coder reliability (ICR) becomes a factor. The standard practice is to have at least two coders independently code a subset of transcripts, then compare agreement using a metric like Cohen's kappa.
Low agreement doesn't necessarily mean someone coded poorly; it might indicate that the codes need clearer definitions in the codebook. Treat disagreements as a signal to revise rather than to override.
AI-assisted coding
AI tools can now auto-suggest codes, surface patterns across large transcript sets, and apply a codebook faster than manual coding. For researchers working with 20+ hours of interview audio, that's hours saved on the repetitive work.
But the tools come with certain limitations. A 2025 study by Jowsey et al. tested Microsoft Copilot on thematic analysis and found minimal overlap between human-generated and AI-generated themes. Copilot's outputs included fabricated quotes in 58% of cases, and it drew themes from only the first few pages of data rather than the full dataset. The study exposed the limitations of Microsoft Copilot AI, but the underlying concern applies to other tools.
The takeaway isn't to avoid AI-assisted coding entirely. AI can handle the initial pass and the pattern-flagging, but interpreting nuance, resolving ambiguity, and making theoretical claims remain the researcher's job.
Preparing your transcripts for thematic analysis
There's a step between "I have a transcript" and "I start coding" that often gets skipped, and it directly affects the quality of everything that follows. Here’s how you should go about it:
- Choose the right transcription type. Verbatim captures every filler word and false start, which suits discourse analysis. For thematic analysis, intelligent verbatim is usually the better fit because it preserves meaning without the noise
- Label your speakers clearly. If your interview transcription doesn't identify who said what, you lose the ability to track how themes distribute across participants or compare experiences between groups
- Align your timestamps. When a coded segment feels ambiguous on the page, you need a way to return to the original audio. Consistent timestamp alignment makes that possible without scrubbing through hours of recordings
- Clean up before you code. AI-generated transcripts will contain errors, especially with technical vocabulary and overlapping dialogue. Correcting mistakes and standardizing formatting before coding prevents bad data from getting baked into your codebook
This is where your choice of transcription platform can save you time. HappyScribe handles the transcript preparation step so that researchers spend less time cleaning and more time coding.
A few HappyScribe features that directly ease your life as a researcher:
AI and human transcription
HappyScribe’s automatic transcription gives you a first draft in minutes. When precision is the ultimate factor, you can request human transcription that delivers 99% accuracy.
Expert linguists are worth considering for studies where exact phrasing is an important part of the process.
Speaker detection
With automatic speaker labeling, transcripts arrive ready for participant-level coding.
This not only saves you time but also directly impacts your analysis. When speakers are correctly identified from the start, you can filter codes by participant and compare how the same theme surfaces differently across interviews.
If you're working with focus groups with speaker overlaps, accurate detection makes attribution easier.
Interactive transcript editor
Editing, searching, and reviewing transcripts become a lot simpler with synced audio playback.
During the familiarization stage, you can read and listen at the same time, which is exactly what Braun and Clarke recommend in Phase 1.
During coding, you can click any segment and hear how the participant delivered it, catching sarcasm, hesitation, or emphasis that reads differently on the page.
Multilingual support
For research projects that span languages, HappyScribe supports 150+ languages and dialects on a single platform, so multilingual transcription and preparation don't require separate tools.
Be it Icelandic, Swiss German, Swahili, or Korean, you can run global research without worrying about misinterpretations.
If you're weighing whether automatic transcription is good practice for qualitative research, that decision depends on the precision your methodology requires. For most thematic analysis projects, starting with AI transcription and reviewing against the audio is a practical middle ground.
Common mistakes in qualitative coding and how to avoid them
1. Using interview questions as themes
If your themes map directly onto your discussion guide, there's a good chance you've organized the data rather than analyzed it.
Themes should emerge from patterns across responses, not mirror the questions you asked.
When you notice this happening, go back to your codes and look for connections that cut across multiple questions.
2. Coding too broadly or too narrowly
A code like "workplace challenges" captures everything and distinguishes nothing. A code like "felt frustrated at 2pm on Tuesday" appears once and cannot be compared to anything.
Test your codes by asking whether each one appears often enough to reveal a pattern but stays specific enough to tell you something useful about the data.
3. Skipping the review phase
First-pass themes are drafts, not findings. Reporting them without checking against the full dataset is how contradictions slip through, and weak themes survive.
Build in at least one full review pass where you read the original transcripts again with your candidate themes in front of you.
4. Ignoring contradictory data
Disconfirming cases are uncomfortable, but they make your analysis stronger when you address them directly.
A theme that only works by sidestepping contradictory data won't survive scrutiny.
Flag contradictions during coding, then decide whether they refine the theme or point to something you missed.
5. Not documenting your coding decisions
If you can't explain why a code exists, why it changed, or why a theme was dropped, your analysis loses credibility.
As I mentioned in the codebook section, maintaining a running log of what changed and why is enough. Reviewers and examiners will look for it.
Reporting thematic analysis in your methodology section
Reviewers and supervisors expect a clear account of how you moved from raw transcripts to final themes. This checklist covers what to include when writing up your methodology:
- The transcription method you used (AI, human, or hybrid) and why you chose it
- The software or tools used for coding
- Whether your coding approach was inductive, deductive, or a combination
- The number of coding passes you completed
- How themes were reviewed and refined, including any themes that were split, merged, or dropped
- How inter-coder reliability was assessed and if more than one researcher coded the data
- Whether member-checking or respondent validation was part of your process
Documenting your transcription method is also increasingly expected in published research.
If you used AI transcription, reviewers will want to know how you verified accuracy. You should include a brief section on your quality-checking process to cover all the bases.
Start with transcripts you can trust
Thematic analysis is only as strong as the transcripts behind it. If the text is inaccurate, the speaker labels are wrong, or the formatting is inconsistent, every coding decision that follows inherits those problems.
HappyScribe gives you research-ready transcripts with accurate speaker labeling, synced audio playback, and support for 150+ languages, so you can move straight into coding with confidence.
FAQs on how to use thematic analysis on qualitative interview transcripts
What is the difference between codes and themes in qualitative research?
Codes are descriptive labels you apply to small data segments. Themes are broader patterns of meaning you construct by grouping codes and interpreting what they reveal about your research objectives. Think of codes as the raw building blocks. You might apply multiple codes to a single transcript passage, and those codes get organized into categories before themes emerge. The key difference is abstraction: codes stay close to the data, while themes require analytical thinking to connect separate ideas into a coherent argument that addresses your research question.
How many interviews do you need for thematic analysis?
There's no fixed number. Data saturation, the point where new interviews stop producing new codes, typically occurs between 12 and 20 interviews for most qualitative research projects. But the right number depends on your research scope, participant diversity, and how complex the topic is. A focused study with a narrow question can reach saturation faster than one exploring a broad phenomenon across diverse groups. Start coding early in the research process rather than waiting until all your data is collected, because that gives you a clearer sense of when new interviews are confirming existing patterns rather than generating fresh insights.
Can you use AI to code qualitative interview data?
Yes, but with limits. AI tools can auto-suggest how different code fit into the narrative analysis, surface broader patterns across large transcript sets, and apply a codebook consistently across focus group transcripts and interview data. That's useful for the repetitive tagging work, especially when you're coding qualitative data across 20+ hours of audio.
Where AI falls short is in interpretive analysis. Identifying themes, resolving ambiguity in how different quotes relate to each other, and making theoretical claims still require a human researcher with a nuanced understanding of the dataset. AI can handle the initial pass, but researchers should handle the final one since they have a comprehensive understanding.
What software is best for coding qualitative interview data?
It depends on your research methods and team size. NVivo, ATLAS.ti, and MAXQDA are established tools for academic qualitative researchers conducting thematic analysis on interview transcripts and open-ended responses. Dovetail serves UX and product research teams, and Delve is popular with dissertation researchers who want a simpler interface.
For the transcription layer that feeds into your data analysis workflow, HappyScribe handles transcript preparation with accurate speaker labeling, synced audio playback, and support for 150+ languages. Getting your transcripts right before you start coding makes thematic analysis considerably smoother, regardless of which coding tool you use.
How long does it take to code interview transcripts?
Manual coding typically takes 2 to 4 hours per hour of interview audio, depending on how dense the data is and how granular your coding scheme gets. A one-hour interview with detailed notes and a fine-grained codebook will take longer than a structured interview coded at a higher level.
AI-assisted tools can cut initial coding time significantly, but reviewing, refining, and developing themes from that first pass remains manual work. Budget your time for the full thematic analysis process, not just the first round of coding.
How do you report thematic analysis in a dissertation?
Lead with the framework you followed. For most qualitative researchers, that's Braun and Clarke's (2006) six-phase model. Then document your transcription method (AI, human, or hybrid), the software you used for coding, whether your approach was inductive or deductive, the number of coding passes you completed, and how you reviewed and refined your preliminary themes until they accurately represent the dataset.
If more than one researcher coded the data, include how you assessed inter-coder reliability. If you used HappyScribe or another AI transcription tool, note how you verified accuracy. Reviewers increasingly expect this level of transparency about the theme development process in published qualitative research.
Biplab Mazumder
Biplab is a content marketer and writer who helps high-growth brands scale content visibility across AI search channels. His works have been published in HubSpot, Freshworks, Atlassian, SurferSEO, etc. When he's not planning content strategy, he's testing AI content workflows and use cases.






