Article Text


What is grounded theory?
  1. Helen Noble1,
  2. Gary Mitchell2
  1. 1School of Nursing and Midwifery, Queens's University Belfast, Belfast, UK
  2. 2Four Seasons Health Care, Belfast, UK
  1. Correspondence to : Dr Helen Noble
    , School of Nursing and Midwifery, Queens's University Belfast, Belfast BT9 7BL, UK; Helen.noble{at}

Statistics from


Grounded theory (GT) is a research method concerned with the generation of theory,1 which is ‘grounded’ in data that has been systematically collected and analysed.2 It is used to uncover such things as social relationships and behaviours of groups, known as social processes.3 It was developed in California, USA by Glaser and Strauss during their study—‘Awareness of Dying’.1 It is a general methodology for developing theory that is grounded in data which is systematically gathered and analysed.

Features of GT

  • Data collection and analysis occur simultaneously.

  • Categories and analytic codes developed from data. Pre-existing conceptualisations not to be used—this is known as theoretical sensitivity (see below).

  • Theoretical sampling used to refine categories.

  • Abstract categories constructed inductively.

  • Social processes discovered in the data.

  • Analytical memos used between coding and writing.

  • Categories integrated into a theoretical framework.4

Carrying out a GT study

First the area of interest is identified. Theoretical preconceptions should be avoided, although it is accepted this is difficult in practice. Analytical procedures and sampling strategies are then used and the study is finished when theoretical sampling reached5 all discussed below. Data collected may be qualitative or quantitative or a combination of both. Data collection methods often include in-depth interviews using open-ended questions. Questions can be adjusted as theory emerges. Observational methods and focus groups may also be used.

Theoretical sampling

Glaser and Strauss (1967) first mentioned theoretical sampling and described a process of generating theory from data which includes collecting the data, then coding and analysing the data.1 Next the researcher makes a conscious decision about what further detail they feel needs exploring as the new theory develops. It usually takes place after some initial key concepts or categories have been identified, for example, you might decide to interview patients about their experience of heart failure. They may talk about systematic errors occurring in the general practice surgery. From this analysis of the data you may decide to approach and interview GPs to explore their views on patients’ comments. Theoretical sampling therefore, is used to produce more data to endorse or refute the categories that have been identified in the previous analysis.6

Theoretical sensitivity

Theoretical sensitivity refers to the insight of the researcher. It concerns the researcher being able to give meaning to data, understand what the data says, and being able to separate out what is relevant and what is not. By being theoretically sensitive and using insight, the researcher is able to develop a theory that is grounded, theoretically dense, and cohesive.7 Sensitivity comes from several sources including (1) literature—in depth reading offers a rich understanding of the phenomena being studied; (2) professional and personal experience—offers an understanding of the events and topics being explored; (3) the analytic process—allows for insight and understanding of the phenomena.8

Analysis of data in GT

There are three stages of data analysis in GT:8

  1. Open coding: this involves line by line coding where concepts and key phrases are identified and highlighted and moved into subcategories, then categories. This breaks the data down into conceptual components and the researcher can start to theorise or reflect on what they are reading and understanding—making sense of the data. The data from each participant will be ‘constantly compared’ for similarities.

  2. Axial coding: at this stage relationships are identified between the categories, and connections identified.

  3. Selective coding: this involves identifying the core category and methodically relating it to other categories. The relationships must be authenticated and categories refined. Categories are then integrated together and a GT identified.

Analytical notes are encouraged. These are notes to oneself to explain thought patterns in relation to the data analysis. Final theory is usually generated from the integration of several analytical memos.

The core category

The core category is the chief phenomena around which the categories are built. Theory is generated around a core category. The core category should account for the variation found in the data, that is, the categories will relate to it in some way. The categories demonstrate how the core category is situated in the lives of those participating in the study.

Example of a GT case study

As illustrated, GT methodologies involve the construction of new theory through the analysis of data. In a study carried out by Beech et al,9 the authors sought to explore patient participant experiences of recovery following surgical intervention for colorectal cancer. Beech et al9 opted to use GT because previous studies had sought to answer this research question by measuring quantifiable biomedical markers, such as symptoms of pain, insomnia or fatigue. According to the authors, there was a paucity of empirical literature around the topic from a holistic perspective, for example social, psychological and cultural aspects of a person's well-being.

Twelve participants were interviewed four times, over a 1-year period. The authors used theoretical sampling to guide the researcher as data were collected. It helped facilitate the development of theory as it emerged, not once data collection was complete.1 Initial participants were selected based on ‘subject area’, as is recommended in theoretical sampling. Each had undergone a surgical procedure to remove a tumour in their bowel or rectum and had not received prescribed chemotherapy or radiotherapy. The authors initially asked patient participants to describe their experiences to date.

Data analysis of the interviews was carried out according to the steps described by Strauss and Corbin.10 The authors began by coding each line of each patient participant transcript. Similar codes were then grouped together to form subcategories and within these subcategories categories were identified. The authors then grouped together the categories to form theory related to patient participant experiences of recovery following surgical intervention for colorectal cancer. The process of data collection continued until each category was saturated and no new data emerged.

Patient participants described their recovery in three phases identified from three categories; disrupting the self, repairing the self and restoring the self. The authors also noted how the process was linear in that all participants went through the stages, for example, phase one began at prediagnosis and ended at the conclusion of surgery; phase two commonly lasted between 3 and 6 months and phase three, from 6 months onwards, was related to a person's fluctuating level of wellness and illness. Notably, these three categories were underpinned by various subcategories, which were generated from initial codes. For example, the second category Beech et al4 identified, ‘disrupting the self’, was made up of the three subcategories; body repair, autonomy and re-establishing personal identity (figure 1).

Figure 1

Grounded theory data analysis.

Importantly, the authors encapsulated the three categories to present a pertinent theory related to patient participant experiences of recovery following surgical intervention for colorectal cancer. They found that recovery is more than physical repair. It is a process of restoring a sense of wellness demonstrated through an awareness and enjoyment of the physical, emotional, social and spiritual aspects of life, in other words, holistic health.9


By using GT and adhering to this as a research method, a theory will be produced that is grounded in your data.10 It is a research method which uses strict procedures for data analysis and will enable you to search for and conceptualise the hidden social and collective patterns and constructions in your area of interest.


View Abstract


  • Competing interests None declared.

  • Provenance and peer review Not commissioned; internally peer reviewed.

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.