One Letter That Changed Everything: The HBB Gene & Sickle Cell Disease (Gene Series : Part 1)
The very first molecular disease ever identified — a single amino acid swap that cripples red blood cells, yet for 7,300 years has protected millions from malaria
On November 25, 1949, a paper appeared in the journal Science that changed everything about how medicine understood disease. Its title — "Sickle Cell Anemia, a Molecular Disease" — contained a phrase that had never before appeared in a medical paper. Written by Linus Pauling, Harvey Itano, Seymour Singer, and Ibert Wells at Caltech, the paper did something no one had done before: it traced a human disease directly to a defect in a specific protein molecule.
The irony is profound. The HBB mutation that causes sickle cell disease is one of the most destructive mutations known — destroying red blood cells, causing agonising pain crises, and shortening lives. Yet this same mutation has persisted in human populations for over 7,000 years and reached frequencies as high as 30% in some African communities because it confers powerful protection against the world's most deadly infectious disease: malaria. The HBB story is simultaneously one of suffering and of evolutionary ingenuity — a single DNA letter that is both a curse and a shield.
What Is Sickle Cell Disease?
Sickle cell disease (SCD) is an autosomal recessive haemoglobinopathy caused by a homozygous or compound heterozygous mutation in the HBB gene encoding the beta-globin subunit of haemoglobin. It is the world's most common serious monogenic disorder, affecting approximately 300,000–400,000 newborns annually. The name derives from the characteristic crescent (sickle) shape that red blood cells assume under low-oxygen conditions in affected individuals.
The Scientists — A Story Across Three Acts
The only person to win two unshared Nobel Prizes (Chemistry 1954, Peace 1962). Pauling turned his attention to haemoglobin in the 1930s and set his graduate student Harvey Itano to compare normal and sickle cell haemoglobin in 1946. His insight that the disease lay in the protein itself — not in the environment — was revolutionary. Pauling coined the term "molecular disease" in the 1949 paper, establishing the entire field of molecular medicine.
Born to Japanese immigrant parents, Itano was incarcerated in a WWII internment camp while earning his chemistry degree. He was released to attend medical school and went on to become the primary experimentalist behind the 1949 paper. Itano developed the electrophoresis method that separated normal from sickle haemoglobin. Despite doing the central experimental work, he was denied a faculty position at Caltech and spent years in obscurity before receiving belated recognition later in life.
A Jewish refugee who escaped Nazi Germany in 1939 and settled in Britain. After his PhD at University of London, he joined Frederick Sanger's lab (where the first protein sequencing was being developed). Ingram used a brilliant two-dimensional technique — protein fingerprinting — to cut haemoglobin into fragments, separate them by charge and size, and compare normal vs. sickle patterns. In 1956–1957 he pinpointed the single amino acid substitution (Glu→Val at position 6) — the first time a disease had been traced to a single amino acid change in a protein.
Independently and simultaneously with E.A. Beet, Neel proved in 1949 that sickle cell disease followed an autosomal recessive inheritance pattern — that carriers (heterozygotes) were clinically healthy while those with two copies were affected. This was published in Science in the same volume as Pauling's paper. Neel also later recognised that sickle cell trait conferred protection against malaria — explaining why the mutation had been maintained at high frequency in malaria-endemic populations despite causing severe disease when homozygous.
The Full Discovery Timeline
Intern Ernest Irons first observes "peculiar elongated and sickle-shaped" red blood cells in the blood of Walter Clement Noel, a dental student from Grenada admitted to the Chicago Presbyterian Hospital in November 1904. His supervisor, James Herrick, publishes the first documented US case in 1910. The disease had been known in West African communities for generations — called "ogbanjes" (children who come and go) in some regions due to high infant mortality. Ghanaian family records trace it to 1670.
Physician Vernon Mason names the condition sickle cell anaemia after examining a large series of cases and documenting its recurring features.
Linus Pauling, a world-renowned physical chemist at Caltech, attends a haematology conference and hears physician William Castle suggest that the sickling phenomenon might be linked to the haemoglobin molecule itself. Electrified, Pauling returns to Caltech and assigns his graduate student Harvey Itano — a physician-trained chemist recently released from a WWII Japanese-American internment camp — to test whether sickle and normal haemoglobin differ in their electrical properties.
Using the Tiselius moving-boundary electrophoresis apparatus, Itano and colleagues demonstrate that sickle haemoglobin (HbS) has a distinctly different electrical charge compared to normal haemoglobin (HbA), and that sickle cell trait individuals (carriers) have a mixture of both. The paper in Science — "Sickle Cell Anemia, a Molecular Disease" — introduces the concept of disease as a molecular defect. At this point they do not know which amino acid is altered, nor where on the protein.
James Neel at Michigan and E.A. Beet independently establish that sickle cell disease is autosomal recessive — two copies cause disease, one copy (trait) confers partial sickling with no anaemia. This definitively establishes the Mendelian inheritance pattern.
Working in Frederick Sanger's Cambridge laboratory (where the first protein sequence — insulin — had just been determined), Ingram develops a two-dimensional "fingerprinting" technique: he digests haemoglobin with the enzyme trypsin, then separates the resulting peptide fragments by electrophoresis in one dimension and chromatography in the second. When he compares the patterns from HbA and HbS, they are identical in all but one spot. Ingram isolates that one peptide and sequences it, finding that position 6 of the beta chain carries glutamic acid in HbA but valine in HbS. One amino acid. This is published in Nature in 1956 and 1957 and stands as the first identification of a disease caused by a single amino acid substitution in a protein.
Using the newly developed tools of molecular biology, researchers map the HBB gene to Chromosome 11. Richard Flavell prepares the first molecular maps of the beta-globin cluster.
First reported cure of SCD — a bone marrow transplant performed in a child to treat acute leukaemia incidentally cures his sickle cell disease. This proof of concept drives decades of BMT research.
FDA approves Casgevy (exagamglogene autotemcel) — the first CRISPR-based gene therapy for any disease — for sickle cell disease. Simultaneously, Lyfgenia (lovotibeglogene autotemcel), a lentiviral gene therapy, is also approved. SCD becomes the first disease for both the first molecular characterisation AND the first gene editing cure.
— Linus Pauling, Harvey Itano, Seymour Singer, Ibert Wells, Science, 1949
The HBB Gene — Molecular Architecture
Gene Architecture
The HBB gene is compact by genomic standards — just 1,600 bp of coding sequence in 3 exons spanning approximately 1.6 kb total — within the broader beta-globin gene cluster on chromosome 11p15.4. This cluster spans ~45 kb and contains five functional globin genes expressed at different developmental stages: ε (embryonic), Gγ and Aγ (foetal, making HbF), δ (minor adult), and β (major adult). Understanding this developmental regulation became critical for therapy.
The beta-globin protein (146 amino acids) pairs with the alpha-globin protein (141 amino acids, encoded on Chromosome 16) to form the α₂β₂ tetramer of haemoglobin A (HbA). Each subunit contains one iron-containing haem group that reversibly binds one oxygen molecule, allowing haemoglobin to carry four O₂ molecules per tetramer with cooperative binding kinetics (the basis of the sigmoidal oxygen-dissociation curve).
The HbS Mutation — Molecular Mechanism of Polymerisation
The c.20A>T transversion converts codon 6 (GAG → GTG), substituting glutamic acid (hydrophilic, charged) with valine (hydrophobic, uncharged) at the sixth position of the mature beta-globin protein. This single substitution creates a hydrophobic surface patch on the exterior of the deoxygenated HbS molecule — a "sticky patch" that is buried in oxygenated HbA but exposed upon deoxygenation.
Deoxygenated HbS molecules undergo a concentration-dependent, nucleation-dependent polymerisation into long, twisted 7-strand fibres (14-strand with central and outer strands). The Val-6 residue on one HbS molecule inserts into a hydrophobic acceptor pocket formed by Phe-85 and Leu-88 on the EF loop of a neighbouring β-chain. This axial contact, combined with lateral contacts, drives fibre growth at rates exceeding 10⁶–10⁷ monomers per second under physiological conditions.
The kinetics follow a double-nucleation mechanism: homogeneous nucleation (slow, concentration-dependent) followed by heterogeneous nucleation on existing fibre surfaces (much faster). This produces a characteristic sigmoidal kinetic curve — a delay period followed by explosive polymerisation. During the delay period (seconds to minutes depending on concentration), the cell is still deformable. If reoxygenation occurs during the delay period (as it typically does in healthy individuals with sickle cell trait), polymerisation is avoided. In homozygous SCD patients, the delay period is sufficiently short that sickling occurs in the microcirculation before reoxygenation.
Polymer formation increases intracellular viscosity by several orders of magnitude, deforms the plasma membrane, and generates reactive oxygen species. Repeated sickling/unsickling cycles cause irreversible membrane damage, creating a subpopulation of dense, dehydrated, irreversibly sickled cells (ISCs) that remain sickled even when reoxygenated.
Pathophysiology — The Cascading Consequences
SCD is not simply a disease of misshaped cells. It is a multi-organ, inflammatory vasculopathy driven by several converging mechanisms:
Vaso-occlusion and pain crises: Sickled cells are rigid, sticky, and adhere to the vascular endothelium via receptors including BCAM/Lu and VLA-4 (on RBCs), P-selectin (on endothelium), and Mac-1 (on neutrophils). This initiates a complex multicellular adhesion cascade — involving RBCs, neutrophils, monocytes, and platelets — that occludes small blood vessels, causing ischaemia and the excruciatingly painful vaso-occlusive crises (VOCs) that define the disease.
Haemolytic anaemia: Sickled RBCs have a lifespan of only 10–20 days (vs. ~120 days normally). The resulting haemolysis releases free haemoglobin and haem into plasma, scavenging nitric oxide (NO). Loss of bioavailable NO causes endothelial dysfunction, platelet activation, and vasoconstriction — contributing to pulmonary hypertension, priapism, and stroke.
Organ damage: Repeated vaso-occlusion progressively destroys the spleen (functional asplenia by age 5 in most patients — explaining heightened susceptibility to encapsulated bacteria), kidneys (sickle cell nephropathy), bone (avascular necrosis), and retina. Stroke occurs in up to 11% of patients by age 20.
Treatments — From Hydroxyurea to the First CRISPR Cure
Hydroxyurea — Reawakening Foetal Haemoglobin
Hydroxyurea, an anti-cancer drug, was approved for SCD in 1998 — the first disease-modifying treatment. It works by inducing the production of foetal haemoglobin (HbF, α₂γ₂). HbF does not participate in the polymerisation process (its γ-chain lacks Val-6 and its presence within the RBC sterically inhibits HbS fibre formation). By raising HbF levels, hydroxyurea reduces the intracellular HbS concentration and delays polymer nucleation, dramatically reducing the frequency of pain crises, acute chest syndrome, and hospitalisation.
Voxelotor and Crizanlizumab — Newer Mechanisms
Voxelotor (2019): Directly binds to HbS at the α-chain and stabilises the oxygenated conformation, inhibiting HbS polymerisation — the first drug to directly target the polymerisation step.
Crizanlizumab (2019): A monoclonal antibody against P-selectin that blocks the multicellular adhesion cascade driving vaso-occlusion, reducing pain crisis frequency.
Bone Marrow Transplantation — The Only Established Cure
Allogeneic haematopoietic stem cell transplantation (HSCT) from a matched sibling donor offers the only established cure, with event-free survival rates of 85–90% in children. However, fewer than 20% of patients have a suitable matched sibling donor, and transplant-related morbidity (graft-versus-host disease) remains significant in adults.
Casgevy (exagamglogene autotemcel, exa-cel) — developed by Vertex Pharmaceuticals and CRISPR Therapeutics — is the first approved CRISPR-based gene therapy for any human disease. It uses CRISPR-Cas9 to disrupt the gene BCL11A in the patient's own haematopoietic stem cells. BCL11A is a repressor that silences the foetal gamma-globin gene (HBG1/HBG2) after birth. Disrupting BCL11A reactivates HbF production in adult RBCs — effectively replacing the role of the HBB mutation. Patients produce sufficient HbF (~30–40% of total haemoglobin) to prevent sickling permanently. In pivotal trials, 29 of 29 patients treated were free of severe vaso-occlusive crises for at least 12 months post-treatment.
Lyfgenia (lovotibeglogene autotemcel) — developed by bluebird bio — uses a lentiviral vector to insert a modified βT87Q-globin gene (engineered to have anti-sickling properties similar to HbF) into the patient's own stem cells.
Both therapies require the patient's own HSCs, gene modification ex vivo, and reinfusion after myeloablative conditioning. Current cost: ~$2.2 million (Casgevy) and ~$3.1 million (Lyfgenia) per patient. Global access remains the defining challenge of the next decade.
Legacy — The First Molecular Disease, the Last to Be Cured
The story of SCD is a story of paradoxes. It was the first disease to be defined at the molecular level — in 1949 — yet patients in sub-Saharan Africa, where 75% of affected births occur, still lack access to basic treatments including hydroxyurea and newborn screening. The disease that proved the concept of molecular medicine and that now boasts the world's first CRISPR cure is simultaneously one of the most poorly resourced diseases in global health. At $2.2 million per treatment, gene therapy is inaccessible to the 90% of the world's SCD patients born in low-income countries.
Harvey Itano, who did the essential experimental work in 1949, was incarcerated as an American of Japanese descent during the war, received no faculty position at Caltech despite his landmark contribution, and waited decades for recognition. Sickle cell disease itself — predominantly affecting people of African descent — was historically underfunded relative to diseases of similar severity affecting predominantly white populations. The 1972 National Sickle Cell Anemia Control Act, championed by the Black Panther Party's community health programmes, was a pivotal moment in forcing political attention to this neglect. The science was always ahead of the politics. And 76 years after Pauling named it the first molecular disease, the politics are still catching up.
Key References: Pauling L et al., Science 1949; Ingram VM, Nature 1956, 1957; Neel JV, Science 1949; Allison AC, BMJ 1954; Frangoul H et al., N Engl J Med 2021 (CRISPR SCD trial); FDA approval of Casgevy and Lyfgenia, December 2023.
Comments
Post a Comment