Skip to main content
🧬Peptide Protocol Wiki

Semaglutide: Molecular Structure

Chemical properties, amino acid sequence, and structural analysis

Reviewed byDr. Research Team(MD (composite credential representing medical review team), PhD in Pharmacology)
📅Updated January 31, 2026
Verified

📌TL;DR

  • Molecular formula: C187H291N45O59
  • Molecular weight: 4113.58 Da
  • Half-life: Approximately 7 days (168 hours)

Amino Acid Sequence

H-Aib-EGTFTSDVSSYLEGQAAKEFIAWLVRGRG with C18 fatty diacid at Lys26

66 amino acids

Formula

C187H291N45O59

Molecular Weight

4113.58 Da

Half-Life

Approximately 7 days (168 hours)

3D molecular structure of Semaglutide
Three-dimensional representation of Semaglutide
Amino acid sequence diagram for Semaglutide
Color-coded amino acid sequence of Semaglutide

Molecular Structure and Properties#

Semaglutide is a 31-amino-acid synthetic peptide analog of human glucagon-like peptide-1 (GLP-1), developed by Novo Nordisk. It has a molecular weight of approximately 4,113.58 Da, with the molecular formula C187H291N45O59 and CAS number 910463-68-2. Semaglutide was designed through iterative optimization of the native GLP-1(7-37) sequence to achieve enzymatic resistance, prolonged duration of action, and enhanced receptor potency.

Amino Acid Sequence#

The primary structure of semaglutide is based on the native human GLP-1(7-37) sequence with two critical modifications:

His-Aib-EGTFTSDVSSYLEGQAAKEFIAWLVRGRG

with a C18 fatty diacid (octadecanedioic acid) moiety conjugated at the lysine residue at position 26 (corresponding to position 34 in the full proglucagon numbering) via a gamma-glutamic acid and mini-PEG linker.

Key residues and their functional significance:

  • Position 1 (Histidine): Retained from native GLP-1; essential for GLP-1 receptor binding and activation. The N-terminal histidine engages the receptor transmembrane domain.
  • Position 2 (Alpha-aminoisobutyric acid, Aib): Replaces native alanine at position 2. This non-natural alpha,alpha-disubstituted amino acid provides steric resistance to dipeptidyl peptidase-4 (DPP-4) cleavage. Native GLP-1 is rapidly inactivated by DPP-4 (half-life approximately 2 minutes), making this substitution essential for therapeutic viability.
  • Position 26 (Lysine): Serves as the conjugation site for the C18 fatty diacid moiety, which is attached through a gamma-glutamic acid spacer and a hydrophilic mini-PEG (aminoethyloxy-acetyl) linker. This position was selected because modifications here do not significantly impair GLP-1 receptor binding.
  • Positions 7-8 (Threonine-Phenylalanine): Critical for receptor binding affinity. The phenylalanine at position 8 (position 16 in proglucagon) contributes to hydrophobic interactions with the receptor extracellular domain.
  • C-terminus (Glycine): The sequence terminates at position 31 with glycine, matching the native GLP-1(7-37) form. Unlike some other GLP-1 analogs, semaglutide does not have C-terminal amidation.
PropertyValueNotes
Sequence length31 amino acidsBased on GLP-1(7-37)
Molecular weight~4,113.58 DaIncluding C18 fatty diacid
Molecular formulaC187H291N45O59Complete molecule
CAS number910463-68-2Registry identifier
Non-natural residueAib at position 2DPP-4 resistance
Lipid modificationC18 fatty diacid at Lys26Via gamma-Glu/mini-PEG linker
C-terminusFree acidGLP-1(7-37) form
Backbone originHuman GLP-1(7-37)94% sequence homology to native

Lipid Modification and Albumin Binding#

The C18 fatty diacid (octadecanedioic acid) conjugated at Lys26 is the defining pharmacokinetic feature of semaglutide. This modification serves several purposes:

  1. Albumin binding: The fatty diacid moiety binds non-covalently to serum albumin with high affinity (>99% bound in plasma), creating a circulating depot that protects the peptide from renal filtration and proteolytic degradation. This extends the half-life from approximately 2 minutes (native GLP-1) to approximately 7 days, enabling once-weekly dosing.

  2. Linker design: The connection between the peptide backbone at Lys26 and the fatty diacid uses a gamma-glutamic acid spacer combined with a mini-PEG (aminoethyloxy-acetyl) element. The mini-PEG component provides hydrophilicity that offsets the lipophilicity of the fatty acid, improving aqueous solubility and reducing the tendency for self-aggregation. This linker design was an improvement over the C16 fatty acid used in liraglutide, which lacks the mini-PEG spacer and achieves a shorter half-life of approximately 13 hours.

  3. Pharmacokinetic optimization: The albumin-bound semaglutide circulates as a slow-release reservoir. Free peptide dissociates from albumin to engage GLP-1 receptors, while the albumin-bound fraction maintains steady-state plasma concentrations with low peak-to-trough variability across the weekly dosing interval.

The C18 chain length was specifically selected through structure-activity relationship studies. Shorter chains (C14, C16) provided insufficient albumin binding affinity, while longer chains reduced aqueous solubility. The C18 diacid represents the optimal balance. This design is conceptually related to but distinct from the C20 fatty diacid used in tirzepatide, which employs a longer chain and different linker to achieve a similar but not identical pharmacokinetic profile (half-life approximately 5 days for tirzepatide versus approximately 7 days for semaglutide).

Physicochemical Properties#

Semaglutide is formulated differently depending on the route of administration:

Injectable formulation (Ozempic/Wegovy): Clear, colorless solution for subcutaneous injection in phosphate buffer at pH 7.4, with disodium phosphate dihydrate, propylene glycol, and phenol as preservative. The formulation is stable under refrigerated conditions (2-8 degrees C).

Oral formulation (Rybelsus): Co-formulated with sodium N-(8-[2-hydroxybenzoyl] amino) caprylate (SNAC), an absorption enhancer that promotes transcellular absorption of semaglutide across the gastric epithelium. SNAC creates a localized increase in pH and lipophilicity at the gastric mucosal surface, facilitating peptide absorption. Oral bioavailability is approximately 0.4-1%, requiring substantially higher doses (7-14 mg daily) compared to the injectable formulation (0.5-2.4 mg weekly).

  • Solubility: Freely soluble in aqueous buffers at physiological pH
  • Stability: Stable at 2-8 degrees C for the shelf life; injectable pens may be stored at room temperature (up to 30 degrees C) for up to 56 days after first use
  • Isoelectric point: Approximately 5.0, reflecting multiple acidic residues (glutamic acid, aspartic acid) and the fatty diacid

Pharmacokinetics#

Semaglutide exhibits predictable, dose-proportional pharmacokinetics across the approved injectable dose range (0.25 mg to 2.4 mg weekly).

Absorption: After subcutaneous injection, semaglutide is absorbed slowly from the injection depot. Time to maximum plasma concentration (Tmax) is approximately 1-3 days post-injection. Absolute bioavailability after subcutaneous administration is approximately 89%. For the oral formulation, bioavailability is approximately 0.4-1%, with Tmax of approximately 1 hour under fasting conditions.

Distribution: The apparent volume of distribution is approximately 12.5 liters, consistent with distribution primarily in the vascular compartment due to extensive albumin binding (>99%). The high degree of protein binding limits extravascular distribution.

Metabolism: Semaglutide is metabolized through proteolytic cleavage of the peptide backbone, beta-oxidation of the C18 fatty diacid, and amide hydrolysis. It does not undergo clinically meaningful CYP-mediated metabolism, and CYP-mediated drug-drug interactions are not expected. The primary metabolite retains an intact fatty acid chain but lacks pharmacological activity.

Elimination: The elimination half-life is approximately 7 days (approximately 168 hours), supporting once-weekly dosing. Clearance is approximately 0.035 L/hr. Semaglutide is eliminated primarily through metabolic degradation; approximately 3% is excreted unchanged in urine. Steady-state concentrations are achieved after 4-5 weeks of once-weekly dosing.

PK ParameterValueNotes
Tmax (SC)1-3 daysSubcutaneous injection
Tmax (oral)~1 hourFasting conditions required
Bioavailability (SC)~89%Subcutaneous administration
Bioavailability (oral)~0.4-1%With SNAC absorption enhancer
Vd (steady state)~12.5 LPrimarily vascular compartment
Protein binding>99%Albumin binding
Half-life~7 days (168 h)Supports weekly dosing
Clearance~0.035 L/hrMetabolic elimination
Time to steady state~4-5 weeksWith weekly dosing
CYP metabolismNot significantCYP-independent

Receptor Pharmacology#

Semaglutide is a highly selective GLP-1 receptor agonist with no clinically meaningful activity at the GIP receptor, glucagon receptor, or other related receptors. This selective profile distinguishes it from newer multi-agonists such as tirzepatide (dual GIP/GLP-1) and retatrutide (triple GIP/GLP-1/glucagon).

  • GLP-1 receptor (GLP-1R): Semaglutide activates GLP-1R with high affinity, comparable to native GLP-1 in potency for cAMP generation. The Aib substitution and fatty acid conjugation do not substantially reduce receptor binding compared to native GLP-1.
  • Biased agonism: Research suggests that semaglutide, like tirzepatide, may exhibit a degree of biased agonism at GLP-1R, with preferential activation of G protein signaling (cAMP pathway) relative to beta-arrestin recruitment. This signaling bias may contribute to sustained receptor responsiveness and a favorable therapeutic window.
  • No GIP receptor activity: Unlike tirzepatide, semaglutide does not activate the GIP receptor, which may partially account for the difference in weight loss efficacy observed in comparative studies.

The design evolution of GLP-1 receptor agonists reflects progressive molecular engineering:

  • vs. Native GLP-1(7-37): Semaglutide retains 94% sequence homology with only the Aib2 substitution. The fatty diacid conjugation is a post-translational modification. Native GLP-1 has a half-life of approximately 2 minutes versus approximately 7 days for semaglutide.
  • vs. Liraglutide: Both are acylated GLP-1 analogs, but liraglutide uses a shorter C16 fatty acid at Lys26 without a mini-PEG spacer, yielding a half-life of only approximately 13 hours (daily dosing required). Semaglutide's C18 diacid with mini-PEG linker provides stronger albumin binding and longer half-life.
  • vs. Tirzepatide: Tirzepatide is a 39-amino-acid peptide based on the GIP sequence backbone (not GLP-1) with modifications for dual GIP/GLP-1 receptor activity. It uses a C20 fatty diacid at Lys20. Semaglutide's selective GLP-1 agonism contrasts with tirzepatide's dual agonist profile.

Frequently Asked Questions About Semaglutide

Explore Further

Disclaimer: For educational purposes only. Not medical advice. Read full disclaimer