← all lessons

Scaffold S18 — dN/dS classification

Five genes. You are given the number of non-synonymous substitutions per non-synonymous site (dN) and synonymous substitutions per synonymous site (dS) for a gene-family comparison. Classify the selection regime from the ratio: positive, purifying, or neutral.

Locked — answer the pretest above first.

Running tally — dN vs. dS for each round

What you just did has a name

dN/dS compares the rate of non-synonymous substitution (which changes the amino acid) to the rate of synonymous substitution (which does not). Synonymous changes are mostly invisible to selection and accumulate at the neutral rate μ. Non-synonymous changes are screened by selection.

Important caveat. dN/dS averaged over a whole gene can hide site-wise variation. Most proteins have highly constrained core residues (low dN/dS) and a few adaptively evolving surface residues (high dN/dS). The average is dominated by the constrained sites — which is why gene-level dN/dS rarely exceeds 1 even for proteins under strong positive selection on a few sites. Site-wise models (PAML, HyPhy) are the standard follow-up when you want to locate the signal.