Skip to Content

Can You Breed a Champion? What Bloodlines Really Tell Us About Racehorses

Can You Breed a Champion? What Bloodlines Really Tell Us About Racehorses

Last updated: June 2, 2026

By: Miles HenryFact Checked

Thoroughbred genetics influence speed, distance range, and soundness — but they do not guarantee performance. A horse’s DNA shapes its physical tendencies; training, management, and circumstance determine what happens with them. For a racehorse owner, understanding that distinction matters more than memorizing gene names.

Genetics influence probabilities, not outcomes. This guide covers what those probabilities look like in practice — what the MSTN speed gene means for distance placement, how the three foundation sires shaped the breed, what genomic testing is used for at auction, and the growing concern about genetic diversity in an increasingly concentrated gene pool.

Thoroughbred Genetics — Key Points:

  • The MSTN gene is the most studied genetic marker in Thoroughbreds — it helps indicate a horse’s likely distance preference
  • Three foundation sires — Darley Arabian, Godolphin Arabian, and Byerley Turk — are the foundation of nearly all modern Thoroughbreds
  • Genetic testing is now widely used at sales — MSTN genotype often appears in auction catalogs alongside pedigree and conformation notes
  • Northern Dancer’s descendants dominate modern pedigrees — Northern Dancer’s descendants appear in most active Thoroughbred pedigrees — often multiple times within the same five-generation page
  • Genetic diversity is declining — the concentration of a small number of elite sires is a recognized concern among breeding researchers and veterinarians
  • Gene editing is banned in racing — all major registries including The Jockey Club prohibit genetically modified horses from registration

About this guide: This guide reflects what genetics looks like in the sale ring, on the backside, and on the pedigree page, where bloodlines, conformation, and race-day potential all have to be weighed together.

What genetics can and cannot predict in Thoroughbreds
Genetics Can Predict Genetics Cannot Reliably Predict
Optimal racing distance (via MSTN)Future race wins
Muscle fiber type profileCareer earnings
Disease carrier statusClass level reached
General physical tendenciesSoundness throughout career
Some temperament traitsResponse to training
Heritable heart-size tendencyWhich foal becomes the champion

What Genetics Controls in a Thoroughbred

Thoroughbred genetics don’t produce a horse that will definitely win — they produce a horse with a particular set of physical and physiological tendencies that training can develop or fail to develop. The distinction matters because buyers and bettors sometimes treat pedigree as a performance guarantee, which it isn’t.

What genetics does influence meaningfully in a Thoroughbred: muscle fiber composition (how much fast-twitch versus slow-twitch muscle a horse carries), cardiovascular capacity, skeletal structure proportions, and behavioral tendencies like trainability and willingness under pressure. These traits combine to determine whether a horse is built for short sprints or longer routes — and how long it can sustain that work before injury risk climbs.

What genetics influences most directly in Thoroughbreds:

  • Muscle fiber type — the ratio of fast-twitch (explosive, sprint) to slow-twitch (sustained, route) muscle fiber is strongly heritable
  • Heart size — an enlarged heart is a heritable trait; Secretariat’s heart was estimated at roughly 22 pounds compared to the average horse’s 8–9 pounds, and some of his daughters became notable producers of large-hearted offspring
  • Optimal racing distance — the MSTN gene variant is the best-known genetic marker for distance tendency, helping indicate whether a horse’s muscle fiber profile favors sprints, middle distances, or routes
  • Bone density and joint structure — heritable traits that affect injury susceptibility and career longevity
  • Temperament — trainability, gate behavior, and response to stress all have genetic components, though environment plays a significant role

What genetics does not reliably predict: which specific horse will win a specific race, how well a foal will respond to a particular training program, or whether a well-bred horse will stay sound through a full racing campaign. The gap between genetic potential and racing outcome is where management, veterinary care, training, and luck all operate.

The MSTN Speed Gene — What It Actually Means

The myostatin gene (MSTN) is the most studied and most commercially applied genetic marker in Thoroughbred racing. Myostatin is a protein that limits muscle growth — horses with certain MSTN variants produce less myostatin, which allows for greater fast-twitch muscle development. The practical result is a horse better suited to shorter, more explosive efforts.

A 2010 study by Emmeline Hill and colleagues identified three MSTN genotypes in Thoroughbreds, each corresponding to a different racing distance profile. This research, published in PLOS ONE, has been replicated and expanded in subsequent studies and forms the basis of commercial testing products like Plusvital’s Speed Gene Test.

MSTN genotype and racing distance profile — Thoroughbreds
MSTN Genotype Distance Profile Practical Meaning
C/CSprinterBest suited to races under 1 mile; fast-twitch dominant; early speed profile
C/TMilerOptimal around 1–1.2 miles; versatile; most common genotype in the breed
T/TStayer / RouteBest suited to races over 1.2 miles; slow-twitch dominant; Belmont Stakes type

The C/C genotype doesn’t produce a faster horse in absolute terms — it produces a horse whose physical structure favors explosive short efforts. A C/C horse trained as a router will often underperform not because of lack of talent but because of a physiological mismatch. The same applies in reverse: a T/T horse is not built to win 5-furlong sprints regardless of how the horse is trained.

Miles’s Take — Genotype data at auction: The first time I saw an MSTN genotype listed in an auction catalog I wasn’t sure what to make of it. That was maybe 10 years ago and it was unusual. Now it’s routine at the major sales — you’ll see C/T or T/T listed alongside the pedigree page the same way you’d see a hip number or a vet report notation. It hasn’t replaced the physical evaluation for me. A horse with a perfect genotype and bad feet is still a bad investment. But it does add one more data point when I’m deciding whether to go after a horse that looks like a miler in training and the pedigree page is ambiguous. The test can confirm what you’re seeing in the horse’s movement and body type — or it can create a contradiction worth investigating.

MSTN gene influence on Thoroughbred racing distance and speed profile
The MSTN gene variant determines whether a Thoroughbred is built for sprints, middle distances, or routes.

The Three Foundation Sires

Every registered Thoroughbred in the world traces its paternal lineage back to one of three stallions imported to England in the late 17th and early 18th centuries. These three horses — the Darley Arabian, the Godolphin Arabian, and the Byerley Turk — are the foundation of the entire breed. This isn’t a historical footnote; it has direct implications for genetic diversity today.

The three foundation sires — what each contributed:

  • Darley Arabian (imported 1704) — the dominant line; roughly 95% of modern Thoroughbreds trace their direct paternal line back to him through his great-great-grandson Eclipse. The Eclipse line includes Northern Dancer, Secretariat, and the majority of 20th-century champions.
  • Godolphin Arabian (imported ~1729) — less dominant in male-line descendants but contributed significant genetic material through the maternal line; known for stamina characteristics
  • Byerley Turk (imported ~1686) — the oldest of the three; his male-line descendants are now rare, though his genetic contribution persists through maternal lines across the breed

The concentration of Darley Arabian genetics in particular — and within that line, the dominance of Eclipse — means that modern Thoroughbred pedigrees often show the same ancestors appearing on multiple lines of the same horse’s family tree. This is known as inbreeding coefficient, and it’s been rising across the breed for decades. The practical consequence isn’t necessarily a weaker horse, but it does reduce the genetic variation the breed has available to adapt to new pressures.

Thoroughbred genetics DNA — foundation sires and modern breeding
All Thoroughbred genetics trace back to three stallions. That narrow origin shapes everything about how the breed evolves today.

Did Secretariat Really Have a Genetic Advantage? The Large Heart Gene

Secretariat’s heart is one of the most discussed subjects in Thoroughbred genetics. When he died in 1989, necropsy reports estimated his heart at approximately 22 pounds — more than twice the average horse’s 8 to 9 pounds, though this figure comes from those reports rather than formal scientific literature. The question that followed was whether this was a fluke or something heritable, and it led directly to the theory of the “X-factor” gene.

The X-factor theory, developed by Marianna Haun in the 1990s, proposes that an enlarged heart in Thoroughbreds is carried on the X chromosome — meaning fillies inherit it from either parent, while colts can only inherit it from their dam. The theory gained traction because several prominent broodmares known to produce horses with large hearts — Somethingroyal (Secretariat’s dam) being the most cited — did seem to cluster large-hearted offspring. Secretariat’s racing record gave the theory a compelling anchor.

What the research actually shows:

  • Heart size is heritable — studies using echocardiography have confirmed that heart dimensions in Thoroughbreds have a meaningful heritable component
  • The X-factor is not confirmed — the specific mechanism proposed by Haun (X-linked inheritance) has not been validated by genetic research; the gene or genes responsible for enlarged hearts in Thoroughbreds have not been definitively identified
  • Large heart correlates with performance — larger heart volume generally means more blood pumped per stroke, which supports sustained high-speed effort; the correlation between heart score (a proxy for heart size) and racing success is supported by some research, though the relationship is not linear
  • Secretariat’s heart may have been an outlier — a 22-pound heart is roughly 2.5 times normal; most elite racehorses with documented large hearts fall in the 11–14 pound range, significant but not in Secretariat’s range

The practical takeaway for breeders is that heart size appears to be a partially heritable trait worth considering in a broodmare’s profile — particularly her dam line, given the X-chromosome theory’s intuitive appeal even without full confirmation. But the claim that any living horse carries “Secretariat’s heart gene” goes beyond what the research currently supports. What’s documented is that heart size has a heritable component. The specific genetic mechanism remains an open question.

Key milestones in Thoroughbred genetics — from foundation sires to modern genomic testing
Year Milestone
1686Byerley Turk imported to England — oldest of the three foundation sires
1704Darley Arabian imported — sire of the dominant Eclipse line
~1729Godolphin Arabian imported — stamina and maternal line contributions
1764Eclipse born — great-great-grandson of Darley Arabian; all modern Thoroughbreds in his line are unbeaten in 18 career starts
1961Northern Dancer born — becomes most influential sire of the 20th century; changes breed population genetics permanently
1973Secretariat wins Triple Crown — necropsy reveals estimated 22-pound heart; sparks X-factor theory research
2010Hill et al. publish landmark MSTN study in PLOS ONE — identifies three genotypes corresponding to sprint, mile, and route distances
Early 2010sCommercial genomic testing expands — Plusvital Speed Gene Test and Equinome products become standard at major sales
2020Jockey Club caps stallion book sizes at 140 registered foals per season — addresses gene pool concentration concerns

How Genomic Testing Changed the Breeding Business

Commercial genomic testing became widely available to Thoroughbred breeders in the early 2010s and has progressively changed how buying and selling decisions are made at the top levels of the market. The MSTN Speed Gene Test is the most widely used, but it’s now one of a suite of available genetic evaluations that cover disease screening, temperament markers, and broader genomic profiling.

The primary uses in practice are distance optimization, disease screening, and pedigree verification. Distance optimization means matching a horse’s MSTN genotype with the training and racing program most likely to suit its physiology. Disease screening covers a range of heritable conditions that can be identified before a breeding decision is made.

Genetic diseases screened in Thoroughbred and broader equine breeding programs: Not all of these are common in purebred Thoroughbreds — some appear primarily in other breeds or crosses — but responsible breeding programs screen for the full range:

  • Severe Combined Immunodeficiency (SCID) — a fatal immune disorder; carrier testing prevents two carriers being mated
  • Glycogen Branching Enzyme Deficiency (GBED) — causes energy metabolism failure in foals; detected through carrier testing
  • Hereditary Equine Regional Dermal Asthenia (HERDA) — a connective tissue disorder more common in Quarter Horse crosses; DNA tests can prevent producing affected foals
  • Malignant Hyperthermia (MH) and Polysaccharide Storage Myopathy (PSSM) — less common but testable conditions that affect muscle function

At the major sales, genomic data is increasingly used to support catalog valuations. A foal with a favorable MSTN genotype, clean disease screening, and a high-demand pedigree commands a premium over an equivalent foal without testing documentation. This has created a market dynamic where genetic data is a selling tool as much as a management tool — something buyers should account for when evaluating whether a test result adds genuine information or just increases the price.

Miles’s Take — What genomic testing means at a real sale: When I see a MSTN result in a catalog now, I use it as a starting point, not a verdict. What I’m actually doing is checking whether the genotype is consistent with what I’m seeing in the horse’s conformation and movement. A C/C horse that looks like it has the build for a route is a contradiction worth probing — either the test adds useful information about why the horse’s physical appearance is misleading, or something else is going on. The horses I’ve gotten into trouble with are the ones where I trusted the paper too much and didn’t spend enough time watching the horse walk. The genotype tells you what the horse was born with. The physical evaluation tells you what it’s become.

Thoroughbred at auction — genomic testing has changed how breeding decisions are made at sales
At major sales, genomic data now accompanies pedigree pages. Whether it adds value or just price depends on how you read it.

Genetics vs. Pedigree — Why Great Bloodlines Don’t Guarantee Great Racehorses

Pedigree and genetics are related but not the same thing. A pedigree page shows you a horse’s ancestry. Genetics is what that ancestry actually passed down to this specific individual — and those two things can diverge substantially. Full siblings from the same sire and dam regularly have completely different racing careers. The same stallion produces Grade 1 winners and $5,000 claimers from the same crop. Understanding why is the most practical thing a buyer or bettor can take from equine genetics.

Each foal inherits approximately half its DNA from each parent, but which half is largely random. Two foals from the same parents get different combinations of their parents’ genetic variants. One may inherit the parent’s best physical traits — the stamina gene expression, the favorable muscle fiber ratio, the joint structure. The other may inherit a less favorable combination. The pedigree page looks identical for both. The genetic reality is different.

Miles’s Take — The pedigree page that fooled me: Early in my career I paid a premium for a well-bred colt because the pedigree page was impressive on both sides. He had stamina in the family, speed in the family, the right names in the right places. He also had the worst work ethic of any horse I’ve ever owned. Wouldn’t rate, wouldn’t settle, fought the bit every step of every workout. Never won a race. His full brother, sold for a fraction of what I paid, ran competitively for four years. Same parents, same farm, same trainer. Different horse entirely. After that I learned to read pedigrees as probability, not promise — and to spend more time watching the horse in front of me than the paper in my hand.

Genetics determines potential — what a horse is physically capable of at its best. Training, management, and to some degree luck determine expression — what actually happens with that potential. The most genetically gifted horse in a barn that’s poorly conditioned, poorly fed, or poorly handled will not run like its genetics suggest it should. The most ordinary pedigree, managed with skill and patience, can surprise. That’s not a reason to ignore genetics — it’s a reason to keep it in proportion.

Four tools for evaluating a Thoroughbred prospect — what each measures and when it’s most useful
Tool What It Measures Most Useful For
PedigreeAncestry and family performance historyIdentifying distance trends and class tendencies across multiple generations
MSTN Genotype TestDistance aptitude based on muscle fiber geneticsConfirming or questioning the distance profile suggested by pedigree and physical type
Conformation AnalysisPhysical structure, balance, and movementEvaluating soundness potential and whether physical build matches distance genetics
Pre-Purchase Vet ExamCurrent health, joint condition, respiratory functionIdentifying existing problems before committing to purchase price

Into Mischief, Gun Runner, and the Modern Sire Era

The Northern Dancer saturation discussed above isn’t the end of the story — it’s the backdrop against which the current generation of dominant sires is operating. Understanding where Into Mischief and Gun Runner sit in the breed’s genetic landscape helps explain why they’ve attracted so much breeding attention and what they offer to owners thinking about where to take their mares.

Into Mischief is the defining sire of the American Thoroughbred in the 2020s. By Harlan’s Holiday out of Leslie’s Lady, he sits within the Northern Dancer male line through Storm Cat — which means breeding to Into Mischief on a Northern Dancer mare deepens that lineage concentration rather than diversifying away from it. What Into Mischief brings is a consistent speed profile and a particularly strong record of transmitting racing temperament and physical soundness. His sons include Authentic (2020 Kentucky Derby and Breeders’ Cup Classic winner) and Practical Joke, who became a sire in his own right. His line is now appearing on both sides of young pedigrees, which is worth tracking as his influence expands.

Gun Runner represents something genuinely different in the gene pool. By Candy Ride (ARG) — a South American-bred stallion whose male line traces through Candy Stripes to Blushing Groom and Red God rather than through Northern Dancer — Gun Runner brings genetic material that doesn’t duplicate what’s already dominant in most American Thoroughbred pedigrees. His early crop statistics have been strong, and breeders specifically seeking to avoid Northern Dancer saturation in their mares have gravitated toward him. That’s not just fashionable — it’s a reasonable response to the diversity concern.

Into Mischief vs. Gun Runner — sire line comparison for breeding decisions
Sire Male Line Profile Breeding Consideration
Into MischiefHarlan’s Holiday → Storm Cat → Northern DancerSpeed, temperament, consistency; sprint-to-classic rangeDeepens Northern Dancer concentration; best on mares with diverse dam lines
Gun RunnerCandy Ride → Blushing Groom → Red God (non-Northern Dancer)Power, classic stamina, physical substanceProvides genuine genetic diversification away from Northern Dancer dominance
GoldencentsInto Mischief → Harlan’s Holiday → Northern DancerSprint speed, dirt preference, competitive temperamentInto Mischief son; concentrated sprint genetics with proven racetrack record

Miles’s Take — My horses from the Into Mischief line: I have two horses running right now that put this genetics discussion in concrete terms. One is a son of Goldencents — who is by Into Mischief — so that colt carries the Into Mischief influence two generations back through Goldencents. The other is Seamus’s Girl, a filly by Mr. Money, who is himself by Goldencents by Into Mischief. That makes her an Into Mischief great-granddaughter on the sire side.

What I notice in both horses is the consistency that the Into Mischief line is known for — a competitive mentality, a willingness to rate, and a physical type that holds up well to training. Neither horse is going to win a Breeders’ Cup, but they’re the kind of horses that give you honest efforts and don’t fall apart in the middle of a campaign. That’s what the Into Mischief line tends to produce even several generations removed — sound, competitive, honest horses. The genetics discussion I’d encourage any small owner to have is not “which famous sire is on the pedigree page” but “what does this sire line consistently produce at the level I’m racing.” Into Mischief at the claiming and allowance level, through sons like Goldencents and Mr. Money, has been a solid answer to that question for me.

Miles Henry's Goldencents colt — son of Goldencents by Into Mischief, racing at Fair Grounds
My Goldencents colt — Goldencents is by Into Mischief, making this horse an Into Mischief grandson. The Into Mischief line has produced consistently competitive horses at every level of claiming and allowance racing.
Seamus's Girl — daughter of Mr. Money by Goldencents by Into Mischief, racing for Miles Henry
Seamus’s Girl, my filly by Mr. Money — himself by Goldencents by Into Mischief. She’s an Into Mischief great-granddaughter on the sire side, and a good example of how that line transmits competitive temperament several generations removed.

The Inbreeding Problem

The Thoroughbred gene pool has been narrowing for decades, and the pace has accelerated as a small number of dominant sires — particularly in the Northern Dancer male line — have come to account for an increasing proportion of all matings. This isn’t a new observation, but research published in recent years has put more specific numbers to the trend.

Northern Dancer, a Canadian-bred son of Nearctic who won the 1964 Kentucky Derby and Preakness, became the most influential sire of the 20th century. His sons Sadler’s Wells, Danzig, Storm Cat, and Nijinsky collectively produced thousands of stakes winners, and their descendants now appear on multiple lines of most competitive Thoroughbred pedigrees. Northern Dancer’s descendants appear in the pedigrees of most active Thoroughbreds — frequently on both the sire and dam side of the same horse’s pedigree. The precise figure varies depending on how the calculation is defined, but the concentration is widely noted by bloodstock researchers and industry publications including BloodHorse.

What declining genetic diversity means practically:

  • Reduced adaptability — a more genetically uniform population is more vulnerable to novel diseases or environmental changes that a diverse population could absorb
  • Potential health consequences — close inbreeding can concentrate recessive genetic disorders that would otherwise remain unexpressed
  • Narrowed performance range — heavy concentration on sprint and middle-distance genetics (the dominant Northern Dancer profile) may be producing a breed progressively less suited to classic distances and demanding surfaces
  • Injury rates — some researchers have proposed a connection between rising injury rates and the reduction in genetic variation affecting bone density and joint structure, though the evidence is contested

The Jockey Club has addressed one aspect of this — capping stallion book sizes at 140 registered foals per season starting in 2020 — but the long-term genetic concentration from decades before that cap remains embedded in the population. The practical response for small breeders is to seek out stallions whose male-line genetics don’t replicate what’s already dominant in a mare’s pedigree, which requires reading beyond the popular names on the top line and understanding what’s underneath.

FAQs About Thoroughbred Genetics

What is the MSTN speed gene in Thoroughbreds?

MSTN (myostatin) is a gene that regulates muscle growth. Thoroughbreds carry one of three variants — C/C, C/T, or T/T — that correspond to sprinter, miler, and stayer profiles respectively. C/C horses develop more fast-twitch muscle and tend to perform best at distances under a mile; T/T horses are better suited to routes of 1.2 miles or longer. The MSTN Speed Gene Test, developed by Plusvital, is among the most widely used commercial genetic tests in Thoroughbred racing.

Does genetic testing predict which horse will win?

No. Genetic testing predicts physiological tendencies — particularly distance suitability based on muscle fiber type — but not race outcomes. A horse with a favorable MSTN genotype still needs sound conformation, proper training, good health, and competent handling to win races. Genetic data is one input among many, not a performance guarantee.

What are the three foundation sires of the Thoroughbred?

Every registered Thoroughbred traces its paternal line back to one of three stallions imported to England in the late 1600s and early 1700s: the Darley Arabian, the Godolphin Arabian, and the Byerley Turk. The Darley Arabian is by far the most dominant — nearly all modern Thoroughbreds trace their direct sire line back to him through Eclipse — estimates typically range from 90–95%.

How widespread is Northern Dancer genetics in modern Thoroughbreds?

Northern Dancer’s genetic influence is very broad. Industry estimates suggest the majority of active Thoroughbreds carry his genetics, with many carrying it on both the sire and dam side. His male-line sons — Sadler’s Wells, Danzig, Storm Cat, Nijinsky — collectively produced thousands of stakes winners, and their descendants now appear on multiple branches of most competitive pedigrees.

Is gene editing allowed in Thoroughbred racing?

Major registries prohibit gene-edited horses from registration. The Jockey Club and equivalent bodies worldwide do not permit genetically modified horses to be registered as Thoroughbreds. CRISPR and similar technologies are therefore incompatible with racing eligibility in any jurisdiction that follows standard registry rules. Research into detecting gene edits in equine DNA has advanced specifically to support enforcement of these registration requirements.

What genetic diseases are screened for in Thoroughbred breeding?

Common genetic disease screens include Severe Combined Immunodeficiency (SCID), Glycogen Branching Enzyme Deficiency (GBED), Hereditary Equine Regional Dermal Asthenia (HERDA), Malignant Hyperthermia (MH), and Polysaccharide Storage Myopathy (PSSM). Many major breeding operations use disease-carrier testing as part of mating decisions, and some require it before approving stallion and mare pairings.

Key references and sources used in this article: