-
PDF
- Split View
-
Views
-
Cite
Cite
Evie van der Spoel, Nicolien A van Vliet, Rosalinde K E Poortvliet, Robert S Du Puy, Wendy P J den Elzen, Terence J Quinn, David J Stott, Naveed Sattar, Patricia M Kearney, Manuel R Blum, Heba Alwan, Nicolas Rodondi, Tinh-Hai Collet, Rudi G J Westendorp, Bart E Ballieux, J Wouter Jukema, Olaf M Dekkers, Jacobijn Gussekloo, Simon P Mooijaart, Diana van Heemst, Incidence and Determinants of Spontaneous Normalization of Subclinical Hypothyroidism in Older Adults, The Journal of Clinical Endocrinology & Metabolism, Volume 109, Issue 3, March 2024, Pages e1167–e1174, https://doi.org/10.1210/clinem/dgad623
- Share Icon Share
Abstract
With age, the prevalence of subclinical hypothyroidism rises. However, incidence and determinants of spontaneous normalization remain largely unknown.
To investigate incidence and determinants of spontaneous normalization of TSH levels in older adults with subclinical hypothyroidism.
Pooled data were used from the (1) pretrial population and (2) in-trial placebo group from 2 randomized, double-blind, placebo-controlled trials (Thyroid Hormone Replacement for Untreated Older Adults With Subclinical Hypothyroidism Trial and Institute for Evidence-Based Medicine in Old Age thyroid 80-plus thyroid trial).
Community-dwelling 65+ adults with subclinical hypothyroidism from the Netherlands, Switzerland, Ireland, and the United Kingdom.
The pretrial population (N = 2335) consisted of older adults with biochemical subclinical hypothyroidism, defined as ≥1 elevated TSH measurement (≥4.60 mIU/L) and a free T4 within the laboratory-specific reference range. Individuals with persistent subclinical hypothyroidism, defined as ≥2 elevated TSH measurements ≥3 months apart, were randomized to levothyroxine/placebo, of which the in-trial placebo group (N = 361) was included.
Incidence of spontaneous normalization of TSH levels and associations between participant characteristics and normalization.
In the pretrial phase, TSH levels normalized in 60.8% of participants in a median follow-up of 1 year. In the in-trial phase, levels normalized in 39.9% of participants after 1 year of follow-up. Younger age, female sex, lower initial TSH level, higher initial free T4 level, absence of thyroid peroxidase antibodies, and a follow-up measurement in summer were independent determinants for normalization.
Because TSH levels spontaneously normalized in a large proportion of older adults with subclinical hypothyroidism (also after confirmation by repeat measurement), a third measurement may be recommended before considering treatment.
ClinicalTrials.gov, NCT01660126 and Netherlands Trial Register, NTR3851.
With increasing age, circulating levels of TSH generally rise, accompanied by a higher prevalence of subclinical hypothyroidism (1, 2). Subclinical hypothyroidism is defined as an elevated TSH level while the serum free T4 (fT4) concentration is within the normal range (3). Several randomized controlled trials have shown that treatment of mild subclinical hypothyroidism in older adults does not improve clinical outcomes (4-9). Therefore, it has been suggested to reevaluate the reference range for TSH in older adults (10). To this end, we need to further explore the natural course of subclinical hypothyroidism in older adults. Interestingly, several studies in adults have shown that subclinical hypothyroidism can spontaneously normalize (11-18). However, it is not known what the most important determinants are for spontaneous TSH normalization in older adults. Enhanced understanding of the natural history and factors contributing to normalization may help clinical decision making on the follow-up strategy.
In this longitudinal study, we aimed to investigate the incidence of spontaneous normalization of TSH levels and identify determinants of normalization in a large group of adults aged 65 years and older with (persistent) subclinical hypothyroidism. We combined individual participant level data from 2 randomized trials investigating the effect of levothyroxine treatment in older adults with subclinical hypothyroidism: the Thyroid Hormone Replacement for Untreated Older Adults With Subclinical Hypothyroidism Trial (TRUST) and Institute for Evidence-Based Medicine in Old Age (IEMO) trials (4, 6, 19, 20). Because we were interested in spontaneous normalization, we only included the pretrial screening populations and the in-trial placebo groups of the 2 clinical trials.
Materials and Methods
Study Design
The present study pooled data from 2 randomized, double-blind, placebo-controlled parallel-group clinical trials investigating the effect of levothyroxine treatment for older adults with subclinical hypothyroidism: the TRUST and the IEMO 80-plus thyroid trial (4, 6). TRUST included community-dwelling participants aged 65 years and older in the Netherlands, Switzerland, Ireland, and the United Kingdom recruited between April 2013 and May 2015 (4, 19). Participants for the IEMO 80-plus thyroid trial were aged 80 years and older and recruited in the Netherlands and Switzerland between May 2014 and May 2017 (6, 20). Both trials shared a near-identical design and recruitment strategy. Trial protocols were approved by the relevant ethics committees and regulatory authorities in all the countries involved in the trials. The trials were conducted in accordance with the principles of the Declaration of Helsinki and Good Clinical Practice guidelines and participants provided written informed consent.
Inclusion of Study Participants
The timeline and the 2 populations of the present study are presented in Fig. 1 (Fig 1).

Timeline of the study. The pretrial population consisted of older adults with ≥1 elevated TSH measurement (≥4.60 mIU/L) and an fT4 level within the laboratory-specific reference range, during the previous 3 months to 3 years (screening). The second measurement was at the trial baseline to assess whether they had persistent subclinical hypothyroidism or had normalized their TSH levels. The in-trial placebo group consisted of older adults with ≥2 elevated TSH measurements, both at the screening and trial baseline, receiving placebo in the clinical trials. After 12 months, at the (third) follow-up measurement, it was determined whether the in-trial placebo group had normalized their TSH levels.
Inclusion of pretrial population
Participants were identified from clinical laboratory databases and general practice records. Older adults with biochemical subclinical hypothyroidism, defined as an elevated TSH level (4.60-19.99 mIU/L) and an fT4 level within the laboratory-specific reference range, 3 months to 3 years before the trial baseline were invited for a repeated measurement. This repeated measurement was taken to assess whether they had persistent subclinical hypothyroidism and were eligible for trial inclusion. Individuals with 2 TSH measurements (the pretrial screening and the trial baseline) were included in the pretrial population. Participants recruited during the screening period in Switzerland were excluded because Swiss demographic data were only registered centrally for those who were randomized.
Inclusion of in-trial placebo group
Persistent subclinical hypothyroidism was defined as an elevated TSH level (4.60-19.99 mIU/L) and an fT4 level within the laboratory-specific reference range at both the pretrial screening (first measurement) and at the trial baseline (second measurement). Participants with persistent subclinical hypothyroidism who fulfilled the inclusion criteria were randomized to receive levothyroxine treatment or placebo in the clinical trials. For the present study, we only included the in-trial participants assigned to placebo treatment with mock titration. TSH normalization was checked at the follow-up visit after 300 to 400 days (third measurement).
Thyroid Status Measurements
For nearly all participants, TSH and fT4 measurements were performed at the same clinical laboratory or general practice where the initial measurement for identification of subclinical hypothyroidism was performed. A subsample (80.1%) of the in-trial placebo group donated blood for biobanking at randomization (a few weeks after the trial baseline), which was stored centrally at −80° C to be analyzed later. In these serum samples, thyroid peroxidase antibodies (anti-TPO) were measured using the Cobas E602 module from Roche, Almere, the Netherlands, performed in a single batch at the Department of Clinical Chemistry and Laboratory Medicine of the Leiden University Medical Center. The threshold for elevated anti-TPO is > 34 kU/L (detection range, 10-599.99 kU/L), the coefficient of variation was 11.3% to 15.6%.
Participant Characteristics
Pretrial population
Determinants of interest were age, sex, TSH level at first measurement, fT4 level at first measurement, and the interval between the first and second measurements (divided into <6 months, 6-12 months, > 12 months). To investigate whether season of follow-up testing has an influence on TSH normalization, the season of the second measurement (divided into meteorological summer [1 June-31 August], autumn [1 September-30 November], winter [1 December-28/29 February], and spring [1 March-31 May]) was assessed as a determinant.
In-trial placebo group
Determinants of interest for the in-trial placebo group were age, sex, TSH level at trial baseline, fT4 level at trial baseline, and anti-TPO positivity. Because the interval between the trial baseline and follow-up measurements for all participants in the in-trial placebo group was between 300 to 400 days, the interval and season of testing were not included as determinants.
Statistical Analyses
Summary statistics were estimated using median and interquartile range for continuous variables and number and percentage for categorical variables. Logistic regression was performed with TSH normalization as the outcome using the glm function in R. Univariable models were created for the determinants of interest separately and multivariable models were created with all determinants combined. All models were adjusted for country as fixed effect. To visualize the probability of normalization for each unit in TSH level, probability plots were created using logistic regression models with initial TSH level as independent determinant and TSH normalization as outcome, adjusted for country (United Kingdom as reference). For the pretrial phase, a sensitivity analysis was performed excluding the participants who had both measurements in the same season (N = 20%) to investigate the true effect of a follow-up measurement in a certain season compared to another season. Analyses were conducted in R version 4.1.2 (21) and figures were produced using GraphPad, Microsoft Word, and Microsoft PowerPoint.
Results
As presented in Fig. 2 (Fig 2), a total of 2989 older adults were identified from clinical laboratory databases and general practice records and assessed for eligibility for the TRUST or IEMO trials. Of these, 2335 participants were included in the pretrial population to investigate the incidence and determinants of spontaneous TSH normalization in older adults with at least 1 measurement of biochemical subclinical hypothyroidism. For the in-trial placebo group, we included participants who were randomized to placebo in 1 of the 2 trials and had a follow-up in-trial TSH measurement in 300 to 400 days. In total, 361 participants were included in the in-trial placebo group to investigate the incidence and determinants of spontaneous TSH normalization in older adults with persistent subclinical hypothyroidism (defined as at least 2 measurement of elevated TSH levels with normal fT4 levels more than 3 months apart).

Pretrial Population: Older Adults With Biochemical Subclinical Hypothyroidism (N = 2335)
Characteristics of study population
The median (interquartile range[IQR]) age of the pretrial population was 72.9 (68.0-79.3) years and 60.7% of the participants were female (Table 1). Median (IQR) levels of TSH and fT4 at the first measurement (screening) were 5.40 (4.91-6.31) mIU/L and 13.6 (12.3-15.0) pmol/L, respectively. Most participants had their second measurement (trial baseline) in summer (32.7%) or autumn (31.5%) and were included in The Netherlands (50.3%) cohort.
Characteristic . | Pretrial population (N = 2335) . |
---|---|
Age, y | 72.9 (68.0-79.3) |
Age ≥80 y, n (%) | 535 (22.9) |
Female, n (%) | 1418 (60.7) |
TSH, mIU/L | |
First measurement (screening) | 5.40 (4.91-6.31) |
>7 mIU/L, n (%) | 368 (15.8) |
>10 mIU/L, n (%) | 57 (2.4) |
Second measurement (trial baseline) | 4.02 (3.01-5.42) |
fT4, pmol/L | |
First measurement (screening)a | 13.6 (12.3-15.0) |
Second measurement (trial baseline) | 13.0 (12.0-14.6) |
Interval between first and second measurements | 344 (207-594) |
<6 mo, n (%) | 465 (19.9) |
6-12 mo, n (%) | 778 (33.3) |
>12 mo, n (%) | 1092 (46.8) |
Season of second measurement (trial baseline), n (%) | |
Summer | 764 (32.7) |
Autumn | 735 (31.5) |
Winter | 318 (13.6) |
Spring | 518 (22.2) |
Country, n (%) | |
United Kingdom | 494 (21.2) |
Ireland | 667 (28.6) |
The Netherlands | 1174 (50.3) |
Characteristic . | Pretrial population (N = 2335) . |
---|---|
Age, y | 72.9 (68.0-79.3) |
Age ≥80 y, n (%) | 535 (22.9) |
Female, n (%) | 1418 (60.7) |
TSH, mIU/L | |
First measurement (screening) | 5.40 (4.91-6.31) |
>7 mIU/L, n (%) | 368 (15.8) |
>10 mIU/L, n (%) | 57 (2.4) |
Second measurement (trial baseline) | 4.02 (3.01-5.42) |
fT4, pmol/L | |
First measurement (screening)a | 13.6 (12.3-15.0) |
Second measurement (trial baseline) | 13.0 (12.0-14.6) |
Interval between first and second measurements | 344 (207-594) |
<6 mo, n (%) | 465 (19.9) |
6-12 mo, n (%) | 778 (33.3) |
>12 mo, n (%) | 1092 (46.8) |
Season of second measurement (trial baseline), n (%) | |
Summer | 764 (32.7) |
Autumn | 735 (31.5) |
Winter | 318 (13.6) |
Spring | 518 (22.2) |
Country, n (%) | |
United Kingdom | 494 (21.2) |
Ireland | 667 (28.6) |
The Netherlands | 1174 (50.3) |
Values shown are median (interquartile range) unless indicated otherwise.
Abbreviation: fT4, free T4.
aFirst fT4 measurement was missing for n = 10.
Characteristic . | Pretrial population (N = 2335) . |
---|---|
Age, y | 72.9 (68.0-79.3) |
Age ≥80 y, n (%) | 535 (22.9) |
Female, n (%) | 1418 (60.7) |
TSH, mIU/L | |
First measurement (screening) | 5.40 (4.91-6.31) |
>7 mIU/L, n (%) | 368 (15.8) |
>10 mIU/L, n (%) | 57 (2.4) |
Second measurement (trial baseline) | 4.02 (3.01-5.42) |
fT4, pmol/L | |
First measurement (screening)a | 13.6 (12.3-15.0) |
Second measurement (trial baseline) | 13.0 (12.0-14.6) |
Interval between first and second measurements | 344 (207-594) |
<6 mo, n (%) | 465 (19.9) |
6-12 mo, n (%) | 778 (33.3) |
>12 mo, n (%) | 1092 (46.8) |
Season of second measurement (trial baseline), n (%) | |
Summer | 764 (32.7) |
Autumn | 735 (31.5) |
Winter | 318 (13.6) |
Spring | 518 (22.2) |
Country, n (%) | |
United Kingdom | 494 (21.2) |
Ireland | 667 (28.6) |
The Netherlands | 1174 (50.3) |
Characteristic . | Pretrial population (N = 2335) . |
---|---|
Age, y | 72.9 (68.0-79.3) |
Age ≥80 y, n (%) | 535 (22.9) |
Female, n (%) | 1418 (60.7) |
TSH, mIU/L | |
First measurement (screening) | 5.40 (4.91-6.31) |
>7 mIU/L, n (%) | 368 (15.8) |
>10 mIU/L, n (%) | 57 (2.4) |
Second measurement (trial baseline) | 4.02 (3.01-5.42) |
fT4, pmol/L | |
First measurement (screening)a | 13.6 (12.3-15.0) |
Second measurement (trial baseline) | 13.0 (12.0-14.6) |
Interval between first and second measurements | 344 (207-594) |
<6 mo, n (%) | 465 (19.9) |
6-12 mo, n (%) | 778 (33.3) |
>12 mo, n (%) | 1092 (46.8) |
Season of second measurement (trial baseline), n (%) | |
Summer | 764 (32.7) |
Autumn | 735 (31.5) |
Winter | 318 (13.6) |
Spring | 518 (22.2) |
Country, n (%) | |
United Kingdom | 494 (21.2) |
Ireland | 667 (28.6) |
The Netherlands | 1174 (50.3) |
Values shown are median (interquartile range) unless indicated otherwise.
Abbreviation: fT4, free T4.
aFirst fT4 measurement was missing for n = 10.
Normalization of subclinical hypothyroidism
From a total of 2335 participants, TSH levels normalized in 1419 (60.8%) participants in a median follow-up of 344 (IQR, 207-594) days. Lower age (odds ratio [OR], 0.98 [95% CI, .97-.99], P = .007), female sex (OR, 1.39 [95% CI, 1.15-1.69], P < .001), lower screening TSH level (OR, 0.57 [95% CI, .52-.62], P < .001), higher normal screening fT4 level (OR, 1.06 [95% CI, 1.01-1.11], P = .03), and a second measurement in summer (compared with winter: OR, 0.59 [95% CI, .44-.79], P < .001) were independently associated with a higher chance of TSH normalization (Table 2). An interval between measurements of more than 12 months was associated with TSH normalization (OR, 1.40 [95% CI, 1.07-1.82], P = .01), but this was not statistically significant in the multivariable model (Table 2). When restricted to participants having their second measurement in another season than the first (N = 1866), having the second measurement in summer was still independently associated with a higher chance of normalization (compared with winter: OR, 0.55 [95% CI, .39-.78], P < .001), data not shown.
Association between TSH normalization and characteristics in older adults with subclinical hypothyroidism from the pretrial population (N = 2335)
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .98 (.97-.99) | <.001 | .98 (.97-.99) | .007 |
Female sex | 1.39 (1.16-1.66) | <.001 | 1.39 (1.15-1.69) | <.001 |
TSH level (per unit mIU/L) | .56 (.52-.61) | <.001 | .57 (.52-.62) | <.001 |
fT4 level (per unit pmol/L) | 1.08 (1.04-1.13) | <.001 | 1.06 (1.01-1.11) | .03 |
Interval between measurements | ||||
<6 mo | 1 (reference) | 1 (reference) | ||
6-12 mo | 1.05 (.81-1.35) | .73 | 1.02 (.78-1.34) | .87 |
>12 mo | 1.40 (1.07-1.82) | .01 | 1.30 (.97-1.72) | .07 |
Season second measurement | ||||
Summer | 1 (reference) | 1 (reference) | ||
Autumn | .74 (.59-.93) | .008 | .72 (.57-.92) | .008 |
Winter | .62 (.47-.82) | <.001 | .59 (.44-.79) | <.001 |
Spring | .73 (.57-.92) | .009 | .73 (.56-.95) | .02 |
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .98 (.97-.99) | <.001 | .98 (.97-.99) | .007 |
Female sex | 1.39 (1.16-1.66) | <.001 | 1.39 (1.15-1.69) | <.001 |
TSH level (per unit mIU/L) | .56 (.52-.61) | <.001 | .57 (.52-.62) | <.001 |
fT4 level (per unit pmol/L) | 1.08 (1.04-1.13) | <.001 | 1.06 (1.01-1.11) | .03 |
Interval between measurements | ||||
<6 mo | 1 (reference) | 1 (reference) | ||
6-12 mo | 1.05 (.81-1.35) | .73 | 1.02 (.78-1.34) | .87 |
>12 mo | 1.40 (1.07-1.82) | .01 | 1.30 (.97-1.72) | .07 |
Season second measurement | ||||
Summer | 1 (reference) | 1 (reference) | ||
Autumn | .74 (.59-.93) | .008 | .72 (.57-.92) | .008 |
Winter | .62 (.47-.82) | <.001 | .59 (.44-.79) | <.001 |
Spring | .73 (.57-.92) | .009 | .73 (.56-.95) | .02 |
Odds ratios (OR) with 95% CIs resulting from logistic regression analyses. Univariable models were created for each characteristic separately and multivariable models were created with all characteristics combined. All analyses were adjusted for country.
Abbreviation: OR, odds ratio.
Association between TSH normalization and characteristics in older adults with subclinical hypothyroidism from the pretrial population (N = 2335)
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .98 (.97-.99) | <.001 | .98 (.97-.99) | .007 |
Female sex | 1.39 (1.16-1.66) | <.001 | 1.39 (1.15-1.69) | <.001 |
TSH level (per unit mIU/L) | .56 (.52-.61) | <.001 | .57 (.52-.62) | <.001 |
fT4 level (per unit pmol/L) | 1.08 (1.04-1.13) | <.001 | 1.06 (1.01-1.11) | .03 |
Interval between measurements | ||||
<6 mo | 1 (reference) | 1 (reference) | ||
6-12 mo | 1.05 (.81-1.35) | .73 | 1.02 (.78-1.34) | .87 |
>12 mo | 1.40 (1.07-1.82) | .01 | 1.30 (.97-1.72) | .07 |
Season second measurement | ||||
Summer | 1 (reference) | 1 (reference) | ||
Autumn | .74 (.59-.93) | .008 | .72 (.57-.92) | .008 |
Winter | .62 (.47-.82) | <.001 | .59 (.44-.79) | <.001 |
Spring | .73 (.57-.92) | .009 | .73 (.56-.95) | .02 |
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .98 (.97-.99) | <.001 | .98 (.97-.99) | .007 |
Female sex | 1.39 (1.16-1.66) | <.001 | 1.39 (1.15-1.69) | <.001 |
TSH level (per unit mIU/L) | .56 (.52-.61) | <.001 | .57 (.52-.62) | <.001 |
fT4 level (per unit pmol/L) | 1.08 (1.04-1.13) | <.001 | 1.06 (1.01-1.11) | .03 |
Interval between measurements | ||||
<6 mo | 1 (reference) | 1 (reference) | ||
6-12 mo | 1.05 (.81-1.35) | .73 | 1.02 (.78-1.34) | .87 |
>12 mo | 1.40 (1.07-1.82) | .01 | 1.30 (.97-1.72) | .07 |
Season second measurement | ||||
Summer | 1 (reference) | 1 (reference) | ||
Autumn | .74 (.59-.93) | .008 | .72 (.57-.92) | .008 |
Winter | .62 (.47-.82) | <.001 | .59 (.44-.79) | <.001 |
Spring | .73 (.57-.92) | .009 | .73 (.56-.95) | .02 |
Odds ratios (OR) with 95% CIs resulting from logistic regression analyses. Univariable models were created for each characteristic separately and multivariable models were created with all characteristics combined. All analyses were adjusted for country.
Abbreviation: OR, odds ratio.
In-trial Placebo Group: Older Adults With Persistent Subclinical Hypothyroidism (N = 361)
Characteristics of study population
The median (IQR) age of the in-trial placebo group was 75.1 (69.6-81.4) years and 51.8% of the participants were female (Table 3). Median (IQR) levels of TSH and fT4 at the trial baseline measurement were 5.75 (5.10-6.86) mIU/L and 13.4 (12.1-14.7) pmol/L, respectively. One quarter (25.3%) of the participants had antibodies to TPO.
Characteristic . | In-trial placebo group (N = 361) . |
---|---|
Age, y | 75.1 (69.6-81.4) |
Age ≥ 80 y, n (%) | 113 (31.3) |
Female, n (%) | 187 (51.8) |
TSH, mIU/L | |
First measurement (screening) | 5.84 (5.20-7.17) |
Second measurement (trial baseline) | 5.75 (5.10-6.86) |
>7 mIU/L, n (%) | 77 (21.3) |
>10 mIU/L, n (%) | 16 (4.4) |
Third measurement (follow-up) | 4.91 (3.96-6.49) |
fT4 at second measurement (trial baseline), pmol/L | 13.4 (12.1-14.7) |
Anti-TPO positive, n (%)a | 73 (25.3) |
Interval between second and third measurements | 362 (345-370) |
Country, n (%) | |
United Kingdom | 64 (17.8) |
Ireland | 50 (13.9) |
The Netherlands | 149 (41.3) |
Switzerland | 98 (27.1) |
Characteristic . | In-trial placebo group (N = 361) . |
---|---|
Age, y | 75.1 (69.6-81.4) |
Age ≥ 80 y, n (%) | 113 (31.3) |
Female, n (%) | 187 (51.8) |
TSH, mIU/L | |
First measurement (screening) | 5.84 (5.20-7.17) |
Second measurement (trial baseline) | 5.75 (5.10-6.86) |
>7 mIU/L, n (%) | 77 (21.3) |
>10 mIU/L, n (%) | 16 (4.4) |
Third measurement (follow-up) | 4.91 (3.96-6.49) |
fT4 at second measurement (trial baseline), pmol/L | 13.4 (12.1-14.7) |
Anti-TPO positive, n (%)a | 73 (25.3) |
Interval between second and third measurements | 362 (345-370) |
Country, n (%) | |
United Kingdom | 64 (17.8) |
Ireland | 50 (13.9) |
The Netherlands | 149 (41.3) |
Switzerland | 98 (27.1) |
Values shown are median (interquartile range) unless indicated otherwise.
Abbreviations: fT4, free T4; anti-TPO, thyroid peroxidase antibodies.
aInformation on TPO antibodies was missing for n = 72.
Characteristic . | In-trial placebo group (N = 361) . |
---|---|
Age, y | 75.1 (69.6-81.4) |
Age ≥ 80 y, n (%) | 113 (31.3) |
Female, n (%) | 187 (51.8) |
TSH, mIU/L | |
First measurement (screening) | 5.84 (5.20-7.17) |
Second measurement (trial baseline) | 5.75 (5.10-6.86) |
>7 mIU/L, n (%) | 77 (21.3) |
>10 mIU/L, n (%) | 16 (4.4) |
Third measurement (follow-up) | 4.91 (3.96-6.49) |
fT4 at second measurement (trial baseline), pmol/L | 13.4 (12.1-14.7) |
Anti-TPO positive, n (%)a | 73 (25.3) |
Interval between second and third measurements | 362 (345-370) |
Country, n (%) | |
United Kingdom | 64 (17.8) |
Ireland | 50 (13.9) |
The Netherlands | 149 (41.3) |
Switzerland | 98 (27.1) |
Characteristic . | In-trial placebo group (N = 361) . |
---|---|
Age, y | 75.1 (69.6-81.4) |
Age ≥ 80 y, n (%) | 113 (31.3) |
Female, n (%) | 187 (51.8) |
TSH, mIU/L | |
First measurement (screening) | 5.84 (5.20-7.17) |
Second measurement (trial baseline) | 5.75 (5.10-6.86) |
>7 mIU/L, n (%) | 77 (21.3) |
>10 mIU/L, n (%) | 16 (4.4) |
Third measurement (follow-up) | 4.91 (3.96-6.49) |
fT4 at second measurement (trial baseline), pmol/L | 13.4 (12.1-14.7) |
Anti-TPO positive, n (%)a | 73 (25.3) |
Interval between second and third measurements | 362 (345-370) |
Country, n (%) | |
United Kingdom | 64 (17.8) |
Ireland | 50 (13.9) |
The Netherlands | 149 (41.3) |
Switzerland | 98 (27.1) |
Values shown are median (interquartile range) unless indicated otherwise.
Abbreviations: fT4, free T4; anti-TPO, thyroid peroxidase antibodies.
aInformation on TPO antibodies was missing for n = 72.
Normalization of persistent subclinical hypothyroidism
From a total of 361 participants, in 144 (39.9%) participants, TSH levels normalized in a median follow-up of 362 (IQR, 345-370) days. Lower age (OR, 0.96 [95% CI, .92-1.00], P = .05), female sex (OR, 1.80 [95% CI, 1.01-3.23], P = .05), lower trial baseline TSH level (OR, 0.52 [95% CI, .38-.67], P < .001), higher normal trial baseline fT4 levels (OR, 1.22 [95% CI, 1.05-1.44], P = .01), and the absence of TPO antibodies (OR, 0.36 [95% CI, .17-.77], P = .007) were independently associated with a higher chance of normalization (Table 4).
Association between TSH normalization and characteristics in older adults with persistent subclinical hypothyroidism from the in-trial placebo group (N = 361)
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .99 (.96-1.02) | .59 | .96 (.92-1.00) | .05 |
Female sex | 1.23 (.79-1.92) | .36 | 1.80 (1.01-3.23) | .05 |
TSH level at trial baseline (per unit mIU/L) | .51 (.40-.63) | <.001 | .52 (.38-.67) | <.001 |
fT4 level at trial baseline (per unit pmol/L) | 1.32 (1.17-1.50) | <.001 | 1.22 (1.05-1.44) | .01 |
Anti-TPO positivity× | .38 (.19-.72) | .004 | .36 (.17-.77) | .007 |
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .99 (.96-1.02) | .59 | .96 (.92-1.00) | .05 |
Female sex | 1.23 (.79-1.92) | .36 | 1.80 (1.01-3.23) | .05 |
TSH level at trial baseline (per unit mIU/L) | .51 (.40-.63) | <.001 | .52 (.38-.67) | <.001 |
fT4 level at trial baseline (per unit pmol/L) | 1.32 (1.17-1.50) | <.001 | 1.22 (1.05-1.44) | .01 |
Anti-TPO positivity× | .38 (.19-.72) | .004 | .36 (.17-.77) | .007 |
Odds ratios (OR) with 95% CIs resulting from logistic regression analyses. Univariable models were created for each characteristic separately and multivariable models were created with all characteristics combined. All analyses were adjusted for country. ×Information on TPO antibodies was missing for n = 72.
Abbreviation: anti-TPO, thyroid peroxidase antibodies.
Association between TSH normalization and characteristics in older adults with persistent subclinical hypothyroidism from the in-trial placebo group (N = 361)
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .99 (.96-1.02) | .59 | .96 (.92-1.00) | .05 |
Female sex | 1.23 (.79-1.92) | .36 | 1.80 (1.01-3.23) | .05 |
TSH level at trial baseline (per unit mIU/L) | .51 (.40-.63) | <.001 | .52 (.38-.67) | <.001 |
fT4 level at trial baseline (per unit pmol/L) | 1.32 (1.17-1.50) | <.001 | 1.22 (1.05-1.44) | .01 |
Anti-TPO positivity× | .38 (.19-.72) | .004 | .36 (.17-.77) | .007 |
Characteristic . | Univariable model . | Multivariable model . | ||
---|---|---|---|---|
OR (95% CI) . | P . | OR (95% CI) . | P . | |
Age (per year) | .99 (.96-1.02) | .59 | .96 (.92-1.00) | .05 |
Female sex | 1.23 (.79-1.92) | .36 | 1.80 (1.01-3.23) | .05 |
TSH level at trial baseline (per unit mIU/L) | .51 (.40-.63) | <.001 | .52 (.38-.67) | <.001 |
fT4 level at trial baseline (per unit pmol/L) | 1.32 (1.17-1.50) | <.001 | 1.22 (1.05-1.44) | .01 |
Anti-TPO positivity× | .38 (.19-.72) | .004 | .36 (.17-.77) | .007 |
Odds ratios (OR) with 95% CIs resulting from logistic regression analyses. Univariable models were created for each characteristic separately and multivariable models were created with all characteristics combined. All analyses were adjusted for country. ×Information on TPO antibodies was missing for n = 72.
Abbreviation: anti-TPO, thyroid peroxidase antibodies.
TSH Level as Determinant of Normalization of Subclinical Hypothyroidism
Figure 3 visualizes that the probability of TSH normalization decreases for each unit in TSH level from 5 to 10 mIU/L in both populations. In the pretrial population, there was an interaction between TSH screening level and sex (P = .004) showing that the probability of normalization tended to decrease more for men than for women (Fig 3A). As an example, older men with biochemical subclinical hypothyroidism with a screening TSH level of 5 mIU/L have a 76.3% (95% CI, 70.2-82.3) chance of normalization, whereas the chance becomes 7.3% (95% CI, 2.7-11.8) when having a screening TSH level of 10 mIU/L. For older women, the chance of TSH normalization decreases from 73.3% (95% CI, 68.2-78.5) at 5 mIU/L to 20.1% (95% CI, 12.1-28.1) at 10 mIU/L. Older adults who already had 2 elevated TSH measurements more than 3 months apart still have a 64.3% (95% CI, 51.6-77.1) chance to normalize when their second (trial baseline) TSH level is 5 mIU/L (Fig 3B). However, when the second (trial baseline) TSH level is 10 mIU/L in older adults with persistent subclinical hypothyroidism, then the chance of normalization is only 5.9% (95% CI, .0-12.0).

Probability of TSH normalization (95% CI) based on initial TSH level in older adults with (persistent) subclinical hypothyroidism. (A) Probability of normalization stratified for sex in the pretrial population (N = 2335), which includes older adults with subclinical hypothyroidism. (B) Probability of normalization in the in-trial placebo group (N = 361), which includes older adults with persistent subclinical hypothyroidism, defined as having at least 2 measurements of elevated TSH levels ≥3 months apart.
Discussion
In this study, we aimed to investigate the incidence and determinants of spontaneous TSH normalization in a large group of adults aged 65 years and older with (persistent) subclinical hypothyroidism. In 60.8% of the older adults with biochemical subclinical hypothyroidism based on at least 1 elevated TSH measurement, TSH levels had returned to the normal range without intervention after a median follow-up of 1 year. Subsequently, TSH levels had still normalized after 1 year in 39.9% of older adults with persistent subclinical hypothyroidism, defined as at least 2 elevated TSH measurements more than 3 months apart. Younger age, female sex, lower initial TSH level, higher normal initial fT4 level, the absence of TPO antibodies, and a second measurement in summer were independent determinants for TSH normalization.
The large proportions of normalization of TSH levels in older adults with subclinical hypothyroidism after approximately 1 year in the present study are in line with percentages of 35% to 70% during a follow-up of 0.5 to 5 years found in studies in adults aged 55 years and older (11-15, 17). In 3 of those studies, a lower initial TSH levels was found to be the strongest determinant for normalization (11, 13, 15). A lower initial TSH level as a determinant for TSH normalization can be statistically explained by regression to the mean, but it can also be reasoned that mildly elevated TSH levels normalize easier than higher levels, especially considering normal fluctuations in TSH levels (22). These fluctuations within an individual over time are caused by both internal and external factors, such as pulsatile secretion, the biological clock, illness, and medication use (22-24). The association between high normal fT4 levels and TSH normalization was also found by Díez and Iglesias (11), and was expected given the negative feedback of thyroid hormones on TSH secretion. TPO antibodies generally associate with higher TSH levels (25), which might explain anti-TPO negativity as a determinant for normalization as confirmed by 2 studies (11, 15), but not by others (13). Although older women are generally more affected by (subclinical) hypothyroidism and tend to have higher TSH levels than older men (26, 27), we found in our study that women have a higher probability for normalization than men, especially at a higher TSH level. TSH levels generally rise with age (1), so older age coincides with higher TSH levels, which might explain the association found between older age and a lower chance of TSH normalization. However, in 3 studies with older adults with subclinical hypothyroidism without known previous thyroid disease, no relation was observed between sex, age, and TSH normalization (11, 13, 15). A second measurement in summer was associated with normalization, which is in line with a study in individuals aged 18 to 90 years (28). TSH levels are subject to change of season with highest levels in winter (22), which is probably caused by changes in environmental temperature (29).
The present study is a large, multicenter study in which adults aged 65 years and older, enriched for adults aged 80+, with mild subclinical hypothyroidism were included. Strengths of this study are that changes in TSH levels were observed over a long follow-up time ranging from 3 months to 4 years, without intervention. Another strength of this study is that we compared TSH normalization after at least 1 elevated TSH measurement and after at least 2 elevated TSH measurements. Measurements follow real-world practice because all thyroid measurements were performed in routine clinical care, thus preventing potential degradation of TSH by long-term sample storage and/or freeze/thaw cycles. Although biochemical assays differed between clinical centers, baseline and follow-up measurements were performed in the same clinical centers for nearly all participants. Unfortunately, time of blood sampling was not recorded. Selection bias could play a role in the pretrial screening population because individuals who became overt hypothyroid or started with thyroid medication for other reasons were not enrolled in the study. Therefore, we were not able to investigate the incidence and determinants of the progression of subclinical hypothyroidism to overt hypothyroidism in the pretrial screening population. Almost all older subjects had mild subclinical hypothyroidism (TSH < 10 mIU/L) in our study, which limits the generalizability to higher TSH levels. With age, the TSH distribution shift toward a higher level, and the 97.5th percentile of the TSH distribution was shown to be 7.5 mU/L in subjects aged 80 years and older (30).
In this study, we have demonstrated that in a large proportion of older adults with mild subclinical hypothyroidism, TSH levels spontaneously normalized in a median follow-up of 1 year, even after 2 consecutive measurements (≥3 months apart) of elevated TSH levels. These results have clinical relevance. Current guidelines recommend that a single measurement of elevated serum TSH, with fT4 within reference range, should be confirmed by a repeat measurement of both serum TSH and fT4, along with thyroid peroxidase antibodies, after a 2- to 3-month interval (31). However, international guidelines differ in their recommendations on the TSH threshold at which treatment of subclinical hypothyroidism should be considered in older adults (32). Once levothyroxine treatment is initiated, it is often continued lifelong (33). Based on the observation that with age, the TSH distribution shifts toward a higher level, it has been proposed to extend the upper limit of the TSH reference to 7 mU/L for people aged 80 years or older (34). Although increasing the upper reference limit will likely reduce unnecessary levothyroxine prescribing for older people, instead of a binary approach, our results support applying more continuous age-related reference ranges, which may be modified by other factors such as sex and season. Moreover, based on the high incidence of spontaneous normalization of TSH levels in a large proportion of older adults with subclinical hypothyroidism (also after confirmation by repeat measurement), a third measurement may be recommended before considering initiation of treatment. Based on such an approach, the frequency of the diagnosis subclinical hypothyroidism can likely be reduced in older adults. This could potentially contribute to a reduction in health care costs, treatment burden, and risk of overtreatment. These results may also have implications for follow-up studies. Levothyroxine is among the most frequently prescribed drugs. Although general practitioner prescription practices vary widely between countries (35), many older adults have initiated levothyroxine treatment based on a mildly elevated TSH. Moreover, in clinical practice, treatment has often been started after a single measurement of TSH (33). A recent meta-analysis indicated that deprescribing levothyroxine could be successful for carefully selected patients, although the quality of available evidence was considered low (36). This underscores the need for future high-quality studies to assess the effects of safely withholding levothyroxine treatment in older adults who initiated levothyroxine treatment based on a diagnosis of mild subclinical hypothyroidism on relevant patient outcomes (36).
Funding
The TRUST trial was supported by research grant (278148) from the European Commission FP7-HEALTH-2011 program and by grants from the Swiss National Science Foundation (SNSF 320030-150025 to N.D.) and the Swiss Heart Foundation and Velux Stiftung (grant 974a to N.D.). The IEMO trial was supported by research grant (627001001) from ZonMw and by grants from the Swiss National Science Foundation (SNSF 320030-150025 and 320030-172676 to N.D.). N.D.’s research on thyroid is funded by a grant from the Swiss National Science Foundation (32003B_200606). M.R.B.'s work is funded by grants from the Swiss National Science Foundation (P2BEP3-175289, 32003B_205067) and the Swiss Heart Foundation (FF21106). T.-H.C.'s research was funded by a grant from the Swiss National Science Foundation (PZ00P3-167826). Study medication (levothyroxine and matching placebo) was supplied free of charge by Merck KGaA. The research of E.v.d.S., N.A.v.V., and D.v.H. was funded by the European Commission project THYRAGE (Horizon 2020, 666869). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author Contributions
All authors have contributed substantially to the conception or design of the work, or the acquisition, analysis, or interpretation of data; drafting the work or revising it critically for important intellectual content; and final approval of the version published.
Disclosures
All authors have declared that they have no conflicts of interest to disclose.
Data Availability
Restrictions apply to the availability of some or all data generated or analyzed during this study to preserve patient confidentiality or because they were used under license. The corresponding author will on request detail the restrictions and any conditions under which access to some data may be provided.
References
Abbreviations
- fT4
free T4
- IEMO
Institute for Evidence-Based Medicine in Old Age
- IQR
interquartile range
- OR
odds ratio
- TPO
thyroid peroxidase
- TRUST
Thyroid Hormone Replacement for Untreated Older Adults With Subclinical Hypothyroidism Trial