Home | Help | Search/Archive | Feedback | Table of Contents |
BMJ 2004;328:879-883 (10 April),
doi:10.1136/bmj.328.7444.879
Clinical reviewEfficacy and safety of antidepressants for children and adolescentsJon N Jureidini, head1, Christopher J Doecke, associate professor of pharmacy practice2, Peter R Mansfield, research fellow3, Michelle M Haby, senior epidemiologist5, David B Menkes, professor of psychological medicine6, Anne L Tonkin, associate professor41 Department of Psychological Medicine, Women's and Children's Hospital, North Adelaide, 5006 SA, Australia, 2 Quality Use of Medicines and Pharmacy Research Centre, University of South Australia, Adelaide, 5000 SA, 3 Department of General Practice, University of Adelaide, Adelaide, 5005 SA, 4 Department of Clinical and Experimental Pharmacology, University of Adelaide, 5 Health Surveillance and Evaluation Section, Public Health, Department of Human Services, Melbourne, 3000 Vic, Australia, 6 University of Wales Academic Unit, Wrexham LL13 7YP
Correspondence to: J N Jureidini jureidinij{at}wch.sa.gov.au
How safe and effective are antidepressants in children and adolescents? The authors of this review have found disturbing shortcomings in the methods and reporting of trials of newer antidepressants in this patient group
Antidepressants introduced since 1990, especially selective serotonin reuptake inhibitors and venlafaxine, have been used increasingly as first line treatment for depression in children.1 2 The safety of prescribing antidepressants to children (including adolescents) has been the subject of increasing concern in the community and the medical profession, leading to recommendations against their use from government and industry (box 1). In this paper, we review the published literature on the efficacy and safety of newer antidepressants in children.
Having criticised the way in which Keller et al interpreted the results of their study,3 4 we sought to check the quality of methods and reporting of other published trials of newer antidepressants in children (box 2). Of seven published randomised controlled trials of newer antidepressants for depressed children published in refereed journals, six used a placebo control.3 5-9 We analysed each study's methods and the extent to which authors' conclusions were supported by data. The seventh study, which compared a newer antidepressant with a tricyclic antidepressant without finding significant difference,10 was not included in the analysis but appears in the table on bmj.com.
Pharmaceutical companies paid for the trials and otherwise remunerated the authors of at least three of the four larger studies (table).
The table on bmj.com summarises the trials reviewed. A total of 477 patients in the six studies were treated with paroxetine, fluoxetine, sertraline, or venlafaxine ( 23% dropouts), and 464 were treated with placebo ( 25% dropouts). Of 42 reported measures, only 14 showed a statistical advantage for an antidepressant. None of the 10 measures relying on patient reported or parent reported outcomes showed significant advantage for an antidepressant, so that claims for effectiveness were based entirely on ratings by doctors. No study presented data on rates of attempted self harm, presentations to emergency or mental health services, or school attendance. Two small studies found no statistically significant advantage for antidepressants over placebo on any of the outcome measures reported.5 6 Of the remaining four papers, two did7 9 and two did not3 8 show statistically significant advantages for antidepressants over placebo on primary outcome measures. We meta-analysed the five published studies on selective serotonin reuptake inhibitors by using the standardised mean difference (Hedges' g) as the measure of effect.3 5 7-9 We averaged relevant outcome measures within studies and then pooled them across studies by using a random effects model. We included all continuous outcome measures related to depression and health related quality of life. The effect size was small 0.26 (95% confidence interval 0.13 to 0.40). Assuming a standard deviation of scores of 11 to 14 on the revised children's depression rating scale in depressed children, an effect size of 0.26 is equivalent to a very modest 3 to 4 point difference on the scale, which has a range of possible scores from 17 to 113. As regards unpublished studies, we note from a report from the US Food and Drug Administration Center for Drug Evaluation and Research that only one of nine showed a statistical advantage for drug over placebo.11
Because the follow up period for the randomised controlled trials was short, and numbers were relatively small, serious adverse effects were likely to be few. When they do occur, we would therefore expect authors to draw attention to them, along with data available from other sources that suggest that serious adverse effects might occur. Of 93 patients treated with paroxetine by Keller et al, 11 had serious adverse events, compared with 2/87 in the placebo group.3 The authors presented no statistical analysis, but the difference was significant (Pearson's 2 = 6.09, df = 1, P = 0.01). In spite of this striking difference in serious events between paroxetine and placebo, Keller et al concluded that, "paroxetine was generally well tolerated in this adolescent population, and most adverse effects were not serious," even though seven patients were admitted to hospital during treatment with paroxetine.3 Furthermore, despite five of these patients being admitted to hospital with events known to occur with the use of selective serotonin reuptake inhibitors, including suicidality, only one serious event (headache) was judged by the treating investigator to be related to paroxetine treatment. The criteria for determining causation of serious events were not stated. Among 373 patients in the trial by Wagner et al, 9% (17/189) treated with sertraline withdrew because of adverse events, compared with 3% (5/184) in the placebo group.9 These authors also published no statistical analysis of this outcome or details of the adverse effects, but the difference in withdrawal rates was significant (Pearson's 2 = 6.62, df = 1, P = 0.01). Wagner et al reported seven adverse effects that occurred in at least 5% of the sertraline group, at least twice as often as in the placebo. Despite these results they concluded that, "sertraline is an effective, safe, and well tolerated short-term treatment for children and adolescents." Other sources of data support the view that adverse effects might be more frequent than the authors of these studies imply. For example, children and adolescents with obsessive compulsive disorder exhibit a variety of treatment emergent effects of fluoxetine, including an "activation syndrome" affecting up to half of young patients; self injurious ideation or behaviour was seen in 6/42 patients.12 The failure of drug companies to disclose increased suicidal activity secondary to these drugs is also the subject of much debate.13
Withdrawals High rates of withdrawal occurred in all the studies, ranging from 17% to 32% for patients treated with selective serotonin reuptake inhibitors and from 17% to 46% for placebo treated patients. Such high rates of withdrawal over relatively short study periods (typically 8-10 weeks) raise concerns about the possible introduction of bias by the analytical method chosen. Most of these studies used an intention to treat, last observation carried forward approach. The last observation carried forward approach is based on the assumption (unlikely in childhood depression) that the condition of patients who have dropped out would have remained unchanged for the remainder of the study, had they continued in it. In none of these studies were the withdrawn patients assessed at the end of the trial to assess their outcome (a "true" intention to treat approach). The higher the drop-out rate the more likely a last observation carried forward approach is to produce unreliable results. Use of categorical outcomes Unblinding Doubtful clinical implications of statistical superiority to
placebo
In discussing their own data, the authors of all of the four larger studies have exaggerated the benefits, downplayed the harms, or both. This raises the question of whether the journals that published the research reviewed the studies with a sufficient degree of scrutiny, given the importance of the subject. Despite the authors' initial claims, data reported by Keller et al showed no statistically significant advantage of drug over placebo.3 Neither of the two pre-designated primary outcome measures (change from baseline in the score on the Hamilton rating scale (depression) and "response" defined as a fall in score below 8 or by 50%), were significantly different between paroxetine and placebo. Interpretation of these data was confused by an unexplained change in the definition of "response" to "reduction of HAM-D to below 8". Altering this definition enabled the authors to claim significance on a primary endpoint. The authors have subsequently modified that claim to having shown a "signal for efficacy."20 The two studies ultimately published by Emslie and colleagues in 1997 and 2002 (B1Y-MC-X065 and B1YMC-HCJE) were the subject of a "statistical review" by the US Center for Drug Evaluation and Research.21 That document showed that the prespecified primary outcome measure in the first Emslie study was proportion of completing patients who achieved recovery. The original definition of recovery in the study protocol was a score of 28 on the revised children's depression rating scale ("remission") and a clinical global impression-improvement score of 1 or 2. Two things are clear: firstly, this measure did not reach statistical significance (P = 0.339); secondly, when the study was published, new primary outcome measures were used (see table on bmj.com). The authors chose a reduction from baseline of 30% on the revised children's depression rating scale (shown on post hoc analysis of the first study to show favourable advantage to fluoxetine21) as the single primary endpoint for the second study (B1Y-MC-HCJE).8 However, they found no statistical difference between fluoxetine and placebo on this measure. Although Emslie et al did state in the results section that significance was not reached on "response," they did not make it explicit that this meant a failure to show change on their stated primary outcome, and they make much more of the secondary endpoints that did favour fluoxetine. Whenever the failure to show superiority of fluoxetine over placebo in achieving 30% reduction from baseline on the rating scale is reported, mean improvement and "remission" are given equal weight in the published paper, implying that one or both of them was also a primary endpoint. The authors go on to make an unqualified claim of efficacy, even though the drug showed no significant advantage over placebo on the single primary outcome measure. The independent "statistical review," on the other hand, concludes that, "the sponsor did not win on these two pediatric depression studies based on the protocol specified endpoint. The evidence for efficacy based on the pre-specified endpoint is not convincing."21 No information is given that provides insight into why the US Food and Drug Administration ultimately approved fluoxetine for childhood depression. Nor is it clear why the UK Committee on Safety of Medicines exempted fluoxetine from its criticisms through accepting the published versions of these studies, when it did not do so in relation to sertraline.22 Wagner et al described their work as "two randomised controlled trials," but the methods are identical, and they and we treated them as a single trial.9 The trials when combined included a large enough number of participants (364) to have adequate statistical power to detect small differences between treatments. Neither trial showed a statistically significant advantage for sertraline over placebo in terms of the primary endpoint, which in this case was change from baseline in the revised children's depression rating scale score.22 Only when the trials were combined did a statistically significant difference emerge, although this was very small (about 2.7 points on a 113 point scale). Furthermore, we question Wagner et al's inference that because tricyclic antidepressants are no more beneficial than placebo, even a small advantage for newer antidepressants justifies their use. The availability of older interventions that are not beneficial should not lower the threshold for accepting a new intervention, especially given the availability of more effective psychological treatments with no known adverse effects.23
The trials consistently found large improvements in placebo groups, with statistically significant additional benefits for active drug on some measures only. These results make a major benefit from newer antidepressants unlikely, but a small benefit remains possible. Randomised controlled trials usually underestimate the serious adverse effects of drugs.24 The fact that serious adverse effects with newer antidepressants are common enough to be detected in randomised controlled trials raises serious concerns about their potential for harm. The magnitude of benefit is unlikely to be sufficient to justify risking those harms, so confidently recommending these drugs as a treatment option, let alone as first line treatment, would be inappropriate. We are concerned that biased reporting and overconfident recommendations in treatment guidelines may mislead doctors, patients, and families. Many will undervalue non-drug treatments that are probably both safer and more effective. Accurate trial reports are a foundation of good medical care. It is vital that authors, reviewers, and editors ensure that published interpretations of data are more reasonable and balanced than is the case in the industry dominated literature on childhood antidepressants. This is particularly true in the light of the increasing reliance on online abstracts by doctors who lack the time or the skills for detailed analysis of complete trial reports.
Papers p 867
We thank Agnes Vitry, Dianne Campbell, and Brita Pekarsky for helpful comments on the manuscript. Contributors: JNJ and ALT had the original idea for the work. JNJ undertook the primary literature review and drafted themanuscript. CJD and DBM undertook a secondary literature review. MMH conducted the meta-analysis. CJD, MMH, PRM, DBM, and ALT assisted in drafting the manuscript. JNJ is the guarantor. Competing interests: JRJ and PRM are office bearers in Healthy Skepticism.
This article has been cited by other articles:
Rapid Responses:Read all Rapid Responses
Other related articles in BMJ:
|
Home | Help | Search/Archive | Feedback | Table of Contents |