Skip to main content

A randomized pilot and feasibility trial of live and recorded music interventions for management of delirium symptoms in acute geriatric patients

Abstract

Background

Delirium is an acute shift in attention and arousal, usually triggered by acute illness or surgery in older dementia patients. Prognosis is poor, and pharmacological options are limited; non-pharmacological interventions and music show promise.

Methods

This randomised pilot and feasibility trial tested feasibility, acceptability, fidelity, and safety of music interventions (MIs) for delirium patients and assessed preliminary effectiveness and suitability of the selected effect outcomes. Participants from an acute geriatric ward were randomised to Preferred Recorded Music (PRM) and Preferred Live Music (PLM), delivered for 30 min over three consecutive days. Feasibility outcomes included recruitment rate, retention, adherence, deviations, and treatment fidelity. Clinical outcomes were trajectory of delirium symptoms (arousal, attention, cognition), delirium duration, hospital stay length, and medication intake. Post-intervention and between groups changes in delirium symptoms were compared using mixed linear regression models for the repeated measurements. Mann-Whitney test and Fishers exact test were used for length of stay and medication use, respectively.

Results

26 participants (PLM = 14; PRM = 12), median age 87, most with hypoactive delirium were recruited at a rate of three participants per month. Retention rates for PLM and PRM were 64% and 33% respectively and adherence to PLM and PRM intervention protocols were 83% and 58%, respectively. Total adherence to the assessment protocols was 44%. PLM was delivered as intended, (treatment fidelity 93%), and PRM did not satisfy treatment fidelity (83%). All delirium symptoms except arousal improved on day 3 compared to baseline, with statistically significant improvement in attention. No conclusive pre-post or between-group differences were detected for any outcomes; confidence intervals were wide.

Conclusions

Feasibility of recruitment, interventions and assessments was indicated, and greater acceptability, safety and fidelity of the PLM intervention compared with the PRM. Adoption of external assessors is warranted in future trials, to mitigate slow recruitment and low adherence. Wide confidence intervals for most measures and comparisons indicate that the possible effect of the MIs on delirium cannot be excluded. The trial was registered at Clinical Trials, ID: NCT05398211, on 31/05/2022.

Peer Review reports

Background and objectives

Delirium is a clinical syndrome represented by an acute change in mental status [1] and inattention; cognitive dysfunction, disturbed level of arousal, psychotic episodes, emotional and behavioural changes may also be present [1, 2]. Delirium is common in hospital settings, occurring in > 50% of patients [3], with the highest prevalence in older, frail [4,5,6], acutely hospitalized, and mechanically ventilated patients in postsurgical intensive care units (ICUs) [7,8,9]. Delirium develops suddenly, is transient with fluctuating symptoms [7]. Pathophysiology is complex and not completely understood, involving an interplay between the predisposing factors such as advanced age, underlying illness like dementia, and precipitating factors such as acute illness, drugs, or surgery [7].

Prognoses is poor, including prolonged hospitalization [10], need for long term care [11, 12], onset or worsening of cognitive impairment [7, 10], and increased mortality risk [7, 13]. Non-pharmacological, multicomponent approaches might address multifactorial delirium etiology, and are showing promising effects in preventing delirium [3, 13], decreasing agitation [8, 14], and deriving interest, pleasure and general well-being [15]. The evidence of their effectiveness in clinical management of delirium symptoms is still scarce [14, 16].

Music interventions (MIs) have positive effects on behavioural and psychological symptoms of conditions similar to delirium, such as dementia [17,18,19,20,21,22,23,24] and disorders of consciousness [25, 26], and may also be effective in prevention and treatment of delirium in older individuals. Our systematic review [27] showed that research on MIs for delirium is scarce; existing trials show promising results, particularly for delirium prevention. High patient acceptability and enjoyment of MIs, cost-effectiveness, and absence of adverse effects were also commonly reported [27]. Despite the moderate-high risk of bias among the included studies, our meta-analysis indicated that postsurgical, critically ill, and mechanically ventilated patients were 50% less likely to develop delirium after being exposed to music postoperatively [27]. Significant improvements in engagement, mood, and delirium severity were also found in acute geriatric and long-term care patients post-intervention [28,29,30]. However, better designed trials, with standardized interventions, in specific clinical settings and utilizing outcome measures able to capture clinically meaningful changes, are still needed to substantiate existing effectiveness claims. The aim of the current trial was to pilot test and establish feasibility to determine the need for investing in a future randomised controlled trial (RCT) design. The objectives of this study were to establish: (1) the feasibility of recruitment procedures and establish the likely recruitment rate; (2) the feasibility of assessments and follow-up procedures; (3) the success of and fidelity adherence of interventions implemented by the therapist; (4) the interventions’ acceptability; (5) the safety of the interventions; and (6) the sensitivity and suitability of the selected effect-outcomes to assess the efficacy of the music interventions. Clinical objectives were: (1) to estimate preliminary efficacy of live and recorded MIs on severity of delirium symptoms, and (2) to establish preliminary evidence of the specific delirium symptom domains most responsive to the MIs.

Methods

Trial design

We adopted a two-arm randomized repeated measures design to evaluate the feasibility and preliminary effectiveness of Preferred Recorded Music (PRM), and Preferred Live Music (PLM) interventions. Our study protocol detailing the intervention description, theoretical rationale, and statistical analysis plan, was published prior to the completion of data-collection [31]. This trial is reported in accordance with the Consolidated Standards of Reporting Trials (CONSORT) statement’s extended checklist for pilot and feasibility trials (Additional file 8) [32]. The trial was registered at Clinical Trials (NCT05398211) on 31/05/2022.

Participants

Participants were recruited from an acute geriatric ward within the Division of Geriatric Medicine at Oslo University Hospital (OUH), where the prevalence of dementia and delirium is high. The 20 bed ward admits 75-80 older patients per month (≥65 years) for acute medical care [33]. After delirium screening at admission, using 4 AT [34]—a rapid validated tool for delirium detection—patients with the score ≥ 4 were assessed by geriatricians for eligibility. Delirium was diagnosed according to the Diagnostic and Statistical Manual of Mental Disorders, 5 th Edition (DSM-5) criteria [1] applying a recommended diagnostic test battery, previously described in our protocol [2, 31, 35,36,37,38] (Additional file 2). Subtypes were determined using the well validated Delirium Motor Subtyping Scale (DMSS-4) [39].

Patients were eligible if:

  1. (1)

    aged ≥ 65 years

  2. (2)

    diagnosed with delirium or subsyndromal delirium within the last 72 hours and still present

  3. (3)

    informed consent was obtained.

Patients with comorbidities such as dementia, mild cognitive impairment, or those under long-term care were also included, and we did not exclude patients with COVID-19. Patients were excluded if:

  1. (1)

    Previously enrolled in the study and were readmitted to the ward during the study period

  2. (2)

    Presenting with severe hearing impairments

  3. (3)

    Presenting with severe psychiatric conditions other than delirium

  4. (4)

    Admitted due to severe alcohol or substance addiction

  5. (5)

    Their assessed musical preferences included orchestral or other kinds of music impossible to perform live by voice and accompaniment

The 5th criteria was included after the trial commenced, registered at Clinical Trials on the 5th of December, 2022, and included in the published protocol [31].

Interventions

Two interventions tested in this study were 1) Preferred Recorded Music (PRM) and 2) Preferred Live Music (PLM). PLM included live music delivered by voice and a guitar with improvisation elements; the PRM involved pre-recorded music delivered via a loudspeaker.

The potential therapeutic efficacy of preferred music, underlying both interventions, stems from its personal, social and cultural attributes, allowing it to alter emotional responses [40], boost self-awareness [41], and transiently enhance certain cognitive functions like autobiographical memory [42]. The live music component and responsive musical and non-musical interactions between the patient and the intervention facilitator in the PLM intervention were expected to regulate arousal levels and attention. Synthetic, loudspeaker sound and original versions of the preferred music in PRM intervention were expected to stimulate autobiographic memory and moderate attention and arousal. A more detailed description of the interventions and the theoretical rationale is provided in our published protocol [31].

Both interventions were delivered by a certified music therapist (MT) who also designed the interventions.

After the baseline assessments, the MT assessed participants’ music preferences from legal guardians, using an adapted Norwegian version of the Assessment of Personal Music Preference tool (APMP) [43], and after that, directly from the participants in an interactive, 30 min’ session, using a Music Assessment Tool (MAT) [44]. Music for the interventions was selected by the MT using the acquired information. The participants received their allocated interventions once a day, for 30 min (between 8 AM and 5 PM) over three consecutive days, in addition to usual care. Participants were considered to have adhered to the intervention protocol if they completed at least 10 min of the sessions.

MIs were primarily delivered in the participants’ private rooms, except when they shared a room with another patient.

Outcomes

Our published protocol previously described outcomes relevant to our feasibility and clinical objectives and the detailed assessment schedule [31]. The main properties of the assessment tools are provided in Additional files 1–3.

Feasibility outcomes were assessed during and upon the completion of the intervention period and comprised:

  1. (1)

    Recruitment rate: an average number of patients recruited per month.

  2. (2)

    Retention rate: the proportion of participants completing the study as described in the protocol. Withdrawals were defined as withdrawing consent to participate in the study. Dropouts were defined as any discontinuations of the interventions and assessments due to the participants’ health condition, discharge, or an unavailable assessor. Refusals were defined as the patient declining invitations to be involved in assessment or treatment.

  3. (3)

    The proportion of sessions where the MIs and pre-post assessments were completed as planned (adherence to study protocol) and the proportion of sessions with protocol deviations. Deviations were categorized as patient or interventionist-related and further classified as minor, major, or fatal based on their impact on data quality and patient safety, with the fatal category indicating patients’ death [45].

  4. (4)

    The success of treatment fidelity (TF) was determined by observing the video recordings of 20% of randomly selected participants from both intervention groups who had completed the interventions as per protocol. Video recordings were evaluated by an independent rater using a bespoke checklist for each intervention. The six checklist items were scored and calculated (no = 0, yes = 1 point), and the threshold for satisfied treatment fidelity for each participant was ≥ 80% averaged across the three intervention days, including satisfied compulsory items 4-6 for each session. The intervention was considered not to have met fidelity if the compulsory items were not satisfied even if the total score was ≥ 80%.

Secondary clinical outcomes were assessed at baseline, within one hour pre- and post-session, and at discharge by the specially trained geriatricians at the ward and included:

  1. (1)

    Trajectories of delirium symptoms, assessed using DSM-5 diagnostic test-battery [31] comprising: Observational Scale of Level of Arousal (OSLA) [36] and modified Richmond Agitation Sedation Scale (mRASS) [46,47,48] for level of arousal; backwards tests and digit span tests for attention [35, 38].; orientation and short-term memory, using delayed recall tasks and orientation questions from Memorial Delirium Assessment Scale (MDAS) [49] for orientation and short term memory.

  2. (2)

    Duration of delirium: determined by an experienced delirium researcher (BEN) after the discharge from the ward, based on all the previously assessed data.

  3. (3)

    Length of hospital stay (LOS)

  4. (4)

    Use of Pro Re Nata (PRN), non-prescribed psychopharmacological medication (benzodiazepines and antipsychotics).

LOS and use of psychopharmacological medications during the hospitalization were retrieved from the electronic medical journals at discharge (Additional file 1).

Adverse events were recorded after music and assessment sessions and from daily reports by the medical staff at the ward. The events were categorized as critical and non-critical for the patients’ health and well-being and related or unrelated to the interventions.

Sample size

This feasibility trial did not intend to draw conclusive findings on the effects of MIs on delirium symptoms, and did not need to be adequately powered. Initially aimed to recruit 60 participants, 30 in each arm, to obtain sufficient data to examine the main objectives of the study, while allowing for potential dropouts.

Randomization

Eligible, consenting patients were randomized to study arms using permuted block randomization 1:1, and the online randomization software, True Random Numbers (https://www.random.org/). An independent researcher generated randomization blocks of 10 participants, to maintain even number in the two study-arms. The participants were assigned to their respective interventions by the music therapist, after baseline delirium assessments and before assessments of music preferences.

Masking

The therapist and the participants could not be masked to group allocation due to the nature of the interventions; assessors were masked. To increase the success of masking of assessors, staff at the ward were instructed not to reveal what treatment arm participants were assigned to, and the music therapist walked into the participants’ rooms with her guitar and the loud-speaker, regardless of assigned intervention.

Statistical methods

The statistical analysis plan is described in our previously published protocol [31].

Linear mixed models were used to estimate the change in OSLA, mRASS, attention and cognitive status from pre- to post-intervention and from baseline for each assessment day, and for the comparison between the intervention groups. The marginal effects were calculated for each of these comparisons, with adjustments for the participants'baseline scores, and using participants’ ID as a random effect, to incorporate the individual differences into the analysis. The Mann-Whitney test for skewed data was used to calculate the difference between the groups in length of hospital stay. We applied Fisher’s Exact test for small samples to determine group differences in received PRN medication.

Results

Participants flow and recruitment

Potential participants were screened for eligibility between 15th June 2022 and 21st April 2023 (approximately 39 weeks). Of the 809 patients admitted to the acute geriatric ward during the recruitment period, 66 patients were assessed for eligibility (Fig. 1). Of these, 40 were excluded due to uncertain delirium diagnosis, patient receiving end of life care, or being discharged before the sessions could begin, contagion (other than COVID-19), aphasia preventing completion of assessments of delirium, unavailable assessor for the rest of the evaluations, or music therapist unavailability. In total n = 26 patients met all the inclusion criteria and were randomized (PLM, n = 14; PRM, n = 12) (Fig. 1).

Fig. 1
figure 1

CONSORT flow chart

Baseline demographic and clinical characteristic were similar across the intervention groups (Table 1). Participants’ median age was 87, half were female (n = 13), and the majority lived alone (n = 19, 73%,), or with others (n = 4, 15%,) in the community. All had DSM-5 delirium at baseline, 68% hypoactive and 20% had no recognizable subtype. In 73% of cases (n = 19), infection, fracture or a cardiovascular event precipitated delirium. For some participants the trigger was unknown, or hard to ascertain due to underlying dementia or depression. Clinical frailty scale scores were obtained for n = 15 participants (58%), showing scores of ≥ 5 in 14 out of 15 tested patients, and a median score of 7. Along with the median frailty index of 0.40 in these participants, this score indicated severe frailty. The median NEWS2 score of 3 at admission to the AG ward indicated low to moderate acute illness severity (Table 1).

Table 1 Demographic and clinical characteristics at baseline

Feasibility outcomes

The average recruitment rate was 3 participants per month (Table 2). The retention rates for the participants in PLM and PRM groups were 64% and 33% respectively. Most withdrawals were due to worsening of the participants’ health condition, but none withdrew their consent to use the data collected prior to withdrawal.

Table 2 Feasibility outcomes

The total adherence to the study protocol (both interventions and assessments) was 29% in the PLM (n = 4 patients), and 17% in the PRM group (n = 2 patients). Five additional participants in the PLM group met the per-protocol criteria but were excluded because their assessments occurred outside of the assessment window.

The total number of conducted music sessions, including those with deviations, was 55 out of the planned 78 (PLM = 42; PRM = 36). Common reasons for missing sessions were: 1) minimum dosage of 10 min was not met, 2) unavailability of assessor/music therapist to assess/treat, 3) patient became palliative, or 4) patient was discharged. The percentage of adherence to the PLM and PRM protocols were 83% and 58% respectively. Of the 21 completed PRM sessions, 17 were delivered with some deviations (81%), of which the most common were the unwanted interaction between the patient and the therapist, and in some cases changing the order of songs from one intervention day to another. No significant protocol deviations were registered during the PLM, and most sessions were delivered as intended.

The success of TF was 93% for the PLM, including satisfied complementary items 4-6 in the checklists, and 83% for the PRM, with one of the complementary items not satisfied. As planned, music preferences were assessed with input from the legal guardians and participants. Only two participants were not able to contribute to music preference assessments due to their worsened health condition at the time of assessment (Table 2).

Of the planned 78 pre-post delirium assessments in both groups, 34 were successfully completed (44%). Adherence to the assessment procedures in PLM was 62%. Ten pre-post assessment were not completed due to unavailable assessor, patient discharged or becoming palliative: six of the pre-post assessments were out of the assessment window. Adherence to the assessment procedures in PRM was 50%, with 15 pre-post assessments missing due to the unavailable assessor, patient discharged or becoming palliative, and three pre-post assessments were conducted outside of the assessment window.

One serious adverse event (death) occurred during the intervention period but was unrelated to the interventions. Minor, potentially intervention-related events such as patients falling asleep while listening to music, as well as showing signs of boredom and restlessness or wanting to switch faster to the next song, were recorded during the PRM sessions.

Secondary clinical outcomes

In addition to its within-subject focus, this two-arm trial compared the interventions to determine which might be more suitable for a future definitive trial. With no control group, the comparative analysis remained exploratory and complemented the primary feasibility assessments. An intention-to-treat principle was used to analyse efficacy outcomes, such that the analysis included also those participants who were discharged prior to completion of the three music sessions, or were not able to receive the interventions as per protocol for other reasons. We also intended to complete a per-protocol analysis, but with few participants completing the interventions as described we ultimately opted not to undertake this analysis.

Within-subject analyses (pre-post).

Comparing baseline and pre-intervention scores showed that delirium symptoms varied during the first three days of the intervention, with measures of level of arousal, attention, orientation and short-term memory fluctuating for most individuals. On average, the participants’ level of arousal was similar across the three assessment days. However, their performance on attention tests improved on day 3 comparing baseline and pre-intervention scores, and was statistically significant for counting from 20 to 1, weekdays backwards, digit span memory test and SAVEAHAART vigilance tests (Table 3).

Table 3 Daily mean score for clinical delirium outcomes and change from baseline (for the entire sample)

There was no significant change in delirium symptoms pre to post MIs on each day. Analysing the data without considering the specific intervention day or group but still considering that each person had up to seven measures did not show significant change either (Table 4). The individual trajectories showed that some symptoms for given participants did change from pre to post MIs, but these changes were in the negative direction. The residuals for delayed recall score were so skewed that linear modelling of this outcome was not appropriate and we instead calculated the proportion of the participants who could successfully recall at least one word. The total percentage of the participants who could recall at least one word on the delayed recall test was only 19% (95% CI: 8.2–38.9) (Additional file 5).

Table 4 Before and after music intervention each day (for the entire sample)

Group comparison.

There was no evidence of a difference between the participants’ delirium symptoms in PLM or PRM groups on day 3 of the intervention (Additional file 4). Group differences on days 1 and 2 were not examined on the assumption that the potential difference in the effect of the interventions would be most relevant on day 3. The participants’ average length of hospital stays in PLM was 11 days (SD = 8.95) and in PRM 13 days (SD = 8.94); there was no statistically significant group difference between the PLM and PRM (U = 82.500, p = 0.94). The results of Fisher’s exact test did not indicate statistically significant difference between the number of patients receiving PRM medication in the two intervention groups (Additional file 6). There was no sufficient data in the medical journals to estimate changes in delirium duration.

Discussion

Feasibility outcomes

This feasibility study demonstrated that implementing PLM and PRM with vulnerable delirium patients at the AG ward was feasible and that the MT could successfully conduct music preference assessments. Obtaining music preferences from the legal representatives prior to direct, interactive patient assessments provided a familiar reference point for dialogic method and recognizing musical examples, thereby forstering a personalized setting that activated recognition memory and facilitated further retrieval of musical memories [43]. However, interactive assessments may have influenced delirium outcomes before the music interventions (MIs) began, introducing a potential confound. They may also have generated expectations regarding subsequent interventions and affected test performance.

Consistent with prior findings identifying hypoactive delirium as the most prevalent subtype [44, 50], 68% of our sample exhibited hypoactive delirium. Because distinct delirium subtypes present different symptoms and warrant tailored interventions, previous recommendations have encouraged investigating interventions separately for each subtype [51]. This recommendation aligns with our suggestions for further investigation into MIs.

PLM demonstrated robust adherence, high TF, a relatively high retention rate, consistent dosage delivery, and minimal protocol deviations, indicating strong feasibility and acceptance. No adverse events or unusual treatment effects were recorded, and the lack of refusals further supports its safety. Descriptive data from the music therapist’s session notes and checklists also suggest PLM was engaging, with participants singing, moving, or reminiscing in nearly all sessions. Notably, 77% of participants in the PLM arm had hypoactive delirium, highlighting the potential relevance of PLM for this particular subtype.

PRM showed lower adherence, retention, and inconsistent session durations (10–33 min). Discontinuations were mainly due to declining health, palliative status, or discharge—rather than refusals—suggesting these findings may not reflect low acceptability. However, therapist notes indicated that patients were actively engaged in only half of the sessions, often exhibiting restlessness, a desire to skip songs, or requesting to end sessions early. Such responses imply PRM may be less engaging and potentially monotonous, likely due to its non-interactive delivery. This observation aligns with earlier assumptions that prolonged loudspeaker-based music can lead to habituation and boredom in delirium patients [31, 52].

Although PRM achieved an 83% success rate, it did not meet TF criteria due to repeated breaches of the protocol’s prohibition on patient-therapist interaction. These interactions—always patient-initiated regarding confusion, pain, distress, or a desire to discuss the music—were ethically justifiable but diminished treatment consistency. Excluding the music therapist (MT) from the delivery altogether could address this issue, though it raises patient-safety concerns about unsupervised music exposure. Alternatively, replacing the MT with another health practitioner would render any interaction typical of usual care.

Recruitment and assessment procedures proved feasible and accurately identified eligible delirium patients. However, reliance on internal assessors—available only two weekdays during day shifts—limited recruitment due to high workloads and other responsibilities. For future research, employing external assessors who can be present most shifts, seven days a week, is advisable to enhance recruitment efficiency.

The recommended test battery for the pre-post assessments was efficient, accurate in assessing delirium and its features, and suitable for application at the AG ward. However, the completion time varied among the assessors. As there is currently no definitive diagnostic test for delirium, its detection depends on assessing key features, combining observation, cognitive testing, patients’ medical history and clinical interviews [2]. By integrating continuous (symptom-based) and dichotomous (delirium yes/no) measures, our harmonized test battery offers more nuanced insights into the trajectory of delirium severity and contributes to the development of more robust, reliable, and standardized delirium assessments in the future [2]. This test battery was also well received by delirium patients, as refusals were rare and typically attributable to significant clinical deterioration. Protocol deviations were minimal, primarily involving delayed or omitted assessments. The lack of clear post-intervention effects may suggest that the one-hour assessment window was too lengthy, potentially obscuring more immediate impacts. Future studies could address this by employing more flexible external assessors who can align measurement schedules more closely with the interventions’ end.

Although the test battery showed high accuracy, efficiency, and suitability, overall adherence to the three-day, multi-measure follow-up protocol was low. However, this was largely due to assessor unavailability and patients becoming palliative or being discharged, suggesting that low adherence may not reflect the feasibility of the follow-up protocol itself. Some participants exhibited substantially worse post-intervention performance on certain cognitive or attention tasks. While the music interventions might have contributed, repeated testing could also be to blame. Although the three-day protocol generated extensive data, it may have overburdened this vulnerable patient group, potentially causing exhaustion, fatigue, or boredom and thus affecting test outcomes. Consequently, future research should carefully re-evaluate the repeated-measures design and the length of the follow-up period.

Clinical outcomes

Our results showed that the participants’ performance on the attention tests improved significantly on day three, when comparing baseline and pre-interventions scores, while most of their other symptoms were similar to baseline. However, without a control group, the observed changes are difficult to attribute to the MIs delivered the previous day, as delirium usually is usually reversed by treatment of underlying causes [7, 53]. Nevertheless, the summary evaluation of individual DSM-5 criteria showed that most participants still had delirium at the end of the intervention period.

No statistically significant pre-post intervention changes or inter-group changes in delirium symptoms were observed for any of the measures. However, the trial was underpowered to detect preliminary effectiveness properly. Accordingly, the CIs for mean differences were wide for most measures, and we cannot exclude the possibility of changes in delirium symptoms associated with the interventions. The percentage of participants who could recall at least one word on the delayed recall tests was very low. With small samples in addition, testing pre-post intervention and between the groups, change in proportion would have provided no conclusive findings and was omitted. No significant differences in LOS and intake of PRN medication between the groups were detected as the study was underpowered to provide conclusive findings in this regard, and changes in these measures could be correlated with many other factors. The follow-up of delirium duration after the intervention period was unsuccessful due to the transient and fluctuating delirium nature, making it difficult to ascertain whether it had been recovered.

Despite not showing sensitivity to the MIs, clinical outcomes tested in this trial are still highly relevant for detecting changes in delirium progression and severity and should be included in the future. However, to capture the potential effects of the MIs, other complementing outcomes, such as biomarkers, patient-centred outcomes (emotional responses and engagement), or environmental outcomes related to the medical ward and staff, should be considered and explored. Data from the MT’s session notes and checklists indicated that relevant intervention-related changes might also have occurred during the MI sessions, and it is recommended that future trials consider assessing these changes more systematically.

Testing clinical outcomes did not provide sufficient information to discern which of the two MIs could more effectively impact delirium symptoms. The wide CIs are mainly associated with small samples but indicate that the potential effect of PLM and PRM interventions cannot be ruled out.

Strengths

The trial was sufficiently powered to assess feasibility outcomes, addressing a noted gap in music and delirium research, where methodological quality is often low to moderate and feasibility evaluations are frequently overlooked [27]. Key strengths include the use of validated and recommended delirium assessment methods [2, 35, 37, 38], targeted assessor training, and involvement of an experienced delirium researcher to final interpretation of the findings. Subtyping delirium is another strength, given its clinical and prognostic importance [54,55,56]. Furthermore, employing a detailed, theoretically grounded intervention protocol with treatment fidelity assessments enhances the generalizability of future efficacy findings [27]. Lastly, involving a trained music therapist (MT) and conducting music preference assessments allowed interventions to be personalized and potentially safer, in line with existing recommendations [27].

Limitations

Despite being adequately powered to examine feasibility, the trial lacked sufficient power to investigate preliminary efficacy or compare between groups, where a minimum sample size of 30 per group is recommended [57]. No control group was included due to the primary feasibility focus and uncertainties related to delirium diagnosis and recruitment. The reliance on on-site staff during regular working hours slowed recruitment and led to missed assessments, as staff managed competing responsibilities. Although delirium subtyping was incorporated, the limited sample size precluded separate analyses for hypoactive and hyperactive subtypes, which likely require different management approaches. Future research should address these constraints by including a control group, considering distinct delirium subtypes, and improving recruitment strategies, for instance, by engaging dedicated external assessors.

Conclusion

The feasibility of recruitment procedures, music preference assessments, MIs and assessment protocols were indicated and the results showed that PLM intervention is more engaging, better accepted, and potentially more suitable for further testing with acutely ill older patients with delirium. Recommended next steps are to undertake a pilot study with a comparative group, assess preliminary efficacy, estimate the size of the treatment effects, and to explore different intervention dosages and frequency of delivery.

Data availability

The datasets may be provided by the corresponding author on reasonable request.

Abbreviations

AG:

Acute Geriatrics

APMP:

Assessment of Personal Music Preference tool

CAM-ICU:

Confusion Assessment Method for the Intensive Care Unit scale

CI:

Confidence Interval

CONSORT:

Consolidated Standards of Reporting Trials

CSF:

Clinical Frailty Scale

DMSS-4:

Delirium Motor Subtyping Scale

DOWB:

Days of the Week Backwards

DSM-5:

Diagnostic and Statistical Manual of Mental Disorders, 5th Edition

FI:

Frailty Index

ICU:

Intensive Care Unit

IQCODE:

Informant Questionnaire on Cognitive Decline in the Elderly

LOS:

Length of stay

MAT:

Music Assessment Tool

MDAS:

Memorial Delirium Assessment Scale

MI:

Music Interventions

MOYB:

Months of the Year Backwards

mRASS:

Modified Richmond Agitation Sedation Scale

MT:

Music Therapist

NEWS2:

National Early Warning Score 2

OSLA:

Observational Scale of Level of Arousal

OUH:

Oslo University Hospital

PLM:

Preferred Live Music

PRM:

Preferred Recorded Music

PRN:

Pro Re Nata, non-prescribed psychopharmacological medication

SAVEHAART:

Ten-letter vigilance “A” test

TF:

Treatment Fidelity

References

  1. American Psychiatric Association DS, American Psychiatric A. Diagnostic and statistical manual of mental disorders: DSM-5: American psychiatric association Washington, DC; 2013.

  2. Oh ES, Akeju O, Avidan MS, Cunningham C, Hayden KM, Jones RN, et al. A roadmap to advance delirium research: Recommendations from the NIDUS Scientific Think Tank. Alzheimers Dement. 2020;16(5):726–33.

    Article  PubMed  Google Scholar 

  3. León-Salas B, Trujillo-Martín MM, Martínez Del Castillo LP, García-García J, Pérez-Ros P, Rivas-Ruiz F, et al. Multicomponent Interventions for the Prevention of Delirium in Hospitalized Older People: A Meta-Analysis. J Am Geriatr Soc. 2020;68(12):2947–54.

    Article  PubMed  Google Scholar 

  4. Juliebø V, Bjøro K, Krogseth M, Skovlund E, Ranhoff AH, Wyller TB. Risk factors for preoperative and postoperative delirium in elderly patients with hip fracture. J Am Geriatr Soc. 2009;57(8):1354–61.

    Article  PubMed  Google Scholar 

  5. Korevaar JC, van Munster BC, de Rooij SE. Risk factors for delirium in acutely admitted elderly patients: a prospective cohort study. BMC Geriatr. 2005;5(1): 6.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Siddiqi N, House AO, Holmes JD. Occurrence and outcome of delirium in medical in-patients: a systematic literature review. Age Ageing. 2006;35(4):350–64.

    Article  PubMed  Google Scholar 

  7. Wilson JE, Mart MF, Cunningham C, Shehabi Y, Girard TD, MacLullich AM, et al. Delirium. Nat Rev Dis Primers. 2020;6(1):90.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Marcantonio ER. Delirium in hospitalized older adults. N Engl J Med. 2017;377(15):1456–66.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Nasjonalt senter for aldring og helse. n.d. “Delirium – Akutt forvirring.” https://www.aldringoghelse.no/fysisk-helse/vanlige-sykdomsbilder-hos-eldre/delirium-akutt-forvirring/. Accessed 29 Apr 2025.

  10. Gleason LJ, Schmitt EM, Kosar CM, Tabloski P, Saczynski JS, Robinson T, et al. Effect of delirium and other major complications on outcomes after elective surgery in older adults. JAMA Surg. 2015;150(12):1134–40.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Krogseth M, Wyller TB, Engedal K, Juliebø V. Delirium is a risk factor for institutionalization and functional decline in older hip fracture patients. J Psychosom Res. 2013;76(1):68–74.

    Article  PubMed  Google Scholar 

  12. Witlox J, Eurelings LS, de Jonghe JF, Kalisvaart KJ, Eikelenboom P, van Gool WA. Delirium in elderly patients and the risk of postdischarge mortality, institutionalization, and dementia: a meta-analysis. JAMA. 2010;304(4):443–51.

    Article  CAS  PubMed  Google Scholar 

  13. Asghar A, Siddiqui KM, Ahsan K, Chughtai S. Postoperative delirium after cardiac surgery; incidence, management and prevention. Anaesthesia, Pain & Intensive Care. 2017;21(1):109–12.

    Google Scholar 

  14. Oh ES, Fong TG, Hshieh TT, Inouye SK. Delirium in older persons: advances in diagnosis and treatment. JAMA. 2017;318(12):1161–74.

    Article  PubMed  PubMed Central  Google Scholar 

  15. O’Hanlon S, O’Regan N, Maclullich AM, Cullen W, Dunne C, Exton C, et al. Improving delirium care through early intervention: from bench to bedside to boardroom. J Neurol Neurosurg Psychiatry. 2014;85(2):207–13.

    Article  PubMed  Google Scholar 

  16. Hshieh TT, Inouye SK, Oh ES. Delirium in the elderly. Clin Geriatr Med. 2020;36(2):183–99.

    Article  PubMed  Google Scholar 

  17. Brancatisano O, Baird A, Thompson WF. Why is music therapeutic for neurological disorders? The Therapeutic Music Capacities Model. Neuroscience and biobehavioral reviews. 2020;112:600–15. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.neubiorev.2020.02.008.

    Article  PubMed  Google Scholar 

  18. O’Kelly J, James L, Palaniappan R, Taborin J, Fachner J, Magee WL. Neurophysiological and behavioral responses to music therapy in vegetative and minimally conscious States. Frontiers in human neuroscience. 2013;7:884. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fnhum.2013.00884.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Sihvonen AJ, Särkämö T, Leo V, Tervaniemi M, Altenmüller E, Soinila S. Lancet Neurol. 2017;16(8):648.

    Article  PubMed  Google Scholar 

  20. Bian X, Wang Y, Zhao X, Zhang Z, Ding C. NeuroRehabilitation. 2021;48(4):553.

    PubMed  Google Scholar 

  21. Moreno-Morales C, Calero R, Moreno-Morales P, Pintado C. Music Therapy in the Treatment of Dementia: A Systematic Review and Meta-Analysis. Frontiers in medicine. 2020;7:160. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fmed.2020.00160.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Ridder HM, Stige B, Qvale LG, Gold C. Individual music therapy for agitation in dementia: an exploratory randomized controlled trial. Aging Ment Health. 2013;17(6):667–78.

    Article  PubMed  PubMed Central  Google Scholar 

  23. van der Steen, J. T., Smaling, H. J., van der Wouden, J. C., Bruinsma, M. S., Scholten, R. J., & Vink, A. C. Music-based therapeutic interventions for people with dementia. Cochrane Database Syst Rev. 2018;7(7):CD003477. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/14651858.CD003477.pub4.

  24. Vink AC, Zuidersma M, Boersma F, de Jonge P, Zuidema SU, Slaets JP. Effect of music therapy versus recreational activities on neuropsychiatric symptoms in elderly adults with dementia: an exploratory randomized controlled trial. J Am Geriatr Soc. 2014;62(2):392–3.

    Article  PubMed  Google Scholar 

  25. Grimm T, Kreutz G. Brain Inj. 2018;32(6):704.

    Article  PubMed  Google Scholar 

  26. Li X, Li C, Hu N, Wang T. J Neurosci Nurs. 2020;52(4):146.

    Article  CAS  PubMed  Google Scholar 

  27. Golubovic J, Neerland BE, Aune D, Baker FA. Music interventions and delirium in adults: a systematic literature review and meta-analysis. Brain Sci. 2022;12(5):568.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Browning SG, Watters R, Thomson-Smith C. Impact of therapeutic music listening on intensive care unit patients: a pilot study. Nurs Clin North Am. 2020;55(4):557–69.

    Article  PubMed  Google Scholar 

  29. Cheong CY, Tan JA, Foong YL, Koh HM, Chen DZ, Tan JJ, et al. Creative music therapy in an acute care setting for older patients with delirium and dementia. Dement Geriatr Cogn Dis Extra. 2016;6(2):268–75.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Correâ L, Caparrol AJDS, Martins G, Pavarini SCI, Grataõ ACM. Effects of music on body and facial expressions and psychological and behavioral symptoms of older adults. Braz J Occup Ther. 2020;28(2):539–53.

    Google Scholar 

  31. Golubovic J, Baker FA, Simpson MR, Neerland BE. Live and recorded music interventions for management of delirium symptoms in acute geriatric patients: Protocol for a randomized feasibility trial. Nordic J Music Ther. 2023;33(1):62–83. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/08098131.2023.2192759.

  32. Eldridge SM, Chan CL, Campbell MJ, Bond CM, Hopewell S, Thabane L, et al. Pilot and feasibility. Studies. 2016;2(1):64.

    Google Scholar 

  33. Ahmed M. Akuttgeriatri - pleietung avdeling i teknologitungt sykehus: Hvordan bevare personale, glød og entusiasme? Den Sjuende Norske Kongressen i Geriatri. 2017. null p.

  34. Shenkin SD, Fox C, Godfrey M, Siddiqi N, Goodacre S, Young J, et al. BMC Med. 2019;17(1):138.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Hall RJ, Meagher DJ, MacLullich AMJ. Delirium detection and monitoring outside the ICU. Best Pract Res Clin Anaesthesiol. 2012;26(3):367–83.

    Article  PubMed  Google Scholar 

  36. Hall R, Stíobhairt A, Allerhand M, MacLullich AMJ, Tieges Z. The Observational Scale of Level of Arousal: A brief tool for assessing and monitoring level of arousal in patients with delirium outside the ICU. Int J Geriatr Psychiatry. 2020;35:1021–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/gps.5324.

    Article  PubMed  Google Scholar 

  37. Neerland BE, Hov KR, Bruun Wyller V, Qvigstad E, Skovlund E, MacLullich AM, et al. The protocol of the Oslo Study of Clonidine in Elderly Patients with Delirium; LUCID: a randomised placebo-controlled trial. BMC geriatr. 2015;15:7.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Pollmann CT, Mellingsæter MR, Neerland BE, Straume-Næsheim T, Årøen A, Watne LO. Orthogeriatric co-management reduces incidence of delirium in hip fracture patients. Osteoporos Int. 2021;32(11):2225–33.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Meagher D, Adamis D, Leonard M, Trzepacz P, Grover S, Jabbar F, et al. Int Psychogeriatr. 2014;26(4):693.

    Article  CAS  PubMed  Google Scholar 

  40. Baird A, Samson S. Music and dementia. Prog Brain Res. 2015;217:207–35. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/bs.pbr.2014.11.028.

  41. Arroyo-Anlló EM, Díaz JP, Gil R. BioMed Research International. 2013;2013(null): 752965.

    PubMed  PubMed Central  Google Scholar 

  42. Thaut M, Hoemberg V. Handbook of neurologic music therapy 2014. null p.

  43. Caruso G, Perri R, Fadda L, Caltagirone C, Carlesimo GA. Recall and recognition in alzheimer’s disease and frontotemporal dementia. J Alzheimers Dis. 2020;77(2):655–66.

    Article  CAS  PubMed  Google Scholar 

  44. Hayhurst CJ, Marra A, Han JH, Patel MB, Brummel NE, Thompson JL, et al. Association of hypoactive and hyperactive delirium with cognitive function after critical illness. Crit Care Med. 2020;48(6):e480–8.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Ghooi RB, Bhosale N, Wadhwani R, Divate P, Divate U. Assessment and classification of protocol deviations. Perspect Clin Res. 2016;7(3):132–6.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Chester JG, Beth Harrington M, Rudolph JL. J Hosp Med. 2012;7(5):450.

    Article  PubMed  Google Scholar 

  47. Han JH, Vasilevskis EE, Schnelle JF, Shintani A, Dittus RS, Wilson A, et al. Acad Emerg Med Off J Soc Acad Emerg Med. 2015;22(7):878.

    Article  Google Scholar 

  48. Ely EW, Truman B, Shintani A, Thomason JWW, Wheeler AP, Gordon S, et al. JAMA. 2003;289(22): 2983.

    Article  PubMed  Google Scholar 

  49. Breitbart W, Rosenfeld B, Roth A, Smith MJ, Cohen K, Passik S. J Pain Symptom Manage. 1997;13(3):128.

    Article  CAS  PubMed  Google Scholar 

  50. Polderman KH, Smit E. Dealing with the delirium dilemma. Crit Care. 2005;9(4):335–6.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Han JH, Brummel NE, Chandrasekhar R, Wilson JE, Liu X, Vasilevskis EE, et al. Exploring Delirium’s Heterogeneity: Association Between Arousal Subtypes at Initial Presentation and 6-Month Mortality in Older Emergency Department Patients. Am J Geriatr Psychiatry. 2017;25(3):233–42.

    Article  PubMed  Google Scholar 

  52. Szpunar KK, Schellenberg EG, Pliner P. Journal of experimental psychology learning. Mem Cognit. 2004;30(2):370.

    Article  Google Scholar 

  53. Bull MJ, Boaz L, Jermé M. Worldviews on Evidence-Based Nursing. 2016;13(3):232.

    Article  PubMed  Google Scholar 

  54. Ghezzi ES, Greaves D, Boord MS, Davis D, Knayfati S, Astley JM, et al. How do predisposing factors differ between delirium motor subtypes? A systematic review and meta-analysis. Age Ageing. 2022;51(9):afac200.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Smit L, Wiegers EJA, Trogrlic Z, Rietdijk WJR, Gommers D, Ista E, et al. Prognostic significance of delirium subtypes in critically ill medical and surgical patients: a secondary analysis of a prospective multicenter study. J Intensive Care. 2022;10(1):54.

    Article  PubMed  PubMed Central  Google Scholar 

  56. O’Keeffe ST, Lavan JN. Clinical significance of delirium subtypes in older people. Age Ageing. 1999;28(2):115–9.

    Article  CAS  PubMed  Google Scholar 

  57. Whitehead AL, Julious SA, Cooper CL, Campbell MJ. Estimating the sample size for a pilot randomised trial to minimise the overall trial sample size for the external pilot and main trial for a continuous outcome variable. Stat Methods Med Res. 2016;25(3):1057–73.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

We wish to thank Oslo University Hospital (OUH) for being the main sponsor and the host of the project and the Norwegian Academy of Music (NMH) for being the collaborating institution and providing funding for JG’s PhD project of which this feasibility study is a part. We also acknowledged the Centre for Research in Music and Health (CREMAH) at NMH and the Director, Prof. Karette Stensæth, for support and valuable discussions during the process. We are particularly grateful to Prof. Torgeir Bruun Wyller, Prof. Leiv Otto Watne, and other members of Oslo Delirium Research for generous and constructive feedback and continuous support, as well as Dr. Marc Ahmed, Dr. Nina Ommundsen, along with all the assessors/geriatricians, patients and their families from the Acute Geriatric Section (OUH).

Funding

This study received no external funding and was financed through JG’s PhD Fellowship.

Author information

Authors and Affiliations

Authors

Contributions

All authors meet the criteria for authorship stated in the Uniform Requirements for Manuscripts Submitted to Biomedical Journals, have read the last draft of the manuscript and agree with the findings presented. J.G., B.E., F.A.B., and M.R.S. developed the protocol. F.B. generated random allocation sequences, and J.G. randomized the participants, performed the music preference assessments and interventions, and recorded videos for treatment fidelity (TF) evaluation. J.G. monitored the data collection process. J.G. assessed and analysed all the feasibility outcomes. J.G. randomly selected 20% of the participants who completed three sessions and for whom videos were available for TF evaluation using a previously generated randomization sequence. K.J. evaluated video recordings and scored treatment delivery checklists, and J.G. calculated percentages of TF success. J.G. plotted pre-post delirium data in SPSS, M.R.S. converted some of the data to STATA files, and B.E.N. collected and plotted the medical variables from the electronic journals. Based on the collected pre-post intervention data, B.E.N. completed summary evaluations of delirium, and J.G. plotted the new variable into the SPSS files. M.R.S. developed a statistical analysis plan and performed the analyses of the pre-post delirium data (mixed linear regression models); J.G. performed the analysis of all the baseline measures and some of the clinical outcomes (length of hospital stay, medication intake). JG wrote the manuscript with F.A.B., B.E.N., M.R.S., and K.J. contributions. All the authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Jelena Golubovic.

Ethics declarations

Ethics approval and consent to participate

Ethical approval was obtained from the Regional Ethics Committee in Norway (REK 457017). The ability to give informed consent was individually evaluated for each participant, and the consents were obtained by the experienced physicians, either directly from the patient, or supplemented or replaced by the consents from the legally assigned representatives (LAR) if the patient’s ability to consent was reduced. Three different consent forms informing of the study objectives personal data collection methods, benefits and potential risks, data management and the right to withdraw from the study were used. Consents in written format were prioritized, but verbal consents via phone-call were also approved by REK in cases where LAR were not physically present, and the written confirmation was acquired at the first possible occasion.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Golubovic, J., Neerland, B.E., Simpson, M.R. et al. A randomized pilot and feasibility trial of live and recorded music interventions for management of delirium symptoms in acute geriatric patients. BMC Geriatr 25, 306 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12877-025-05954-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12877-025-05954-1

Keywords