178 research article Diagnostic accuracy of haemophilia early arthropathy detection with ultrasound (HEAD-US): a comparative magnetic resonance imaging (MRI) study Domen Plut12, Barbara Faganel Kotnik23, Irena Preloznik Zupan24, Damjana Kljucevsek23, Gaj Vidmar25, Ziga Snoj12, Carlo Martinoli6, Vladka Salapura1,2 1 Clinical Radiology Institute, University Medical Centre Ljubljana, Ljubljana, Slovenia 2 Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia 3 Division of Paediatrics, University Medical Centre Ljubljana, Ljubljana, Slovenia 4 Division of Internal Medicine, University Medical Centre Ljubljana, Ljubljana, Slovenia 5 University Rehabilitation Institute Republic of Slovenia, Ljubljana, Slovenia 6 School of Medicine and Pharmacy, University of Genoa, Genoa, Italy Radiol Oncol 2019; 53(2): 178-186. Received 21 February 2019 Accepted 25 April 2019 Correspondence to: Assist. Prof. Vladka Salapura, M.D., Ph.D., Clinical Radiology Institute, University Medical Centre Ljubljana, Zaloska cesta 7, SI-1000 Ljubljana, Slovenia. Phone: + 386 1 522 85 30; Fax: +386 1 522 24 97; E-mail: salapura@siol.net Disclosure: No potential conflicts of interest were disclosed. Background. Repeated haemarthroses affect approximately 90% of patients with severe haemophilia and lead to progressive arthropathy, which is the main cause of morbidity in these patients. Diagnostic imaging can detect even subclinical arthropathy changes and may impact prophylactic treatment. Magnetic resonance imagining (MRI) is generally the gold standard tool for precise evaluation of joints, but it is not easily feasible in regular follow-up of patients with haemophilia. The development of the standardized ultrasound (US) protocol for detection of early changes in haemophilic arthropathy (HEAD-US) opened new perspectives in the use of US in management of these patients. The HEAD-US protocol enables quick evaluation of the six mostly affected joints in a single study. The aim of this prospective study was to determine the diagnostic accuracy of the HEAD-US protocol for the detection and quantification of haemophilic arthropathy in comparison to the MRI. Patients and methods. The study included 30 patients with severe haemophilia. We evaluated their elbows, ankles and knees (overall 168 joints) by US using the HEAD-US protocol and compared the results with the MRI using the International Prophylaxis Study Group (IPSG) MRI score. Results. The results showed that the overall HEAD-US score correlated very highly with the overall IPSG MRI score (r = 0.92). Correlation was very high for the evaluation of the elbows and knees (r = 0.95), and slightly lower for the ankles (r = 0.85). Conclusions. HEAD-US protocol proved to be a quick, reliable and accurate method for the detection and quantification of haemophilic arthropathy. Key words: haemophilia; haemophilic arthropathy; HEAD-US; ultrasound; magnetic resonance imaging Introduction Intra-articular joint bleeds (haemarthroses) affect approximately 90% of patients with severe haemophilia.1 The most frequently involved joints are the ankles, knees, and elbows.2 Repeated episodes of intra-articular bleeding lead to progressive ar- thropathy, which is the main cause of morbidity in these patients.3 The prevention of the occurrence of haemarthrosis is therefore important for the prevention of the arthropathy. Small intra-articular bleeds may be unnoticed at physical examination and the detection of early signs of osteochondral damage is difficult by Radiol Oncol 2019; 53(2): 206-212. doi: 10.2478/raon-2019-0016 Plut D et al. / Haemophilia early arthropathy detection with ultrasound 179 clinical evaluation. It is known that osteochondral damage can be present in the joints that are asymptomatic and in which none or just a few bleeding episodes were previously recognized.45 These subtle articular changes of the subclinical disease can be detected by diagnostic imaging. Consequently, based on the diagnostic findings, appropriate treatment can be introduced or modified to prevent further disease progression and disability.6-11 Magnetic resonance imaging (MRI) is the modality of choice to evaluate the musculoskeletal system because of its excellent spatial and contrast resolution. By MRI, it is possible to detect disease specific findings and give an accurate visualization of early arthropathy changes. However, MRI is a modality of high cost, its time of examining is long, it is usually poorly accessible and as such, it is not suitable for multi-joint screening. Additionally, it requires sedation in young children.12 Ultrasound (US), with the advent of last generation equipment, has excellent spatial resolution for the superficial structures. By US, it is now possible to depict the small, superficial structures of the musculoskeletal system as present in the early stages of haemophilic arthropathy. Contrary to MRI, US has a low cost, the time of examining is short and it is widely accessible. The drawbacks for the use of US in musculoskeletal radiology are poor visualization of inner joint structures and lack of standardized evaluation and reporting. In the field of haemophilic arthropathy, the development of the standardized US protocol for the detection of early changes in haemophilic arthropathy (HEAD-US) by Martinoli et al. in 2013 opened new perspectives in the use of US in management of patients with haemophilia. The HEAD-US protocol and scoring method are rapid to perform and enable full screening of the six joints in a single study.6 The aim of the present study was to determine the diagnostic accuracy of the HEAD-US protocol and scoring method for the detection and quantification of haemophilic arthropathy in patients with haemophilia in comparison to MRI using the International Prophylaxis Study Group (IPSG) MRI scoring scale. Patients and methods Patients All patients were recruited at the Slovenian National Haemophilia Comprehensive Care Centre at the University Medical Centre Ljubljana. The inclusion criteria were age over 16 years, diagnosis of a severe haemophilia A or B and prophylactic treatment with factor concentrates. Exclusion criteria were non-cooperation and contraindications for the MRI examination. Patients with prosthetic joints were allowed to participate in the study, but the prosthetic joint was not evaluated. The study group included a total of 30 patients (age range 16 to 49, mean age 33) who were willing to participate and met the aforementioned criteria. In 23 patients, six joints (elbows, knees and ankles) were systematically examined by US and MRI according to the protocols. One out of six joints was not examined in six patients due to a prosthetic implant. Two joints were excluded from the evaluation in the patient who had left lower limb amputation. The elbows were excluded in two patients because MRI could not be performed due to patient discomfort. Overall, 168 joints were examined in this study: 59 elbows, 53 knees and 56 ankles. The clinical evaluation of the joints according to the hemophilia joint health score HJHS 2.1 was obtained by a trained haemathologist on the day of the imaging examinations. This prospective observational study was performed at a single tertiary center from June 2016 to March 2017. Research was conducted following the Helsinki Declaration. All patients included in the study provided a written informed consent for study participation. The National Medical Ethics Committee approved the study (Project number 70/11/15, approved on 11/21/2015). Diagnostic imaging In each patient, the US and MRI examinations were performed on the same day. Ultrasound US examinations were performed using a 13-5 MHz electronic linear-array transducer on a ProSound F75 scanner (Hitachi Aloka Medical, Ltd. Tokyo, Japan) by an experienced radiologist using the HEAD-US protocol and scoring method described elsewhere.6 The total scanning time per patient for all six joints combined was approximately 20 minutes. A series of images from 10 US examinations were reviewed and scored by another radiologist to determine the inter-rater reliability. This latter reviewer was blinded from the original scores of the examinations. Magnetic resonance imaging MRI was performed on a 3 Tesla unit (Achieva, Philips Healthcare, Eindhoven, The Netherlands). Radiol Oncol 2019; 53(2): 178-186. 180 Plut D et al. / Haemophilia early arthropathy detection with ultrasound 180 TABLE 1. Baseline characteristics of the study population Age: median; range (years) 33; 16-49 Age of start of prophylaxis: mean (years) age group: 0-9 (patient count) age group: 10-19 (patient count) age group: 20+ (patient count) 17.4 7 14 9 Duration of prophylaxis: mean (years) 15.4 Ankles Knees Elbows Right Left Right Left Right Left No. of joints 30 29 25 28 28 28 No. of lifetime joint bleeds: 0-5 (joint count) 4-20 (joint count) > 20 (joint count) Unknown (joint count) 5 5 12 15 14 13 12 11 7 10 4 2 13 12 5 3 9 12 0 1 1 0 1 1 HJHS 2.1 score: mean; max* 3.3; 12 2.6; 11 1.4; 7 1.2; 8 1.9; 9 1.9; 8 * Minimum was 0 for all the scores Statistical analysis Descriptive statistics were obtained to describe characteristics of the study group. We checked the inter-rater reliability of HEAD-US and MRI for the total scores using intra-class correlation (two-way mixed model, ICC(2,1)) and for all the sub-scores using Cohen's kappa statistics (with quadratic weights). We analyzed the agreement between HEAD-US and MRI using the Pearson correlation coefficient (r) for the total score and separately for the hypertrophic synovium, cartilage degradation, and bone changes (we could not use agreement coefficients because all those scores derive from different scales for HEAD-US and MRI). Regarding the agreement between HEAD-US and MRI for the cartilage degradation, we used Cohen's Kappa (with quadratic weights), because both scores are based on the same (0-4) scale. Agreement was illustrated using the concordance bubble plots.14 An 8 elements phased array SENSE knee coil was used for the knee imaging, an 8 elements phased-array SENSE foot-ankle coil for the ankle imaging, and two 2 elements phase-array SENSE flex coils for the elbow imaging. The protocol included 3D T2*-weighted water selective gradient echo sequence (FOV, 160x160x108mm; voxel size, 0.58x0.58x0.50mm; flip angle: 15°; TE, 9.2/6.1ms; TR, 26ms), and 3D proton density (PD) weighted turbo spin echo sequence (FOV, 160x160x161mm; voxel size: 0.52x0.52x0.52mm; TE, 33ms; TR, 1000ms). The total scanning time was approximately 15 minutes per joint. In each patient, all joints were scanned in a single session for a total examination time extending up to two hours. After examining each joint, the patient was encouraged to stretch the body while the coils for the imaging of the next joint were setup. As mentioned, two MRI examinations were incomplete due to patient discomfort. All the MRI examinations were scored according to the International Prophylaxis Study Group (IPSG) MRI scale described elsewhere.13 The scoring was performed by an experienced musculoskeletal radiologist who was blinded regarding the results of the HEAD-US examinations. Additionally, the datasets of 10 MRI examinations were reviewed and scored by another experienced musculoskel-etal radiologist to determine the inter-rater reliability. This latter reader was blinded from the original IPSG scores of the MRI examinations and from the HEAD-US scores. Results Baseline characteristics of the study group are shown in Table 1. In our series, all patients underwent prophylactic treatment for haemophilia: 7 patients started therapy before the age of 10 years, 14 patients between 10 and 19 years, and 9 patients after the age of 20 years. The mean age at which prophylactic treatment was started was 17.4 years and the mean duration of the prophylaxis was 15.4 years. In our series, the disease presentation was quite variable with a mean HJHS 2.1 score of 2.3 (range 0-12). HJHS scores were the highest in the ankle and the lowest in the knee, and correlated well with the lifetime number of joint bleeding episodes. The ankles were the joints with the most often recorded history of prior bleeds: 42% of examined ankles had >20 prior lifetime bleeds recorded. The knees were the least affected joints, with 51% of the examined knees having <5 prior lifetime bleeds recorded. Inter-rater reliability The inter-rater reliability of the interpretation was excellent for the US examinations (ICC values 0.960-0.996 for total score, median k across subscores 1.000) and for the MRI (ICC values 0.9570.990 for total score, median k across sub-scores 0.815). Radiol Oncol 2019; 53(2): 178-186. Plut D et al. / Haemophilia early arthropathy detection with ultrasound 90 Diagnostic accuracy of ultrasound HEAD-US scores were correlated with the IPSG MRI scores; results are shown in Table 2. A high overall correlation was found between the scores (r « 0.92). Correlation for the overall scores at the joint level was nearly perfect in the elbows and knees (r « 0.95) and slightly lower, but still very high in the ankles (r « 0.85). Separate evaluation of each parameter of the joint (synovial hypertrophy, cartilage degradation, bone changes) showed a mediumhigh to high agreement for all the parameters. The correlation between the HEAD-US and MRI scores was the lowest for the evaluation of the synovium hypertrophy and cartilage degradation at the ankle level (r « 0.55). All other parameters showed a high agreement between the methods (r > 0.70). Concordance bubble plot for agreement between the HEAD-US and MRI scores at all three joint levels is shown in Figure 1. The distribution of circles within the plots demonstrates the variable degree of haemophilic arthropathy presentation in our study group at all joint levels. The plots also explicitly demonstrate the high overall correlation between the HEAD-US and MRI scores. The biggest deviation from the perfect line is shown at the ankle level, in the ankles with higher degree of hae-mophilic arthropathy. In our series there were 42 joints with no haemo-philic arthropathy, that are the joints scored with 0 by the IPSG MRI scoring system: 19 elbows, 20 knees and 3 ankles. In 35 of those joints the HEAD-US score was also 0. An example of a perfect concordance between the US and MRI examination for a knee with no haemophilic arthropathy is shown in Figure 2. In 7 joints with the IPSG MRI score 0 the HEAD-US score was 1. These HEAD-US examinations are false positives for the presence of TABLE 2. Correlation between the HEAD-US and IPSG MRI scores Elbows Knees Ankles All joints 0.921 Overall score (r) 0.949 0.941 0.838 Detailed scores: Synovial hypertrophy (r) 0.840 0.710 0.561 Cartilage degradation (r) 0.734 0.812 0.537 Bone changes (r) 0.883 0.741 0.725 Notes: all the reported correlations are statistically significant (p<0.001); the values for the elbows, knees and ankles are the averages over the right and left side values (the differences between them were negligible); the correlations are averaged using Fisher-z transformation. haemophilic arthropathy. The false positive rate was 16.7%, which means specificity of HEAD-US to diagnose haemophilic arthropathy in our study was 83.3%. Detailed evaluation of the false positive examinations reveals that the findings diagnosed by US and not confirmed by MRI were: mild syn-ovium hypertrophy in one elbow and two knees, small cartilage defect in two elbows and one ankle, and a small osteophyte in one knee. Conversely, there were 6 joints that were scored with 0 by HEAD-US and scored positive by the IPSG MRI scoring. These HEAD-US examinations are the false negatives for the presence of haemo-philic arthropathy. The false negative rate was 4.8%, which means the sensitivity of HEAD-US to diagnose haemophilic arthropathy in our study was 95.2%. Detailed evaluation of the false negative examinations reveals that the findings missed by US were: a cartilage defect at the tibial side of the talocrural joint (Figure 3), a small synovium hypertrophy in the posterior recess in another ankle, two small cartilage defects at the ulnar side of the joint in elbows, and two small osteochondral lesions at the ulnar side of the joint in another two 8 ■ S V) ■D 4 <0 01 X 2 h Elbows ° oooo o OOo o o o OOO o o o o o O P ) o o 8 ■ 6 tn i*