| HSS Journal The Musculoskeletal Journal of Hospital for Special Surgery |
| © Hospital for Special Surgery 2007 |
| 10.1007/s11420-007-9066-z |
Alexander Brunner1
, Benjamin Ulmar1
, Heiko Reichel1
and Ralf Decking1 
| (1) | Department of Orthopaedics, University of Ulm, Oberer Eselsberg 45, Ulm, 89081, Germany |
![]() |
Alexander Brunner Email: a-r.brunner@web.de |
![]() |
Benjamin Ulmar Email: benjamin.ulmar@uni-ulm.de |
![]() |
Heiko Reichel Email: heiko.reichel@uniklinik-ulm.de |
![]() |
Ralf Decking (Corresponding author) Email: ralf.decking@uni-ulm.de |
Received: 7 November 2007 Accepted: 12 November 2007 Published online: 18 December 2007
Key words hip - dysplasia - prosthesis - classification
Untreated developmental dysplasia of the hip (DDH) is one of the most common reasons for secondary coxarthrosis in female adult patients [1]. Radiological signs of dysplasia have been described in 25% of all patients being treated by total hip replacement (THR) [2]. The severity of dysplasia may range from slight hypoplasia of the acetabulum with only little lateral displacement of the joint center to an insufficient acetabular roof to contain the femoral head leading to its complete dislocation. In its milder forms with only mild subluxation, the shallow acetabulum commonly has a wide opening and is oval in shape. In cases of a high dislocation of the hip, the femoral head does not articulate with the original acetabulum, which is often rudimentary [3].
Focusing on the femoral side of dysplasia, a coxa valga deformity with posterior translation of the greater trochanter and excessive anteversion of the femoral neck is common in almost all subluxed hip joints. In severe forms of femoral dysplasia, the femur is small with a narrow medullary canal and thin, ecstatic cortical bone [4].
Because of these morphologic abnormalties, THR in the setting of developmental dysplasia of the hip can be technically demanding [4, 5]. When the dysplasia is mild, only minimal modification to the surgical technique of total hip arthroplasty is required [4]. In more sever cases, however, the operation is substantially more complex, and special techniques to restore an anatomic location of the femoral head and to avoid excessive lengthening of the leg and inadvertent sciatic nerve injury are required. Furthermore, with the anatomic femoral abnormalities, special implants may be required to deal with them [4, 6]. A number of studies showed [7–13] that THR in the setting of developmental dysplasia has higher rates of complications and inferior outcomes when compared with THR performed for primary osteoarthritis. Moreover, the results depend on the severity of dysplasia, and patients with lower grades of dysplasia tend to have better results than patients with higher grades of dysplasia [14], so interpretation of long-term results after THR requires detailed information about the degree of dysplasia.
In type A, the acetabulum is slightly elongated and appears dysplastic, accommodating a flattened mushroom-shaped femoral head. Eftekhar described type B and C as intermediate and high dislocations, respectively. The true acetabulum is poorly developed, but the floor is thick and easily identifiable following removal of fibrofatty tissue from the site of the true acetabulum. The lower border of the false acetabulum identifies the roof of the original acetabulum.
Type D describes an old, unreduced dislocation. The head actually has never been in contact with the ilium, so there is no pseudoacetabulum. The original site of the acetabulum can hardly be recognized. It only represents a narrow “isthmus” at the site of the old triradiate cartilage.
A number of studies [16, 18, 20–22] have used the classifications of Eftehkar and Kerboul to describe the preoperative anatomic situation before THR. The use of any radiologic classification system requires sufficient reliability, as measured by inter- and intraobserver correlation. Unfortunately, the measurement of reliability was not part of the original publications of Eftekhar and Kerboul [16, 18], and we have not been able to find any studies evaluating it. This study was performed to measure the inter- and intraobsever reliability of these two radiologic classification systems, to evaluate the comparability of data presented in the literature, and to see if these systems offer an advantage over other classifications used.
From the databases of two orthopedic hospitals (Orthopaedic Department of the University of Ulm, Germany, and the Orthopaedic Department of St. Francis Hospital, Muenster, Germany), a total of 96 patients within three consecutive years were identified with the preoperative diagnosis “coxarthrosis secondary to developmental dysplasia of the hip” who had been scheduled for THR. In 69 of these patients (48 women and 21 men), preoperative anteroposterior pelvis x-ray films could be obtained. Twenty of the these 69 patients showed a bilateral dislocation. We reviewed the preoperative anteroposterior pelvic x-ray films of these patients. In three patients, the x-ray films where underexposed and unsuitable for evaluation, which left 86 dislocated hips on 66 anteroposterior pelvic x-ray films (46 women and 20 men) for review and analysis. The mean age of these patients was 54 years (range: 17–83 years).
To consider the possible effect of the observers’ experience on classifying the hips, we chose three observers with different levels of orthopedic training and experience (one fellowship-trained consultant specializing in joint replacement, one fourth-year resident, and one first-year resident). The three observers made themselves familiar with the original publications of Eftekhar and Kerboul [16, 18]. Prior to the study, the observers were trained on the nuances and subtleties of the classification systems by reviewing ten x-rays as a group. In accordance with HIPPA regulations, all identifying marks were removed from the films, which were then sorted, labeled, and given to the observers in a random order by one of the investigators who was not an observer. All x-rays were graded by each observer in separate rooms and in random order first according to the Eftekhar system and later according to the criteria of Kerboul et al. Each observer reviewed the x-rays again 3 months later. The order of the x-rays was randomized again to prevent possible recall.
Statistical analysis was performed by calculating the weighted kappa correlation coefficient according to Cicchelli and Allison [23], as well as the overall kappa coefficient according to Fleiss [24]. Interpretation of the kappa coefficients was performed using the criteria of Landis and Koch [25]. They defined a kappa coefficient of more than 0.8 as excellent, between 0.6 and 0.8 as good (exceeding chance), between 0.4 and 0.6 as moderate, and less than 0.4 as poor correlation.



In four cases (4.7%), all three raters found it extremely difficult to distinguish between Eftekhar types B and C in all measurements. In three cases (3.5%), observers found it difficult to differentiate between Kerboul types A and B.
These two radiologic classification systems showed sufficient inter- and intraobserver reliability for the use in scientific studies and in daily practice. Overall the Eftekhar classification demonstrated slightly better results than the system according to Kerboul et al.
In addition to reliability, for a classification system to be useful, it needs to be valid. This means that the classification system should provide an accurate measurement of what is actually being measured. In this case, the intraoperative findings would be the gold standard. This study did not measure the concordance between intraoperative findings and the radiographic ratings. This is a major weakness of this study, but it is also a weakness of the other radiographic classifications used today. To this point, the Eftekhar system has some potential downfalls that might limit its validity. It defines an intermediate dislocation of the femoral head as type B and a high dislocation as type C. Some dysplastic hips rated in this study fit neither type B nor type C as proposed by Eftekhar.
Similarly, the Kerboul system uses the anteroposterior position of the femoral head to grade the severity of hip dysplasia. In some cases, the observers found it extremely difficult to locate the anteroposterior displacement of the femoral head by using a standard pelvic x-ray. It might be possible that the “training session” had biased the three observers for an intraobserver-reliability with a mean kappa coefficient of 0.716. Hence, for use in daily practice, a cross-table lateral hip x-ray is necessary. In many hospitals, this is not available for retrospective studies and would furthermore increase the number of x-rays needed when using this classification prospectively. In addition, the anteroposterior location of the femoral head does not tell the surgeon much about the severity of the dysplasia, and may not have much relevance to surgical practice. As reported by several authors [3, 5, 14–17, 26, 27], proximal dislocation of the femur and severity of acetabular dysplasia are important factors that influence preoperative planning, operative procedure, and functional postoperative outcome after THR for DDH. The Kerboul classification gives insufficient information about any of these factors, and we were unable to localize any publication evaluating the influence of the femur’s anteriorposterior position on the severity of acetabular dysplasia or the height of dislocation.
Some authors [28, 29] have suggested that clinical experience might affect the interobserver agreement between two observers with different training levels. In the present study, different levels of clinical experience did not influence the interobserver reliability.
A number of other classification systems [15, 17, 19] have been published to grade osteoarthritis secondary to DDH. Most of the international literature is published using either the system of Crowe et al. [15] or the one of Hartofilakidis et al. [17]. Crowe et al. [15] defined a four-stage system classifying the degree of dislocation in terms of the percentage of proximal displacement of the femoral head in relation to the height of the pelvis, resulting in a calculated coefficient, which translates into one of the four types.
Hartofilakidis et al. [17] used the pathology of the dysplastic acetabulum to distinguish between three different types of dysplasia, discriminating between a primary and a secondary acetablum and the relation of the head in relation to these structures. Both systems have shown a better reproducibility than the systems of Eftekhar and Kerboul et al. when studied with the same protocol [30] used in this study. Interrater kappa-coefficients ranged from 0.75 to 0.88 (Crowe) and 0.68 to 0.84 (Hartofilakidis), and intrarater coefficients between 0.76 and 0.94 for the Crowe and 0.72 and 0.84 for the Hartofilakidis classification.
The methods of Eftekhar and Kerboul showed a sufficient inter- and intraobserver reliability, yet questions remain regarding their validity and regarding the relevance of the information obtained from the Kerboul classification. Furthermore, both of the systems have been used to a far lesser extent than the classifications by Crowe and Hartofilakidis, which also had a slightly better reproducibility. In addition, there are more data on their correlation regarding the outcome of THA according to the severity of dysplasia. With the classification of Crowe being increasingly popular, especially in the English literature, it may be the best choice for a classification system with the data available today.