Quantitative Ultrasound for Measuring Obstructive Severity in Children with Hydronephrosis

Quantitative Ultrasound for Measuring Obstructive Severity in Children with Hydronephrosis

Quantitative Ultrasound for Measuring Obstructive Severity in Children with Hydronephrosis Juan J. Cerrolaza,* Craig A. Peters,* Aaron D. Martin, Emma...

724KB Sizes 12 Downloads 96 Views

Quantitative Ultrasound for Measuring Obstructive Severity in Children with Hydronephrosis Juan J. Cerrolaza,* Craig A. Peters,* Aaron D. Martin, Emmarie Myers, Nabile Safdar and Marius George Linguraru From Sheikh Zayed Institute for Pediatric Surgical Innovation (JJC, CAP, ADM, EM, MGL), and Division of Urology and Pediatrics (CAP, ADM), Children’s National Health System, and Departments of Radiology and Pediatrics, School of Medicine and Health Science, George Washington University (MGL), Washington, D.C., and Department of Radiology and Imaging Sciences, Emory University School of Medicine, Atlanta, Georgia (NS)

Purpose: We define sonographic biomarkers for hydronephrotic renal units that can predict the necessity of diuretic nuclear renography. Materials and Methods: We selected a cohort of 50 consecutive patients with hydronephrosis of varying severity in whom 2-dimensional sonography and diuretic mercaptoacetyltriglycine renography had been performed. A total of 131 morphological parameters were computed using quantitative image analysis algorithms. Machine learning techniques were then applied to identify ultrasound based safety thresholds that agreed with the t1/2 for washout. A best fit model was then derived for each threshold level of t1/2 that would be clinically relevant at 20, 30 and 40 minutes. Receiver operating characteristic curve analysis was performed. Sensitivity, specificity and area under the receiver operating characteristic curve were determined. Improvement obtained by the quantitative imaging method compared to the Society for Fetal Urology grading system and the hydronephrosis index was statistically verified. Results: For the 3 thresholds considered and at 100% sensitivity the specificities of the quantitative imaging method were 94%, 70% and 74%, respectively. Corresponding area under the receiver operating characteristic curve values were 0.98, 0.94 and 0.94, respectively. Improvement obtained by the quantitative imaging method over the Society for Fetal Urology grade and hydronephrosis index was statistically significant (p <0.05 in all cases). Conclusions: Quantitative imaging analysis of renal sonograms in children with hydronephrosis can identify thresholds of clinically significant washout times with 100% sensitivity to decrease the number of diuretic renograms in up to 62% of children. Key Words: biological markers, hydronephrosis, kidney, technetium Tc 99m mertiatide, ultrasonography

ULTRASOUND is the mainstay of imaging for pediatric hydronephrosis yet is limited by its subjectivity and lack of apparent correlation with functional imaging modalities.1 To improve the clinical usefulness of sonography for hydronephrosis, we

Abbreviations and Acronyms D ¼ dimensional DR ¼ diuretic renography HI ¼ hydronephrosis index LOG ¼ logistic regression analysis MAG3 ¼ mercaptoacetyltriglycine QI ¼ quantitative imaging SFU ¼ Society for Fetal Urology SVM ¼ support vector machine US ¼ ultrasound Accepted for publication October 18, 2015. No direct or indirect commercial incentive associated with publishing this article. The corresponding author certifies that, when applicable, a statement(s) has been included in the manuscript documenting institutional review board, ethics committee or ethical review board study approval; principles of Helsinki Declaration were followed in lieu of formal ethics committee approval; institutional animal care and use committee approval; all human subjects provided written informed consent with guarantees of confidentiality; IRB approved protocol number; animal approved project number. Supported by a grant from the government of Abu Dhabi. * Correspondence: Sheikh Zayed Institute for Pediatric Surgical Innovation, Children’s National Health System, 111 Michigan Ave., NW Washington, D.C. 20010 (e-mail: jcerrola@childrensnational. org, [email protected]).

developed a semiautomatic quantitative imaging analysis algorithm to objectively describe hydronephrosis and to define ultrasound based thresholds of obstruction below which diuretic renography can be safely avoided.2 Quantitative imaging

0022-5347/16/1954-1093/0 THE JOURNAL OF UROLOGY® Ó 2016 by AMERICAN UROLOGICAL ASSOCIATION EDUCATION AND RESEARCH, INC.

http://dx.doi.org/10.1016/j.juro.2015.10.173 Vol. 195, 1093-1099, April 2016 Printed in U.S.A.

www.jurology.com

j

1093

1094

QUANTITATIVE ULTRASOUND FOR PEDIATRIC HYDRONEPHROSIS

techniques rely on analysis of digital images and creation of normative models of shape and appearance to which a test case is compared. The test case can then be defined, based on probability, as being within a specific range of values different from the norm. By using machine learning to correlate patterns of hydronephrosis with the diuretic renography parameters, in this case t1/2 washout, the correlation between structure and function can be defined and used to categorize subsequent test cases. With the primary goal being to identify ultrasound derived thresholds of safety for hydronephrotic renal units where diuretic renography can be avoided based on washout times, we applied quantitative imaging techniques to a cohort of pediatric patients with varying degrees of hydronephrosis in whom routine ultrasonography and diuretic renography had been performed. The quantitative imaging algorithm was then assessed as a predictor of the washout parameters of diuretic renography. Currently the most widely used instrument for grading hydronephrosis sonographically is the SFU system,3 although it is not universally accepted as useful.4,5 Shapiro6 and Venkatesan7 et al proposed a quantitative measure of hydronephrosis severity, the HI, defined as the ratio of the collecting system area to the total area of the collecting system and renal parenchyma. However, the practical use of this index remains unclear.8 While different clinicians use differing thresholds for selecting patients in whom to perform DR and for defining clinically significant obstruction, it would seem valuable to perform DR only in those in whom it is likely to reveal clinically relevant parameters for intervention or close followup. We hypothesized that hydronephrosis can be objectively quantified sonographically using modern imaging analysis, and that there is a correlation between degree of hydronephrosis and severity of obstruction as measured by DR. This correlation likely is not linear, but demonstrates a threshold of dilatation above which clinically significant drainage abnormalities are detectable on DR. If this relationship can be defined, then quantitative assessment of degree of hydronephrosis on US could be used to selectively identify which patients have more severe obstruction (by DR parameters) and should undergo DR, and which patients have less severe hydronephrosis and should forgo DR.

MATERIALS AND METHODS Study Population Under an internal review board approved protocol we retrospectively selected 50 consecutive patients (32 males

and 18 females) with hydronephrosis who had undergone concurrent renal 2-D ultrasound and DR (MAG3 scan). Mean patient age was 9.6 months (median 4.0, range 0 to 168). Of the cases 21 were left sided, 29 were right sided and 8 were bilateral. Cases involving vesicoureteral reflux, ureterovesical junction obstruction, ureterocele or hydroureter/megaureter were excluded to avoid confounding factors. Ultrasound acquisition was performed using the PhilipsÒ iU22 (C2-9-D transducer) and GEÒ Logiq E9 (C8-5 transducer) scanners between July 2012 and January 2013. Images were identified that contained the closest to a full coronal view that included the renal pelvis and calyces. MAG3 scan was performed using a standardized method described previously.9,10 All patients underwent DR based on clinician preference, and no study was postoperative.

Quantitative Imaging Analysis Manual segmentation of the renal parenchyma and collecting system was performed to create a cohort for program calibration and algorithm development. This was accomplished by manually drawing the best line around the perimeter of the renal parenchyma and the collecting system (using the ITK-Snap Medical Image Segmentation Tool, http://www.itk-snap.org), excluding the extrarenal pelvis. The intrarenal pelvis was defined as the area falling within a line drawn around the hilar curve of the parenchyma (fig. 1). All segmentations were reconciled with 2 observers (pediatric radiologist and urologist) and submitted for QI analysis. The same cohort underwent grading using the SFU classification by a pediatric radiologist and a pediatric urologist. A total of 131 morphological parameters were computed (fig. 2 and supplementary table, http://jurology. com/). These descriptors can be intuitively grouped into 3 categories, ie size, geometric shape and curvature. Size descriptor parameters describe the size of the collecting system and renal parenchyma, such as relative area (eg HI) and perimeter, depth of calyces, and thickness of parenchyma. Geometric shape descriptor parameters include such factors as circularity ratio, eccentricity of the circumscribed ellipse and convexity. Curvature descriptors are parameters that describe the local curvature of the structures under analysis (ie kidney and collecting system).11 These quantitative shape features were used as potential predictive variables to evaluate the diagnostic performance of 2 different supervised classification models, the LOG12 and SVM with nonlinear kernel.13 Both approaches have been widely used in the field of computer-aided diagnosis with satisfactory results.14 These 2 classification approaches were applied to the test cohort to compute US based specific thresholds to identify patients with a t1/2 above different thresholds, ie 20, 30 and 40 minutes, since not all clinicians agree as to the clinically relevant DR thresholds. These cutoffs represent commonly used clinical thresholds.15 The design parameters of the model were adjusted to maximize the sensitivity of detecting severe cases of hydronephrosis, that is no case with t1/2 above the threshold was misclassified. A best fit model was then derived for each threshold using optimal morphological parameters

QUANTITATIVE ULTRASOUND FOR PEDIATRIC HYDRONEPHROSIS

1095

Figure 1. 2-D ultrasound slice selection and segmentation. A, selected slice containing whole longitudinal section of kidney. B, consensus manual delineation of renal parenchyma and collecting system.

and SVM or LOG to categorize the renal units and severity of obstruction. To compare this QI method to the current SFU grading system in selecting cases where DR was to be performed, we compared SFU grade 3 and above or only SFU grade 4 as the criterion for performing DR. Sensitivity and specificity were determined, as well as ROC curve analysis with AUC calculation. A similar assessment was made using the previously described HI. The proposed framework was implemented in MATLABÒ, using a 64-bit 2.8 GHz processor, with an average execution time of 12 seconds per case.

Statistical Analysis The cohort data were used to assess the performance of the QI algorithm described previously,2 using leaveone-out cross-validation methodology.16 From the 131 morphological features computed for each case an optimal subset of predictive variables was selected for each t1/2 threshold using the feature selection technique proposed by Cai et al.17 The significance of the improvement obtained by the QI method over the SFU grade and HI was statistically verified using the McNemar test.18 Additionally the interobserver variance for SFU grading was assessed using the Fleiss kappa statistic. In the absence of previous studies that related to this topic to provide an estimation of the expected effect we performed a retrospective power analysis using a 2-tailed t-test over the output probability provided by the

classifiers, SVM and LOG (significance level ¼ 0.05). Effect sizes were calculated using the Hedges formulation for means differences.19

RESULTS The retrospective power analysis of the cohort of 50 patients provided a power value of greater than 0.99 in all instances. SFU grades ranged from 1 to 4 (median 3). Median differential uptake on MAG3 scan was 48% (range 14% to 100%). Median t1/2 was 17 minutes (range 3 to greater than 120). Differential uptake was less than 40% in 7 patients and less than 30% in 3. All patients with differential uptake less than 30% would have been identified with a t1/2 threshold of 30 minutes or greater. Two patients with differential uptake less than 40% and greater than 30% had washout times of less than 20 minutes (table 1), suggesting a nonobstructive cause of the reduced uptake. The best performance of the QI method was obtained by means of the SVM classification system for thresholds of 20 and 30 minutes, while LOG provided better performance for a threshold of 40 minutes. Because the design parameters of the model were defined to maximize the sensitivity of detecting cases of hydronephrosis over a specified

Figure 2. Morphological descriptors of renal unit. Sample illustrates morphological descriptors considered as potential predictive features of hydronephrosis severity. ab, eccentricity of circumscribed ellipse. Pmax, maximum parenchymal thickness. Pmin, minimum parenchymal thickness. PKMAmax, kidney medial axis and point of maximum curvature. PKMAleft, left axis of kidney. PKMAright, right axis of kidney. Arrows indicate curvature measures of collecting system and kidney surface. Relative area of collecting system and parenchyma is shown.

QUANTITATIVE ULTRASOUND FOR PEDIATRIC HYDRONEPHROSIS

1096

19 14 8

that obtained using SFU grade when an SFU grade of 3 or 4 was used to refer patients for MAG3 imaging. For the 3 different t1/2 thresholds under study the QI algorithm maximizes the reduction in MAG3 scan within the total cohort, while no critical case with severe obstruction was missed.

14 18 18

DISCUSSION

Table 1. Cases by category No. Cases 1

T /2 threshold: 20 Mins or greater 30 Mins or greater 40 Mins or greater SFU grade: 2 3 4

DR parameter, no case with t1/2 above the threshold was misclassified (100% sensitivity). For the 3 thresholds considered, ie 20, 30 and 40 minutes, and at 100% sensitivity the specificities of the method were 94%, 70% and 74%, respectively. Compared to using the SFU grading system to select patients for MAG3 scan (table 2) or the HI, this approach provides greater specificity while maintaining maximum sensitivity. Figure 3 illustrates the ROC curves for the quantitative imaging, SFU and hydronephrosis index techniques for selecting patients with t1/2 greater than 30 minutes. Using either hydronephrosis index or SFU grading system showed an AUC of 0.78, which was significantly less robust than the 0.94 AUC using the SVM based quantitative imaging method (p <0.001 and p ¼ 0.04, respectively). Similar results were obtained for washout threshold values of 20 and 40 minutes. Quantitative imaging performed significantly better than hydronephrosis index (p <0.001 for either threshold) and significantly better than the SFU grading system (p <0.001 for t1/2 of 20 minutes and p ¼ 0.05 for t1/2 of 40 minutes). The interobserver reliability for SFU grading between a pediatric radiologist and a pediatric urologist revealed a Fleiss kappa statistic of 0.378, indicating fair agreement.20 Table 3 outlines the potential reduction in MAG3 scanning using the QI method, compared to

Using the QI algorithm with machine learning in pediatric patients with hydronephrosis, US can be used to predict which patients will have clinically relevant increased washout times on DR. Applying this technique to children with hydronephrosis, particularly those with prenatally diagnosed disease, could safely decrease the need for invasive and ionizing DR. Clinicians would be able to select thresholds of safety that they find acceptable. These results have several potentially useful implications. These observations demonstrate that renal structure and function, specifically drainage, are related. Being able to use ultrasound images to infer functional data would offer decreased invasiveness, complexity and cost, with no reduction in safety and acceptable reduction in accuracy. This system appears to be a useful clinical tool to determine which patients would benefit from DR over using the SFU scoring system alone. While this approach does not eliminate the need for DR, it permits more specific application of a costly, complex, semi-invasive imaging test with radiation exposure. Application of this modality would be most useful in infants with prenatally detected hydronephrosis, in whom there remains controversy regarding the need for imaging beyond the postnatal US. There are currently no clearly defined guidelines based on objective metrics or clinical outcomes that aid in the selection of patients for DR. Of course, interpretation of the DR remains controversial, as is the decision regarding the need and timing for surgical intervention. This study

Table 2. QI vs SFU grade or HI to identify washout time thresholds in patients with hydronephrosis % Sensitivity (95% CI)

% Specificity (95% CI)

% Pos Predictive Value (95% CI)

% Accuracy (95% CI)

% AUC (95% CI)

100 (82e100) 100 (82e100) 100 (82e100)

94 (79e99) 0 (0e11) 52 (33e70)

90 (70e99) 38 (25e53) 56 (38e73)

96 (86e99) 38 (25e53) 70 (55e82)

98 (92e100) 74 (57e83) 77 (58e86)

100 (77e100) 100 (77e100) 100 (77e100)

70 (52e84) 39 (23e56) 47 (30e64)

56 (35e75) 39 (23e56) 42 (25e61)

78 (64e88) 56 (41e70) 62 (47e75)

94 (83e98) 78 (62e86) 78 (55e88)

100 (63e100) 100 (63e100) 100 (63e100)

74 (58e86) 33 (20e50) 62 (46e76)

42 (20e66) 22 (10e39) 33 (16e55)

78 (64e88) 44 (30e59) 68 (53e80)

94 (81e98) 88 (56e90) 80 (46e93)

1

T /2 greater than 20 mins: QI (SVM) SFU grade 2 HI T1/2 greater than 30 mins: QI (SVM) SFU grade 3 HI T1/2 greater than 40 mins: QI (LOG) SFU grade 3 HI

For each case (t1/2 greater than 20, 30 or 40 minutes) SFU grade that resulted in 100% sensitivity was used. Optimized threshold of HI was used for prediction of t1/2 greater than 20, 30 or 40 minutes. Accuracy equals true positive þ true negative/total positive þ total negative. Due to limited number of cases, exact 95% confidence interval is provided (using binomial distribution).

QUANTITATIVE ULTRASOUND FOR PEDIATRIC HYDRONEPHROSIS

Figure 3. ROC curves for quantitative imaging (SVM and LOG methods), SFU grading and hydronephrosis index methods for selecting patients with t1/2 greater than 30 minutes. Sens., sensitivity. Spec., specificity. Acc., accuracy.

cannot answer those questions, but it can provide more objective criteria from ultrasonographic imaging that could support prospective studies of congenital obstruction. Objective prediction of clinically significant hydronephrosis in children has been an ongoing challenge. Erickson et al reported that SFU grade 3 has never been associated with reduced differential uptake in pediatric kidneys.21 However, the dividing line between SFU grades 3 and 4 is not clearly defined or reproducible.4 A recent report by Dias et al presented the use of renal pelvic dilatation as highly predictive of the need for surgical intervention, based on decreased function on dimercaptosuccinic acid scanning and not DR.22 This approach used fetal and postnatal anteroposterior diameter of the renal pelvis, with thresholds of 18 mm for fetal and 16 mm for postnatal measures. This approach was not statistically Table 3. Decrease in MAG3 scanning possible using QI vs SFU grade Threshold

QI

SFU Grade 3

SFU Grade 4

21 58 0

36 28 5

18 64 42

25 50 0

36 28 0

18 64 29

19 62 0

36 28 0

18 64 13

1

T /2 greater than 20 mins: No. MAG3 scans % Reduction from cohort % Missed critical cases 1 T /2 greater than 30 mins: No. MAG3 scans % Reduction from cohort % Missed critical cases 1 T /2 greater than 40 mins: No. MAG3 scans % Reduction from cohort % Missed critical cases

1097

different from SFU selection using grades 3 and 4, which we have shown performs significantly less well than the QI approach presented in this report (p <0.05 for all t1/2 thresholds). The criterion for the study by Dias et al was need for pyeloplasty based on nonstandard criteria, rather than defining a threshold for more precise imaging.22 Several investigators have measured renal parenchymal area to assess renal status in vesicoureteral reflux and hydronephrosis,23,24 and recently in posterior urethral valves.25 Renal parenchymal volume has been measured using 3-D ultrasound to permit estimation of relative function.26 These approaches have focused primarily on renal function rather than severity of obstruction yet indicate the potential usefulness of ultrasound in estimating relative renal function. To our knowledge, this is the first study to relate renal morphological features from US with clinical parameters of DR. In a previous study proposed by Kim et al a clinical cohort of 46 patients was evaluated.5 However, no study of the power of the proposed HI as a meaningful indicator of renal function was presented. We used a data cohort of 50 patients, which provided a retrospective power value of greater than 0.99. There are several limitations to this pilot study that will be addressed in future research. Despite the satisfactory power value provided by the post hoc power analysis, the number of patients may be relatively small for a condition that is as heterogeneous as hydronephrosis. Even so, the errors noted are small, and if the parameters are conservatively defined, there is less risk of an error of omission (ie missing detection of a patient with an increased t1/2). The current methodology requires hand drawn segmentation. While this is not difficult, it is tedious and requires some training, and represents a potential source of interobserver variability. Ultimately the goal is to develop an automatic segmentation algorithm,27,28 which would permit rapid evaluation of an image to yield a “hydronephrosis score” that would be available to the clinician for further decision making. This study does not assess the potential of using this methodology for clinical followup. It is unknown if the US parameters remain consistent or reflect alterations in either direction as hydronephrosis evolves. Hafez29 and Ross30 et al observed that increasing hydronephrosis is indicative of obstruction. However, not all obstructed kidneys exhibit increasing dilatation on followup, and it is unknown if persisting hydronephrosis is safe. It remains to be seen if QI may be able to more readily identify these patients. That area is the subject of an ongoing study.

1098

QUANTITATIVE ULTRASOUND FOR PEDIATRIC HYDRONEPHROSIS

CONCLUSIONS We have demonstrated the potential clinical usefulness of a semiautomated means of quantitating ultrasonographic images of hydronephrotic kidneys. This approach can decrease the number of diuretic

renograms performed by up to 62%, depending on selection criteria used. While diuretic renography remains the gold standard, the described technique has the potential to simplify and enhance our care of children with hydronephrosis.

REFERENCES 1. Siegel MJ: Pediatric Sonography, 4th ed. Philadelphia: Lippincott Williams & Wilkins 2010.

11. Kovalevsky V: Curvature in digital 2D images. Int J Patt Recogn Artif Intell 2001; 15: 1183.

percentage differential function. J Pediatr Urol 2007; 3: 382.

2. Cerrolaza JJ, Peters CA, Martin AD et al: Ultrasound based computer-aided-diagnosis of kidneys for pediatric hydronephrosis. Proc SPIE 9035, Medical Imaging 2014; 90352T-1.

12. Hosmer DW and Lemeshow S: Applied Logistic Regression, 2nd ed. New York: John Wiley & Sons Inc. 2000.

22. Dias CS, Silva JM, Pereira AK et al: Diagnostic accuracy of renal pelvic dilatation for detecting surgically managed ureteropelvic junction obstruction. J Urol 2013; 190: 661.

3. Fernbach SK, Maizels M and Conway JJ: Ultrasound grading of hydronephrosis: introduction to the system used by the Society for Fetal Urology. Pediatr Radiol 1993; 23: 478.

13. Cristianini N and Shawe-Taylor J: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. New York: Cambridge University Press 2000.

4. Keays MA, Guerra LA, Mihill J et al: Reliability assessment of Society for Fetal Urology ultrasound grading system for hydronephrosis. J Urol, suppl., 2008; 180: 1680.

14. Sun T, Wang J, Li X et al: Comparative evaluation of support vector machines for computer aided diagnosis of lung cancer in CT based on a multi-dimensional data set. Comput Methods Programs Biomed 2013; 111: 519.

5. Kim SY, Kim MJ, Yoon CS et al: Comparison of the reliability of two hydronephrosis grading systems: the Society for Foetal Urology grading system vs. the Onen grading system. Clin Radiol 2013; 68: e484.

15. Eskild-Jensen A, Gordon I, Piepsz A et al: Interpretation of the renogram: problems and pitfalls in hydronephrosis in children. BJU Int 2004; 94: 887.

6. Shapiro SR, Wahl EF, Silberstein MJ et al: Hydronephrosis index: a new method to track patients with hydronephrosis quantitatively. Urology 2008; 72: 536. 7. Venkatesan KJ, Green J, Shapiro SR et al: Correlation of hydronephrosis index to Society of Fetal Urology hydronephrosis scale. Adv Urol 2009; 960490. 8. Rud O, Moersier J, Peter J et al: Prospective evaluation of interobserver variability of the hydronephrosis index and the renal resisitive index as sonographic examination methods for the evaluation of acute hydronephrosis. BJU Int 2012; 110: E350. 9. Shulkin BL, Mandell GA, Cooper JA et al: Procedure guideline for diuretic renography in children 3.0. J Nucl Med Technol 2008; 36: 162. 10. Conway JJ and Maizels M: The “well tempered” diuretic renogram: a standard method to examine the asymptomatic neonate with hydronephrosis or hydroureteronephrosis. A report from combined meetings of the Society for Fetal Urology and members of the Pediatric Nuclear Medicine Councildthe Society of Nuclear Medicine. J Nucl Med 1992; 33: 2047.

23. Cost GA, Merguerian PA, Cheerasam SP et al: Sonographic renal parenchymal and pelvicaliceal areas: new quantitative parameters for renal sonographic followup. J Urol 1996; 156: 725. 24. Rodrıguez LV, Lock J, Kennedy WA et al: Evaluation of sonographic renal parenchymal area in the management of hydronephrosis. J Urol 2001; 165: 548. 25. Pulido JE, Furth SL, Zderic SA et al: Renal parenchymal area and risk of ESRD in boys with posterior urethral valves. Clin J Am Soc Nephrol 2014; 9: 499.

16. Elisseeff A and Pontil M: Leave-one-out error and stability of learning algorithms with applications. In: Advances in Learning Theory: Methods, Models and Applications. Edited by J Suykens, G Horvath, S Basu et al: Amsterdam: IOS Press 2003; chapt 6, pp 111e129.

26. Riccabona M, Fritz GA, Sch€ollnast H et al: Hydronephrotic kidney: pediatric three-dimensional US for relative renal size assessmentdinitial experience. Radiology 2005; 236: 276.

17. Cai D, Zhang C and He X: Unsupervised feature selection for multi-cluster data. Presented at annual conference of ACM SIGKDD, Washington, D.C., July 25-28, 2010.

27. Kang X, Safdar N, Myers E et al: Automatic analysis of pediatric renal ultrasound using shape, anatomical and image acquisition priors. Med Image Comput Comput Assist Interv 2013; 8151: 259.

18. Bostanci B and Bostanci E: An evaluation of classification algorithms using McNemar’s test. Presented at International Conference on BioInspired Computing: Theories and Applications, Gwalior, India, December 14-16, 2012.

28. Cerrolaza JJ, Safdar N, Peters CA et al: Segmentation of kidney in 3D-ultrasound images using Gabor-based appearance models. Presented at IEEE International Symposium on Biomedical Imaging, Beijing, China, April 29-May 2, 2014.

19. Hedges LV: Distribution theory for Glass’s estimator of effect size and related estimators. J Educ Behav Stat 1981; 6: 107.

29. Hafez AT, McLorie G, B€agli D et al: Analysis of trends on serial ultrasound for high grade neonatal hydronephrosis. J Urol 2002; 168: 1518.

20. Landis JR and Koch GG: The measurement of observer agreement for categorical data. Biometrics 1977; 33: 159. 21. Erickson BA, Maizels M, Shore RM et al: Newborn Society of Fetal Urology grade 3 hydronephrosis is equivalent to preserved

30. Ross SS, Kardos S, Krill A et al: Observation of infants with SFU grades 3-4 hydronephrosis: worsening drainage with serial diuresis renography indicates surgical intervention and helps prevent loss of renal function. J Pediatr Urol 2011; 7: 266.

EDITORIAL COMMENT This report is a promising contribution to the never ending discussion among pediatric urologists about the proper diagnosis of congenital

hydronephrosis with regard to a conservative approach, followup and indications for surgical intervention. The semiquantitative approach, based

QUANTITATIVE ULTRASOUND FOR PEDIATRIC HYDRONEPHROSIS

on ultrasonographic images, may lead to more objective and, perhaps most importantly, reproducible results of diagnosis and outcome measures. So far, semiquantitative US has been related to t1/2 on MAG3 renography, a quite controversial parameter, especially in younger children and infants. To some extent, this method is observer dependent but the same is the case in assessment

1099

of renography. Future studies will hopefully give us the potential to assess differential renal function using 3-D quantitative imaging as a noninvasive tool without the exposure to radiation. L. Henning Olsen Department of Urology, Section of Pediatric Urology Aarhus University Hospital Aarhus, Denmark