eISSN: 1896-9151
ISSN: 1734-1922
Archives of Medical Science
Current issue Archive Manuscripts accepted About the journal Special issues Editorial board Abstracting and indexing Subscription Contact Instructions for authors
SCImago Journal & Country Rank
vol. 14
Clinical research

Comparison of ovarian responsiveness tests with outcome of assisted reproductive technology – a retrospective analysis

Selcuk Selcuk, Bulent Emre Bilgic, Cetin Kilicci, Mehmet Kucukbas, Cetin Cam, Huseyin Tayfun Kutlu, Ates Karateke

Arch Med Sci 2018; 14, 4: 851–859
Online publish date: 2016/09/22
Article file
Get citation
JabRef, Mendeley
Papers, Reference Manager, RefWorks, Zotero


The ovarian response to controlled ovarian stimulation with gonadotropins is essential for the outcomes of assisted reproductive technology (ART) cycles [1]. Previous studies have demonstrated various clinical, hormonal, and ultrasonographic markers to estimate the ovarian reserve and the ovarian responsiveness to gonadotropin stimulation. Age, ultrasonographic antral follicle count (AFC), basal levels of estradiol (E2), follicle stimulating hormone (FSH), luteinizing hormone (LH), inhibin B, and anti-Müllerian hormone (AMH) have been defined as ovarian reserve tests, and they have been proposed for the evaluation of ovarian response [2–4].
Moreover, different models with a combination of various markers have been suggested as ways to increase the effectiveness of extrapolating the ovarian response. The ovarian sensitivity index (OSI) and ovarian response prediction index (ORPI) have been shown to be valid and reliable estimation models for ovarian response to gonadotropins in ART cycles [5, 6].
Based on previous studies that have reported strong outcomes for the previously described estimation indices, the issue of which ovarian response model has the best accuracy is still a challenge for managing patients undergoing ART treatment.
Nevertheless, to date, no detailed studies have compared the suggested ovarian response markers and/or a combination of models with ART treatment outcomes in the same patient. Therefore, we conducted a retrospective analysis of five different ovarian response markers (age, AMH, AFC, OSI, and ORPI) to evaluate the ovarian response to gonadotropin stimulation and the chance of ongoing pregnancy after ART treatment.

Material and methods

Study population

In this retrospective study, patients who underwent intracytoplasmic sperm injection (ICSI) treatment at the infertility clinic between 2014 and 2015 were enrolled. The study’s protocol was approved by the institution’s ethics committee. Informed consent was obtained from all the participants included in this study. Inclusion criteria were: ≤ 42 years of age, body mass index ≤ 35 kg/m2, regular menstrual cycles, no presence of endocrine disorders, no history of ovarian surgery, and no severe endometriosis. Demographics, clinical data, and ART treatment outcomes of the patients were abstracted from the hospital’s database. All patients underwent transvaginal ultrasonographic examination with a 7.5 MHz vaginal probe during the follicular phase of their spontaneous menstrual cycle. Total AFC was calculated with the sum of 2–10 mm antral follicles in both ovaries.

Ovarian stimulation protocol

The controlled ovarian stimulation (COS) protocols consisted of a long gonadotropin-releasing hormone (GnRH) agonist or a multi-dose GnRH antagonist protocol individualized for each patient.
The GnRH agonist protocol was performed using a 1 mg/day dose of long GnRH agonist (leuprolide acetate; Lucrine; Abbott, Turkey) during the 14 days of the luteal phase of the previous menstrual cycle for down-regulation. Then, ovarian stimulation was started by daily injection of recombinant follicle stimulating hormone (r-FSH) (Gonal-F, Merck Serono, Istanbul, Turkey) with a starting dose specific for each individual case, according to the patient’s age, body mass index (BMI), ovarian reserve, and AFC.
In the GnRH antagonist protocol, ovarian stimulation was started with subcutaneous injection of gonadotropins, such as r-FSH (Gonal F, Merck Serono, Istanbul, Turkey), and a starting dose was administered based on the patient’s age, BMI, ovarian reserve, and AFC from days 2–4 of the menstrual cycle. According to the fixed antagonist protocol, the GnRH antagonist (Cetrotide; 0.25 mg; Merck Serono, Istanbul, Turkey) injection was started beginning on the sixth day of stimulation.
The number and size of the follicles and the serum estradiol levels of all patients were monitored via ultrasound. The gonadotropin dose was chosen based on the patient’s response. On cycle days 2–3, each patient underwent a transvaginal ultrasound to determine their AFC and to screen for ovarian cysts. Repeat examination was performed on day 6 of the ovarian stimulation and subsequently every 1–3 days, as clinically indicated until the criterion for subcutaneous administration of recombinant chorionic gonadotropin alpha 250 mg (Ovitrelle; Merck-Serono, Istanbul, Turkey) was ≥ 18 mm in diameter. Ovum retrieval was performed 36 h later.
In all cases, the ICSI procedure was performed on the same day (day 0), and fertilization was confirmed 16–18 h later. Embryo transfer was performed on day 3, day 4, or day 5 based on the quality of the embryos. From the day of ovum retrieval, the luteal phase was supported by vaginal progesterone gel (Crinone 8% gel; Serono, Istanbul, Turkey) twice a day. Clinical pregnancy was described as fetal pole and fetal cardiac activity as determined by ultrasonographic examination. Ongoing pregnancy was defined as a viable pregnancy, as confirmed by ultrasonography at 20 weeks.

Embryo quality

The polarization, presence of a cytoplasmic halo, number of pronuclei, and pronuclear appearance were the morphological features used in the zygote scoring system. Embryo quality was described based on the size of the blastomeres and the number, degree of fragmentation, and presence of multinucleated blastomeres. An embryo with 7 or 8 equal sized blastomeres and with less than 10% cytoplasmic fragmentation with no multinucleation was accepted as good quality on day 3 [7, 8].

Ovarian sensitivity index calculation

Ovarian sensitivity index was calculated by dividing the total administered r-FSH dose by the number of oocytes retrieved at the oocyte pick-up [5].

Ovarian response prediction index calculation

The ORPI was calculated using the following formula: ORPI = (AMH × AFC)/patient age [6].

Statistical analysis

Statistical analyses of the data were performed using Statistical Package for the Social Sciences (SPSS) version 11.5 software. Data were given as mean ± standard deviation or percentage. Spearman’s correlation analysis was used to assess the correlation between the parametric variables (total retrieved oocytes, MII oocytes, embryos, good quality embryos on day 1, and good quality embryos on day 3) and the ovarian response tests. The significance of the difference between two correlation coefficients was analyzed by using the z-test. A comparison of the two paired receiver operating characteristic (ROC) curves of the ovarian response tests is appropriate, and the statistical significance of the difference between the areas under the curve (AUC) of those tests was assessed using the z-test, as defined by Hanley et al. The AUC is the combined calculation of sensitivity and specificity, and it is a measure of the overall predictive performance of a predictive test [9]. The overall predictive performances of ovarian response tests can be compared by comparing their AUCs. In the present study, the overall predictive performance of age, AFC, AMH, OSI, and ORPI was calculated for the retrieval of ≥ 4 MII oocytes, ≥ 15 oocytes, and the ongoing pregnancy rate per cycle. Retrieval of < 4 oocytes was accepted as the criterion for poor ovarian response [10, 11], whereas retrieval of ≥ 15 oocytes was accepted as the criterion of excessive response [12, 13]. Data were given as 95% confidential intervals (CI). A p-value ≤ 0.05 was considered statistically significant.


Data of 176 patients who underwent the ICSI procedure with either a GnRH agonist (n = 37) or a GnRH antagonist (n = 139) COS protocol were analyzed. Demographic, clinical, and ART treatment characteristics of the patients are given in Table I. The outcomes of Spearman’s correlation coefficients between the ovarian response tests and the total number of retrieved oocytes, MII oocytes, embryos, and good quality embryos on day 1 and day 3 are presented in Table II. The correlation coefficients were compared separately and are given in Tables III and IV. The correlation coefficients for OSI were significantly higher than the other parameters in terms of total number of retrieved oocytes, MII oocytes, embryos, and good quality embryos on day 1 and day 3.
The ROC curve analyses of age, AFC, AMH, OSI, and ORPI were performed for the retrieval of < 4 oocytes, ≥ 4 MII oocytes, ≥ 15 oocytes, and ongoing pregnancy rate per cycle (Figure 1). Pairwise comparison of the AUCs was performed in order to analyze the relationship between the ovarian response tests and the recorded data for the outcomes. Comparison of the two paired ROC curves of the ovarian response tests are shown in Table V. The level of association between the ovarian response tests and poor ovarian response data was (in descending order): OSI, ORPI, AFC, AMH, and age. The OSI (AUCOSI = 0.976) was found to be the ovarian response test with the highest relationship with poor ovarian response. The overall association of OSI with poor ovarian response was significantly higher than it was for the other parameters. For the retrieval of ≥ 4 MII oocytes, OSI was the parameter with the highest AUC value (0.935), whereas age had the lowest AUC value (0.629) among all the studied parameters. In patients with high ovarian response data, OSI had the highest AUC value (0.984), followed by AFC (0.907) and ORPI (0.887). In addition, the AUC value of OSI had a higher statistical significance than the APC value of ORPI and AFC (p1 = 0.023, p2 = 0.014, respectively). There was no statistically significant difference among the AUC values of the ovarian response tests for the data of patients with ongoing pregnancies.


The present retrospective analysis shows that OSI may have a significant role in the selection of patients with a poor or high ovarian response. Which ovarian response marker is more useful in predicting ovarian response and the chance of pregnancy remains to be answered. Unfortunately, there are limited data to determine the most accurate test or model that can be used to estimate ovarian response and the probability of pregnancy or live birth.
Measuring the ovarian response with markers such as AMH, AFC, and ORPI is influenced by the gonadotropin dosage, and the administered dosage of gonadotropin influences the final number of retrieved oocytes. OSI was first defined by Biasoni et al. to predict the ovarian response for in vitro fertilization (IVF) cycles, and the authors concluded that OSI accurately reflects the AFC and AMH levels of patients who underwent IVF treatment with a long GnRH agonist protocol [5]. In contrast to other markers, OSI is irrespective of gonadotrophin, and it measures the genuine potential of specific ovaries. On the other hand, although the predictive abilities of age, AFC, basal levels of E2, FSH, inhibin B, and AMH were found to be moderate for large study populations, their predictive performance was reported to be quite low for individual cases [14, 15].
There is a lack of studies in the literature evaluating OSI for ovarian response in women on antagonist protocols. To our knowledge, only one study has demonstrated an excellent correlation between the parameters of ovarian response and OSI [16], and our analysis is the second study to signify the role of OSI in an antagonist protocol. Huber et al. demonstrated a positive correlation between pregnancy rate per oocyte pick-up and OSI in patients on a long agonist protocol, and they emphasized the necessity of confirming the OSI for patients on other protocols [15].
The ORPI was first described by Oliveira et al. [6]. In their study, ORPI showed excellent predictive ability for poor ovarian response and a good prediction performance for retrieval of MII oocytes ≥ 4, a high ovarian response, and clinical pregnancies in women on both antagonist and agonist protocols [6]. Brodin et al. evaluated four different ovarian reserve tests (age, AFC, AMH, and combinations of basal levels of FSH and LH and menstrual cycle lengths), and they concluded that the combination of AMH, AFC, and age was the best model for predicting ovarian response; moreover, when compared with age, AMH was superior in the estimation of live birth rates after ART treatment [17].
Our data showed that both OSI and ORPI may have important roles for identifying women with a possible poor ovarian response; however, the association between poor ovarian response and OSI was found to be significantly stronger than the association between poor ovarian response and ORPI. Similarly, while ORPI, AFC, and AMH seemed to be capable, to some degree, of identifying women with a possible excessive ovarian response, OSI showed superiority in these cycles. In addition, OSI was found to be more useful than ORPI for the total number of embryos and good quality embryos on day 1 and day 3.
The possible reasons for the lower predictive performances of AFC, AMH, and ORPI in comparison to OSI for estimating ovarian response might be related to the limitations of AFC and AMH. In terms of inter-observer variation, the quality of the ultrasound and difficulties in visualizing the ovaries because of anatomic abnormalities may impair the quality of assessing the AFC and reduce its predictive ability for ovarian response and pregnancy rates [17]. On the other hand, previous studies have shown that factors such as assay technical issues, sample instability, and inter- and intra-individual variations may affect the AMH measurements and limit AMH’s use as a reliable ovarian responsiveness marker [18–21]. The other important limitation of AMH is the influence of ethnicity on serum levels. A recent review suggested the necessity of ethnicity-specific cut-off points in defining expected poor and high responders based on differences in the nomogram of the AMH levels in different ethnic populations [21]. Given the inherent limitations of AFC and AMH measurements, it is not surprising that the ORPI model, which is calculated with these markers, may be inferior to other models, such as OSI.
Recently, studies have reported conflicting results about the associations between ovarian response tests and the estimation of the number of pregnancies [21, 22]. While some studies have reported that AFC and AMH have strong associations with live-birth rates after ART treatment based on their predictive ability for oocyte quantity and oocyte quality [23, 24], other studies found no significant relationship between AMH and oocyte quality/embryo quality [21]. In our study, the correlation coefficients of OSI showed significantly higher correlations with the total number of retrieved oocytes, MII oocytes, embryos, and good quality embryos on day 1 and day 3 than the other ovarian response tests. Although our data showed marked associations between ovarian response markers and the total number of embryos, none of the analyzed parameters or models demonstrated similar positive correlations with the number of ongoing pregnancies. It should be taken into account that various factors, such as the abnormality of the sperm parameters and endometrial features, could affect the occurrence of pregnancy [25]. In addition, ongoing or live birth rates may be affected by fetal-maternal risk factors.
One of the limitations of our study is its retrospective nature. It was not possible to report reliable predictive data of the analyzed markers and/or models. A prospective methodology would be much more convincing if it deals with the predictive value of putative ovarian responsiveness and pregnancy markers. It is obvious that prospective verification of the correspondence between prediction and real facts will definitely provide higher quality evidence than a retrospective analysis.
Another limitation is the relatively small sample size of the study, especially in the GnRH agonist cycles. A sample size calculation was considered to be unnecessary because the study aimed to collect as much data of the cycles from the hospital’s database as possible to meet our relatively narrow inclusion criteria.
In conclusion, although retrospective, the present study is the first to compare the five most frequently used ovarian responsiveness markers. The retrospective data indicating that OSI might possess superior associations for poor and high ovarian responses to gonadotropin stimulation in ART cycles with GnRH agonist or antagonist protocols than other ovarian responsiveness markers need to be confirmed in larger prospective studies. This may be of particular importance because OSI is calculated without extra effort during an ART cycle, unlike ORPI, in which ultrasonographic assessment and blood sampling are required.

Conflict of interest

The authors declare no conflict of interest.


1. La Marca A, Ferraretti AP, Palermo R, Ubaldi FM. The use of ovarian reserve markers in IVF clinical practice: a national consensus. Gynecol Endocrinol 2016; 32: 1-5.
2. Holte J, Brodin T, Berglund L, Hadziosmanovic N, Olovsson M, Bergh T. Antral follicle counts are strongly associated with live-birth rates after assisted reproduction, with superior treatment outcome in women with polycystic ovaries. Fertil Steril 2011; 96: 594-9.
3. Tsakos E, Tolikas A, Daniilidis A, Asimakopoulos B. Predictive value of anti-Müllerian hormone, follicle-stimulating hormone and antral follicle count on the outcome of ovarian stimulation in women following GnRH-antagonist protocol for IVF/ET. Arch Gynecol Obstet 2014; 290: 1249-53.
4. Oner G, Ulug P, Elmali F. Ovarian reserve markers in unexplained infertility patients treated with clomiphene citrate during intrauterine insemination. Arch Med Sci 2015; 11: 1250-4.
5. Biasoni V, Patriarca A, Dalmasso P, et al. Ovarian sensitivity index is strongly related to circulating AMH and may be used to predict ovarian response to exogenous gonadotropins in IVF. Reprod Biol Endocrinol 2011; 9: 112.
6. Oliveira JB, Baruffi RL, Petersen CG, et al. A new ovarian response prediction index (ORPI): implications for individualised controlled ovarian stimulation. Reprod Biol Endocrinol 2012; 10: 94.
7. Van Royen E, Mangelschots K, De Neubourg D, et al. Characterization of a top quality embryo, a step towards single-embryo transfer. Hum Reprod 1999; 14: 2345-9.
8. Alpha Scientists in Reproductive Medicine and ESHRE Special Interest Group of Embryology. Balaban B, Brison D, Calderon G, et al. The Istanbul consensus workshop on embryo assessment: proceedings of an expert meeting. Hum Reprod 2011; 26: 1270-83.
9. Hanley JA, McNeil BJ. A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 1983; 148: 839-43.
10. Younis JS, Jadaon J, Izhaki I, et al. A simple multivariate score could predict ovarian reserve, as well as pregnancy rate, in infertile women. Fertil Steril 2010; 94: 655-61.
11. Ferraretti AP, La Marca A, Fauser BC, Tarlatzis B, Nargund G, Gianaroli L. ESHRE consensus on the definition of ‘poor response’ to ovarian stimulation for in vitro fertilization: the Bologna criteria. Hum Reprod 2011; 26: 1616-24.
12. Broer SL, Dolleman M, Opmeer BC, Fauser BC, Mol BW, Broekmans FJ. AMH and AFC as predictors of excessive response in controlled ovarian hyperstimulation: a meta-analysis. Hum Reprod Update 2011; 17: 46-54.
13. Aflatoonian A, Oskouian H, Ahmadi S, Oskouian L. Prediction of high ovarian response to controlled ovarian hyperstimulation: anti-Mullerian hormone versus small antral follicle count (2–6 mm). J Assist Reprod Genet 2009; 26: 319-25.
14. Jayaprakasan K, Campbell B, Hopkisson J, Johnson I, Raine-Fenning N. A prospective, comparative analysis of anti-Mullerian hormone, inhibin-B, and three-dimensional ultrasound determinants of ovarian reserve in the prediction of poor response to controlled ovarian stimulation. Fertil Steril 2010; 93: 855-64.
15. Huber M, Hadziosmanovic N, Berglund L, Holte J. Using the ovarian sensitivity index to define poor, normal, and high response after controlled ovarian hyperstimulation in the long gonadotropin-releasing hormone-agonist protocol: suggestions for a new principle to solve an old problem. Fertil Steril 2013; 100: 1270-6.
16. Li HW, Lee VC, Ho PC, Ng EH. Ovarian sensitivity index is a better measure of ovarian responsiveness to gonadotrophin stimulation than the number of oocytes during in-vitro fertilization treatment. J Assist Reprod Genet 2014; 31: 199-203.
17. Brodin T, Hadziosmanovic N, Berglund L, Olovsson M, Holte J. Comparing four ovarian reserve markers – associations with ovarian response and live births after assisted reproduction. Acta Obstet Gynecol Scand 2015; 94: 1056-63.
18. Ledger WL. Measurement of antimüllerian hormone: not as straightforward as it seems. Fertil Steril 2014; 101: 339.
19. Rustamov O, Smith A, Roberts SA, et al. Anti-Mullerian hormone: poor assay reproducibility in a large cohort of subjects suggests sample instability. Hum Reprod 2012; 27: 3085-91.
20. Muzii L, Luciano AA, Zupi E, Panici PB. Effect of surgery for endometrioma on ovarian function: a different point of view. J Minim Invasive Gynecol 2014; 21: 531-3.
21. Iliodromiti S, Kelsey TW, Wu O, Anderson RA, Nelson SM. The predictive accuracy of anti-Mullerian hormone for live birth after assisted conception: a systematic review and meta-analysis of the literature. Hum Reprod Update 2014; 20: 560-70.
22. van Loendersloot L, Repping S, Bossuyt PM, van der Veen F, van Wely M. Prediction models in in vitro fertilization; where are we? A mini review. J Adv Res 2014; 5: 295-301.
23. Lehmann P, Vélez MP, Saumet J, et al. Anti-Müllerian hormone (AMH): a reliable biomarker of oocyte quality in IVF. J Assist Reprod Genet 2014; 31: 493-8.
24. Brodin T, Hadziosmanovic N, Berglund L, Olovsson M, Holte J. Antimullerian hormone levels are strongly associated with live-birth rates after assisted reproduction. J Clin Endocrinol Metab 2013; 98: 1107-14.
25. Mao GH, Feng Z, He Y, Huang YR. Comparisons of the effects of long-acting and short-acting GnRH agonists on embryo quality, endometrial thickness and pregnancy rate in human in vitro fertilization. Arch Med Sci 2014; 10: 161-6.
Copyright: © 2016 Termedia & Banach. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License (http://creativecommons.org/licenses/by-nc-sa/4.0/), allowing third parties to copy and redistribute the material in any medium or format and to remix, transform, and build upon the material, provided the original work is properly cited and states its license.
Quick links
© 2019 Termedia Sp. z o.o. All rights reserved.
Developed by Bentus.
PayU - płatności internetowe