HYBRID EVENT: You can participate in person at Orlando, Florida, USA or Virtually from your home or work.

3rd Edition of Global Conference on Gynecology & Women's Health

October 27-29, 2025 | Orlando, Florida, USA

Gynec 2025

Leveraging natural language processing to advance women’s health: Early detection of possible polycystic ovarian syndrome in emergency radiology reports

Speaker at Gynecology Conferences - Raveena Atwal
University of California, United States
Title : Leveraging natural language processing to advance women’s health: Early detection of possible polycystic ovarian syndrome in emergency radiology reports

Abstract:

Polycystic Ovary Syndrome (PCOS) is one of the most common endocrine disorders affecting women of reproductive age, with implications for fertility, metabolic health, and long-term disease risk. Diagnosis requires a combination of imaging and clinical findings such as irregular menstruation and hormone levels. However, in settings like the Emergency Department (ED), radiologists often describe ovarian morphology without explicitly suggesting PCOS or recommending follow-up. This diagnostic ambiguity contributes to patients, especially women, being lost to follow-up, risking delayed diagnosis and missed treatment opportunities. With national trends showing declining fertility and increasing pre-diabetes rates, there is a pressing need for scalable solutions to bridge this care gap.

Pelvic ultrasound reports from non-obstetric (OB) patients who presented through the ED were manually reviewed and classified as “No PCOS” or “Possible PCOS” based on imaging descriptors. A training dataset was created with 83 reports per category, and a separate validation set with 30 “No PCOS” and 15 “Possible PCOS” cases. We used Google Colaboratory (Colab), a cloud-based Jupyter Notebook platform, to train a logistic regression classifier, leveraging libraries such as Scikit-learn and TensorFlow. The model was trained using TF-IDF, which converts ultrasound report text into numerical features for classification.

The trained logistic regression model achieved a validation accuracy of 0.93. For the “No PCOS” class (n=30), precision was 1.00 and recall was 0.90, indicating strong performance with minimal false positives. For the “Possible PCOS” class (n=15), precision was 0.83 and recall was 1.00, suggesting the model successfully identified all true positive cases in this group but misclassified a few non-PCOS reports as possible PCOS. The weighted average F1-score was 0.93, demonstrating overall model reliability across both categories.

This study demonstrates that a logistic regression model trained on radiology reports can reliably identify possible PCOS cases. Such tools could support earlier clinical intervention by automatically identifying patients who may benefit from follow-up or confirmatory testing. This model addresses a critical gap in women’s health by enabling proactive care, even when explicit recommendations are missing, without increasing the workload for radiologists or ED clinicians.

Biography:

Raveena is a second-year medical student at the University of California, Irvine School of Medicine. She is currently conducting research in the Computational Abdominal Radiology (CAR) Lab, focusing the application of artificial intelligence in radiology. Her research interest include women’s health and the role of machine learning in advancing diagnostic imaging. She is passionate about integrating clinical medicine with innovative imaging research to improve patient outcomes.

Watsapp