Laryngeal Cancer Screening During Flexible Video Laryngoscopy Using Large Computer Vision Models

Ishwarya S. Mamidi, LSU Health Sciences Center - New OrleansFollow
Michael E. Dunham, LSU Health Sciences Center - New OrleansFollow
Lacey K. Adkins, LSU Health Sciences Center - New OrleansFollow
Andrew J. McWhorter, LSU Health Sciences Center - New OrleansFollow
Zhide Fang, LSU Health Sciences Center - New OrleansFollow
Britney T. Banh, Our Lady of the Lake Regional Medical Center

Document Type

Article

Publication Date

5-16-2024

Publication Title

Annals of Otology, Rhinology and Laryngology

Abstract

Objective: Develop an artificial intelligence assisted computer vision model to screen for laryngeal cancer during flexible laryngoscopy. Methods: Using laryngeal images and flexible laryngoscopy video recordings, we developed computer vision models to classify video frames for usability and cancer screening. A separate model segments any identified lesions on the frames. We used these computer vision models to construct a video stream annotation system. This system classifies findings from flexible laryngoscopy as “potentially malignant” or “probably benign” and segments any detected lesions. Additionally, the model provides a confidence level for each classification. Results: The overall accuracy of the flexible laryngoscopy cancer screening model was 92%. For cancer screening, it achieved a sensitivity of 97.7% and a specificity of 76.9%. The segmentation model attained an average precision at a 0.50 intersection-over-union of 0.595. The confidence level for positive screening results can assist clinicians in counseling patients regarding the findings. Conclusion: Our model is highly sensitive and adequately specific for laryngeal cancer screening. Segmentation helps endoscopists identify and describe potential lesions. Further optimization is required to enable the model’s deployment in clinical settings for real-time annotation during flexible laryngoscopy.

PubMed ID

38755974

Recommended Citation

Mamidi, Ishwarya S.; Dunham, Michael E.; Adkins, Lacey K.; McWhorter, Andrew J.; Fang, Zhide; and Banh, Britney T., "Laryngeal Cancer Screening During Flexible Video Laryngoscopy Using Large Computer Vision Models" (2024). School of Medicine Faculty Publications. 2556.
https://digitalscholar.lsuhsc.edu/som_facpubs/2556
10.1177/00034894241253376

Link to Full Text

Find in your library

COinS

DOI

10.1177/00034894241253376

Laryngeal Cancer Screening During Flexible Video Laryngoscopy Using Large Computer Vision Models

Document Type

Publication Date

Publication Title

Abstract

PubMed ID

Recommended Citation

DOI

Search

Browse

Author Corner

Links

School of Medicine Faculty Publications

Laryngeal Cancer Screening During Flexible Video Laryngoscopy Using Large Computer Vision Models

Authors

Document Type

Publication Date

Publication Title

Abstract

PubMed ID

Recommended Citation

Share

DOI

Search

Browse

Author Corner

Links