Learning Camera Pose from Optical Colonoscopy Frames Through Deep Convolutional Neural Network (CNN)

Mohammad Ali Armin*, Nick Barnes, Jose Alvarez, Hongdong Li, Florian Grimpen, Olivier Salvado

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    7 Citations (Scopus)

    Abstract

    Optical colonoscopy is performed by insertion of a long flexible colonoscope into the colon. Estimating the position of the colonoscope tip with respect to the colon surface is important as it would help localization of cancerous polyps for subsequent surgery and facilitate navigation. Knowing camera pose is also essential for 3D automatic scene reconstruction, which could support clinicians inspecting the whole colon surface thereby reducing missed polyps. This paper presents a method to estimate the pose of the colonoscope camera with six degrees of freedom (DoF) using deep convolutional neural network (CNN). Because obtaining a ground truth to train the CNN for camera pose from actual colonoscopy videos is extremely challenging, we trained the CNN using realistic synthetic videos generated with a colonoscopy simulator, which could generate the exact camera pose parameters. We validated the trained CNN on unseen simulated video datasets and on actual colonoscopy videos from 10 patients. Our results showed that the colonoscopy camera pose could be estimated with higher accuracy and speed than feature based computer vision methods such as the classical structure from motion (SfM) pipeline. This paper demonstrates that transfer learning from surgical simulation to actual endoscopic based surgery is a possible approach for deep learning technologies.

    Original languageEnglish
    Title of host publicationComputer Assisted and Robotic Endoscopy and Clinical Image-Based Procedures - 4th International Workshop, CARE 2017 and 6th International Workshop, CLIP 2017 Held in Conjunction with MICCAI 2017, Proceedings
    EditorsTal Arbel, M. Jorge Cardoso
    PublisherSpringer Verlag
    Pages50-59
    Number of pages10
    ISBN (Print)9783319675428
    DOIs
    Publication statusPublished - 2017
    Event4th International Workshop on Computer Assisted and Robotic Endoscopy, CARE 2017 and 6th International Workshop on Clinical Image-Based Procedures, CLIP 2017 held in Conjunction with 20th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2017 - Quebec City, Canada
    Duration: 14 Sept 201714 Sept 2017

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume10550 LNCS
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    Conference4th International Workshop on Computer Assisted and Robotic Endoscopy, CARE 2017 and 6th International Workshop on Clinical Image-Based Procedures, CLIP 2017 held in Conjunction with 20th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2017
    Country/TerritoryCanada
    CityQuebec City
    Period14/09/1714/09/17

    Fingerprint

    Dive into the research topics of 'Learning Camera Pose from Optical Colonoscopy Frames Through Deep Convolutional Neural Network (CNN)'. Together they form a unique fingerprint.

    Cite this