Skip to main content

Research Repository

See what's under the surface

Clinical Scene Segmentation with Tiny Datasets

Smith, Thomas J; Sharkey, Don; Crowe, John; Valstar, Michel


Thomas J Smith

Don Sharkey

John Crowe

Michel Valstar


Many clinical procedures could benefit from automatic scene segmentation and subsequent action recognition. Using Convolutional Neural Networks to semantically segment meaningful parts of an image or video is still an unsolved problem. This becomes even more apparent when only a small dataset is available. Whilst using RGB as the input is sufficient for a large labelled dataset, achieving high accuracy on a small dataset directly from RGB is difficult. This is because the ratio of free image dimensions to the number of training images is very high, resulting in unavoidable underfitting. We show that the addition of su-perpixels to represent an image in our network improves the semantic segmentation, and that superpixels can be learned to be detected by Convolutional Neural Networks if those superpixels are appropriately represented. Here we present a novel representation for superpixels, multi-channel connected graphs (MCGs). We show how using pre-trained deep learned superpixels used in an end-to-end manner achieve good semantic segmentation results without the need for large quantities of labelled data, by training with only 20 instances for 23 classes.

Start Date Oct 27, 2019
Publication Date Nov 2, 2019
Book Title Proceedings of IEEE 17th International Conference on Computer Vision (ICCV 2019)
APA6 Citation Smith, T. J., Sharkey, D., Crowe, J., & Valstar, M. (2019). Clinical Scene Segmentation with Tiny Datasets. In Proceedings of IEEE 17th International Conference on Computer Vision (ICCV 2019)
Related Public URLs