Abstract

Title

Gradient-based neuroevolution of augmenting topologies for compact, low compute deep ANN search

Authors

Sithin Thulasi Seetha1,2, Kurt Driessens3, Henry Woodruff1, Tiziana Rancati4, Elena Bertocchi4, Ugo Pastorino2, Philippe Lambin1

Authors Affiliations

1GROW - School for Oncology and Developmental Biology, Maastricht University Medical Centre, Department of Precision Medicine, Maastricht, The Netherlands; 2IRCCS Foundation National Cancer Institute, Department of Thoracic Oncology, Milan, Italy; 3Maastricht University, Data Science & Knowledge Engineering, Maastricht, The Netherlands; 4IRCCS Foundation National Cancer Institute, Prostate Cancer Unit, Milan, Italy

Purpose or Objective

To propose a novel fast, compact, and low compute hybrid neural architectural search (NAS) algorithm which we call gNEAT for complex deep learning applications; validate the algorithm, and apply it for the task of lung cancer segmentation.

Materials and Methods

gNEAT essentially searches over a space of possible network solutions starting from the most basic architecture (containing only the input and output nodes). During the course of the search, additional hidden nodes and connections are introduced incrementally. Compactness in the context of ANNs is measured based on the minimum number of parameters (nodes and connections) needed to solve a task, in relation to some known optimal solution. Validation of gNEAT's compactness is done by XOR and Autoencoder (AE) tasks (see Fig 1.a). Another important quality of gNEAT is that it can perform complex NAS with incredible speed and low computational cost. This is achieved by reducing the population size while ensuring diversity within the population for sufficient search space exploration. Furthermore, to offset the complexity associated with gradient-descent training, computation saving techniques such as the use of a proxy dataset (using resized dataset) and a blueprint (using a predefined structure and searching its components) are integrated (see Fig 1.c). Once the components are evolved, the resultant model is then applied to the original dataset. gNEAT’s abilities for complex NAS are showcased by evolving a 2D UNet architecture for lung cancer segmentation (see Fig 1.b) on the Medical Segmentation Decathlon (Lung D6) dataset. The results are then compared to the state-of-the-art (STOA) nnUNet pipeline (re-implemented). Even though nnUNet 3D holds the current STOA benchmark on this dataset, we will be limiting all the experiments to nnUNet 2D architecture to reduce complexity. In addition to this, we perform an external validation of nnUNet 2D and the evolved solution on the open-source NSCLC Radiomics (Lung 1) dataset.


Results

The table below and figure 2 illustrates validation and application results. Moreover, gNEAT was able to return a solution within 5.66 ± 2.47 hours, averaged across 5 runs, on a single NVIDIA GTX 1080 Ti GPU using less than 4GB of GPU memory. 

Validation Tasks

Metrics

XOR

Vanilla AE

Domain Adapted AE

avg no. of hidden nodes (v/s Ideal)

1.06±0.24 (1)

3.52±1.05 (3)

2.97±0.33 (3)

avg no. of connections (v/s Ideal)

5.09±0.38 (5)

36.7±4.79 (48)

45.18±5.42 (48)

avg. run time in seconds

36.55±16.51

198.60±131.40

56.83±23

Application Tasks

Models

Lung (D6) Decathlon

Lung1 (NSCLC Radiomics)

nnUNet 2D (published) dice score

0.50

NA

nnUNet 2D (reproduced) dice score

0.51±0.04

0.41

Evolved UNet 2D dice score

0.55±0.03

0.45



Conclusion

gNEAT can replace the expensive, time-consuming, and exhaustive trial-and-error process of manual architectural search. Unlike similar automated NAS algorithms, gNEAT can return domain-specific compact solutions in a matter of hours on a low compute system accessible to the majority of research institutions.