NA-MIC Project Weeks

Back to Projects List

Integrate CPU friendly auto segmentation and CT utility models into mhub

Key Investigators

Suraj Pai (Brigham and Women's Hospital, Boston)
Leonard Nürnberg (Brigham and Women's Hospital, Boston)
Andriy Fedorov (Brigham and Women's Hospital, Boston)
Hugo Aerts (Brigham and Women's Hospital, Boston)

Presenter location: In-person

Project Description

This project will aim to integrate two categories of models into mhub.ai

1. CPU friendly (whole-body) auto-segmentation models

2. CT utility model for image QA

Objective

Working example of a CPU-friendly (whole-body) auto-segmentation model through the mhub.ai platform
Working example of a QA pipeline using CT utility tools through mhub.ai platform

Approach and Plan

CPU-friendly auto-seg Several auto-segmentation models have been integrated into slicer recently through https://github.com/lassoan/SlicerMONAIAuto3DSeg/releases/tag/ModelsTestResults

While the quick version of these models run fast on CPU, the slower versions take a couple of mins. It would be interesting to explore if CPU related optimizations would work to increase the speed and reduce memory of the full resolution versions while making the quick versions even faster.

Some initial thoughts on optimization techniques could include,

Converting models to OpenVINO format for optimized inference on CPU (https://docs.openvino.ai/2024/home.html, https://docs.openvino.ai/2024/omz_demos_3d_segmentation_demo_python.html). This could provide faster inference and make models more lightweight offering a better user experience as well.
For a majority of these auto-seg models, sliding window inferer implementation results in major differences in memory (with higher batch-size) and inference time (with larger overlap ratios). Is there an optimal configuration to save memory and increase speed?
Another ticket item is that the the memory consumption largely increases when predicting more output classes in the softmax, is there a way to efficienlty address this issue as well. Perhaps a more restrictive implementation of the sliding window inferer with a accuracy-efficiency trade-off?
Distilling models to smaller ones that run faster (might be something that takes longer than PW): https://github.com/VaticanCameos99/knowledge-distillation-for-unet

CT utility models Implementing CT image inspection utility models, namely, body part regression - https://github.com/MIC-DKFZ/BodyPartRegression. This model allows determining the body part examined and if there are anomalies in certain slices in the processed image (nifti).

Integrating this into Mhub would allow users to perform this QA by providing DICOM inputs directly

Progress and Next Steps

CPU friendly auto-seg

Tested the abdominal-organs-v2.0.0 segmentation as the reported times on CPU were approx ~6 mins.

New Note

Default CPU version:

Screenshot 2024-06-27 at 19 40 24

OpenVINO compiled model (most significant gain) + Reducing overlap ratio:

Screenshot 2024-06-27 at 18 54 49

PS: OpenVINO models are FP16 compressed and are half the size.

Body part regression

BPREG

Next Steps

Current conversion to OpenVINO is manually done for the abdominal model. This can be automated across models in a script and pushed to Github for download.
CPU benchmarked on is AMD, benchmark on Intel and ARM.
Implement Mhub models for CPU friendly autoseg. and interface with Andras’ SlicerMONIAAutoSeg3D extension (work on some slicer extension specifics).
Complete conribution workflow for Bpreg - model is ready to go!

Illustrations

BPREG: