Daniel Novak

1.2CVJun 19

Native space based pipelines outperform template space based pipeline in subcortical segmentation

Tomás Lima, Daniel Novák, Eduard Bakštein

Accurate segmentation of subcortical regions is critical for neurosurgical planning and functional research. Most automated methods rely on template space coregistration, which may compromise patient-specific accuracy, particularly in small structures. We identify a need to evaluate whether native space approaches offer a measurable advantage, which we evaluate in the context of movement disorders. We developed two UNet-based segmentation pipelines of the Subthalamic Nucleus (STN) - a common surgical target in Parkinson's Disease - and the neighbouring Red Nucleus (RN) and Substantia Nigra (SN). We collected 7T and 3T MRI data from five public datasets. The pipelines were evaluated in the native-space against manual labels. We further investigated the effect of the template resolution. Motivated by the hypothesis that models may better learn target boundaries in higher field, we tested the transferability of 7T-trained models to 3T clinical images, and whether synthetic 3T training data - generated via a disentangled representation learning method - could help bridging this domain gap. On held-out 7T data, the native pipeline consistently outperformed the template one. For the STN, native-space Dice reached 0.775 +- 0.055 versus 0.713 +- 0.051 (1 mm template), with HD95 of 0.79 +- 0.24 mm versus 1.17 +- 1.10 mm, respectively. Similar advantages were observed for the RN and SN. Increasing template resolution did not improve accuracy. When applied to 3T images, all models showed a considerable performance drop. Adding synthetic 3T data yielded only modest improvements, though without degrading 7T performance. Native-space segmentation is preferable for applications requiring patient specific anatomical fidelity, such as the surgical planning in PD. Bridging the 7T-to-3T domain gap remains an open challenge, motivating future work on domain adaptation tailored to subcortical structures.

1.0CLFeb 15, 2024

Quantized Embedding Vectors for Controllable Diffusion Language Models

Cheng Kang, Xinye Chen, Yong Hu et al.

Improving the controllability, portability, and inference speed of diffusion language models (DLMs) is a key challenge in natural language generation. While recent research has shown significant success in complex text generation with language models, the memory and computational power are still very demanding and fall short of expectations, which naturally results in low portability and instability for the models. To mitigate these issues, numerous well-established methods were proposed for neural network quantization. To further enhance their portability of independent deployment as well as improve their stability evaluated by language perplexity, we propose a novel approach called the Quantized Embedding Controllable Diffusion Language Model (QE-CDLM). QE-CDLM builds upon the recent successful controllable DLMs by remodeling the task-specific embedding space via quantization. This leads to a gradient-based controller for the generation tasks, and more stable intermediate latent variables are obtained, which naturally brings in an accelerated convergence as well as better controllability. Additionally, the adaption fine-tuning method is employed to reduce tunable weights. Experimental results on five challenging fine-grained control tasks demonstrate that QE-CDLM compares favorably to existing methods in terms of quality and feasibility, achieving better perplexity and lightweight fine-tuning.

Daniel Novak

2 Papers