CVAIApr 1, 2024

Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Anthropic Prior Knowledge

arXiv:2404.01013v19 citationsh-index: 6Has CodeCVPR
Originality Synthesis-oriented
AI Analysis

This work addresses a domain-specific problem in dentistry for enhancing diagnostics and treatment planning, but it is incremental as it adapts existing methods to a new dataset.

The paper tackles the problem of teeth instance segmentation in 2D dental images, which is challenging due to subtle shape differences, variations across subjects, and abnormalities, and proposes Teeth-SEG, a ViT-based framework with multi-scale aggregation and anthropic prior knowledge that outperforms state-of-the-art models on the new IO150K dataset.

Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, general instance segmentation frameworks are incompetent due to 1) the subtle differences between some teeth' shapes (e.g., maxillary first premolar and second premolar), 2) the teeth's position and shape variation across subjects, and 3) the presence of abnormalities in the dentition (e.g., caries and edentulism). To address these problems, we propose a ViT-based framework named TeethSEG, which consists of stacked Multi-Scale Aggregation (MSA) blocks and an Anthropic Prior Knowledge (APK) layer. Specifically, to compose the two modules, we design 1) a unique permutation-based upscaler to ensure high efficiency while establishing clear segmentation boundaries with 2) multi-head self/cross-gating layers to emphasize particular semantics meanwhile maintaining the divergence between token embeddings. Besides, we collect 3) the first open-sourced intraoral image dataset IO150K, which comprises over 150k intraoral photos, and all photos are annotated by orthodontists using a human-machine hybrid algorithm. Experiments on IO150K demonstrate that our TeethSEG outperforms the state-of-the-art segmentation models on dental image segmentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes