Organ-aware 3D lesion segmentation dataset and pipeline for abdominal CT analysis (ACM Multimedia 2025 candidate)
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Mehran Advand 0488edfb51 Update 'README.md' 4 days ago
Phase I the last commit 1 week ago
Phase II the last commit 1 week ago
Phase III the last commit 1 week ago
assets Upload files to 'assets' 1 week ago
3DLAND_Info.csv the last commit 1 week ago
LICENSE the total commit 1 week ago
README.md Update 'README.md' 4 days ago
deeplesion_Info.csv the last commit 1 week ago

README.md

3DLAND: 3D Lesion Abdominal Anomaly Localization Dataset

This repository accompanies the 3DLAND project β€” a large-scale, organ-aware 3D lesion segmentation benchmark for abdominal CT scans β€” submitted for review to ACM Multimedia 2025.

🌐 Overview

  • 6,000+ contrast-enhanced CT studies
  • 3D lesion masks aligned with 7 abdominal organs
  • Prompt-based annotation and propagation pipeline
  • Applications: anomaly detection, lesion retrieval, organ-aware analysis

🧠 Pipeline

The lesion segmentation pipeline includes:

  1. Organ segmentation via MONAI and lesion-to-organ assignment
  2. 2D mask generation using SAM prompts
  3. 3D mask propagation using MedSAM2

πŸ“¦ Dataset

We curated over 6,000 contrast-enhanced abdominal CT scans from the publicly available DeepLesion dataset, selecting only those studies that include visible lesions or anomalies in abdominal organs.

To transform these raw scans into a structured, organ-aware 3D segmentation benchmark, we developed a multi-stage pipeline with both automated and expert-in-the-loop components:

  1. Organ Segmentation: We used MONAI models trained on TotalSegmentator to segment seven abdominal organs β€” liver, kidneys, pancreas, spleen, stomach, and gallbladder. 1.2. Lesion-to-Organ Assignment: Lesions were matched to the most probable organ based on IoU overlap and 3D proximity, with ambiguous cases reviewed by clinicians.
  2. 2D Lesion Mask Generation: Using MedSAM1, we generated lesion masks from DeepLesion’s bounding boxes. We found that shrinking the box to 70% of its original size, along with a center point prompt, significantly improved segmentation precision.
  3. 3D Mask Propagation: The resulting 2D masks were propagated across slices using MedSAM2, producing dense 3D segmentations with anatomical continuity.

Each lesion in the dataset is:

  • Annotated in 2D on the slice where the lesion is most clearly visible within the CT series
  • Localized in 3D across all slices where the lesion is present and discernible
  • Assigned to a specific abdominal organ
  • Each 3D segmentation mask is saved as a stack of 2D PNG slices, preserving spatial consistency across the volume

The dataset includes:

  • Phase II/2D_lesion_mask:2D lesion masks linked to organs
  • Pahse III/3D_lesion_mask: 3D lesion masks linked to organs
  • deeplesion_Info.csv: CSV file of Our dataset metadata according to DeepLesion metadata

All annotations underwent clinical review on 10–20% of lesions per organ to ensure high-quality ground truth.

πŸ“‘ Metadata CSV Format

Each lesion in the 3DLAND dataset is described in a structured CSV file that includes key metadata for localization and segmentation. This CSV file links the image slices with their corresponding organ labels and bounding boxes.

πŸ“„ Sample Columns

Column Description
series Series ID from the DeepLesion dataset
slice_range Range of axial slices in which the lesion appears (e.g., 44~83)
key_slice Central slice with the most visible view of the lesion
lesion_id Unique ID assigned to each lesion
matched_organs Organ to which the lesion is anatomically linked
File_name PNG file name of the key slice (e.g., 000002_02_01_050.png)
Bounding_boxes Coordinates of the lesion in the key slice: [x_min, y_min, x_max, y_max]

πŸ“„ License

The dataset and outputs are licensed under CC BY 4.0.
See the full license in the LICENSE file or at creativecommons.org/licenses/by/4.0