RSNA Pneumonia Processed Dataset

RSNA Pneumonia Processed Dataset

Datasets

RSNA Pneumonia Processed Dataset

File

RSNA Pneumonia Processed Dataset

Use Case

RSNA Pneumonia Processed Dataset

Description

Discover the RSNA Pneumonia Detection Dataset, featuring pre-processed chest X-ray images, mask annotations, and detailed metadata for AI and machine learning in medical imaging

RSNA Pneumonia Processed Dataset

Description:

The RSNA Pneumonia Detection Dataset is a meticulously pre-processed version of the original RSNA Pneumonia Detection Challenge dataset, designed to simplify and accelerate machine learning workflows in healthcare. This dataset provides high-quality, pre-processed chest X-ray images (PNG format) with detailed annotations, making it ideal for developing AI models focused on pneumonia detection, classification, and medical imaging research.

Download Dataset

The RSNA Pneumonia Detection Dataset has been carefully pre-processed to streamline machine learning workflows, enabling faster and more efficient model training. Originally provided in DICOM format, the images have been converted into PNG format for compatibility and ease of use. The dataset preserves essential bounding box annotations by converting them into binary mask images, ensuring seamless integration into various training pipelines. Additionally, metadata, including patient details and bounding box coordinates, has been meticulously organized into CSV files, simplifying dataset exploration and feature engineering.

Folder Structure for Easy Navigation

The dataset is organized into a structured folder system, designed for straightforward usage:

1. Train

  • Images: Pre-processed PNG images for training.
  • Masks: Binary mask images corresponding to the bounding boxes in the training set.

2. Test

  • Contains pre-processed PNG images designated for testing.

3. Metadata Files

  • Train_metadata.csv: Includes patient IDs, bounding box coordinates, and pneumonia presence labels for the training set.
  • Test_metadata.csv: Contains patient IDs and relevant details for the test set.

Key Data Fields

The dataset provides detailed annotations and metadata to support robust model development:

  • patientId: Unique identifier for each patient or image.
  • x, y, width, height: Bounding box dimensions specifying regions of interest.
  • Target: Binary label indicating pneumonia presence (1 = Pneumonia, 0 = No Pneumonia).
  • class: Categorization of conditions:
    • Lung Opacity
    • Normal
    • No Lung Opacity / Not Normal
  • age: Patient’s age at the time of imaging.
  • sex: Patient’s gender (e.g., Male, Female).
  • modality: Imaging method (e.g., X-ray, CT Scan).
  • position: Patient’s position during imaging (e.g., AP for Anterior-Posterior, PA for Posterior-Anterior).

Dataset Distribution

The dataset includes a diverse range of cases, distributed as follows:

  • No Lung Opacity / Not Normal: 11,821 images
  • Normal: 8,851 images
  • Lung Opacity: 9,555 images

Contact Us

Please enable JavaScript in your browser to complete this form.
Technology

Quality Data Creation

Technology

Guaranteed TAT

Technology

ISO 9001:2015, ISO/IEC 27001:2013 Certified

Technology

HIPAA Compliance

Technology

GDPR Compliance

Technology

Compliance and Security

Let's Discuss your Data collection Requirement With Us

To get a detailed estimation of requirements please reach us.

Scroll to Top

Please provide your details to download the Dataset.