Variables:
- ID: A unique identifier for each individual record.
- Smoking Habit: Classified as Heavy, Moderate, Occasional, or None based on smoking frequency.
- Alcohol Consumption: Categorized as High, Moderate, Low, or None based on alcohol intake.
- Physical Activity (Biking): Measured as High, Medium, Low based on biking frequency.
- Physical Activity (Walking): Measured as High, Medium, Low based on walking frequency.
- Physical Activity (Jogging): Measured as High, Medium, Low based on jogging frequency.
- Dietary Habits: Frequency of consuming a balanced diet (e.g., daily, occasionally, rarely).
- Sleep Quality: Hours of sleep per night, categorized as Optimal (7-8 hrs), Suboptimal (5-6 hrs), and Poor (<5 hrs).
- Workplace Stress Level: Stress at work categorized as High, Medium, or Low.
Assumptions:
This dataset assumes a relationship between lifestyle choices, genetic, and environmental factors with cancer risk. It simplifies cancer risk probabilities, recognizing that genetics, environment, and lifestyle interplay in complex ways.
Potential Applications:
- Predictive Modeling: Build models to predict cancer likelihood.
- Exploratory Research: Explore lifestyle habits’ role in cancer risk.
- Public Health Research: Influence campaigns on cancer prevention.