Categories: Practice Datasets
Tags:

The Seaborn diamonds dataset is a popular sample dataset used for data visualization and statistical modeling. It contains detailed information about 53,940 diamonds, with 10 attributes:

Columns Explained:

  1. carat: Weight of the diamond (numeric, in carats).
  2. cut: Quality of the cut (categorical):
    • Options include: Fair, Good, Very Good, Premium, Ideal
  3. color: Diamond color grade (categorical):
    • Ranges from D (best/least color) to J (worst/more color).
  4. clarity: Measure of diamond clarity (categorical):
    • Levels: IF (Internally Flawless), VVS1, VVS2, VS1, VS2, SI1, SI2, I1
    • Ordered from best to worst clarity.
  5. depth: Total depth percentage = (z / mean(x, y)) * 100 (numeric).
  6. table: Width of the top of the diamond relative to the widest point (numeric).
  7. price: Price in US dollars ($326 to $18,823).
  8. x: Length in mm (numeric).
  9. y: Width in mm (numeric).
  10. z: Depth in mm (numeric).

Dataset Link : https://github.com/mwaskom/seaborn-data/blob/master/diamonds.csv

Sample Rows:

caratcutcolorclaritydepthtablepricexyz
0.23IdealESI261.555.03263.953.982.43
0.21PremiumESI159.861.03263.893.842.31