library(mlbench) data(PimaIndiansDiabetes) y <- PimaIndiansDiabetes$diabetes cbind(freq=table(y), percentage=prop.table(table(y))*100)
freq percentage neg 50065.10417 pos 26834.89583
每个属性的值的分布情况
summary(dataset)
V1 V2 V3 V4 Min. :4.300 Min. :2.000 Min. :1.000 Min. :0.100 1st Qu.:5.1001st Qu.:2.8001st Qu.:1.6001st Qu.:0.300 Median :5.800 Median :3.000 Median :4.350 Median :1.300 Mean :5.843 Mean :3.054 Mean :3.759 Mean :1.199 3rd Qu.:6.4003rd Qu.:3.3003rd Qu.:5.1003rd Qu.:1.800 Max. :7.900 Max. :4.400 Max. :6.900 Max. :2.500 V5 Length:150 Class :character Mode :character