sepal_length sepal_width petal_length petal_width species 0 5.1 3.5 1.4 0.2 setosa 1 4.9 3.0 1.4 0.2 setosa 2 4.7 3.2 1.3 0.2 setosa 3 4.6 3.1 1.5 0.2 setosa 4 5.0 3.6 1.4 0.2 setosa
We can also use the DataFrame.info() function to get information about the DataFrame.
import pandas
df = pandas.read_csv("iris.csv")
print(df.info())
The output of the above program will be like the following:
RangeIndex: 150 entries, 0 to 149 Data columns (total 5 columns): # Column Non-Null Count Dtype --- ------ -------------- ----- 0 sepal_length 150 non-null float64 1 sepal_width 150 non-null float64 2 petal_length 150 non-null float64 3 petal_width 150 non-null float64 4 species 150 non-null object dtypes: float64(4), object(1) memory usage: 6.0+ KB None
And, we can use the DataFrame.describe() function to get a description of the DataFrame.
import pandas
df = pandas.read_csv("iris.csv")
print(df.describe())
The output will be like the following:
sepal_length sepal_width petal_length petal_width
count 150.000000 150.000000 150.000000 150.000000
mean 5.843333 3.057333 3.758000 1.199333
std 0.828066 0.435866 1.765298 0.762238
min 4.300000 2.000000 1.000000 0.100000
25% 5.100000 2.800000 1.600000 0.300000
50% 5.800000 3.000000 4.350000 1.300000
75% 6.400000 3.300000 5.100000 1.800000
max 7.900000 4.400000 6.900000 2.500000
Here, count refers to the number of non-null observations.
Max and min refer to the maximum and minimum values in the object.
Mean and std. Refer to the mean value and the standard deviation of the observations.
25%, 50%, and 75% refer to the 25, 50, and 75 percentile, respectively.








































0 Comments