DS_task-2
DS_task-2
pandas,matplotlib/pyplot
Daily Summary
0 Partly cloudy throughout the day.
1 partly cloudy throughout the day.
2 Partly cloudy throughout the day. 3 Partly cloudy throughout the day. 4 Partly cloudy throughout the
day.
print(df. info())
print(df. describe())
Temperature (C) Apparent Temperature (C) Humidity
count 96453. eeeeee 96453. eeeeeo 96453.000000
mean 11.932678 10.855029
734899
std 9.551546 10.696847
195473
min -21.822222 -27.716667
ooooee
25% 4.688889
o .600000
12. eeeeee 12. eeoooo
780000
75% 18.838889 18.838889
o .890000
max 39.905556 39. 344444 1 .
000000
Wind Speed (km/h) Wind Bearing Visibility Loud
(degrees) Cover
(km)
count 96453. 96453. 96453. eøøøøø 96453.
eeeeee oeeeee e
mean 10.81064e 187.509232 le. e.e
347325
std 6.913571 107.383428 4. e.e
192123
min o. eeeeee o. eeeeee e.
øøøøøø
25% 5.8282ee 116. oeeeee 8. 3398ØØ e.e
sex 9.9659ee 180. eeeeee e.e
14.1358ee 290. eeeeee 14.812øøø e.e
max 63 .8526 359. eeeeee 16. løøøøø e.e
00
Pressure (millibars) count
96453. eeeeee mean
1003 .235956 std
116.9699e6 min a.
eeeeee
1011. geeeee
sex 1016.45eeee 75% 1021.
egeeee max 1046.38eeee
' Formatted Date' , ' Summary ' , ' Precip Type' , 'Temperature (C) ' ,
' Apparent Temperature (C) ' , 'Humidity' , 'Wind Speed (km/h) ,
'Wind Bearing (degrees) , 'Visibility (km)' , 'Loud Cover • ,
'pressure (millibars)', 'Daily Summary' ] , dtype=
' object ' )
2500
2000
e 1500
1000
500
Temperature (C)
plt. 6) )
# Calculate correlation matrix, excluding non-numeric columns sns . heatmap(df.
select_dtypes(include=[ 'number' ] ) . corr(), annot=True, cmap= ' coolwarm' ) plt.
title( 'Correlation Matrix' ) plt. show()
#identifying
patterns and relationships.
print(df.
groupby( ' Summary' ) [ 'Temperature (C) ' ] .mean())
Summary
Breezy 7 .92201
Breezy and Dry 6
x: 'Summary'
plt.title( 'Distribution of Categorical Feature •
plt. show()
30000
25000
20000
15000
10000
5000
Summary
Name: T.LAVANYA
Roll no: 21MQ1A0531
college: Sri vasavi institute Of engineering and
technology
Email: lavanyaterli.2003@gmail.com