trend_level: TypeError resolved (issue #54) #64

Tanvi-Jain01 · 2023-07-10T11:29:49Z

@nipunbatra , @patel-zeel
This PR proposes solution for issue #54

BEFORE:

CODE:

file_path = r'C:\..\mydata.csv'

# Read the CSV file into a DataFrame
df = pd.read_csv(file_path)

#df.reset_index(inplace=True)
df['date'] = pd.to_datetime(df['date'])
print(df)

df_2003 = df[df['date'].dt.year == 2003]
print(df_2003)

from vayu.trendLevel import trendLevel
trendLevel(df_2003, 'pm25')

Error:

TypeError                                 Traceback (most recent call last)
Cell In[59], line 2
      1 from vayu.trendLevel import trendLevel
----> 2 trendLevel(df_2003, 'pm25')

File ~\anaconda3\lib\site-packages\vayu\trendLevel.py:45, in trendLevel(df, pollutant, **kwargs)
     34 t = pollutant_series_year.groupby(
     35     [pollutant_series_year.index.month, pollutant_series_year.index.hour]
     36 ).mean()
     37 two_d_array = t.values.reshape(12, 24).T
     38 sns.heatmap(
     39     two_d_array,
     40     cbar=True,
     41     linewidth=0,
     42     cmap="Spectral_r",
     43     vmin=0,
     44     vmax=400,
---> 45     ax=ax[i],
     46 )
     47 ax[i].set_title(year_string)
     48 ax[i].invert_yaxis()

TypeError: 'Axes' object is not subscriptable

IMPROVED CODE:

import matplotlib.pyplot as plt
import seaborn as sns
import pandas as pd
import datetime as dt
import matplotlib as mpl
import numpy as np
from numpy import vstack
from numpy import array

def trend_level(df:pd.DataFrame, pollutant:str, **kwargs):
    """
    Plot that shows the overall pollutant trend for every year in the 
    df. It takes the average hour value of each month and plots a heatmap
    showing what times of the year there is a high concentration of the 
    pollutant.

    Parameters
    ----------
    df: pd.DataFrame
        Data frame of complete data
    pollutant: str
        Name of the data series in df to produce plot.
    """
   

    df.index = pd.to_datetime(df.date)
    pollutant_series = df[pollutant]
    unique_years = np.unique(df.index.year)
    num_unique_years = len(unique_years)
    fig, ax = plt.subplots(nrows=num_unique_years, figsize=(20, 20))

    months = ['Jan', 'Feb', 'Mar', 'Apr', 'May', 'Jun', 'Jul',
              'Aug', 'Sep', 'Oct', 'Nov', 'Dec']


    for i, year in enumerate(unique_years):
       
        year_string = str(year)
        pollutant_series_year = pollutant_series[year_string]
        t = pollutant_series_year.groupby(
            [pollutant_series_year.index.month, pollutant_series_year.index.hour]
        ).mean()
        two_d_array = t.values.reshape(12, 24).T
        heatmap_ax = ax[i] if num_unique_years > 1 else ax
        sns.heatmap(
            two_d_array,
            cbar=True,
            linewidth=0,
            cmap="Spectral_r",
            vmin=0,
            vmax=400,
            ax=heatmap_ax
        )
        heatmap_ax.set_xticklabels(months)
        heatmap_ax.set_ylabel("Hour of the Day")
        heatmap_ax.set_title(year_string)
        heatmap_ax.invert_yaxis()
        
    plt.savefig("TrendLevelPlot.png", bbox_inches="tight",dpi=300)
    print("Your plots has also been saved")
    plt.show()  # Display the plot

USAGE:

 df = pd.read_csv("mydata.csv")
 trend_level(df, 'pm25')

OUTPUT:

Tanvi-Jain01 added 7 commits June 30, 2023 08:59

enhanced code of scatterPlot(refer issue sustainability-lab#43)

03f403a

timplot: modifying plots using plotly and

a2ea7c7

Adding visualization using plotly

d7d2e6b

modifying the code of googleMaps

fd7c45c

modifying googlemaps sustainability-lab#38

5b48d31

code optimize

ccede9c

type error resolved

fbd5293

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trend_level: TypeError resolved (issue #54) #64

trend_level: TypeError resolved (issue #54) #64

Tanvi-Jain01 commented Jul 10, 2023

trend_level: TypeError resolved (issue #54) #64

Are you sure you want to change the base?

trend_level: TypeError resolved (issue #54) #64

Conversation

Tanvi-Jain01 commented Jul 10, 2023

BEFORE:

Error: