agg function in pandas | find aggregate sum in pandas over rows & columns
In this pandas tutorial, we will discuss about:
-
agg function in pandas,
-
aggregate sum in pandas over rows,
-
aggregate sum in pandas for specific column,
-
aggregate sum in pandas for all column
Before learning about aggregate sum function in pandas, using agg sum in pandas lets first create a dataframe.
Pandas DataFrame is an two dimensional data structure that will store data in two dimensional format. One dimension refers to a row and second dimension refers to a column, So It will store the data in rows and columns.
We can able to create this DataFrame using DataFrame() method. But this is available in pandas module, so we have to import pandas module.
Syntax:
pandas.DataFrame(data)
Where, data is the input dataframe , The data can be a dictionary that stores list of values with specified key
Create DataFrame Example
In this pandas example, we will create a dataframe with 4 rows and 3 columns with building data and assign indices through index parameter.
import pandas as pd
#create dataframe from the building data
data= pd.DataFrame({
'length':[5.6,7.8,4.5,5.3],
"breadth":[12.9,4.5,21.5,6.0],
"area":[20,56,43,45]
},index=['one','two','three','four'])
#display the dataframe
print(data)
Output:
length breadth area
one 5.6 12.9 20
two 7.8 4.5 56
three 4.5 21.5 43
four 5.3 6.0 45
Thus we have created our dataframe. Now lets learn about aggregate sum function in pandas.
aggregate sum function in pandas
If we want to get the total sum of the columns in the dataframe, then we have to use aggregate sum function in pandas.
In that, sum is one of the function, which will return the total value (row/column) of the dataframe.
For aggregation, the method used is agg in pandas and for sum, the method used is sum.
There are three scenarios of finding aggregate sum function in pandas using agg function in pandas. Let's discuss one by one.
Scenario 1 : aggregate sum in pandas over the rows
In this aggregate sum in pandas scenario, we will get the sum over each row in the dataframe.
Syntax:
dataframe.agg('sum')
where, dataframe is the input dataframe.
Example: aggregate sum in pandas over the rows
Lets see program on agg function in pandas to get aggregate sum over rows
import pandas as pd
#create dataframe from the college data
data= pd.DataFrame({
'length':[5.6,7.8,4.5,5.3],
"breadth":[12.9,4.5,21.5,6.0],
"area":[20,56,43,45]
},index=['one','two','three','four'])
#sum aggregation over the rows
data.agg('sum')
Output: From the above aggregate sum in pandas example, we will perform sum aggregation over the rows
length 23.2
breadth 44.9
area 164.0
dtype: float64
Lets see scenario two on aggregate sum in pandas using agg in pandas.
Scenario 2 : sum aggregation per column
In this aggregate sum in pandas scenario, we will get the sum for the mentioned columns in the dataframe.
agg in pandas Syntax:
dataframe.agg({'column' : 'sum', ..............})
where, dataframe is the input dataframe and column is the column name to get sum.
Example: aggregate sum in pandas per column
In this aggregate sum in pandas example, we will get sum for length and area columns.
import pandas as pd
#create dataframe from the college data
data= pd.DataFrame({
'length':[5.6,7.8,4.5,5.3],
"breadth":[12.9,4.5,21.5,6.0],
"area":[20,56,43,45]
},index=['one','two','three','four'])
#sum aggregation per column
data.agg({'length' : 'sum', 'area' : 'sum'})
Output: In the above aggregate sum in pandas example, we will perform sum aggregation for length and area columns.
length 23.2
area 164.0
dtype: float64
Lets see third scenario on aggregate sum function in pandas using agg in pandas.
Scenario 3 : sum aggregation over the columns
In this aggregate sum in pandas scenario, we will get the sum over each column in the dataframe.
Syntax:
dataframe.agg("sum", axis="columns")
where, dataframe is the input dataframe.
Example: aggregate sum in pandas over each columns
import pandas as pd
#create dataframe from the college data
data= pd.DataFrame({
'length':[5.6,7.8,4.5,5.3],
"breadth":[12.9,4.5,21.5,6.0],
"area":[20,56,43,45]
},index=['one','two','three','four'])
# aggregation over the columns
data.agg("sum", axis="columns")
Output: From the above aggregate sum in pandas example, we will perform sum aggregation over the columns
one 38.5
two 68.3
three 69.0
four 56.3
dtype: float64
This is how we can use aggregate sum function in pandas using agg in pandas.
Would you like to see your article here on tutorialsinhand.
Join
Write4Us program by tutorialsinhand.com
About the Author
Gottumukkala Sravan Kumar 171FA07058
B.Tech (Hon's) - IT from Vignan's University.
Published 1400+ Technical Articles on Python, R, Swift, Java, C#, LISP, PHP - MySQL and Machine Learning
Page Views :
Published Date :
Apr 22,2022