Skip to content

Data Science Tutorials

  • Home
  • R
  • Statistics
  • Course
  • Machine Learning
  • Guest Blog
  • Contact
  • About Us
  • Toggle search form
  • How to Add a title to ggplot2 Plots in R
    How to Add a caption to ggplot2 Plots in R? R
  • How to get the last value of each group in R
    How to get the last value of each group in R R
  • Extract patterns in R
    Extract patterns in R? R
  • how to create a hexbins chart in R
    How to create a hexbin chart in R R
  • Ogive Graph in R
    Ogive Graph in R R
  • How to Implement the Sklearn Predict Approach
    How to Implement the Sklearn Predict Approach? R
  • Dealing Missing values in R
    Dealing With Missing values in R R
  • Best Books to Learn R Programming
    Best Books to Learn R Programming Course
Cumulative Sum calculation in R

Cumulative Sum calculation in R

Posted on June 8June 7 By Jim No Comments on Cumulative Sum calculation in R
Tweet
Share
Share
Pin

Cumulative Sum calculation in R, using the dplyr package in R, you can calculate the cumulative sum of a column using the following methods.

Best online course for R programming – Data Science Tutorials

Approach 1: Calculate Cumulative Sum of One Column

df %>% mutate(cum_sum = cumsum(var1))

Approach 2: Calculate Cumulative Sum by Group

df %>% group_by(var1) %>% mutate(cum_sum = cumsum(var2))

The examples below demonstrate how to apply each strategy in practice.

One way ANOVA Example in R-Quick Guide – Data Science Tutorials

Example 1: Using dplyr, calculate the cumulative sum.

Let’s say we have the following R data frame:

Let’s make a dataset

df <- data.frame(day=c(1, 2, 3, 4, 5, 6, 7, 8),
                 sales=c(57, 42, 50, 99, 59, 51, 58, 45))

Now we can view the dataset

df
  day sales
1   1    57
2   2    42
3   3    50
4   4    99
5   5    59
6   6    51
7   7    58
8   8    45

To create a new column that holds the cumulative sum of the values in the ‘sales’ column, use the following code.

How to Use the Multinomial Distribution in R? – Data Science Tutorials

library(dplyr)

Let’s calculate the cumulative sum of sales

df %>% mutate(cum_sales = cumsum(sales))
    day sales cum_sales
1   1    57        57
2   2    42        99
3   3    50       149
4   4    99       248
5   5    59       307
6   6    51       358
7   7    58       416
8   8    45       461

Example 2: Using dplyr, calculate the Cumulative Sum by Group.

Let’s say we have the following R data frame.

Dealing With Missing values in R – Data Science Tutorials

Make a dataset

df <- data.frame(store=c('X', 'X', 'X', 'X', 'Y', 'Y', 'Y', 'Y'),
                 day=c(1, 2, 3, 4, 1, 2, 3, 4),
                 sales=c(87, 82, 80, 98, 98, 81, 88, 83))

View the dataset now

df
      X   1    87
2     X   2    82
3     X   3    80
4     X   4    98
5     Y   1    98
6     Y   2    81
7     Y   3    88
8     Y   4    83

To construct a new column that holds the cumulative sum of the values in the ‘sales’ column, grouped by the ‘store’ column, we can use the following code:

library(dplyr)

Now we can calculate the cumulative sum of sales by store.

Methods for Integrating R and Hadoop complete Guide – Data Science Tutorials

df %>% group_by(store) %>% mutate(cum_sales = cumsum(sales))
store   day sales cum_sales
  <chr> <dbl> <dbl>     <dbl>
1 X         1    87        87
2 X         2    82       169
3 X         3    80       249
4 X         4    98       347
5 Y         1    98        98
6 Y         2    81       179
7 Y         3    88       267
8 Y         4    83       350
Tweet
Share
Share
Pin
R Tags:dplyr

Post navigation

Previous Post: How to Count Distinct Values in R
Next Post: droplevels in R with examples

Related Posts

  • Filtering for Unique Values
    Filtering for Unique Values in R- Using the dplyr R
  • Arrange the rows in a specific sequence in R
    Arrange the rows in a specific sequence in R R
  • Arrange Data by Month in R
    Arrange Data by Month in R with example R
  • Checking Missing Values in R
    Checking Missing Values in R R
  • Interactive 3d plot in R
    Interactive 3d plot in R-Quick Guide R
  • Calculate the p-Value from Z-Score in R
    Calculate the p-Value from Z-Score in R R

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • About Us
  • Contact
  • Disclaimer
  • Guest Blog
  • Privacy Policy
  • YouTube
  • Twitter
  • Facebook
  • Tips for Data Scientist Interview Openings
  • What is Epoch in Machine Learning?
  • Dynamic data visualizations in R
  • How Do Machine Learning Chatbots Work
  • Convex optimization role in machine learning

Check your inbox or spam folder to confirm your subscription.

  • Sampling from the population in R
  • Two of the Best Online Data Science Courses for 2023
  • Process of Machine Learning Optimisation?
  • ggplot2 scale in R (grammar for graphics)
  • ggplot aesthetics in R (Grammer of graphics)
  • Beginner's Guide to Data Science
    Beginner’s Guide to Data Science Machine Learning
  • sorting in r
    Sorting in r: sort, order & rank R Functions R
  • best books about data analytics
    Best Books About Data Analytics Course
  • pheatmap function in R
    The pheatmap function in R R
  • Is R or Python Better for Data Science in Bangalore
    Is R or Python Better for Data Science in Bangalore R
  • Error in sum(List) : invalid 'type' (list) of argument
    Error in sum(List) : invalid ‘type’ (list) of argument R
  • Comparing group means in R
    One way ANOVA Example in R-Quick Guide R
  • How to add columns to a data frame in R
    How to add columns to a data frame in R R

Copyright © 2023 Data Science Tutorials.

Powered by PressBook News WordPress theme