Introduction to Prefix Sums

Resources 1D Prefix Sums Solution - Static Range Sum Problems Quiz

Focus Problem – try your best to solve this problem before continuing!

Resources


IUSACO	11 - Prefix Sums	module is based off this
CPH	9.1 - Sum Queries	rather brief
PAPS	11.2.1 - Prefix Precomputation	also rather brief

1D Prefix Sums

Let's say we have a one-indexed integer array $\texttt{arr}$ of size $N$ and we want to compute the value of

\texttt{arr}[a]+\texttt{arr}[a+1]+\cdots+\texttt{arr}[b]

for $Q$ different pairs $(a,b)$ satisfying $1\le a\le b\le N$ . We'll use the following example with $N = 6$ :

Index $i$	1	2	3	4	5	6
$\texttt{arr}[i]$	1	6	4	2	5	3

Naively, for every query, we can iterate through all entries from index $a$ to index $b$ to add them up. Since we have $Q$ queries and each query requires a maximum of $\mathcal{O}(N)$ operations to calculate the sum, our total time complexity is $\mathcal{O}(NQ)$ . For most problems of this nature, the constraints will be $N, Q \leq 10^5$ , so $NQ$ is on the order of $10^{10}$ . This is not acceptable; it will almost certainly exceed the time limit.

Instead, we can use prefix sums to process these array sum queries. We designate a prefix sum array $\texttt{prefix}$ . First, because we're 1-indexing the array, set $\texttt{prefix}[0]=0$ , then for indices $k$ such that $1 \leq k \leq N$ , define the prefix sum array as follows:

\texttt{prefix}[k]=\sum_{i=1}^{k} \texttt{arr}[i]

Basically, what this means is that the element at index $k$ of the prefix sum array stores the sum of all the elements in the original array from index $1$ up to $k$ . This can be calculated easily in $\mathcal{O}(N)$ by the following formula for each $1\le k\le N$ :

\texttt{prefix}[k]=\texttt{prefix}[k-1]+\texttt{arr}[k]

For the example case, our prefix sum array looks like this:

Index $i$	0	1	2	3	4	5	6
$\texttt{prefix}[i]$	0	1	7	11	13	18	21

Now, when we want to query for the sum of the elements of $\texttt{arr}$ between (1-indexed) indices $a$ and $b$ inclusive, we can use the following formula:

\sum_{i=L}^{R} \texttt{arr}[i] = \sum_{i=1}^{R} \texttt{arr}[i] - \sum_{i=1}^{L-1} \texttt{arr}[i]

Using our definition of the elements in the prefix sum array, we have

\sum_{i=L}^{R} \texttt{arr}[i]= \texttt{prefix}[R]-\texttt{prefix}[L-1]

Since we are only querying two elements in the prefix sum array, we can calculate subarray sums in $\mathcal{O}(1)$ per query, which is much better than the $\mathcal{O}(N)$ per query that we had before. Now, after an $\mathcal{O}(N)$ preprocessing to calculate the prefix sum array, each of the $Q$ queries takes $\mathcal{O}(1)$ time. Thus, our total time complexity is $\mathcal{O}(N+Q)$ , which should now pass the time limit.

Let's do an example query and find the subarray sum between indices $a = 2$ and $b = 5$ , inclusive, in the 1-indexed $\texttt{arr}$ . From looking at the original array, we see that this is

\sum_{i=2}^{5} \texttt{arr}[i] = 6 + 4 + 2 + 5 = 17.

Index $i$	1	2	3	4	5	6
$\texttt{arr}[i]$	1	6	4	2	5	3

Using prefix sums:

\texttt{prefix}[5] - \texttt{prefix}[1] = 18 - 1 = 17.

Index $i$	0	1	2	3	4	5	6
$\texttt{prefix}[i]$	0	1	7	11	13	18	21

These are also known as partial sums.

Solution - Static Range Sum

C++

In C++ we can use std::partial_sum, although it doesn't shorten the code by much.

#include <bits/stdc++.h>
using namespace std;

vector<long long> psum(const vector<int> &arr) {
	vector<long long> psums(arr.size() + 1);
	for (int i = 0; i < arr.size(); i++) { psums[i + 1] = psums[i] + arr[i]; }
	// or partial_sum(begin(a), end(a), begin(psums) + 1);
	return psums;
}

Java

import java.io.*;
import java.util.*;

public class Main {
	static int N, Q;
	public static void main(String[] args) throws IOException {
		BufferedReader reader = new BufferedReader(new InputStreamReader(System.in));
		PrintWriter writer = new PrintWriter(System.out);
		StringTokenizer st = new StringTokenizer(reader.readLine());
		N = Integer.parseInt(st.nextToken());

Python

def psum(arr):
	psums = [0]
	for i in arr:
		psums.append(psum[-1] + i)
	return psums


N, Q = map(int, input().split())
nums = list(map(int, input().split()))
prefix_arr = psum(nums)

for i in range(Q):
	l, r = map(int, input().split())
	print(prefix_arr[r] - prefix_arr[l])

An alternative approach is to use itertools.accumulate. Notice that we need to add a $0$ to the front of the array. If using a newer version, you can use the optional parameter initial=0 as well.

import itertools


def psum(arr):
	return [0] + list(itertools.accumulate(arr))


 Code Snippet: Same code as above (Click to expand)

Problems

Source	Problem Name	Difficulty	Tags
Silver	Breed Counting	Very Easy	Show Tags Prefix Sums
Silver	Subsequences Summing to Sevens	Easy	Show Tags Prefix Sums
Silver	Hoof Paper Scissors	Easy	Show Tags Prefix Sums
CSES	Subarray Sums II	Easy	Show Tags Prefix Sums
CSES	Subarray Divisibility	Easy	Show Tags Prefix Sums
Silver	Why Did the Cow Cross the Road II	Easy	Show Tags Prefix Sums
CF	Good Subarrays	Normal	Show Tags Math, Prefix Sums
AC	GCD on Blackboard	Normal	Show Tags Prefix Sums
CF	Running Miles	Normal	Show Tags Prefix Sums
CF	Irreducible Anagrams	Normal	Show Tags Prefix Sums
Silver	Farmer John's Favorite Operation	Normal	Show Tags Prefix Sums
AC	Multiple of 2019	Hard	Show Tags Prefix Sums

Quiz

What is the optimal time complexity of calculating the prefix sum array of some array of length $n$ ?

Question 1 of 4

Module Progress:

Join the USACO Forum!

Stuck on a problem, or don't understand a module? Join the USACO Forum and get help from other competitive programmers!

Join Forum

Table of Contents