Graph Traversal

Introduction Application: Connected Components Application: Graph Two-Coloring DFS BFS Prerequisite - Queues & Deques Queues Deques Implementation Solution - Building Roads DFS Solution An Issue With Deep Recursion BFS Solution Connected Component Problems Solution - Building Teams DFS Solution BFS Solution Graph Two-Coloring Problems Quiz

Introduction

Resources
		CPH	12 - Graph traversal

Graph traversal algorithms visit all nodes within a graph in a certain order and can compute some information along the way. Two common algorithms for doing this are depth first search (DFS) and breadth first search (BFS).

Application: Connected Components

Building Roads

CSES - Easy

Focus Problem – try your best to solve this problem before continuing!

A connected component is a maximal set of connected nodes in an undirected graph. In other words, two nodes are in the same connected component if and only if they can reach each other via edges in the graph.

In the above focus problem, the goal is to add the minimum possible number of edges such that the entire graph forms a single connected component.

Application: Graph Two-Coloring

Building Teams

CSES - Easy

Focus Problem – try your best to solve this problem before continuing!

Graph two-coloring refers to assigning a boolean value to each node of the graph, dictated by the edge configuration. The most common example of a two-colored graph is a bipartite graph, in which each edge connects two nodes of opposite colors.

In the above focus problem, the goal is to assign each node (friend) of the graph to one of two colors (teams), subject to the constraint that edges (friendships) connect two nodes of opposite colors. In other words, we need to check whether the input is a bipartite graph and output a valid coloring if it is.

DFS

Resources
		CSA	Depth First Search	up to but not including "More about DFS"
		CPH	12.1 - DFS	example diagram + code

From the second resource:

Depth-first search (DFS) is a straightforward graph traversal technique. The algorithm begins at a starting node, and proceeds to all other nodes that are reachable from the starting node using the edges of the graph.

Depth-first search always follows a single path in the graph as long as it finds new nodes. After this, it returns to previous nodes and begins to explore other parts of the graph. The algorithm keeps track of visited nodes, so that it processes each node only once.

When implementing DFS, we often use a recursive function to visit the vertices and an array to store whether we've seen a vertex before.

C++

#include <bits/stdc++.h>
using namespace std;

int n = 6;
vector<vector<int>> adj(n);
vector<bool> visited(n);

void dfs(int current_node) {
	if (visited[current_node]) { return; }
	visited[current_node] = true;

Java

import java.io.*;
import java.util.*;

public class DFSDemo {
	static List<Integer>[] adj;
	static boolean[] visited;
	static int n = 6;

	public static void main(String[] args) throws IOException {
		visited = new boolean[n];

Python

import sys

sys.setrecursionlimit(10**5)  # Python has a default recursion limit of 1000

n = 6
visited = [False] * n

"""
Define adjacency list and read in problem-specific input here.

BFS


CSA	BFS	interactive, implementation
PAPS	12.1 - BFS	grid, 8-puzzle examples
cp-algo	BFS	common applications
KA	BFS and its uses
YouTube	Breadth First Search Algorithm	If you prefer a video format

In a breadth-first search, we travel through the vertices in order of their distance from the starting vertex.

Prerequisite - Queues & Deques

Resources
		CPH	4.5 - Queues, Deques
		PAPS	3.2, 6.3 - Queues

Queues

A queue is a First In First Out (FIFO) data structure that supports three operations, all in $\mathcal{O}(1)$ time.

C++

std::queue

push: inserts at the back of the queue
pop: deletes from the front of the queue
front: retrieves the element at the front without removing it.

queue<int> q;
q.push(1);                  // [1]
q.push(3);                  // [3, 1]
q.push(4);                  // [4, 3, 1]
q.pop();                    // [4, 3]
cout << q.front() << endl;  // 3

Java

add: insertion at the back of the queue
poll: deletion from the front of the queue
peek: which retrieves the element at the front without removing it

Java doesn't actually have a Queue class; it's only an interface. The most commonly used implementation is the LinkedList, declared as follows:

Queue<Integer> q = new LinkedList<Integer>();
q.add(1);                      // [1]
q.add(3);                      // [3, 1]
q.add(4);                      // [4, 3, 1]
q.poll();                      // [4, 3]
System.out.println(q.peek());  // 3

Python

Python has a builtin queue module.

Queue.put(n): Inserts element to the back of the queue.
Queue.get(): Gets and removes the front element. If the queue is empty, this will wait forever, creating a TLE error.
Queue.queue[n]: Gets the nth element without removing it. Set n to 0 for the first element.

from queue import Queue

q = Queue()  # []
q.put(1)  # [1]
q.put(2)  # [1, 2]
v = q.queue[0]  # v = 1, q = [1, 2]
v = q.get()  # v = 1, q = [2]
v = q.get()  # v = 2, q = []
v = q.get()  # Code waits forever, creating TLE error.

Warning!

Python's queue.Queue() uses Locks to maintain a threadsafe synchronization, so it's quite slow. To avoid TLE, use collections.deque() instead for a faster version of a queue.

Deques

A deque (usually pronounced "deck") stands for double ended queue and is a combination of a stack and a queue, in that it supports $\mathcal{O}(1)$ insertions and deletions from both the front and the back of the deque. Not very common in Bronze / Silver.

C++

std::deque

The four methods for adding and removing are push_back, pop_back, push_front, and pop_front.

deque<int> d;
d.push_front(3);  // [3]
d.push_front(4);  // [4, 3]
d.push_back(7);   // [4, 3, 7]
d.pop_front();    // [3, 7]
d.push_front(1);  // [1, 3, 7]
d.pop_back();     // [1, 3]

You can also access deques in constant time like an array in constant time with the [] operator. For example, to access the $i$ th element of a deque $\texttt{dq}$ , do $\texttt{dq}[i]$ .

Java

In Java, the deque class is called ArrayDeque. The four methods for adding and removing are addFirst , removeFirst, addLast, and removeLast.

ArrayDeque<Integer> deque = new ArrayDeque<Integer>();
deque.addFirst(3);    // [3]
deque.addFirst(4);    // [4, 3]
deque.addLast(7);     // [4, 3, 7]
deque.removeFirst();  // [3, 7]
deque.addFirst(1);    // [1, 3, 7]
deque.removeLast();   // [1, 3]

Python

In Python, collections.deque() is used for a deque data structure. The four methods for adding and removing are appendleft, popleft, append, and pop.

d = collections.deque()
d.appendleft(3)  # [3]
d.appendleft(4)  # [4, 3]
d.append(7)  # [4, 3, 7]
d.popleft()  # [3, 7]
d.appendleft(1)  # [1, 3, 7]
d.pop()  # [1, 3]

Implementation

When implementing BFS, we often use a queue to track the next vertex to visit. Like DFS, we'll also keep an array to store whether we've seen a vertex before.

Java

import java.util.*;

public class Main {
	public static void main(String[] args) {
		int n = 6;
		boolean[] visited = new boolean[n];
		List<Integer>[] adj = new ArrayList[6];
		for (int i = 0; i < n; i++) { adj[i] = new ArrayList<>(); }
		adj[0] = new ArrayList<>(Arrays.asList(1, 2, 4));
		adj[1] = new ArrayList<>(Arrays.asList(3, 4));

C++

#include <queue>
#include <vector>

using std::queue;
using std::vector;

int main() {
	int n = 6;
	vector<vector<int>> adj(n);
	vector<bool> visited(n);

Python

from collections import deque

"""
Define adjacency list and read in problem-specific input

In this example, we've provided "dummy input" that's
reflected in the GIF above to help illustrate the
order of the recursive calls.
"""

Solution - Building Roads

Note that each edge decreases the number of connected components by either zero or one. So you must add at least $C-1$ edges, where $C$ is the number of connected components in the input graph.

To compute $C$ , iterate through each node. If it has not been visited, visit it and all other nodes in its connected component using DFS or BFS. Then $C$ equals the number of times we perform the visiting operation.

There are many valid ways to pick $C-1$ new roads to build. One way is to choose a single representative from each of the $C$ components and link them together in a line.

DFS Solution

C++

#include <deque>
#include <iostream>
#include <vector>

using namespace std;

vector<vector<int>> adj;
vector<bool> visited;

void dfs(int node) {

Java

import java.io.*;
import java.util.*;

public class BuildingRoads {
	static List<Integer>[] adj;
	static boolean[] visited;

	public static void main(String[] args) throws IOException {
		Kattio io = new Kattio();
		int n = io.nextInt();

Python

from collections import deque

n, m = map(int, input().split())

adj = [[] for _ in range(n)]
for _ in range(m):
	a, b = map(int, input().split())
	adj[a - 1].append(b - 1)
	adj[b - 1].append(a - 1)

However, this code causes a runtime error on nearly half the test cases. What could be going wrong?

An Issue With Deep Recursion

If you run the solution code locally on the line graph generated by the following Python code:

n = 100000
print(n, n - 1)
for i in range(1, n):
	print(i, i + 1)

C++

then you might get a segmentation fault even though your code passes on the online judge. This occurs because every recursive call contributes to the size of the call stack, which is limited to a few megabytes by default. To increase the stack size, refer to this module. Short answer: If you would normally compile your code with g++ sol.cpp, then compile it with g++ -Wl,-stack_size,0xF0000000 sol.cpp instead.

Java

then you might get a StackOverflowError even though your code passes on the online judge. This occurs because every recursive call contributes to the size of the call stack, which is limited to less than a megabyte by default. To resolve this, you can pass an option of the form -Xss... to run the code with an increased stack size. For example, java -Xss512m Main will run the code with a stack size limit of 512 megabytes.

Python

then you will observe a RecursionError that looks like this:

Traceback (most recent call last):
  File "input/code.py", line 28, in <module>
	solve(n, adj)
  File "input/code.py", line 14, in solve
	dfs(start, start)
  File "input/code.py", line 9, in dfs
	dfs(start, next)
  File "input/code.py", line 9, in dfs
	dfs(start, next)
  File "input/code.py", line 9, in dfs
	dfs(start, next)
  [Previous line repeated 994 more times]
  File "input/code.py", line 7, in dfs
	if next in unvisited:
RecursionError: maximum recursion depth exceeded in comparison

This will occur for $N>10^3$ since the recursion limit in Python is set to 1000 by default. We can fix this by increasing the recursion limit with sys.setrecursionlimit(10 ** 6), although we still get TLE on two test cases. To resolve this, we can implement a BFS solution, as shown below.

BFS Solution

C++

#include <deque>
#include <iostream>
#include <vector>

using namespace std;

int main() {
	int n;
	int m;
	cin >> n >> m;

Java

import java.io.*;
import java.util.*;

public class BuildingRoads {
	public static void main(String[] args) throws IOException {
		Kattio io = new Kattio();
		int n = io.nextInt();
		int m = io.nextInt();

		List<Integer>[] adj = new ArrayList[n];

Python

from collections import deque

n, m = map(int, input().split())

adj = [[] for _ in range(n)]
for _ in range(m):
	a, b = map(int, input().split())
	adj[a - 1].append(b - 1)
	adj[b - 1].append(a - 1)

Connected Component Problems

Source	Problem Name	Difficulty	Tags
Silver	Closing the Farm	Easy	Show Tags Connected Components
Silver	Moocast	Easy	Show Tags BFS, Connected Components, DFS
Silver	Fence Planning	Easy	Show Tags Connected Components
Kattis	Birthday Party	Easy	Show Tags Connected Components
ACSL	Rank	Easy	Show Tags DFS
CSES	Flight Routes Check	Normal	Show Tags DFS
CSA	BFS-DFS	Normal	Show Tags BFS, DFS
Gold	Moocast	Normal	Show Tags Binary Search, Connected Components
Silver	Wormhole Sort	Normal	Show Tags Binary Search, Connected Components
Silver	Moo Route II	Normal	Show Tags BFS
Silver	Connecting Two Barns	Normal	Show Tags 2P, Binary Search, Connected Components
Silver	Redistributing Gifts	Normal	Show Tags DFS
CF	Round Dance	Normal	Show Tags Connected Components, DFS
CSES	Subarray Sum Constraints	Normal	Show Tags Connected Components, DFS, Prefix Sums
CF	Connected Components?	Hard	Show Tags DFS, Sorted Set
Kattis	Lane Switching	Very Hard	Show Tags Binary Search, Connected Components
Silver	Cereal 2	Very Hard	Show Tags Constructive, Cycles, Spanning Tree

Solution - Building Teams


CPH	12.3 - Bipartiteness check	Brief solution sketch with diagrams.
IUSACO	10.7 - Bipartite Graphs
cp-algo	Bipartite Check
CP2	4.2.6 - Bipartite Check

For each connected component, we can arbitrarily label a node and then run DFS or BFS. Every time we visit a new (unvisited) node, we set its color based on the edge rule. When we visit a previously visited node, check to see whether its color matches the edge rule.

DFS Solution

Optional: Adjacency List Without an Array of Vectors

See here.

C++

#include <iostream>
#include <vector>

using namespace std;

vector<int> assigned;
vector<vector<int>> adj;

/** @return true only if it's possible to assign each person to a team */
bool dfs(int node) {

Java

Warning!

Because Java is so slow, an adjacency list using lists/arraylists results in TLE. Instead, the Java sample code uses the edge representation mentioned in the optional block above.

import java.io.*;
import java.util.*;

public class BuildingTeams {
	static List<Integer>[] adj;
	static int[] assigned;

	public static void main(String[] args) throws IOException {
		Kattio io = new Kattio();
		int n = io.nextInt();

Python

Warning!

You have to submit with CPython (not PyPy3) to avoid TLE.

import sys

input = sys.stdin.readline

sys.setrecursionlimit(int(1e9))  # disable recursion limit

n, m = map(int, input().strip().split())
adj = [[] for _ in range(n)]
team = [0] * n  # 0: not assigned yet, 1: team 1, 2: team 2

BFS Solution

The specifics of the algorithm are almost exactly the same; it's just that we do them in an iterative rather than recursive fashion.

C++

#include <deque>
#include <iostream>
#include <vector>

using namespace std;

int main() {
	int n;
	int m;
	cin >> n >> m;

Java

import java.io.*;
import java.util.*;

public class BuildingTeams {
	public static void main(String[] args) throws IOException {
		Kattio io = new Kattio();
		int n = io.nextInt();
		int m = io.nextInt();

		List<Integer>[] adj = new ArrayList[n];

Python

from collections import deque

n, m = map(int, input().split())

adj = [[] for _ in range(n)]
for _ in range(m):
	a, b = map(int, input().split())
	adj[a - 1].append(b - 1)
	adj[b - 1].append(a - 1)

Graph Two-Coloring Problems

Source	Problem Name	Difficulty	Tags
CF	Bipartiteness	Easy	Show Tags Bipartite
Silver	The Great Revegetation	Easy	Show Tags Bipartite
CF	Cover it!	Easy	Show Tags Bipartite
Baltic OI	2020 - Graph	Hard	Show Tags DFS, Median
CC	Among Us	Hard	Show Tags Bipartite, DFS
CF	Coloring Game	Hard	Show Tags Bipartite, DFS
CF	Catshock	Hard	Show Tags Bipartite, Trees
APIO	2011 - Table Coloring	Very Hard	Show Tags Bipartite

Quiz

What's the main difference between DFS and BFS?

Question 1 of 4

Module Progress:

Join the USACO Forum!

Stuck on a problem, or don't understand a module? Join the USACO Forum and get help from other competitive programmers!

Join Forum

Table of Contents

Table of Contents

Introduction

Application: Connected Components

Application: Graph Two-Coloring

DFS

BFS

Prerequisite - Queues & Deques

Queues

Warning!

Deques

Implementation

Solution - Building Roads

DFS Solution

An Issue With Deep Recursion

BFS Solution

Connected Component Problems

Solution - Building Teams

DFS Solution

Optional: Adjacency List Without an Array of Vectors

Warning!

Warning!

BFS Solution

Graph Two-Coloring Problems

Quiz

Module Progress:

Join the USACO Forum!