IS CS-2017S1-02

题目来源：Problem 2 日期：2024-08-09 题目主题: CS-算法-图论-最小生成树

解题思路

这道题目考察了最小生成树算法的实现和复杂度分析。我们需要理解算法 $A$ 的工作原理,分析不同图表示方法下的最佳实现,并证明算法的正确性。关键点包括贪心策略、图的表示方法对算法效率的影响,以及最小生成树的性质。

Solution

1. Completing the algorithm description

The appropriate phrase to fill in (a) is:

“all edges connecting a vertex in $G^{'}$ to a vertex not in $G^{'}$ ”

This choice ensures that we always select an edge that connects the current partial spanning tree to a new vertex, maintaining the tree structure and gradually including all vertices.

2. Implementation for dense graphs

For dense graphs represented by an adjacency matrix:

Initialize a Union-Find set to keep track of connected components. $O (V)$
Initialize an array minCost[1..V] to store the minimum cost edge from $G^{'}$ to each vertex not in $G^{'}$ . $O (V)$
Choose an arbitrary starting vertex and mark it as in $G^{'}$ .
Repeat $V - 1$ times: a) Scan minCost array to find the minimum cost edge to a vertex not in $G^{'}$ . $O (V)$ b) Add this edge to $G^{'}$ and mark the new vertex as in $G^{'}$ . $O (1)$ c) Update minCost for vertices not in $G^{'}$ by checking if there’s a cheaper edge from the newly added vertex. $O (V)$

Time complexity: $O (V^{2})$

Explanation: We perform $V - 1$ iterations, each involving a scan of the minCost array ( $O (V)$ ) and an update of minCost ( $O (V)$ ). For dense graphs, $E \approx V^{2}$ , so this $O (V^{2})$ implementation is optimal as it matches the input size.

3. Implementation for sparse graphs

For sparse graphs represented by adjacency lists:

Initialize a priority queue $Q$ to store edges, keyed by their costs, implemented as a min-heap.
Choose an arbitrary starting vertex and add all its incident edges to $Q$ .
Repeat until $Q$ is empty: a) Extract the minimum cost edge $(u, v)$ from $Q$ . $O (lo g E)$ b) If $v$ is already in $G^{'}$ , discard this edge and continue. c) Otherwise, add $(u, v)$ to $G^{'}$ and add all edges incident to $v$ to $Q$ . $O (d e g (v) * lo g E)$

Time complexity: $O (E lo g E)$

Explanation: The time complexity is dominated by operations on the priority queue $Q$ , which is implemented as a min-heap. Each vertex is added to $G^{'}$ once, and when it’s added, we process all its incident edges. In total, we process at most $2 E$ edges (each edge is encountered from both its endpoints). Each edge operation (insert or extract-min) on the priority queue takes $O (lo g E)$ time as a min-heap. Since the graph is sparse, $E \approx V$ , so the time complexity can also be written as $O (E lo g V)$ .

4. Proof of correctness

To prove that Algorithm $A$ (Kruskal’s Algorithm) produces a minimum spanning tree, we need to establish that the algorithm satisfies the greedy choice property and the optimal substructure property. The proof can be outlined as follows:

Greedy Choice Property: Kruskal’s algorithm always adds the smallest edge that does not form a cycle. We need to prove that this choice is safe and that it does not exclude the possibility of obtaining the MST.
Optimal Substructure Property: A subgraph of a minimum spanning tree is also a minimum spanning tree for its vertices. Therefore, if the MST has been built partially, adding the next smallest edge that does not form a cycle will lead to the global MST.

Detailed Proof

Greedy Choice Property

Let’s consider a step in Kruskal’s algorithm where the edge $e = (u, v)$ is added to the MST. Assume $T$ is the MST being constructed by Kruskal’s algorithm, and $T^{'}$ is some other MST that includes a different edge $e^{'} \neq = e$ .

Case 1: $e^{'}$ is not in $T$ . If we add $e^{'}$ to $T$ , then $T$ will form a cycle. Since $T^{'}$ is also a spanning tree, it must contain another edge $f$ in the cycle that is not in $T^{'}$ . If we replace $f$ with $e$ in $T^{'}$ , the resulting tree $T^{''}$ will still be a spanning tree, and the weight will be less than or equal to the weight of $T^{'}$ because $w (e) \leq w (f)$ . Therefore, the new tree $T^{''}$ is also an MST.
Case 2: $e^{'}$ is already in $T$ . Then $T$ and $T^{'}$ differ only by the edges added after $e^{'}$ . By the inductive hypothesis, adding $e$ will not exclude any edge from $T$ , so $T$ remains an MST.

This shows that adding $e$ does not prevent us from achieving the MST, and hence, the greedy choice made by Kruskal’s algorithm is safe.

Optimal Substructure Property

Assume that $T_{k}$ is the tree formed after the first $k$ steps of Kruskal’s algorithm. We need to show that $T_{k}$ is part of some MST.

When $k = 0$ , $T_{0}$ is empty, which trivially satisfies the property.
Suppose $T_{k}$ is a subgraph of an MST $T^{*}$ . Let $e_{k + 1}$ be the edge added in the $(k + 1)$ -th step by Kruskal’s algorithm.
- If $e_{k + 1}$ is also in $T^{*}$ , then $T_{k + 1} = T_{k} + e_{k + 1}$ is still a subgraph of $T^{*}$ .
- If $e_{k + 1}$ is not in $T^{*}$ , adding $e_{k + 1}$ to $T^{*}$ would create a cycle, and one of the edges $e^{'}$ in the cycle must not be in $T_{k}$ . Since $e^{'}$ is not in $T_{k}$ and $e_{k + 1}$ is the smallest edge that does not form a cycle in $T_{k}$ , we have $w (e_{k + 1}) \leq w (e^{'})$ . By replacing $e^{'}$ with $e_{k + 1}$ in $T^{*}$ , we obtain another MST that contains $T_{k + 1}$ .

This inductive argument ensures that each intermediate step $T_{k}$ is part of some MST, and when the algorithm terminates, $T = T_{∣ V ∣ - 1}$ is an MST.

By satisfying both the greedy choice property and the optimal substructure property, Kruskal’s algorithm is guaranteed to produce a minimum spanning tree. The algorithm’s correctness hinges on the fact that it always selects the smallest available edge that does not form a cycle, which is a necessary and sufficient condition for obtaining the MST.

知识点

最小生成树 Kruskal算法贪心算法图论数据结构复杂度分析

难点思路

理解稀疏图和稠密图对算法实现的影响
分析不同数据结构 (如优先队列) 在算法中的应用
使用归纳法和切割性质证明算法的正确性

解题技巧和信息

在分析图算法时,要考虑图的表示方法 (邻接矩阵 vs 邻接表) 对算法效率的影响
时间复杂度分析中,要注意图的特性 (稀疏 vs 稠密) 对复杂度的影响
证明贪心算法正确性时,可以使用归纳法和反证法
最小生成树问题中,切割性质是一个强有力的工具

常见图算法的时间复杂度:

Kruskal’s MST: $O (E lo g E)$
Prim’s MST (binary heap): $O ((V + E) lo g V)$
Prim’s MST (Fibonacci heap): $O (E + V lo g V)$
Dijkstra’s (binary heap): $O ((V + E) lo g V)$
Dijkstra’s (Fibonacci heap): $O (E + V lo g V)$
Bellman-Ford: $O (V E)$
Floyd-Warshall: $O (V^{3})$

重点词汇

Minimum Spanning Tree (MST) 最小生成树
adjacency matrix 邻接矩阵
adjacency list 邻接表
dense graph 稠密图
sparse graph 稀疏图
greedy algorithm 贪心算法
Cut Property 切割性质
time complexity 时间复杂度
priority queue 优先队列

参考资料

Introduction to Algorithms (CLRS), Chapter 23: Minimum Spanning Trees
Algorithm Design (Kleinberg & Tardos), Chapter 4: Greedy Algorithms
The Algorithm Design Manual (Skiena), Section 6.1: Minimum Spanning Trees

Zephyr's Notes on ISCS & CBMS, UTokyo

Explorer