欢迎来到 Comprehensive Rust 🦀

这是一门由 Android 团队开发的免费 Rust 课程。课程涵盖了 Rust 的全部内容，从基本语法到泛型和错误处理等高级主题。

如需查看课程的最新版本，请访问 https://google.github.io/comprehensive-rust/。如果您在其他地方阅读，请在那里查看更新。

The course is also available as a PDF.

本课程的目标是教会你使用 Rust。假设你对 Rust 一无所知，本课程可以：

帮助你全面理解 Rust 的语法和语言。
让你可以用 Rust 修改现有程序和编写新程序。
为你展示常见的 Rust 习惯用法。

我们将前四天的课程称为“Rust 基础”。

在此基础上，你可以选择深入研究一个或多个专门的主题：

Android：为期半天的课程，介绍如何在 Android 平台开发中使用 Rust（AOSP）。课程内容包括与 C、C++ 和 Java 的互操作性。
Chromium：为期半天的课程，介绍如何在基于 Chromium 的浏览器中使用 Rust。课程内容包括与 C++ 的互操作性以及如何在 Chromium 中加入第三方 crate。
裸机：为期一天的课程，介绍如何使用 Rust 进行裸机（嵌入式）开发。课程内容涵盖微控制器和应用处理器。
并发：为期一天的课程，介绍 Rust 中的并发性。我们将涵盖传统并发（使用线程和互斥锁进行抢占式调度）和 async/await 并发（使用 futures 进行协作式多任务处理）。

非目标

Rust 是一门庞大的语言，短短几天的课程无法覆盖其全部内容。本课程不包括以下内容：

学习如何开发宏：请参阅 Rust Book 的第 19.5 章和 Rust by Examples 对应章节。

学习前提

本课程假设你已经具备编程知识。Rust 是一种静态类型语言，本课程有时会将其与 C 和 C++ 进行比较，以便更好地解释或对比 Rust 的设计。

如果你会使用 Python 或 JavaScript 等动态类型语言编程，那么你也能够很好地跟上进度。

Speaker Notes

这是演讲者备注（Speaker Notes）的示例。页面中使用它来为幻灯片添加备注信息，其内容包括讲师应涉及的要点，以及对课堂上可能出现的典型问题的回答。

授课

本页供课程讲师使用。

以下是有关 Google 内部开展课程的一些相关背景。

上课时间通常是从上午 9:00 到下午 4:00，中间有 1 小时的午餐休息时间。这样上午和下午就各有 3 小时上课时间。上下午上课时间段内都有多次休息时间和学生做练习的时间。

在授课之前，你需要完成以下事项：

熟悉课程资料。页面提供了演讲者注释以突出重点（请帮忙多多贡献演讲者备注！）。演讲时，请确保在弹出窗口中打开演讲者笔记（点击“演讲者笔记”旁边带小箭头的链接）。这样就可以确保屏幕整洁有序，更好地向全班学员展示课程内容。
决定培训日期。由于课程为期四天，建议将时间安排在两周内。课程学员曾表示，他们认为在课程保留一些间隙有助于更好地进行理解。
找一间足以容纳全体线下学员的教室。建议的班级人数为 15-25 人。这样少的人数可以让大家能够更轻松地提问，也可以让仅有一位的讲师有足够时间回答问题。确保教室里有讲师和学生用的桌子，并能够坐下来使用笔记本电脑。特别地，讲师需要进行大量的现场编码工作，因此讲台是不需要的。
课程当天，请提早到教室进行准备。建议直接在笔记本电脑上运行 mdbook serve 来演示课程内容（请参阅安装说明）。这样可以确保在更换页面时不会出现延迟，演示效果更好。使用笔记本运行还可以在发现错别字时及时更正。
让学员自己或以小组为单位解决练习问题。通常在上午和下午各安排 30-45 分钟的练习时间（包括查看解答的时间）。请务必询问学员是否遇到困难，或是否需要任何帮助。如果有多位学员遇到同样的问题，则在班级内进行讲解，并提供相应的解决方案，例如告诉大家在标准库的什么位置可以找到相关信息。

以上就是全部事项，祝你授课顺利！希望你能像我们一样享受其中的乐趣！

欢迎你在课后提供反馈，以帮助我们不断改进课程。我们非常期待了解哪些方面做得不错，哪些方面还需要改进。同时非常欢迎学生们向我们发送反馈！

课程结构

本页供课程讲师使用。

Rust 基础

前四天的内容是 Rust 基础。这几天的课程节奏很快，内容也很丰富！

课程安排：

Day 1 Morning (2 hours and 5 minutes, including breaks)

Segment	Duration
欢迎	5 minutes
Hello, World	15 minutes
类型和值	40 minutes
控制流基础	40 minutes

Day 1 Afternoon (2 hours and 35 minutes, including breaks)

Segment	Duration
元组和数组	35 minutes
引用	55 minutes
用户定义的类型	50 minutes

Day 2 Morning (2 hours and 55 minutes, including breaks)

Segment	Duration
欢迎	3 minutes
模式匹配	1 hour
方法和特征	50 minutes
泛型	40 minutes

Day 2 Afternoon (3 hours and 10 minutes, including breaks)

Segment	Duration
标准库类型	1 hour and 20 minutes
标准库特征	1 hour and 40 minutes

第三天上午（2 小时 20 分钟，含休息时间）

Segment	Duration
欢迎	3 minutes
内存管理	1 hour
智能指针	55 minutes

Day 3 Afternoon (1 hour and 50 minutes, including breaks)

Segment	Duration
借用	50 minutes
结构体生命周期	50 minutes

Day 4 Morning (2 hours and 40 minutes, including breaks)

Segment	Duration
欢迎	3 minutes
迭代器	45 minutes
模块	40 minutes
测试	45 minutes

Day 4 Afternoon (2 hours and 10 minutes, including breaks)

Segment	Duration
错误处理	55 minutes
不安全 Rust	1 hour and 5 minutes

深入探究

除了为期四天的“Rust 基础”课程外，还有一些专业课题提供：

Android 中的 Rust

深入探究 Android 中的 Rust 课程为期半天，旨在介绍如何使用 Rust 进行 Android 平台开发。其中包括与 C、C++ 和 Java 的互操作性。

你需要检出 AOSP。在同一机器上检出课程库，然后将 src/android/ 目录移至所检出的 AOSP 的根目录。这将确保 Android 构建系统能检测到 src/android/ 中的 Android.bp 文件。

确保 adb sync 适用于你的模拟器或实际设备，并使用 src/android/build_all.sh 预构建所有 Android 示例。请阅读脚本，查看它所运行的命令，并确保这些命令能在你手动运行时正确执行。

Chromium 中的 Rust

深入探究 Chromium 中的 Rust 课程为期半天，旨在介绍 Chromium 浏览器中 Rust 的使用。课程内容包括在 Chromium 的 gn 编译系统中使用 Rust，引入第三方 crate，以及与 C++ 的互操作性。

您需要能够构建 Chromium。为了提高速度，建议使用调试、组件构建方式，其他构建方式也可以使用。确保所构建的 Chromium 浏览器可以正常运行。

裸机 Rust

深入探究裸机 Rust 课程为期一天，旨在介绍如何使用 Rust 进行裸机（嵌入式）开发。其中涵盖了微控制器和应用处理器。

对于微控制器部分，需要提前购买 BBC micro:bit v2 开发板。每个人都需要安装多个软件包，具体如欢迎页面中所述。

Rust 中的并发

深入探究 Rust 中的并发课程为期一天，旨在介绍传统并发和 async/await 并发。

你需要设置一个新 crate，下载所需的依赖项，做好课前准备。然后，你可以将示例复制/粘贴到 src/main.rs 中，以便对以下代码进行实验：


cargo init concurrency
cd concurrency
cargo add tokio --features full
cargo run

键盘快捷键

mdBook 中有一些实用键盘快捷键：

Arrow-Left: Navigate to the previous page.
Arrow-Right: Navigate to the next page.
Ctrl + Enter: Execute the code sample that has focus.
s: Activate the search bar.

翻译

一批优秀的志愿者已将本课程翻译成其他语言：

巴西葡萄牙语版本译者：@rastringer、@hugojacob、@joaovicmendes 和 @henrif75。
简体中文版本译者：@suetfei、@wnghl、@anlunx、@kongy、@noahdragon、@superwhd、@SketchK 和 @nodmp。
繁体中文版本译者：@hueich、@victorhsieh、@mingyc、@kuanhungchen 和 @johnathan79717。
Korean by @keispace, @jiyongp, @jooyunghan, and @namhyung.
西班牙语版本译者：@deavid。

使用右上角的语言选择器切换语言。

未完成的翻译

还有很多语言版本仍在翻译中。以下是最近更新的翻译版本的链接：

如果你想协助翻译，请参阅翻译说明，了解如何开始翻译工作。翻译工作可通过此议题追踪。

使用 Cargo

开始了解 Rust 后，你很快就会遇到 Cargo，这是 Rust 生态系统中用于构建和运行 Rust 应用的标准工具。在这里，我们想简要介绍一下什么是 Cargo、它如何融入更广泛的生态系统，以及我们如何在本培训中合理利用 Cargo。

安装

请按照 https://rustup.rs/ 上的说明操作。

这将为你提供 Cargo 构建工具 (cargo)和 Rust 编译器 (rustc)。你还将获得 rustup，这是一个命令行实用程序,你可以用它来安装不同的编译器版本。

安装 Rust 之后，你应当配置你的编辑器或 IDE 以开始使用 Rust。大多数编辑器使用了 rust-analyzer。它为 VS Code、Emacs、Vim/Neovim 及其他许多编辑器提供了自动补全及定义跳转的功能。同样也可以使用 RustRover IDE。

Speaker Notes

在 Debian/Ubuntu 上，你也可以通过 apt 安装 Cargo、Rust 源代码和 Rust 格式化工具。但是，这样会得到一个过时的 Rust 版本，这可能会导致意外的行为。命令如下:
```
sudo apt install cargo rust-src rustfmt
```

Rust 生态系统

Rust 生态系统由许多工具组成，主要包括：

rustc：Rust 编译器，可将 .rs 文件转换为二进制文件和其他中间格式。
cargo：Rust 依赖项管理器和构建工具。Cargo 知道如何下载托管在 https://crates.io 上的依赖项,并在构建项目时将它们传递给 rustc。Cargo 还附带一个内置的测试运行程序，用于执行单元测试。
rustup：Rust 工具链安装和更新工具。当 Rust 发布新版本时，此工具用于安装并更新 rustc 和 cargo。此外，rustup 还可以下载标准库的文档。可以同时安装多个版本的 Rust，rustup 会根据需要让你在这些版本之间切换。

Speaker Notes

关键点：

Rust 有一个快速发布时间表，每六周就会发布一次新版本。新版本保持与旧版本的向后兼容性，并添加新功能。
共有三个发布阶段：“稳定版（stable）”、“测试版（beta）”和“夜间版（nightly）”。
新功能会先在“夜间版”上测试，“测试版”会每六周转为“稳定版”。
依赖关系也可以通过其他 registry、git 及文件夹等解析。
Rust 区分版本（edition）：当前版本是 Rust 2021。之前的版本是 Rust 2015 和 Rust 2018。
- 这些版本支持对语言进行向后不兼容的更改。
- 为防止破坏代码，版本是可选的：通过 Cargo.toml 文件为 crate 选择合适的版本。
- 为免分割生态系统，Rust 编译器可以混合使用为不同版本编写的代码。
- 请注意，不借助 cargo 直接使用编译器的情况相当少见（大多数用户从不这样做）。
- 值得一提的是，Cargo 本身就是一个功能强大且全面的工具。它能够实现许多高级功能，包括但不限于：
  - 项目/软件包结构
  - 工作区
  - 开发依赖和运行时依赖管理/缓存
  - 构建脚本
  - 全局安装
  - 它还可以使用子命令插件（例如 cargo clippy）进行扩展。
- 详情请参阅官方 Cargo Book

本培训中的代码示例

在本培训中，我们将主要通过示例探索 Rust 语言，这些示例可以通过浏览器执行。这能大大简化配置过程，并确保所有人都能获得一致的体验。

我们仍然建议你安装 Cargo：它有助于你更轻松地完成练习。在最后一天，我们要做一个综合的练习，向你展示如何使用依赖项，因此你需要安装 Cargo。

本课程中的代码块是完全交互式的：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

You can use Ctrl + Enter to execute the code when focus is in the text box.

Speaker Notes

如上所示，大多数代码示例都可修改。少数代码示例可能会因以下原因而不可修改：

嵌入的 Playground 无法执行单元测试。将代码复制并粘贴到实际 Playground 中，以演示单元测试。
嵌入的 Playground 会在离开页面后丢失编辑状态！因此，学员应使用本地安装的 Rust 或通过 Playground 解题。

使用 Cargo 在本地运行代码

如果你想在自己的系统上进行代码实验，则需要先安装 Rust。为此，请按照 Rust Book 中的说明操作。这应会为你提供一个有效的 rustc 和 cargo。在撰写本文时，最新的 Rust 稳定版是以下的版本号：


% rustc --version
rustc 1.69.0 (84c898d65 2023-04-16)
% cargo --version
cargo 1.69.0 (6e9a83356 2023-04-12)

你也可以使用任何更高版本，因为 Rust 保持向后兼容性。

了解这些信息后，请按照以下步骤从本培训中的一个示例中构建 Rust 二进制文件：

在你要复制的示例上点击“复制到剪贴板（Copy to clipboard）”按钮。

使用 cargo new exercise 为代码新建一个 exercise/ 目录：


$ cargo new exercise
     Created binary (application) `exercise` package

转到 exercise/ 并使用 cargo run 构建并运行二进制文件：


$ cd exercise
$ cargo run
   Compiling exercise v0.1.0 (/home/mgeisler/tmp/exercise)
    Finished dev [unoptimized + debuginfo] target(s) in 0.75s
     Running `target/debug/exercise`
Hello, world!

将 src/main.rs 中的样板代码替换为自己的代码。例如，使用上一页中的示例，将 src/main.rs 改为：


fn main() {
    println!("Edit me!");
}

使用 cargo run 构建并运行更新后的二进制文件：


$ cargo run
   Compiling exercise v0.1.0 (/home/mgeisler/tmp/exercise)
    Finished dev [unoptimized + debuginfo] target(s) in 0.24s
     Running `target/debug/exercise`
Edit me!

使用 cargo check 快速检查项目是否存在错误；使用 cargo build 只进行编译，而不运行。你可以在 target/debug/ 中找到常规调试 build 的输出。使用 cargo build --release 在 target/release/ 中生成经过优化的发布 build。
可以通过修改 Cargo.toml 为项目添加依赖项。当运行 cargo 命令时，系统会自动下载和编译缺失的依赖项。

Speaker Notes

尽量鼓励全班学员安装 Cargo 并使用本地编辑器。这能使他们拥有常规的开发环境，让工作变得更加轻松。

欢迎来到第一天

今天是学习 Comprehensive Rust 的第一天。我们会涉及很多内容：

Rust 基本语法：变量、标量（scalar）和复合（compound）类型、枚举（Enum）、结构体（struct）、引用、函数和方法。
Types and type inference.
控制流结构：循环、条件语句等。
用户定义的类型：结构体和枚举。
模式匹配：解构枚举、结构体和数组（array）。

时间表

Including 10 minute breaks, this session should take about 2 hours and 5 minutes. It contains:

Segment	Duration
欢迎	5 minutes
Hello, World	15 minutes
类型和值	40 minutes
控制流基础	40 minutes

Speaker Notes

This slide should take about 5 minutes.

请提醒学生：

他们可以随时提问，不需要留到最后。
这个课程本应该是互动的，我们鼓励大家积极讨论。
- As an instructor, you should try to keep the discussions relevant, i.e., keep the discussions related to how Rust does things vs some other language. It can be hard to find the right balance, but err on the side of allowing discussions since they engage people much more than one-way communication.
有些问题会导致我们提前谈到后面的内容。
- 这完全没有问题！重复是学习的一个重要方法。请记住，这些幻灯片只是一种辅助，你可以选择性地跳过。

第一天的目标是展示 Rust 中与其他编程语言有明显相似之处的 “基础”内容。Rust 中更高级复杂的内容将在后续几天中逐步介绍。

如果您是在教室里授课，则这是一个好时机，可对课程时间表进行讲解。请注意，每节课结束之后会有练习内容，然后是休息时间。安排在休息结束后讲解练习内容的解答。为了确保课程按时间表进行，此处所列时间仅供参考。请按需进行灵活调整！

Hello, World

This segment should take about 15 minutes. It contains:

Slide	Duration
什么是 Rust？	10 minutes
Rust 的优势	3 minutes
Playground	2 minutes

什么是 Rust？

Rust 是一门新的编程语言，它的1.0 版本于 2015 年发布：

Rust 是一门静态编译语言，其功能定位与 C++ 相似
- rustc 使用 LLVM 作为它的后端。
Rust 支持多种平台和架构:
- x86、ARM、WebAssembly……
- Linux、Mac、Windows……
Rust 被广泛用于各种设备中：
- 固件和引导程序，
- 智能显示器，
- 手机，
- 桌面，
- 服务器。

Speaker Notes

This slide should take about 10 minutes.

Rust 和 C++ 适用于类似的场景：

极高的灵活性。
高度的控制能力。
能够在资源匮乏的设备（如手机）上运行。
没有运行时和垃圾收集。
关注程序可靠性和安全性，而不会牺牲任何性能。

Rust 的优势

Rust 有一些独特的卖点：

编译时内存安全：在编译时可防止所有类内存 bug
- 不存在未初始化的变量。
- 不存在“双重释放”。
- 不存在“释放后使用”。
- 不存在 NULL 指针。
- 不存在被遗忘的互斥锁。
- 不存在线程之间的数据竞争。
- 不存在迭代器失效。
没有未定义的运行时行为：每个 Rust 语句的行为都有明确定义
- 数组访问有边界检查。
- 整数溢出有明确定义（panic 或回绕）。
现代语言功能：具有与高级语言一样丰富且人性化的表达能力
- 枚举和模式匹配。
- 泛型。
- 无额外开销的外部函数接口（FFI）。
- 零成本抽象。
- 强大的编译器错误提示。
- 内置依赖管理器。
- 对测试的内置支持。
- 优秀的语言服务协议（Language Server Protocol）支持。

Speaker Notes

This slide should take about 3 minutes.

不用在这里占用过多时间。所有这些要点均会在后面进行详细讲解。

应该问问学生们都使用过哪些语言。根据答案侧重讲解 Rust 的不同特性：

使用过 C 或 C++：Rust 利用借用检查消除了一类 运行时错误 。你可以达到堪比 C 和 C++ 的性能，而没有内存不安全的问题。并且你还可以得到些现代的语言构造，比如模式匹配和内置依赖管理。
使用过 Java、Go、Python、JavaScript……：你可以得到和这些语言相同的内存安全特性，并拥有类似的使用高级语言的感受。同时你可以得到类似 C 和 C++ 的高速且可预测的执行性能（无垃圾回收机制），以及在需要时对底层硬件的访问。

Playground

The Rust Playground provides an easy way to run short Rust programs, and is the basis for the examples and exercises in this course. Try running the “hello-world” program it starts with. It comes with a few handy features:

在 “Tools” 下，使用 rustfmt 选项以 “standard” 方式设置代码格式。
Rust 具有两个用于生成代码的主要 “配置文件”：调试（进行额外运行时检查、较少优化）和发布（较少运行时检查，进行大量优化）。可以在顶部的 “Debug” 下访问这些内容。
如果感兴趣，您可以使用 “…” 下的 “ASM” 查看生成的汇编代码。

Speaker Notes

This slide should take about 2 minutes.

As students head into the break, encourage them to open up the playground and experiment a little. Encourage them to keep the tab open and try things out during the rest of the course. This is particularly helpful for advanced students who want to know more about Rust’s optimizations or generated assembly.

类型和值

This segment should take about 40 minutes. It contains:

Slide	Duration
Hello, World	5 minutes
变量	5 minutes
值	5 minutes
算术	3 minutes
类型推导	3 minutes
练习：Fibonacci	15 minutes

Hello, World

让我们进入最简单的 Rust 程序，一个经典的 Hello World 程序：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

你看到的：

函数以 fn 开头。
像 C 和 C++ 一样，块由花括号分隔。
main 函数是程序的入口点。
Rust 有卫生宏 (hygienic macros)，println! 就是一个例子。
Rust 字符串是 UTF-8 编码的，可以包含任何 Unicode 字符。

Speaker Notes

This slide should take about 5 minutes.

This slide tries to make the students comfortable with Rust code. They will see a ton of it over the next four days so we start small with something familiar.

关键点：

Rust is very much like other languages in the C/C++/Java tradition. It is imperative and it doesn’t try to reinvent things unless absolutely necessary.
Rust 是一门现代编程语言，它完全支持 Unicode 等特性。
Rust uses macros for situations where you want to have a variable number of arguments (no function overloading).
宏是“卫生的”，这意味着它们不会意外地捕获它们所在作用域中的标识符。实际上，Rust 的宏只是部分卫生。
Rust 是多范式编程语言。例如，它具有强大的面向对象的编程功能，虽然它不是函数式语言，但包括一系列的函数概念。

变量

Rust provides type safety via static typing. Variable bindings are made with let:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

取消备注 x = 20，以证明变量默认是不可变的。添加 mut 关键字以允许进行更改。
这里的 i32 是变量的类型。编译时必须已知类型，但在很多情况下，由于具有类型推理功能（稍后介绍），程序员可以忽略这一点。

值

以下是一些基本的内置类型以及每种类型的字面量值的语法。

	类型	字面量
有符号整数	`i8`、`i16`、`i32`、`i64`、`i128`、`isize`	`-10`、`0`、`1_000`、`123_i64`
无符号整数	`u8`、`u16`、`u32`、`u64`、`u128`、`usize`	`0`、`123`、`10_u16`
浮点数	`f32`、`f64`	`3.14`、`-10.0e20`、`2_f32`
Unicode 标量类型	`char`	`'a'`、`'α'`、`'∞'`
布尔值	`bool`	`true`、`false`

各类型占用的空间为：

iN, uN 和 fN 占用 N 位，
isize 和 usize 占用一个指针大小的空间，
char 占用 32 位空间，
bool 占用 8 位空间。

Speaker Notes

This slide should take about 5 minutes.

上表中还有一些未提及的语法：

数字中的所有下划线均可忽略，它们只是为了方便辨识。因此，1_000 可以写为 1000（或 10_00），而 123_i64 可以写为 123i64。

算术


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 3 minutes.

这是我们第一次看到除 main 之外的函数，不过其含义应该很明确：它接受三个整数，然后返回一个整数。稍后会对这些函数进行详细介绍。

算术和优先级均与其他语言极为相似。

整数溢出是什么样的？在 C 和 C++ 中，有符号 整数溢出实际上是未定义的，可能会在不同平台或编译器上执行不同的操作。在 Rust 中，整数溢出具有明确定义。

Change the i32’s to i16 to see an integer overflow, which panics (checked) in a debug build and wraps in a release build. There are other options, such as overflowing, saturating, and carrying. These are accessed with method syntax, e.g., (a * b).saturating_add(b * c).saturating_add(c * a).

事实上，编译器会检测常量表达式的溢出情况，这便是为何该示例需要单独的函数。

类型推导

Rust 会根据变量的使用来确定其类型：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 3 minutes.

这张幻灯片演示了 Rust 编译器是如何根据变量声明和用法来推导其类型的。

需要重点强调的是这样声明的变量并非像那种动态类型语言中可以持有任何数据的“任何类型”。这种声明所生成的机器码与明确类型声明完全相同。编译器进行类型推导能够让我们编写更简略的代码。

当整数字面量的类型不受限制时，Rust 默认为 i32。这在错误消息中有时显示为 {integer}。同样，浮点字面量默认为 f64。


fn main() {
    let x = 3.14;
    let y = 20;
    assert_eq!(x, y);
    // ERROR: no implementation for `{float} == {integer}`
}

练习：Fibonacci

The first and second Fibonacci numbers are both 1. For n>2, the n’th Fibonacci number is calculated recursively as the sum of the n-1’th and n-2’th Fibonacci numbers.

Write a function fib(n) that calculates the n’th Fibonacci number. When will this function panic?


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

控制流基础

This segment should take about 40 minutes. It contains:

Slide	Duration
if 表达式	4 minutes
循环控制	5 minutes
break 和 continue	4 minutes
代码块和作用域	5 minutes
函数	3 minutes
宏	2 minutes
练习：考拉兹序列	15 minutes

`if` 表达式

if 表达式的用法与其他语言中的 if 语句完全一样。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

此外，你还可以将 if 用作一个表达式。每个块的最后一个表达式将成为 if 表达式的值：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 4 minutes.

Because if is an expression and must have a particular type, both of its branch blocks must have the same type. Show what happens if you add ; after "small" in the second example.

如果在表达式中使用 if，则表达式中必须包含 ; 来将其与下一个语句分隔开。移除 println! 前面的 ; 即可查看编译器错误。

循环控制

Rust 中有三个循环关键字：while、loop 和 for：

`while`

The while keyword works much like in other languages, executing the loop body as long as the condition is true.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

`for`

The for loop iterates over ranges of values or the items in a collection:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

Under the hood for loops use a concept called “iterators” to handle iterating over different kinds of ranges/collections. Iterators will be discussed in more detail later.
请注意，for 循环只迭代到 4。现在展示使用 1..=5 语法表示一个包含边界的范围。

`loop`

The loop statement just loops forever, until a break.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

`break` 和 `continue`

如果需要立即启动下一次迭代，请使用 continue。

If you want to exit any kind of loop early, use break. For loop, this can take an optional expression that becomes the value of the loop expression.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Labels

continue 和 break 都可以选择接受一个标签参数，用来终止嵌套循环：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

请注意，loop 是唯一返回有意义的值的循环结构。这是因为它保证至少被输入一次（与 while 和 for 循环不同）。

代码块和作用域

块

A block in Rust contains a sequence of expressions, enclosed by braces {}. Each block has a value and a type, which are those of the last expression of the block:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

If the last expression ends with ;, then the resulting value and type is ().

Speaker Notes

This slide and its sub-slides should take about 5 minutes.

你可以通过更改块的最后一行，来展示块值的变化情况。例如，添加/移除分号或使用 return。

作用域和遮蔽（Shadowing）

变量的作用域仅限于封闭代码块内。

你可以隐藏变量，位于外部作用域的变量和相同作用域的变量都可以：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

Show that a variable’s scope is limited by adding a b in the inner block in the last example, and then trying to access it outside that block.
Shadowing is different from mutation, because after shadowing both variable’s memory locations exist at the same time. Both are available under the same name, depending where you use it in the code.
A shadowing variable can have a different type.
隐藏起初看起来会有些晦涩，但是它很便于存 .unwrap() 之后的得到的值。

函数


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 3 minutes.

类型跟随在声明的参数后（与某些编程语言相反），然后是返回类型。
The last expression in a function body (or any block) becomes the return value. Simply omit the ; at the end of the expression. The return keyword can be used for early return, but the “bare value” form is idiomatic at the end of a function (refactor gcd to use a return).
有些函数没有返回值，会返回“单元类型（unit type）”()。如果省略了-> ()的返回类型，编译器将会自动推断。
Overloading is not supported – each function has a single implementation.
- 始终采用固定数量的参数。不支持默认参数。宏可用于支持可变函数。
- Always takes a single set of parameter types. These types can be generic, which will be covered later.

宏

宏在编译过程中会扩展为 Rust 代码，并且可以接受可变数量的参数。它们以 ! 结尾来进行区分。Rust 标准库包含各种有用的宏。

println!(format, ..) prints a line to standard output, applying formatting described in std::fmt.
format!(format, ..) 的用法与 println! 类似，但它以字符串形式返回结果。
dbg!(expression) 会记录表达式的值并返回该值。
todo!() 用于标记尚未实现的代码段。如果执行该代码段，则会触发 panic。
unreachable!() 用于标记无法访问的代码段。如果执行该代码段，则会触发 panic。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 2 minutes.

这一节的要点是介绍这些常见的便捷功能以及如何使用它们。而为何将它们定义为宏以及它们可以扩展为什么内容，并不是特别关键。

本课程不会介绍如何定义宏，但在后续部分会介绍派生宏的用法。

练习：考拉兹序列

The Collatz Sequence is defined as follows, for an arbitrary n₁ greater than zero:

If n_i is 1, then the sequence terminates at n_i.
If n_i is even, then n_i+1 = n_i / 2.
If n_i is odd, then n_i+1 = 3 * n_i + 1.

For example, beginning with n₁ = 3:

3 is odd, so n₂ = 3 * 3 + 1 = 10;
10 is even, so n₃ = 10 / 2 = 5;
5 is odd, so n₄ = 3 * 5 + 1 = 16;
16 is even, so n₅ = 16 / 2 = 8;
8 is even, so n₆ = 8 / 2 = 4;
4 is even, so n₇ = 4 / 2 = 2;
2 is even, so n₈ = 1; and
序列终止。

编写一个函数，用于计算给定初始 n 的考拉兹序列的长度。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Welcome Back

Including 10 minute breaks, this session should take about 2 hours and 35 minutes. It contains:

Segment	Duration
元组和数组	35 minutes
引用	55 minutes
用户定义的类型	50 minutes

元组和数组

This segment should take about 35 minutes. It contains:

Slide	Duration
数组（Arrays）	5 minutes
元组（Tuples）	5 minutes
数组迭代	3 minutes
解构	5 minutes
练习：嵌套数组	15 minutes

数组（Arrays）


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

A value of the array type [T; N] holds N (a compile-time constant) elements of the same type T. Note that the length of the array is part of its type, which means that [u8; 3] and [u8; 4] are considered two different types. Slices, which have a size determined at runtime, are covered later.
请尝试访问出界数组元素。系统会在运行时检查数组访问。Rust 通常会通过优化消除这些检查，以及使用不安全的 Rust 来避免这些检查。
我们可以使用字面量来为数组赋值。
The println! macro asks for the debug implementation with the ? format parameter: {} gives the default output, {:?} gives the debug output. Types such as integers and strings implement the default output, but arrays only implement the debug output. This means that we must use debug output here.
添加 #，比如 {a:#?}，会输出“美观打印（pretty printing）” 格式，这种格式可能会更加易读。

元组（Tuples）


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

和数组一样，元组也具有固定的长度。
元组将不同类型的值组成一个复合类型。
元组中的字段可以通过英文句号加上值的下标进行访问比如：t.0, t.1。
The empty tuple () is referred to as the “unit type” and signifies absence of a return value, akin to void in other languages.

数组迭代

for 语句支持对数组进行迭代（但不支持元组）。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 3 minutes.

此功能使用了 IntoIterator trait，但我们还没有讲到它。

这里新增了 assert_ne! 宏。此外，还有 assert_eq! 和 assert! 宏。系统始终会对这些宏进行检查，而像 debug_assert! 这样的仅调试变体在发布 build 中不会编译成任何代码。

解构

When working with tuples and other structured values it’s common to want to extract the inner values into local variables. This can be done manually by directly accessing the inner values:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

However, Rust also supports using pattern matching to destructure a larger value into its constituent parts:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

The patterns used here are “irrefutable”, meaning that the compiler can statically verify that the value on the right of = has the same structure as the pattern.
A variable name is an irrefutable pattern that always matches any value, hence why we can also use let to declare a single variable.
Rust also supports using patterns in conditionals, allowing for equality comparison and destructuring to happen at the same time. This form of pattern matching will be discussed in more detail later.
Edit the examples above to show the compiler error when the pattern doesn’t match the value being matched on.

练习：嵌套数组

数组可以包含其他数组：


#![allow(unused)]
fn main() {
let array = [[1, 2, 3], [4, 5, 6], [7, 8, 9]];
}

What is the type of this variable?

Use an array such as the above to write a function transpose which will transpose a matrix (turn rows into columns):

硬编码这两个函数，让它们处理 3 × 3 的矩阵。

将下面的代码复制到 https://play.rust-lang.org/ 并实现上述函数：


// TODO: remove this when you're done with your implementation.
#![allow(unused_variables, dead_code)]

fn transpose(matrix: [[i32; 3]; 3]) -> [[i32; 3]; 3] {
    unimplemented!()
}

#[test]
fn test_transpose() {
    let matrix = [
        [101, 102, 103], //
        [201, 202, 203],
        [301, 302, 303],
    ];
    let transposed = transpose(matrix);
    assert_eq!(
        transposed,
        [
            [101, 201, 301], //
            [102, 202, 302],
            [103, 203, 303],
        ]
    );
}

fn main() {
    let matrix = [
        [101, 102, 103], // <-- the comment makes rustfmt add a newline
        [201, 202, 203],
        [301, 302, 303],
    ];

    println!("matrix: {:#?}", matrix);
    let transposed = transpose(matrix);
    println!("transposed: {:#?}", transposed);
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

引用

This segment should take about 55 minutes. It contains:

Slide	Duration
共享引用	10 minutes
独占引用	10 minutes
Slices: &[T]	10 minutes
字符串	10 minutes
练习：几何图形	15 minutes

共享引用

引用提供了一种可以访问另一个值但无需对该值负责的方式，也被称为 “借用”。共享引用处于只读状态，且引用的数据无法更改。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

对类型 T 的共享引用表示为 &T。可以使用 & 运算符创建引用值。* 运算符会 “解引用”某个引用，并得到该引用值。

Rust 会静态禁止悬垂引用：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

引用被称为 “借用”了其所引用的值，这对于不熟悉指针的学生来说是一个很好的模型：代码可以通过引用来访问值，但原始变量仍然保有对该值的 “所有权”。本课程会在第 3 天详细介绍所有权。
引用是以指针的形式实现的，其关键优势在于它们可以比其所指的内容小得多。熟悉 C 或 C++ 的学生会将引用视为指针。本课程的后续部分将介绍 Rust 如何防止因使用原始指针而导致的内存安全 bug。
Rust 不会自动为您创建引用，必须始终使用 & 符号。
Rust will auto-dereference in some cases, in particular when invoking methods (try r.is_ascii()). There is no need for an -> operator like in C++.
在本例中，r 是可变的，因此可以为其重新赋值 (r = &b)。请注意，这会重新绑定 r，使其引用其他内容。这与 C++ 不同，在 C++ 中为引用赋值会更改引用的值。
共享引用不允许修改其所引用的值，即使该值是可变的。请尝试 *r = 'X'。
Rust 会跟踪所有引用的生命周期，以确保它们存在足够长的时间。在安全的 Rust 中不会出现悬空引用。x_axis 会返回对 point 的引用，但 point 会在该函数返回时取消分配，因此不会进行编译。
我们会在讲到所有权（ownership）时详细讨论借用（borrow）。

独占引用

独占引用（也称为可变引用）允许更改其所引用的值。它们的类型为 &mut T。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

关键点：

“独占模式”表示只有此引用可用于访问该值。在独占引用存在期间，不允许同时存在其他引用（无论是共享引用还是独占引用），并且无法访问引用的值。请尝试在 x_coord 处于活动状态时创建 &point.0 或更改 point.0。
Be sure to note the difference between let mut x_coord: &i32 and let x_coord: &mut i32. The first one represents a shared reference which can be bound to different values, while the second represents an exclusive reference to a mutable value.

切片

切片 (slice) 的作用是提供对集合 (collection) 的视图 (view):


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

切片从被切片的类型中借用 (borrow) 数据。
Question: What happens if you modify a[3] right before printing s?

Speaker Notes

This slide should take about 10 minutes.

We create a slice by borrowing a and specifying the starting and ending indexes in brackets.
If the slice starts at index 0, Rust’s range syntax allows us to drop the starting index, meaning that &a[0..a.len()] and &a[..a.len()] are identical.
The same is true for the last index, so &a[2..a.len()] and &a[2..] are identical.
To easily create a slice of the full array, we can therefore use &a[..].
s is a reference to a slice of i32s. Notice that the type of s (&[i32]) no longer mentions the array length. This allows us to perform computation on slices of different sizes.
Slices always borrow from another object. In this example, a has to remain ‘alive’ (in scope) for at least as long as our slice.
The question about modifying a[3] can spark an interesting discussion, but the answer is that for memory safety reasons you cannot do it through a at this point in the execution, but you can read the data from both a and s safely. It works before you created the slice, and again after the println, when the slice is no longer used.

字符串

现在我们就可以理解 Rust 中的两种字符串类型：

&str is a slice of UTF-8 encoded bytes, similar to &[u8].
String is an owned, heap-allocated buffer of UTF-8 bytes.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

&str introduces a string slice, which is an immutable reference to UTF-8 encoded string data stored in a block of memory. String literals ("Hello"), are stored in the program’s binary.
Rust’s String type is a wrapper around a vector of bytes. As with a Vec<T>, it is owned.
As with many other types String::from() creates a string from a string literal; String::new() creates a new empty string, to which string data can be added using the push() and push_str() methods.
The format!() macro is a convenient way to generate an owned string from dynamic values. It accepts the same format specification as println!().
You can borrow &str slices from String via & and optionally range selection. If you select a byte range that is not aligned to character boundaries, the expression will panic. The chars iterator iterates over characters and is preferred over trying to get character boundaries right.
For C++ programmers: think of &str as std::string_view from C++, but the one that always points to a valid string in memory. Rust String is a rough equivalent of std::string from C++ (main difference: it can only contain UTF-8 encoded bytes and will never use a small-string optimization).

Byte strings literals allow you to create a &[u8] value directly:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

原始字符串可在创建 &str 时禁用转义：r"\n" == "\\n"。可以在外层引号两侧添加相同数量的 #，以在字符串中嵌入双引号：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

练习：几何图形

我们将为三维几何图形创建几个实用函数，将点表示为 [f64;3]。函数签名由您自行确定。


// Calculate the magnitude of a vector by summing the squares of its coordinates
// and taking the square root. Use the `sqrt()` method to calculate the square
// root, like `v.sqrt()`.


fn magnitude(...) -> f64 {
    todo!()
}

// Normalize a vector by calculating its magnitude and dividing all of its
// coordinates by that magnitude.


fn normalize(...) {
    todo!()
}

// Use the following `main` to test your work.

fn main() {
    println!("Magnitude of a unit vector: {}", magnitude(&[0.0, 1.0, 0.0]));

    let mut v = [1.0, 2.0, 9.0];
    println!("Magnitude of {v:?}: {}", magnitude(&v));
    normalize(&mut v);
    println!("Magnitude of {v:?} after normalization: {}", magnitude(&v));
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

用户定义的类型

This segment should take about 50 minutes. It contains:

Slide	Duration
结构体	10 minutes
元组结构体	10 minutes
枚举	5 minutes
static	5 minutes
类型别名	2 minutes
练习：电梯事件	15 minutes

结构体

与 C 和 C++ 一样，Rust 支持自定义结构体：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

关键点：

结构体的运作方式与使用 C 或 C++ 时类似。
- 不需要 typedef 即可定义类型，这与使用 C++ 类似，但与使用 C 不同。
- 与使用 C++ 不同的是，结构体之间没有继承关系。
This may be a good time to let people know there are different types of structs.
- Zero-sized structs (e.g. struct Foo;) might be used when implementing a trait on some type but don’t have any data that you want to store in the value itself.
- 下一张幻灯片将介绍元组结构体，当字段名称不重要时使用。
If you already have variables with the right names, then you can create the struct using a shorthand.
The syntax ..avery allows us to copy the majority of the fields from the old struct without having to explicitly type it all out. It must always be the last element.

元组结构体

如果字段名称不重要，您可以使用元组结构体：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

这通常用于单字段封装容器（称为 newtype）：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

如需对基元类型中的值的额外信息进行编码，使用 newtype 是一种非常好的方式，例如：
- 数字会以某些单位来衡量：上方示例中为 Newtons。
- The value passed some validation when it was created, so you no longer have to validate it again at every use: PhoneNumber(String) or OddNumber(u32).
展示如何通过访问 newtype 中的单个字段，将 f64 值添加到 Newtons 类型。
- Rust 通常不喜欢不明确的内容，例如自动解封或将布尔值用作整数。
- 运算符过载在第 3 天（泛型）讨论。
此示例巧妙地引用了火星气候探测者号的失败事故。

枚举

enum 关键字允许创建具有几个不同变体的类型：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

关键点：

Enumerations allow you to collect a set of values under one type.
Direction is a type with variants. There are two values of Direction: Direction::Left and Direction::Right.
PlayerMove is a type with three variants. In addition to the payloads, Rust will store a discriminant so that it knows at runtime which variant is in a PlayerMove value.
This might be a good time to compare structs and enums:
- In both, you can have a simple version without fields (unit struct) or one with different types of fields (variant payloads).
- You could even implement the different variants of an enum with separate structs but then they wouldn’t be the same type as they would if they were all defined in an enum.

Rust 使用最小的空间来存储判标识。

如有必要，它会存储所需最小大小的整数
如果允许的变体值未涵盖所有位模式，则它将使用无效的位模式对判别标识进行编码（“小众优化”）。例如，Option<&u8> 存储的要么是指向整数的指针，要么是 None 变体的 NULL 值。

You can control the discriminant if needed (e.g., for compatibility with C):


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Without repr, the discriminant type takes 2 bytes, because 10001 fits 2 bytes.

探索更多

Rust 具有多种优化措施，可以减少枚举占用的空间。

Null pointer optimization: For some types, Rust guarantees that size_of::<T>() equals size_of::<Option<T>>().

Example code if you want to show how the bitwise representation may look like in practice. It’s important to note that the compiler provides no guarantees regarding this representation, therefore this is totally unsafe.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

`static`

静态变量在程序的整个执行过程中始终有效，因此不会移动：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

As noted in the Rust RFC Book, these are not inlined upon use and have an actual associated memory location. This is useful for unsafe and embedded code, and the variable lives through the entirety of the program execution. When a globally-scoped value does not have a reason to need object identity, const is generally preferred.

Speaker Notes

This slide should take about 5 minutes.

static is similar to mutable global variables in C++.
static provides object identity: an address in memory and state as required by types with interior mutability such as Mutex<T>.

探索更多

Because static variables are accessible from any thread, they must be Sync. Interior mutability is possible through a Mutex, atomic or similar.

Thread-local data can be created with the macro std::thread_local.

`const`

Constants are evaluated at compile time and their values are inlined wherever they are used:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

根据 Rust RFC Book 这些变量在使用时是内联 (inlined) 的。

在编译时只能调用标记为“const”的函数以生成“const”值。不过，可在运行时调用“const”函数。

Speaker Notes

Mention that const behaves semantically similar to C++’s constexpr
虽然需要使用在运行中求值的常量的情况并不是很常见，但是它是有帮助的，而且比使用静态变量更安全。

类型别名

类型别名为另一种类型创建名称。这两种类型可以互换使用。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 2 minutes.

C 语言程序员会认为这类似于 typedef。

练习：电梯事件

我们将创建一个数据结构来表示电梯控制系统中的事件。您可以自行定义用于构造各种事件的类型和函数。使用 #[derive(Debug)] 以允许通过 {:?} 设置类型格式。

This exercise only requires creating and populating data structures so that main runs without errors. The next part of the course will cover getting data out of these structures.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

欢迎来到第二天

Now that we have seen a fair amount of Rust, today will focus on Rust’s type system:

Pattern matching: extracting data from structures.
方法：将函数与类型相关联。
特征：多个类型所共有的行为。
泛型：对其他类型进行类型参数化。
标准库类型和特征：探索 Rust 丰富的标准库。

时间表

Including 10 minute breaks, this session should take about 2 hours and 55 minutes. It contains:

Segment	Duration
欢迎	3 minutes
模式匹配	1 hour
方法和特征	50 minutes
泛型	40 minutes

模式匹配

This segment should take about 1 hour. It contains:

Slide	Duration
Matching Values	10 minutes
解构	10 minutes
let 控制流	10 minutes
练习：表达式求值	30 minutes

Matching Values

The match keyword lets you match a value against one or more patterns. The comparisons are done from top to bottom and the first match wins.

模式（pattern）可以是简单的值，其用法类似于 C 与 C++ 中的 switch 。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

The _ pattern is a wildcard pattern which matches any value. The expressions must be exhaustive, meaning that it covers every possibility, so _ is often used as the final catch-all case.

Match can be used as an expression. Just like if, each match arm must have the same type. The type is the last expression of the block, if any. In the example above, the type is ().

模式中的变量（本例中为 key）将创建一个可在匹配分支内使用的绑定。

只有当条件为真时，保护语句才能使分支匹配成功。

Speaker Notes

This slide should take about 10 minutes.

关键点：

You might point out how some specific characters are being used when in a pattern
- | as an or
- .. can expand as much as it needs to be
- 1..=5 represents an inclusive range
- “_”是通配符
有些想法比模式本身所允许的程度更加复杂，如果我们希望简要地表达这些想法，就必须把匹配守卫视为独立的语法功能。
它们与匹配分支中的单独“if”表达式不同。选择匹配分支后，分支块内（在“=>”之后）会出现“if”表达式。如果该分支块内的“if”条件失败，系统不会考虑原始“match”表达式的其他分支。
只要表达式在包含“|”的模式中，就会适用守卫定义的条件。

解构

与元组一样，结构体和枚举也可以通过匹配方式进行解构：

结构体


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

枚举

模式还可用于将变量绑定到值的某些部分。这是您检查类型结构的方式。我们先从简单的“enum”类型开始：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

在这里，我们使用了分支来解构“Result”值。在第一个分支中，“half”被绑定到“Ok”变体中的值。在第二个分支中，“msg”被绑定到错误消息。

Speaker Notes

This slide should take about 8 minutes.

结构体

更改“foo”中的字面量值以与其他模式相匹配。
向“Foo”添加一个新字段，并根据需要更改模式。
捕获和常量表达式之间的区别可能很难发现。尝试将第二个分支中的“2”更改为一个变量，可以看到它几乎无法运作了。将它更改为“const”，可以看到它又正常运作了。

枚举

关键点：

“if”/“else”表达式将返回一个枚举，该枚举之后会使用“match”进行解封装。
您可以尝试在枚举定义中添加第三个变体，并在运行代码时显示错误。指出代码现在有哪些地方还不详尽，并说明编译器会如何尝试给予提示。
The values in the enum variants can only be accessed after being pattern matched.
Demonstrate what happens when the search is inexhaustive. Note the advantage the Rust compiler provides by confirming when all cases are handled.
Save the result of divide_in_two in the result variable and match it in a loop. That won’t compile because msg is consumed when matched. To fix it, match &result instead of result. That will make msg a reference so it won’t be consumed. This “match ergonomics” appeared in Rust 2018. If you want to support older Rust, replace msg with ref msg in the pattern.

let 控制流

Rust 有几个与其他语言不同的控制流结构。它们用于模式匹配：

if let 表达式
while let expressions
match 表达式

`if let` 表达式

if let 表达式能让你根据某个值是否与模式相匹配来执行不同的代码：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

`let else` expressions

如需了解匹配模式并从函数返回的常见情况，请使用 let else。“else” 分支必须执行不同的结束方式（例如，return、break 或 panic，但不能直接执行到代码块的末尾）。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

与 if let 一样，while let 变体会针对一个模式重复测试一个值：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Here String::pop returns Some(c) until the string is empty, after which it will return None. The while let lets us keep iterating through all items.

Speaker Notes

This slide should take about 10 minutes.

if-let

Unlike match, if let does not have to cover all branches. This can make it more concise than match.
使用 Option 时，常见的做法是处理 Some 值。
与 match 不同的是，if let 不支持模式匹配的 guard 子句。

let-else

if-lets can pile up, as shown. The let-else construct supports flattening this nested code. Rewrite the awkward version for students, so they can see the transformation.

重写后的版本为：


#![allow(unused)]
fn main() {
fn hex_or_die_trying(maybe_string: Option<String>) -> Result<u32, String> {
    let Some(s) = maybe_string else {
        return Err(String::from("got None"));
    };

    let Some(first_byte_char) = s.chars().next() else {
        return Err(String::from("got empty string"));
    };

    let Some(digit) = first_byte_char.to_digit(16) else {
        return Err(String::from("not a hex digit"));
    };

    return Ok(digit);
}
}

while-let

指出只要值与模式匹配，while let 循环就会一直进行下去。
You could rewrite the while let loop as an infinite loop with an if statement that breaks when there is no value to unwrap for name.pop(). The while let provides syntactic sugar for the above scenario.

练习：表达式求值

Let’s write a simple recursive evaluator for arithmetic expressions.

The Box type here is a smart pointer, and will be covered in detail later in the course. An expression can be “boxed” with Box::new as seen in the tests. To evaluate a boxed expression, use the deref operator (*) to “unbox” it: eval(*boxed_expr).

Some expressions cannot be evaluated and will return an error. The standard Result<Value, String> type is an enum that represents either a successful value (Ok(Value)) or an error (Err(String)). We will cover this type in detail later.

将代码复制粘贴到 Rust Playground，然后开始实现 eval。最终结果应能通过测试。使用 todo!() 并使测试逐个通过可能会很有帮助。您还可以使用 #[ignore] 暂时跳过测试：


#[test]
#[ignore]
fn test_value() { .. }

If you finish early, try writing a test that results in division by zero or integer overflow. How could you handle this with Result instead of a panic?


#![allow(unused)]
fn main() {
/// An operation to perform on two subexpressions.
#[derive(Debug)]
enum Operation {
    Add,
    Sub,
    Mul,
    Div,
}

/// An expression, in tree form.
#[derive(Debug)]
enum Expression {
    /// An operation on two subexpressions.
    Op { op: Operation, left: Box<Expression>, right: Box<Expression> },

    /// A literal value
    Value(i64),
}

fn eval(e: Expression) -> Result<i64, String> {
    todo!()
}

#[test]
fn test_value() {
    assert_eq!(eval(Expression::Value(19)), Ok(19));
}

#[test]
fn test_sum() {
    assert_eq!(
        eval(Expression::Op {
            op: Operation::Add,
            left: Box::new(Expression::Value(10)),
            right: Box::new(Expression::Value(20)),
        }),
        Ok(30)
    );
}

#[test]
fn test_recursion() {
    let term1 = Expression::Op {
        op: Operation::Mul,
        left: Box::new(Expression::Value(10)),
        right: Box::new(Expression::Value(9)),
    };
    let term2 = Expression::Op {
        op: Operation::Mul,
        left: Box::new(Expression::Op {
            op: Operation::Sub,
            left: Box::new(Expression::Value(3)),
            right: Box::new(Expression::Value(4)),
        }),
        right: Box::new(Expression::Value(5)),
    };
    assert_eq!(
        eval(Expression::Op {
            op: Operation::Add,
            left: Box::new(term1),
            right: Box::new(term2),
        }),
        Ok(85)
    );
}

#[test]
fn test_error() {
    assert_eq!(
        eval(Expression::Op {
            op: Operation::Div,
            left: Box::new(Expression::Value(99)),
            right: Box::new(Expression::Value(0)),
        }),
        Err(String::from("division by zero"))
    );
}
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

方法和特征

This segment should take about 50 minutes. It contains:

Slide	Duration
方法	10 minutes
特征（Trait）	15 minutes
派生特征	3 minutes
练习：通用日志	20 minutes

方法

Rust 允许您将函数与新类型相关联。您可以使用“impl”块来执行此操作：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

The self arguments specify the “receiver” - the object the method acts on. There are several common receivers for a method:

“&self”：使用不可变的共享引用从调用方借用对象。之后可以再次使用该对象。
“&mut self”：使用唯一的可变引用从调用方借用对象。之后可以再次使用该对象。
“self”：获取对象的所有权并将其从调用方移出。该方法会成为对象的所有者。除非明确转移对象的所有权，否则在该方法返回时，对象将被丢弃（取消分配）。具备完全所有权，不自动等同于具备可变性。
mut self: same as above, but the method can mutate the object.
无接收器：这将变为结构体上的静态方法。通常用于创建构造函数，按惯例被称为“new”。

Speaker Notes

This slide should take about 8 minutes.

关键点：

引入方法时，将方法与函数进行比较会很有帮助。
- 在某种类型（例如结构体或枚举）的实例上调用方法，第一个参数将该实例表示为“self”。
- 开发者可能会选择使用方法，以便利用方法接收器语法并让方法更有条理。通过使用方法，我们可以将所有实现代码保存在一个可预测的位置。
指出关键字“self”的用法，它是一种方法接收器。
- 显示它是“self: Self”的缩写术语，或许要显示结构体名称的可能用法。
- 说明“Self”是“impl”块所属类型的类型别名，可以在块中的其他位置使用。
- 指出“self”的使用方式与其他结构体一样，并且可以使用点表示法来指代各个字段。
- This might be a good time to demonstrate how the &self differs from self by trying to run finish twice.
- Beyond variants on self, there are also special wrapper types allowed to be receiver types, such as Box<Self>.

特征（Trait）

Rust 让您可以依据特征对类型进行抽象化处理。特征与接口类似：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide and its sub-slides should take about 15 minutes.

trait 定义了类型实现该 trait 所必须具备的一些方法。
In the “Generics” segment, next, we will see how to build functionality that is generic over all types implementing a trait.

实现 Unsafe Trait


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

To implement Trait for Type, you use an impl Trait for Type { .. } block.
Unlike Go interfaces, just having matching methods is not enough: a Cat type with a talk() method would not automatically satisfy Pet unless it is in an impl Pet block.
Traits may provide default implementations of some methods. Default implementations can rely on all the methods of the trait. In this case, greet is provided, and relies on talk.

共享类型

Associated types are placeholder types which are supplied by the trait implementation.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

Associated types are sometimes also called “output types”. The key observation is that the implementer, not the caller, chooses this type.
Many standard library traits have associated types, including arithmetic operators and Iterator.

派生特征

系统可以自动为您的自定义类型实现支持的 trait，如下所示：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 3 minutes.

派生功能是通过宏实现的，并且许多 crate 提供有用的派生宏，以添加实用功能。例如，serde 可以使用 #[derive(Serialize)] 为结构体派生序列化支持。

Exercise: Logger Trait

Let’s design a simple logging utility, using a trait Logger with a log method. Code which might log its progress can then take an &impl Logger. In testing, this might put messages in the test logfile, while in a production build it would send messages to a log server.

However, the StderrLogger given below logs all messages, regardless of verbosity. Your task is to write a VerbosityFilter type that will ignore messages above a maximum verbosity.

This is a common pattern: a struct wrapping a trait implementation and implementing that same trait, adding behavior in the process. What other kinds of wrappers might be useful in a logging utility?


use std::fmt::Display;

pub trait Logger {
    /// Log a message at the given verbosity level.
    fn log(&self, verbosity: u8, message: impl Display);
}

struct StderrLogger;

impl Logger for StderrLogger {
    fn log(&self, verbosity: u8, message: impl Display) {
        eprintln!("verbosity={verbosity}: {message}");
    }
}

fn do_things(logger: &impl Logger) {
    logger.log(5, "FYI");
    logger.log(2, "Uhoh");
}

// TODO: Define and implement `VerbosityFilter`.

fn main() {
    let l = VerbosityFilter { max_verbosity: 3, inner: StderrLogger };
    do_things(&l);
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

泛型

This segment should take about 40 minutes. It contains:

Slide	Duration
泛型函数	5 minutes
泛型类型	10 minutes
特征边界	10 minutes
impl Trait	5 minutes
练习：通用 min 函数	10 minutes

泛型函数

Rust supports generics, which lets you abstract algorithms or data structures (such as sorting or a binary tree) over the types used or stored.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

Rust 会根据参数类型和返回值推理出 T 的类型。
这与 C++ 模板类似，但 Rust 会立即编译部分通用函数，因此该函数必须对所有符合约束条件的类型都有效。例如，请尝试修改 pick 函数，如果 n == 0，则返回 even + odd。即使仅使用带有整数的“pick”实例化，Rust 仍会将其视为无效。C++ 可让您做到这一点。
Generic code is turned into non-generic code based on the call sites. This is a zero-cost abstraction: you get exactly the same result as if you had hand-coded the data structures without the abstraction.

泛型类型

您可以使用泛型对具体字段类型进行抽象化处理：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

*问：*为什么 T 在 impl<T> Point<T> {} 中指定了两次？这不是多余的吗？
- 这是因为它是泛型类型的泛型实现部分。它们是独立的泛型内容。
- 这意味着这些方法是针对所有 T 定义的。
- It is possible to write impl Point<u32> { .. }.
  - Point 依然是一个泛型，并且您可以使用 Point<f64>，但此块中的方法将仅适用于 Point<u32>。
请尝试声明一个新变量 let p = Point { x: 5, y: 10.0 };。通过使用两种类型变量（例如 T 和 U），更新代码以允许具有不同类型元素的点。

泛型

Traits can also be generic, just like types and functions. A trait’s parameters get concrete types when it is used.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

The From trait will be covered later in the course, but its definition in the std docs is simple.
Implementations of the trait do not need to cover all possible type parameters. Here, Foo::From("hello") would not compile because there is no From<&str> implementation for Foo.
Generic traits take types as “input”, while associated types are a kind of “output” type. A trait can have multiple implementations for different input types.
In fact, Rust requires that at most one implementation of a trait match for any type T. Unlike some other languages, Rust has no heuristic for choosing the “most specific” match. There is work on adding this support, called specialization.

特征边界

使用泛型时，您通常会想要利用类型来实现某些特性，这样才能调用此特征的方法。

您可以使用 T: Trait 或 impl Trait 执行此操作：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 8 minutes.

请尝试创建一个 NonClonable 对象，并将其传递给 duplicate 函数。
当需要多个 trait 时，请使用 + 将它们连接起来。
显示 where 子句，学员在阅读代码时会看到它。
```
fn duplicate<T>(a: T) -> (T, T)
where
    T: Clone,
{
    (a.clone(), a.clone())
}
```
- 它会在您有多个形参的情况下整理函数签名。
- 它具有额外功能，因此也更强大。
  - 如果有人提问，便阐明额外功能是指“:”左侧的类别可为任意值，例如 Option<T>。
请注意，Rust 尚不支持专精领域认证。例如，根据原始 duplicate 函数，添加专精领域认证的 Duplicate(a: u32) 是无效的。

`impl Trait`

与特征边界类似，impl Trait 语法可以在函数形参和返回值中使用：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

impl Trait allows you to work with types which you cannot name. The meaning of impl Trait is a bit different in the different positions.

对形参来说，impl Trait 就像是具有特征边界的匿名泛型形参。
对返回值类型来说，它则意味着返回值类型就是实现该特征的某具体类型，无需为该类型命名。如果您不想在公共 API 中公开该具体类型，便可使用此方法。

在返回位置处进行推断有一定难度。会返回 impl Foo 的函数会挑选自身返回的具体类型，而不必在来源中写出此信息。会返回泛型类型（例如 collect<B>() -> B）的函数则可返回符合 B 的任何类型，而调用方可能需要选择一个类型，例如使用 let x: Vec<_> = foo.collect() 或使用以下 Turbofish：foo.collect::<Vec<_>>()。

debuggable 是什么类型？尝试输入 let debuggable: () = ..，查看会显示什么错误消息。

练习：通用 `min` 函数

In this short exercise, you will implement a generic min function that determines the minimum of two values, using the Ord trait.


use std::cmp::Ordering;

// TODO: implement the `min` function used in `main`.

fn main() {
    assert_eq!(min(0, 10), 0);
    assert_eq!(min(500, 123), 123);

    assert_eq!(min('a', 'z'), 'a');
    assert_eq!(min('7', '1'), '1');

    assert_eq!(min("hello", "goodbye"), "goodbye");
    assert_eq!(min("bat", "armadillo"), "armadillo");
}

Speaker Notes

This slide and its sub-slides should take about 10 minutes.

Show students the Ord trait and Ordering enum.

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Welcome Back

Including 10 minute breaks, this session should take about 3 hours and 10 minutes. It contains:

Segment	Duration
标准库类型	1 hour and 20 minutes
标准库特征	1 hour and 40 minutes

标准库类型

This segment should take about 1 hour and 20 minutes. It contains:

Slide	Duration
标准库	3 minutes
文档	5 minutes
Option	10 minutes
Result	10 minutes
String	10 minutes
Vec	10 minutes
HashMap	10 minutes
练习：计数器	20 minutes

Speaker Notes

对于本部分的每张幻灯片，请花些时间仔细阅读文档页面，重点了解一些较为常用的方法。

标准库

Rust comes with a standard library which helps establish a set of common types used by Rust libraries and programs. This way, two libraries can work together smoothly because they both use the same String type.

In fact, Rust contains several layers of the Standard Library: core, alloc and std.

core includes the most basic types and functions that don’t depend on libc, allocator or even the presence of an operating system.
alloc 包括需要全局堆分配器的类型，例如 Vec、Box 和 Arc。
嵌入式 Rust 应用通常只使用 core，偶尔会使用 alloc。

文档

Rust comes with extensive documentation. For example:

All of the details about loops.
‘u8’ 等基元类型。
Standard library types like Option or BinaryHeap.

事实上，您可以为自己的代码编写文档：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

The contents are treated as Markdown. All published Rust library crates are automatically documented at docs.rs using the rustdoc tool. It is idiomatic to document all public items in an API using this pattern.

如需从项内（例如在模块内）为项编写文档，请使用 //! 或 /*! .. */，这称为“内部文档注释”：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

Show students the generated docs for the rand crate at https://docs.rs/rand.

Option

我们已经了解了 Option<T> 的一些用法。它可以存储“T”类型的值，或者不存储任何值。例如，‘String::find’ 会返回 Option<usize>。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

Option is widely used, not just in the standard library.
unwrap 会返回 Option 或 panic 中的值。expect 方法与此类似，但其使用错误消息。
- 出现 None 时您或许会恐慌，但不能 “无意中”忘记检查是否为 None 的情况。
- 在草拟阶段的编程中，频繁使用 unwrap/expect 进行处理十分常见，但在正式版代码时，通常以更为妥当的方式处理 None 的情况。
小众优化意味着 Option<T> 在内存中的大小通常与 T 相同。

Result

Result 与 Option 相似，但表示操作成功或失败，且每个操作的类型不同。这类似于表达式练习中定义的 Res，但是一个泛型：Result<T, E>，其中 T 用于 Ok 变体，E 出现在 Err 变体中。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

与 Option 方法相同，成功值位于 Result 方法内部，开发者必须显示提取成功值。因此，建议进行错误检查。在绝不应出现错误的情况下，可以调用 unwrap() 或 expect() 方法，这也是一种开发者意向信号。
Result documentation is a recommended read. Not during the course, but it is worth mentioning. It contains a lot of convenience methods and functions that help functional-style programming.
Result is the standard type to implement error handling as we will see on Day 4.

String

String 是标准堆分配的可扩容 UTF-8 字符串缓冲区：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

String 会实现 Deref<Target = str>，这意味着您可以对 String 调用所有 str 方法。

Speaker Notes

This slide should take about 10 minutes.

“String::new”会返回一个新的空字符串，如果您知道自己想要推送到字符串的数据量，请使用“String::with_capacity”。
“String::len”会返回“String”的大小（以字节为单位，可能不同于以字符为单位的长度）。
“String::chars”会针对实际字符返回一个迭代器。请注意，由于字素簇，“char”可能与人们所认为的“字符”有所不同。
当人们提到字符串时，可能是指“&str”或“String”。
当某个类型实现“Deref<Target = T>”时，编译器会让您以公开透明方式从“T”调用方法。
- 我们尚未讨论过 Deref trait，所以本部分目前主要介绍文档中边栏的结构。
- “String”会实现“Deref<Target = str>”，后者可公开透明地授予其访问“str”方法的权限。
- Write and compare let s3 = s1.deref(); and let s3 = &*s1;.
“String”是作为字节矢量的封装容器实现的，矢量上支持的许多操作在“String”上也受支持，但有一些额外保证。
比较将“String”编入索引的不同方式：
- 使用“s3.chars().nth(i).unwrap()”转换为字符，其中“i”代表是否出界。
- 通过使用“s3[0..4]”转换为子字符串，其中该 Slice 在或不在字符边界上。
Many types can be converted to a string with the to_string method. This trait is automatically implemented for all types that implement Display, so anything that can be formatted can also be converted to a string.

`Vec`

Vec 是标准的可调整大小堆分配缓冲区：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Vec 会实现 Deref<Target = [T]>，这意味着您可以对 Vec 调用 slice 方法。

Speaker Notes

This slide should take about 10 minutes.

Vec is a type of collection, along with String and HashMap. The data it contains is stored on the heap. This means the amount of data doesn’t need to be known at compile time. It can grow or shrink at runtime.
Notice how Vec<T> is a generic type too, but you don’t have to specify T explicitly. As always with Rust type inference, the T was established during the first push call.
“vec![…]”是用来代替“Vec::new()”的规范化宏，它支持向矢量添加初始元素。
如需将矢量编入索引，您可以使用“[’ ‘]”方法，但如果超出边界，矢量将会 panic。此外，使用“get”将返回“Option”。“pop”函数会移除最后一个元素。
Slices are covered on day 3. For now, students only need to know that a value of type Vec gives access to all of the documented slice methods, too.

`HashMap`

标准的哈希映射，内含针对 HashDoS 攻击的保护措施：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

“HashMap”未在序言中定义，因此需要纳入范围中。
请尝试使用以下代码行。第一行将查看图书是否在 hashmap 中；如果不在，则返回替代值。如果未找到图书，第二行会在 hashmap 中插入替代值。
```
let pc1 = page_counts
    .get("Harry Potter and the Sorcerer's Stone")
    .unwrap_or(&336);
let pc2 = page_counts
    .entry("The Hunger Games".to_string())
    .or_insert(374);
```
遗憾的是，与“vec!”不同，不存在标准的“hashmap!”宏。
- 不过，从 Rust 1.56 开始，HashMap 实现了“From<[(K, V); N]>”，让我们能够轻松地从字面量数组初始化哈希映射：
```
let page_counts = HashMap::from([
  ("Harry Potter and the Sorcerer's Stone".to_string(), 336),
  ("The Hunger Games".to_string(), 374),
]);
```
或者，HashMap 也可以基于任何可生成键-值元组的“Iterator”进行构建。
我们要展示“HashMap<String, i32>”，避免将“&str”用作键，以便简化示例。当然，可以在集合中使用引用，但可能会导致借用检查器出现复杂问题。
- 尝试从上述示例中移除“to_string()”，看看它是否仍可编译。您认为我们可能会在哪些方面遇到问题？
此类型具有几种特定于方法的返回值类型，例如“std::collections::hash_map::Keys”。这些类型通常会出现在 Rust 文档的搜索结果中。向学员展示此类型的文档，以及指向“keys”方法的实用链接。

练习：计数器

在本练习中，您将学习一个非常简单的数据结构，并将其变成泛型的。该结构使用 std::collections::HashMap 来跟踪已经出现过的值以及每个值出现的次数。

Counter 的初始版本经过硬编码，仅适用于 u32 值。使结构体及其方法可用于所跟踪的值类型，以便 Counter 能够跟踪任何类型的值。

如果提前完成操作，请尝试使用 entry 方法将哈希查找次数减半，从而实现 count 方法。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

标准库特征

This segment should take about 1 hour and 40 minutes. It contains:

Slide	Duration
比较	10 minutes
运算符	10 minutes
From 和 Into	10 minutes
类型转换	5 minutes
Read 和 Write	10 minutes
Default，结构体更新语法	5 minutes
闭包	20 minutes
练习：ROT13	30 minutes

Speaker Notes

与标准库类型一样，请花些时间仔细阅读每个 trait 的文档。

此部分内容较长。中途可休息一下。

比较

这些 trait 支持在值之间进行比较。对于包含实现这些 trait 的字段，可以派生所有这些 trait。

`PartialEq` and `Eq`

PartialEq 指部分等价关系，其中包含必需的方法 eq 和提供的方法 ne。== 和 != 运算符会调用这些方法。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Eq is a full equivalence relation (reflexive, symmetric, and transitive) and implies PartialEq. Functions that require full equivalence will use Eq as a trait bound.

`PartialOrd` and `Ord`

PartialOrd 定义了使用 partial_cmp 方法的部分排序。它用于实现 <、<=、>= 和 > 运算符。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Ord 是总排序，其中 cmp 返回 Ordering。

Speaker Notes

This slide should take about 10 minutes.

PartialEq 可以在不同类型之间实现，但 Eq 不能，因为它具有自反性：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

在实践中，派生这些 trait 很常见，但很少会实现它们。

运算符

运算符重载是通过 std::ops 中的特征实现的：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

讨论点：

You could implement Add for &Point. In which situations is that useful?
- 回答：Add:add 会耗用 self。如果您的运算符重载对象（即类型 T）不是 Copy，建议您也为 &T 重载运算符。这可避免调用点上存在不必要的克隆任务。
为什么 Output 是关联类型？可将它用作该方法的类型形参吗？
- Short answer: Function type parameters are controlled by the caller, but associated types (like Output) are controlled by the implementer of a trait.
您可以针对两种不同类型实现 Add，例如， impl Add<(i32, i32)> for Point 会向 Point 中添加元组。

`From` 和 `Into`

类型会实现 From 和 Into 以加快类型转换：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

实现 From 后，系统会自动实现 Into：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

这就是为什么通常只需实现 From，因为您的类型也会实现 Into。
若要声明某个函数实参输入类型（例如“任何可转换成 String 的类型”），规则便会相反，此时应使用 Into。您的函数会接受可实现 From 的类型，以及那些仅实现 Into 的类型。

类型转换

Rust 没有隐式类型转换，但支持使用 as 进行显式转换。转换写法通常和 C 语言的写法相一致。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

使用 as 的结果在 Rust 中始终定义明确，并且在不同平台上保持一致。这可能和您对于更改符号或转换为更小类型的直观理解不一样。为清晰起见，请查看文档和注释。

使用 as 进行类型转换是一种快捷好用但容易出错的方法，也在未来进行代码更新时，比如改变了类型或类型范围时会导致潜伏的 Bug。类型转换最好是用于明确是要进行无条件截取（比如截取低位的 u64 为 u32，忽略高位的数值）。

对于绝对不会出错的转换（比如 u32 转 u64），相比 as，更推荐用 From 或 Into 以肯定该转换是不会出错的。对于可能出错的转换，如果你想对这些不能成功转换的情况有不同处理方案时，可以考虑 TryFrom 和 TryInto。

Speaker Notes

This slide should take about 5 minutes.

请在这张幻灯片之后休息一下。

as 类似于 C++ 静态类型转换。通常不建议在可能丢失数据的情况下使用 as，或者至少应该添加说明性注释。

会经常遇到的是，将整数类型转换为 usize 以用作索引。

`Read` 和 `Write`

您可以使用 Read 和 BufRead 对 u8 来源进行抽象化处理：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

您同样可使用 Write 对 u8 接收器进行抽象化处理：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

`Default` 特征

Default 特征会为类型生成默认值。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

系统可以直接实现它，也可以通过 #[derive(Default)] 派生出它。
A derived implementation will produce a value where all fields are set to their default values.
- 这意味着，该结构体中的所有类型也都必须实现 Default。
标准的 Rust 类型通常会以合理的值（例如 0“” `等）实现Default`。
The partial struct initialization works nicely with default.
The Rust standard library is aware that types can implement Default and provides convenience methods that use it.
The .. syntax is called struct update syntax.

闭包

闭包或 lambda 表达式具有无法命名的类型。不过，它们会实现特殊的 Fn， FnMut 和 FnOnce 特征：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 20 minutes.

Fn（例如 add_3）既不会耗用也不会修改捕获的值，或许也不会捕获任何值。它可被并发调用多次。

FnMut（例如 accumulate）可能会改变捕获的值。您可以多次调用它，但不能并发调用它。

如果您使用 FnOnce（例如 multiply_sum），或许只能调用它一次。它可能会耗用所捕获的值。

FnMut 是 FnOnce 的子类型。Fn 是 FnMut 和 FnOnce 的子类型。也就是说，您可以在任何需要调用 FnOnce 的地方使用 FnMut，还可在任何需要调用 FnMut 或 FnOnce 的地方使用 Fn。

When you define a function that takes a closure, you should take FnOnce if you can (i.e. you call it once), or FnMut else, and last Fn. This allows the most flexibility for the caller.

In contrast, when you have a closure, the most flexible you can have is Fn (it can be passed everywhere), then FnMut, and lastly FnOnce.

编译器也会推断 Copy（例如针对 add_3）和 Clone（例如 multiply_sum），具体取决于闭包捕获的数据。

默认情况下，闭包会依据引用来捕获数据（如果可以的话）。move 关键字则可让闭包依据值来捕获数据。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

练习：ROT13

在此示例中，您将实现经典的 “ROT13”加密。将此代码复制到 Playground，并实现缺失的位。请仅旋转 ASCII 字母字符，以确保结果仍为有效的 UTF-8 编码字符。


use std::io::Read;

struct RotDecoder<R: Read> {
    input: R,
    rot: u8,
}

// Implement the `Read` trait for `RotDecoder`.

fn main() {
    let mut rot =
        RotDecoder { input: "Gb trg gb gur bgure fvqr!".as_bytes(), rot: 13 };
    let mut result = String::new();
    rot.read_to_string(&mut result).unwrap();
    println!("{}", result);
}

#[cfg(test)]
mod test {
    use super::*;

    #[test]
    fn joke() {
        let mut rot =
            RotDecoder { input: "Gb trg gb gur bgure fvqr!".as_bytes(), rot: 13 };
        let mut result = String::new();
        rot.read_to_string(&mut result).unwrap();
        assert_eq!(&result, "To get to the other side!");
    }

    #[test]
    fn binary() {
        let input: Vec<u8> = (0..=255u8).collect();
        let mut rot = RotDecoder::<&[u8]> { input: input.as_ref(), rot: 13 };
        let mut buf = [0u8; 256];
        assert_eq!(rot.read(&mut buf).unwrap(), 256);
        for i in 0..=255 {
            if input[i] != buf[i] {
                assert!(input[i].is_ascii_alphabetic());
                assert!(buf[i].is_ascii_alphabetic());
            }
        }
    }
}

如果将两个 RotDecoder 实例链接在一起，每个实例旋转 13 个字符，会发生什么情况？

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

欢迎参加第 3 天的课程

今日内容：

内存管理、生命周期和借用检查器：Rust 如何确保内存安全。
智能指针：标准库指针类型。

时间表

Including 10 minute breaks, this session should take about 2 hours and 20 minutes. It contains:

Segment	Duration
欢迎	3 minutes
内存管理	1 hour
智能指针	55 minutes

内存管理

This segment should take about 1 hour. It contains:

Slide	Duration
回顾：程序的内存分配	5 minutes
内存管理方法	10 minutes
所有权	5 minutes
移动语义	5 minutes
Clone	2 minutes
复合类型	5 minutes
Drop	10 minutes
练习：构建器类型	20 minutes

回顾：程序的内存分配

程序通过以下两种方式分配内存：

栈：局部变量的连续内存区域。
- 值在编译时具有已知的固定大小。
- 速度极快：只需移动一个栈指针。
- 易于管理：遵循函数调用规则。
- 优秀的内存局部性。
堆：函数调用之外的值的存储。
- 值具有动态大小，具体大小需在运行时确定。
- 比栈稍慢：需要向系统申请空间。
- 不保证内存局部性。

示例

Creating a String puts fixed-sized metadata on the stack and dynamically sized data, the actual string, on the heap:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

指出 String 底层由 Vec 实现，因此它具有容量和长度，如果值可变，则可以通过在堆上重新分配存储空间进行增长。
如果学员提出相关问题，你可以提及我们不仅能使用[系统分配器]在堆上分配底层内存，还能使用 Allocator API 实现自定义分配器

探索更多

We can inspect the memory layout with unsafe Rust. However, you should point out that this is rightfully unsafe!


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

内存管理方法

传统上，语言分为两大类：

通过手动内存管理实现完全控制：C、C++、Pascal…
- 程序员决定何时分配或释放堆内存。
- 程序员必须确定指针是否仍指向有效内存。
- 研究表明，程序员难免会犯错。
运行时通过自动内存管理实现完全安全：Java、Python、Go、Haskell…
- 运行时系统可确保在内存无法被引用之前，不会释放该内存。
- 通常通过引用计数、垃圾回收或 RAII 实现。

Rust 提供了一个全新的组合：

通过编译时强制执行正确的内存>管理来实现完全控制与安全。

它通过一个明确的所有权（ownership）概念来实现此目的。

Speaker Notes

This slide should take about 10 minutes.

本幻灯片旨在帮助学习其他语言的学生更好地了解 Rust。

C 语言必须使用 malloc 和 free 函数手动管理堆。常见错误包括忘记调用 free、针对同一指针多次调用它，或在释放某指针所指向的内存后解引用它。
C++ 具有智能指针（unique_ptr、shared_ptr）等工具，可以利用与调用析构函数相关的语言保证来确保在函数返回时释放内存。这些工具仍然很容易被滥用并导致与 C 语言类似的 bug。
Java、Go 和 Python 依赖垃圾回收器来识别无法再访问的内存并将其舍弃。这保证可对所有指针进行解引用操作，从而消除了释放后使用等各类 bug。但是，垃圾回收 (GC) 会产生运行时成本，并且很难进行适当调优。

在许多情况下，Rust 的所有权和借用模型可以实现 C 语言的性能，能够精确地在所需位置执行分配和释放操作，且为零成本。它还提供类似于 C++ 智能指针的工具。必要时，它还提供引用计数等其他选项，甚至还有第三方 crate 可以支持运行时垃圾回收（本课程中不作介绍）。

所有权

所有变量绑定都有一个有效的“作用域”，使用超出其作用域的变量是错误的：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

We say that the variable owns the value. Every Rust value has precisely one owner at all times.

At the end of the scope, the variable is dropped and the data is freed. A destructor can run here to free up resources.

Speaker Notes

This slide should take about 5 minutes.

熟悉垃圾回收实现的学生知道，垃圾回收器从一组 “根”开始查找所有可访问内存。Rust 的 “单一所有者”原则与此类似。

移动语义

An assignment will transfer ownership between variables:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

将 s1 赋值给 s2，即转移了所有权。
When s1 goes out of scope, nothing happens: it does not own anything.
当 s2 离开作用域时，字符串数据被释放。

移动到 s2 中之前：

移动到 s2 中之后：

你将值传递给函数时，该值会被赋给函数参数。这就转移了所有权：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

指出这与 C++ 中的默认值相反。除非你使用 std::move（并已定义 move 构造函数！），否则 C++ 中的默认值是按值复制的。
只有所有权发生了转移。是否会生成任何机器码来操控数据本身是一个优化方面的问题，系统会主动优化此类副本。
简单的值（例如整数）可以标记为“Copy”（请看后续幻灯片）。
在 Rust 中，克隆是显式的（通过使用 clone）。

在 say_hello 示例中：

首次调用 say_hello 时，main 便放弃了 name 的所有权。此后，main 中不能再使用 name。
在 say_hello 函数结束时，系统会释放为 name 分配的堆内存。
如果 main 将 name 作为引用 (&name) 传递过去，且 say_hello 接受作为参数的引用，则可保留所有权。
此外，main 也可以在首次调用时传递 name 的克隆 (name.clone())。
相较于 C++，Rust 通过将移动语义设为默认值，并强制程序员进行显式克隆，更难以无意中创建副本。

探索更多

Defensive Copies in Modern C++

现代 C++ 以不同的方式解决此问题：


std::string s1 = "Cpp";
std::string s2 = s1;  // Duplicate the data in s1.

s1 中的堆数据被复制，s2 获得自己的独立副本。
当 s1 和 s2 离开作用域时，它们会各自释放自己的内存。

复制-赋值之前：

复制-赋值之后：

关键点：

C++ 做出了与 Rust 略有不同的选择。由于“=”会复制数据，因此必须克隆字符串数据。否则，当任一字符串超出范围时，便会出现二次释放。
C++ 还包含“std::move”，它用于指示何时可以移动某个值。如果示例为“s2 = std::move(s1)”，则不会发生堆分配。移动后，“s1”将处于有效但未指定的状态。与 Rust 不同，程序员可以继续使用“s1”。
与 Rust 不同，使用 C++ 时，“=”可以运行任意代码，具体取决于要复制或移动的类型。

Clone

有时，如需复制某个值。Clone 特征可以完成此操作。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 2 minutes.

Clone 的设计理念是让您轻松发现堆分配的位置。查找 .clone() 和其他一些内容，例如 Vec::new 或 Box::new。

通常的做法是，先使用 “克隆操作”解决借用检查器问题，在后续通过优化消除这些克隆操作。

复合类型

虽然移动语义是默认的，但默认情况下会复制某些类型：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

这些类型实现了 Copy trait。

你可以选择自己的类型来使用复制语义：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

赋值之后，p1 和 p2 都拥有自己的数据。
我们还可以使用 p1.clone() 显式复制数据。

Speaker Notes

This slide should take about 5 minutes.

复制和克隆是两码事：

复制是指内存区域的按位复制，不适用于任意对象。
复制不允许自定义逻辑（不同于 C++ 中的复制构造函数）。
克隆是一种更通用的操作，也允许通过实现 Clone trait 来自定义行为。
复制不适用于实现 Drop trait 的类型。

在上述示例中，请尝试以下操作：

在 struct Point 中添加 String 字段。由于 String 不属于 Copy 类型，因此无法编译。
Remove Copy from the derive attribute. The compiler error is now in the println! for p1.
指出如果你改为克隆 p1，则可按预期运行。

`Drop` 特征

用于实现 Drop 的值可以指定在超出范围时运行的代码：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 8 minutes.

请注意，std::mem::drop 与 std::ops::Drop::drop 不同。
当值超出范围时，系统会自动将其删除。
丢弃某个值时，如果该值实现了 std::ops::Drop，则会调用其 Drop::drop 实现。
然后，该值所有字段也会被丢弃，无论其是否实现了 Drop。
std::mem::drop 只是一个采用任何值的空函数。重要的是它获得了值的所有权，因此在其作用域结束时便会被丢弃。如此您可以轻松提前明确地丢弃值，而不必等到值超过范围的时候。
- 这对于通过 drop 执行任务的对象来说非常有用，例如释放锁、关闭文件等。

讨论点：

为什么 Drop::drop 不使用 self？
- 简答：如果这样的话，系统会在代码块结尾调用 std::mem::drop，进而引发再一次调用 Drop::drop，并引发堆栈溢出！
尝试用 a.drop() 替换 drop(a)。

练习：构建器类型

在此示例中，我们将实现一个拥有全部数据所有权的复杂数据类型。我们将使用 “构建器模式”来支持逐步构建新值，通过便捷函数来实现。

填补缺失的内容。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

智能指针

This segment should take about 55 minutes. It contains:

Slide	Duration
Box	10 minutes
Rc	5 minutes
特征对象	10 minutes
练习：二叉树	30 minutes

`Box<T>`

Box 是指向堆上数据的自有指针：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Box<T> 会实现 Deref<Target = T>，这意味着您可以直接在 Box<T> 上通过 T 调用相应方法。

递归数据类型或具有动态大小的数据类型需要使用 Box：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 8 minutes.

Box is like std::unique_ptr in C++, except that it’s guaranteed to be not null.
在以下情况下，Box 可能会很实用：
- 在编译时间遇到无法知晓大小的类型，但 Rust 编译器需要知道确切大小。
- 想要转让大量数据的所有权。为避免在堆栈上复制大量数据，请改为将数据存储在 Box 中的堆上，以便仅移动指针。
If Box was not used and we attempted to embed a List directly into the List, the compiler would not be able to compute a fixed size for the struct in memory (the List would be of infinite size).
Box 大小与一般指针相同，并且只会指向堆中的下一个 List 元素，因此可以解决这个问题。
Remove the Box in the List definition and show the compiler error. We get the message “recursive without indirection”, because for data recursion, we have to use indirection, a Box or reference of some kind, instead of storing the value directly.

探索更多

小众优化

Though Box looks like std::unique_ptr in C++, it cannot be empty/null. This makes Box one of the types that allow the compiler to optimize storage of some enums.

For example, Option<Box<T>> has the same size, as just Box<T>, because compiler uses NULL-value to discriminate variants instead of using explicit tag (“Null Pointer Optimization”):


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

`Rc`

Rc 是引用计数的共享指针。如果您需要从多个位置引用相同的数据，请使用此指针：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

See Arc and Mutex if you are in a multi-threaded context.
您可以将共享指针_降级_为 Weak 指针，以便创建之后会被舍弃的循环引用。

Speaker Notes

This slide should take about 5 minutes.

Rc 的计数可确保只要有引用，内含的值就会保持有效。
Rust 中的“Rc”与 C++ 中的“std::shared_ptr”类似。
Rc::clone 的成本很低：这个做法会创建指向相同分配的指针，并增加引用计数，而不会产生深层的克隆，排查代码性能问题时通常可以忽略。
make_mut 实际上会在必要时克隆内部值（“clone-on-write”），并返回可变的引用。
使用 Rc::strong_count 可查看引用计数。
Rc::downgrade gives you a weakly reference-counted object to create cycles that will be dropped properly (likely in combination with RefCell).

特征对象

特征（Trait）对象可接受不同类型的值，举例来说，在集合中会是这样：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

以下是分配 pets 后的内存布局：

Speaker Notes

This slide should take about 10 minutes.

实现给定 trait 的类型可能大小不同。因此，上例中不可能具有像 Vec<dyn Pet> 这样的项。
可通过“dyn Pet”这个方法向编译器告知实现“Pet”的动态大小类型。
在本例中，pets 在栈上分配内存，矢量数据存储在堆上。这两个矢量元素是 胖指针：
- 胖指针属于全角指针。它包含两个部分：指向实际对象的指针，以及指向该特定对象的 Pet 实现的虚拟方法表 (vtable) 的指针。
- Dog（名为 Fido）类型的数据是 name 和 age 字段。Cat 类型包含一个 lives 字段。

比较上述示例中的这些输出：


println!("{} {}", std::mem::size_of::<Dog>(), std::mem::size_of::<Cat>());
println!("{} {}", std::mem::size_of::<&Dog>(), std::mem::size_of::<&Cat>());
println!("{}", std::mem::size_of::<&dyn Pet>());
println!("{}", std::mem::size_of::<Box<dyn Pet>>());

练习：二叉树

二元树是一种树型数据结构，其中每个节点都有两个子节点（左侧和右侧）。我们将创建一个树状结构，其中每个节点存储一个值。对于给定的节点 N，N 的左侧子树中的所有节点都包含较小的值，而 N 的右侧子树中的所有节点都将包含较大的值。

实现以下类型，以便通过指定的测试。

额外提示：对按顺序返回值的二元树实现迭代器。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Welcome Back

Including 10 minute breaks, this session should take about 1 hour and 50 minutes. It contains:

Segment	Duration
借用	50 minutes
结构体生命周期	50 minutes

借用

This segment should take about 50 minutes. It contains:

Slide	Duration
借用值	10 minutes
借用检查	10 minutes
内部可变性	10 minutes
练习：健康统计	20 minutes

借用值

As we saw before, instead of transferring ownership when calling a function, you can let a function borrow the value:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

add 函数“借用”两个点并返回一个新点。
调用方会保留输入的所有权。

Speaker Notes

This slide should take about 10 minutes.

此幻灯片是对第 1 天引用材料的回顾，并稍作了扩展，添加了函数参数和返回值。

探索更多

关于栈返回的说明：

Demonstrate that the return from add is cheap because the compiler can eliminate the copy operation. Change the above code to print stack addresses and run it on the Playground or look at the assembly in Godbolt. In the “DEBUG” optimization level, the addresses should change, while they stay the same when changing to the “RELEASE” setting:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Rust 编译器能够执行返回值优化 (RVO)。
In C++, copy elision has to be defined in the language specification because constructors can have side effects. In Rust, this is not an issue at all. If RVO did not happen, Rust will always perform a simple and efficient memcpy copy.

借用检查

Rust’s borrow checker puts constraints on the ways you can borrow values. For a given value, at any time:

You can have one or more shared references to the value, or
You can have exactly one exclusive reference to the value.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

请注意，要求是相冲突的引用不能 同时存在。而引用的解引用位置无关紧要。
上述代码无法编译，因为 a 同时作为可变值（通过 c）和不可变值（通过 b）被借用。
将b 的 println! 语句移到引入 c 的作用域之前，这段代码就可以编译。
这样更改后，编译器会发现 b 只在通过 c 对 a 进行新可变借用之前使用过。这是借用检查器的一个功能，名为“非词法作用域生命周期”。
The exclusive reference constraint is quite strong. Rust uses it to ensure that data races do not occur. Rust also relies on this constraint to optimize code. For example, a value behind a shared reference can be safely cached in a register for the lifetime of that reference.
借用检查器专用于处理许多常见模式，例如同时对结构体中的不同字段进行独占引用。但在某些情况下，它并不能完全 “领会”您的意图，这往往会导致 “与借用检查器进行一番斗争”。

内部可变性

In some situations, it’s necessary to modify data behind a shared (read-only) reference. For example, a shared data structure might have an internal cache, and wish to update that cache from read-only methods.

The “interior mutability” pattern allows exclusive (mutable) access behind a shared reference. The standard library provides several ways to do this, all while still ensuring safety, typically by performing a runtime check.

`RefCell`


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

`Cell`

Cell wraps a value and allows getting or setting the value, even with a shared reference to the Cell. However, it does not allow any references to the value. Since there are no references, borrowing rules cannot be broken.

Speaker Notes

This slide should take about 10 minutes.

The main thing to take away from this slide is that Rust provides safe ways to modify data behind a shared reference. There are a variety of ways to ensure that safety, and RefCell and Cell are two of them.

RefCell enforces Rust’s usual borrowing rules (either multiple shared references or a single exclusive reference) with a runtime check. In this case, all borrows are very short and never overlap, so the checks always succeed.
Rc only allows shared (read-only) access to its contents, since its purpose is to allow (and count) many references. But we want to modify the value, so we need interior mutability.
Cell is a simpler means to ensure safety: it has a set method that takes &self. This needs no runtime check, but requires moving values, which can have its own cost.
Demonstrate that reference loops can be created by adding root to subtree.children.
为了演示运行时 panic，请添加一个会递增“self.value”并以相同方法调用其子项的“fn inc(&mut self)”。如果存在引用循环，就会 panic，并且“thread”“main”会因“already borrowed: BorrowMutError”而 panic。

练习：健康统计

你正在实现一个健康监控系统。作为其中的一部分，你需要对用户的健康统计数据进行追踪。

You’ll start with a stubbed function in an impl block as well as a User struct definition. Your goal is to implement the stubbed out method on the User struct defined in the impl block.

Copy the code below to https://play.rust-lang.org/ and fill in the missing method:


// TODO: remove this when you're done with your implementation.
#![allow(unused_variables, dead_code)]


#![allow(dead_code)]
pub struct User {
    name: String,
    age: u32,
    height: f32,
    visit_count: usize,
    last_blood_pressure: Option<(u32, u32)>,
}

pub struct Measurements {
    height: f32,
    blood_pressure: (u32, u32),
}

pub struct HealthReport<'a> {
    patient_name: &'a str,
    visit_count: u32,
    height_change: f32,
    blood_pressure_change: Option<(i32, i32)>,
}

impl User {
    pub fn new(name: String, age: u32, height: f32) -> Self {
        Self { name, age, height, visit_count: 0, last_blood_pressure: None }
    }

    pub fn visit_doctor(&mut self, measurements: Measurements) -> HealthReport {
        todo!("Update a user's statistics based on measurements from a visit to the doctor")
    }
}

fn main() {
    let bob = User::new(String::from("Bob"), 32, 155.2);
    println!("I'm {} and my age is {}", bob.name, bob.age);
}

#[test]
fn test_visit() {
    let mut bob = User::new(String::from("Bob"), 32, 155.2);
    assert_eq!(bob.visit_count, 0);
    let report =
        bob.visit_doctor(Measurements { height: 156.1, blood_pressure: (120, 80) });
    assert_eq!(report.patient_name, "Bob");
    assert_eq!(report.visit_count, 1);
    assert_eq!(report.blood_pressure_change, None);

    let report =
        bob.visit_doctor(Measurements { height: 156.1, blood_pressure: (115, 76) });

    assert_eq!(report.visit_count, 2);
    assert_eq!(report.blood_pressure_change, Some((-5, -4)));
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

结构体生命周期

This segment should take about 50 minutes. It contains:

Slide	Duration
生命周期注解	10 minutes
生命周期省略	5 minutes
结构体生命周期	5 minutes
练习：Protobuf 解析	30 minutes

生命周期注解

A reference has a lifetime, which must not “outlive” the value it refers to. This is verified by the borrow checker.

The lifetime can be implicit - this is what we have seen so far. Lifetimes can also be explicit: &'a Point, &'document str. Lifetimes start with ' and 'a is a typical default name. Read &'a Point as “a borrowed Point which is valid for at least the lifetime a”.

Lifetimes are always inferred by the compiler: you cannot assign a lifetime yourself. Explicit lifetime annotations create constraints where there is ambiguity; the compiler verifies that there is a valid solution.

当考虑向函数传递值和从函数返回值时，生命周期会变得更加复杂。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

In this example, the compiler does not know what lifetime to infer for p3. Looking inside the function body shows that it can only safely assume that p3’s lifetime is the shorter of p1 and p2. But just like types, Rust requires explicit annotations of lifetimes on function arguments and return values.

将 'a 适当添加到 left_most 中：


fn left_most<'a>(p1: &'a Point, p2: &'a Point) -> &'a Point {

这表示.，“假设 p1 和 p2 的存在时间都比 'a 更长，则返回值至少在 'a 内有效”。

在一般情况下，可以省略生命周期，如下一张幻灯片中所述。

函数调用中的生命周期

Lifetimes for function arguments and return values must be fully specified, but Rust allows lifetimes to be elided in most cases with a few simple rules. This is not inference – it is just a syntactic shorthand.

每个没有生命周期注解的参数都会添加一个生命周期注解。
如果只有一个参数生命周期，则将其赋予所有未加注解的返回值。
如果有多个参数生命周期，但第一个是用于 self 的，则将该生命周期赋予所有未加注解的返回值。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

在此示例中，cab_distance 被轻易省略掉了。

nearest 函数提供了另一个函数示例，该函数的参数中包含多个引用，需要显式注解。

请尝试将签名调整为 “谎报”了返回的生命周期：


fn nearest<'a, 'q>(points: &'a [Point], query: &'q Point) -> Option<&'q Point> {

This won’t compile, demonstrating that the annotations are checked for validity by the compiler. Note that this is not the case for raw pointers (unsafe), and this is a common source of errors with unsafe Rust.

Students may ask when to use lifetimes. Rust borrows always have lifetimes. Most of the time, elision and type inference mean these don’t need to be written out. In more complicated cases, lifetime annotations can help resolve ambiguity. Often, especially when prototyping, it’s easier to just work with owned data by cloning values where necessary.

数据结构中的生命周期

如果数据类型存储了借用的数据，则必须对其添加生命周期注释：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

在上述示例中，Highlight 注释会强制包含 &str 的底层数据的生命周期至少与使用该数据的任何 Highlight 实例一样长。
如果 text 在 fox（或 dog）的生命周期结束前被消耗，借用检查器将抛出一个错误。
借用数据的类型会迫使用户保留原始数据。这对于创建轻量级视图很有用，但通常会使它们更难使用。
如有可能，让数据结构直接拥有自己的数据。
一些包含多个引用的结构可以有多个生命周期注释。除了结构体本身的生命周期之外，如果需要描述引用之间的生命周期关系，则可能需要这样做。这些都是非常高级的用例。

练习：Protobuf 解析

在本练习中，您将为 protobuf 二进制编码构建一个解析器。别担心，其实非常简单！这展示了一种常见的解析模式，即传递数据 slice。底层数据本身永远不会被复制。

如要完整解析 protobuf 消息，需要知道字段的类型（按字段编号编入索引）。这通常会在 proto 文件中提供。在本练习中，我们将把这些信息编码成处理每个字段所调用的函数中的 match 语句。

我们将使用以下 proto：


message PhoneNumber {
  optional string number = 1;
  optional string type = 2;
}

message Person {
  optional string name = 1;
  optional int32 id = 2;
  repeated PhoneNumber phones = 3;
}

proto 消息被编码为连续的一系列字段。每个字段都通过 “标签”后面紧跟值的形式来实现。标签包含一个字段编号（例如Person 消息的 id 字段的值为 2）和线型（用于定义应如何从字节流确定载荷）。

整数（包括标签）使用名为 VARINT 的可变长度编码表示。幸运的是，下面为您提供了 parse_varint 的定义。该指定代码还定义了一些回调，用于处理 Person 和 PhoneNumber 字段，并将消息解析为对这些回调的一系列调用。

What remains for you is to implement the parse_field function and the ProtoMessage trait for Person and PhoneNumber.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Welcome to Day 4

Today we will cover topics relating to building large-scale software in Rust:

迭代器：深入了解 Iterator 特征。
模块和可见性。
Testing.
错误处理：panic、“Result”和 try 运算符“?”。
不安全 Rust：当无法用安全 Rust 表达您的意图时，则可将其作为应急方法。

时间表

Including 10 minute breaks, this session should take about 2 hours and 40 minutes. It contains:

Segment	Duration
欢迎	3 minutes
迭代器	45 minutes
模块	40 minutes
测试	45 minutes

迭代器

This segment should take about 45 minutes. It contains:

Slide	Duration
Iterator	5 minutes
IntoIterator	5 minutes
FromIterator	5 minutes
练习：迭代器方法链	30 minutes

`Iterator`

‘Iterator’ trait 支持迭代集合中的值。它需要用到 next 方法，并提供很多方法。许多标准库类型均能实现 Iterator，您也可以自行实现：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

The Iterator trait implements many common functional programming operations over collections (e.g. map, filter, reduce, etc). This is the trait where you can find all the documentation about them. In Rust these functions should produce the code as efficient as equivalent imperative implementations.
IntoIterator 是迫使 for 循环运作的特征。此特征由集合类型（例如 Vec<T>）和相关引用（例如 &Vec<T> 和 &[T]）而实现。此外，范围也会实现这项特征。因此，您可以使用 for i in some_vec { .. } 来遍历某矢量，但 some_vec.next() 不存在。

`IntoIterator`

The Iterator trait tells you how to iterate once you have created an iterator. The related trait IntoIterator defines how to create an iterator for a type. It is used automatically by the for loop.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

Click through to the docs for IntoIterator. Every implementation of IntoIterator must declare two types:

Item: the type to iterate over, such as i8,
“IntoIter”：“into_iter”方法返回的“Iterator”类型。

Note that IntoIter and Item are linked: the iterator must have the same Item type, which means that it returns Option<Item>

此示例对 x 坐标和 y 坐标的所有组合进行了迭代。

请尝试在 main 中对网格进行两次迭代。为什么会失败？请注意，IntoIterator::into_iter 获得了 self 的所有权。

如要解决此问题，请为 &Grid 实现 IntoIterator，并在 GridIter 中存储对 Grid 的引用。

对于标准库类型，可能会出现同样的问题：for e in some_vector 将获得 some_vector 的所有权，并迭代该矢量中的自有元素。请改用 for e in &some_vector 来迭代 some_vector 的元素的引用。

FromIterator

FromIterator 让您可通过 Iterator 构建一个集合。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

Iterator implements


fn collect<B>(self) -> B
where
    B: FromIterator<Self::Item>,
    Self: Sized

可以通过两种方式为此方法指定 B：

With the “turbofish”: some_iterator.collect::<COLLECTION_TYPE>(), as shown. The _ shorthand used here lets Rust infer the type of the Vec elements.
使用类型推理功能时：let prime_squares: Vec<_> = some_iterator.collect()。将示例重写成使用这种形式。

There are basic implementations of FromIterator for Vec, HashMap, etc. There are also more specialized implementations which let you do cool things like convert an Iterator<Item = Result<V, E>> into a Result<Vec<V>, E>.

练习：迭代器方法链

In this exercise, you will need to find and use some of the provided methods in the Iterator trait to implement a complex calculation.

Copy the following code to https://play.rust-lang.org/ and make the tests pass. Use an iterator expression and collect the result to construct the return value.


#![allow(unused)]
fn main() {
/// Calculate the differences between elements of `values` offset by `offset`,
/// wrapping around from the end of `values` to the beginning.
///
/// Element `n` of the result is `values[(n+offset)%len] - values[n]`.
fn offset_differences<N>(offset: usize, values: Vec<N>) -> Vec<N>
where
    N: Copy + std::ops::Sub<Output = N>,
{
    unimplemented!()
}

#[test]
fn test_offset_one() {
    assert_eq!(offset_differences(1, vec![1, 3, 5, 7]), vec![2, 2, 2, -6]);
    assert_eq!(offset_differences(1, vec![1, 3, 5]), vec![2, 2, -4]);
    assert_eq!(offset_differences(1, vec![1, 3]), vec![2, -2]);
}

#[test]
fn test_larger_offsets() {
    assert_eq!(offset_differences(2, vec![1, 3, 5, 7]), vec![4, 4, -4, -4]);
    assert_eq!(offset_differences(3, vec![1, 3, 5, 7]), vec![6, -2, -2, -2]);
    assert_eq!(offset_differences(4, vec![1, 3, 5, 7]), vec![0, 0, 0, 0]);
    assert_eq!(offset_differences(5, vec![1, 3, 5, 7]), vec![2, 2, 2, -6]);
}

#[test]
fn test_custom_type() {
    assert_eq!(
        offset_differences(1, vec![1.0, 11.0, 5.0, 0.0]),
        vec![10.0, -6.0, -5.0, 1.0]
    );
}

#[test]
fn test_degenerate_cases() {
    assert_eq!(offset_differences(1, vec![0]), vec![0]);
    assert_eq!(offset_differences(1, vec![1]), vec![0]);
    let empty: Vec<i32> = vec![];
    assert_eq!(offset_differences(1, empty), vec![]);
}
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

模块

This segment should take about 40 minutes. It contains:

Slide	Duration
模块	3 minutes
文件系统层级结构	5 minutes
可见性	5 minutes
use、super、self	10 minutes
练习：面向 GUI 库的模块	15 minutes

模块

我们已看了“impl”块如何让我们将函数的命名空间建为一种类型。

同样，“mod”让我们可为类型和函数建立命名空间：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 3 minutes.

包提供功能，并包含一个描述如何构建包含 1 个以上 crate 的捆绑包的“Cargo.toml”文件。
crate 是一种模块树，其中的二进制 crate 会创建一个可执行文件，而库 crate 会编译为库。
模块定义了组织和范围，并且是本部分的重点。

文件系统层级结构

如果省略模块内容，则会指示 Rust 在另一个文件中查找：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

这会告知 Rust 可以在“src/garden.rs”中找到“garden”模块内容。同样，您可以在“src/garden/vegetables.rs”中找到“garden::vegetables”模块。

“crate”根目录位于：

“src/lib.rs”（对于库 crate）
“src/main.rs”（对于二进制文件 crate）

也可以使用“内部文档注释”对文件中定义的模块进行记录。这些用于记录包含它们的项（在本例中为模块）。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

在 Rust 2018 之前的版本中，模块需要位于“module/mod.rs”而非“module.rs”中，对于 2018 年之后的版本而言，这仍是有效的替代方案。
引入“filename.rs”来替代“filename/mod.rs”的主要原因是，许多名为“mod.rs”的文件在 IDE 中可能难以区分。

即使主模块是文件，更深层的嵌套也可以使用文件夹：


src/
├── main.rs
├── top_module.rs
└── top_module/
    └── sub_module.rs

Rust 寻找模块的位置可通过编译器指令更改：
```
#[path = "some/path.rs"]
mod some_module;
```
例如，如果您想将某个模块的测试放在名为“some_module_test.rs”的文件中（类似于 Go 中的惯例），这样做很有用。

可见性

模块是一种隐私边界：

默认情况下，模块项是私有的（隐藏实现详情）。
父项和同级子项始终可见。
换言之，如果某个项在模块“foo”中可见，那么该项在“foo”的所有后代中均可见。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

使用“pub”关键字将模块设为公开。

此外，您还可以使用高级“pub(…)”说明符来限制公开可见的范围。

请参阅 Rust 参考。
配置“pub(crate)”可见性是一种常见模式。
您可以为特定路径授予可见性，这种情况不太常见。
在任何情况下，都必须向祖先模块（及其所有后代）授予可见性。

use、super、self

一个模块可以使用“use”将另一个模块的符号全部纳入。您通常在每个模块的顶部会看到如下内容：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

路径

路径解析如下：

作为相对路径：
- foo 或 self::foo 是指当前模块中的 foo，
- “super::foo”是指父模块中的“foo”。
作为绝对路径：
- crate::foo 是指当前 crate 的根中的 foo，
- “bar::foo”是指“bar”crate 中的“foo”。

Speaker Notes

This slide should take about 8 minutes.

通常使用较短的路径来 “重新导出”符号。例如，crate 中的顶层 lib.rs 文件可能会
```
mod storage;

pub use storage::disk::DiskStorage;
pub use storage::network::NetworkStorage;
```
通过便捷的短路径，使得 DiskStorage 和 NetworkStorage 可供其他 crate 使用。
在大多数情况下，只有模块中显示的项才需通过 use 引入。不过，即使实现该 trait 的类型已处于作用域内，如要调用该 trait 的任何方法，仍需将该 trait 引入到作用域内。例如，如需对实现 Read trait 的类型使用 read_to_string 方法，您需要使用 use std::io::Read 引入。
use 语句可以包含通配符：use std::io::*。但不推荐这种做法，因为不清楚导入了哪些项，并且这些内容可能会随时间而变化。

练习：面向 GUI 库的模块

In this exercise, you will reorganize a small GUI Library implementation. This library defines a Widget trait and a few implementations of that trait, as well as a main function.

It is typical to put each type or set of closely-related types into its own module, so each widget type should get its own module.

Cargo Setup

Rust Playground 仅支持一个文件，因此您需要在本地文件系统上创建一个 Cargo 项目：


cargo init gui-modules
cd gui-modules
cargo run

Edit the resulting src/main.rs to add mod statements, and add additional files in the src directory.

Source

Here’s the single-module implementation of the GUI library:


pub trait Widget {
    /// Natural width of `self`.
    fn width(&self) -> usize;

    /// Draw the widget into a buffer.
    fn draw_into(&self, buffer: &mut dyn std::fmt::Write);

    /// Draw the widget on standard output.
    fn draw(&self) {
        let mut buffer = String::new();
        self.draw_into(&mut buffer);
        println!("{buffer}");
    }
}

pub struct Label {
    label: String,
}

impl Label {
    fn new(label: &str) -> Label {
        Label { label: label.to_owned() }
    }
}

pub struct Button {
    label: Label,
}

impl Button {
    fn new(label: &str) -> Button {
        Button { label: Label::new(label) }
    }
}

pub struct Window {
    title: String,
    widgets: Vec<Box<dyn Widget>>,
}

impl Window {
    fn new(title: &str) -> Window {
        Window { title: title.to_owned(), widgets: Vec::new() }
    }

    fn add_widget(&mut self, widget: Box<dyn Widget>) {
        self.widgets.push(widget);
    }

    fn inner_width(&self) -> usize {
        std::cmp::max(
            self.title.chars().count(),
            self.widgets.iter().map(|w| w.width()).max().unwrap_or(0),
        )
    }
}

impl Widget for Window {
    fn width(&self) -> usize {
        // Add 4 paddings for borders
        self.inner_width() + 4
    }

    fn draw_into(&self, buffer: &mut dyn std::fmt::Write) {
        let mut inner = String::new();
        for widget in &self.widgets {
            widget.draw_into(&mut inner);
        }

        let inner_width = self.inner_width();

        // TODO: Change draw_into to return Result<(), std::fmt::Error>. Then use the
        // ?-operator here instead of .unwrap().
        writeln!(buffer, "+-{:-<inner_width$}-+", "").unwrap();
        writeln!(buffer, "| {:^inner_width$} |", &self.title).unwrap();
        writeln!(buffer, "+={:=<inner_width$}=+", "").unwrap();
        for line in inner.lines() {
            writeln!(buffer, "| {:inner_width$} |", line).unwrap();
        }
        writeln!(buffer, "+-{:-<inner_width$}-+", "").unwrap();
    }
}

impl Widget for Button {
    fn width(&self) -> usize {
        self.label.width() + 8 // add a bit of padding
    }

    fn draw_into(&self, buffer: &mut dyn std::fmt::Write) {
        let width = self.width();
        let mut label = String::new();
        self.label.draw_into(&mut label);

        writeln!(buffer, "+{:-<width$}+", "").unwrap();
        for line in label.lines() {
            writeln!(buffer, "|{:^width$}|", &line).unwrap();
        }
        writeln!(buffer, "+{:-<width$}+", "").unwrap();
    }
}

impl Widget for Label {
    fn width(&self) -> usize {
        self.label.lines().map(|line| line.chars().count()).max().unwrap_or(0)
    }

    fn draw_into(&self, buffer: &mut dyn std::fmt::Write) {
        writeln!(buffer, "{}", &self.label).unwrap();
    }
}

fn main() {
    let mut window = Window::new("Rust GUI Demo 1.23");
    window.add_widget(Box::new(Label::new("This is a small text GUI demo.")));
    window.add_widget(Box::new(Button::new("Click me!")));
    window.draw();
}

Speaker Notes

This slide and its sub-slides should take about 15 minutes.

鼓励学生按照自己认为合适的方式划分代码，并熟悉必需的 mod、use 和 pub 声明。之后，讨论哪些组织方式最符合惯例。

解答


src
├── main.rs
├── widgets
│   ├── button.rs
│   ├── label.rs
│   └── window.rs
└── widgets.rs


// ---- src/widgets.rs ----
mod button;
mod label;
mod window;

pub trait Widget {
    /// Natural width of `self`.
    fn width(&self) -> usize;

    /// Draw the widget into a buffer.
    fn draw_into(&self, buffer: &mut dyn std::fmt::Write);

    /// Draw the widget on standard output.
    fn draw(&self) {
        let mut buffer = String::new();
        self.draw_into(&mut buffer);
        println!("{buffer}");
    }
}

pub use button::Button;
pub use label::Label;
pub use window::Window;


// ---- src/widgets/label.rs ----
use super::Widget;

pub struct Label {
    label: String,
}

impl Label {
    pub fn new(label: &str) -> Label {
        Label { label: label.to_owned() }
    }
}

impl Widget for Label {
    fn width(&self) -> usize {
        // ANCHOR_END: Label-width
        self.label.lines().map(|line| line.chars().count()).max().unwrap_or(0)
    }

    // ANCHOR: Label-draw_into
    fn draw_into(&self, buffer: &mut dyn std::fmt::Write) {
        // ANCHOR_END: Label-draw_into
        writeln!(buffer, "{}", &self.label).unwrap();
    }
}


// ---- src/widgets/button.rs ----
use super::{Label, Widget};

pub struct Button {
    label: Label,
}

impl Button {
    pub fn new(label: &str) -> Button {
        Button { label: Label::new(label) }
    }
}

impl Widget for Button {
    fn width(&self) -> usize {
        // ANCHOR_END: Button-width
        self.label.width() + 8 // add a bit of padding
    }

    // ANCHOR: Button-draw_into
    fn draw_into(&self, buffer: &mut dyn std::fmt::Write) {
        // ANCHOR_END: Button-draw_into
        let width = self.width();
        let mut label = String::new();
        self.label.draw_into(&mut label);

        writeln!(buffer, "+{:-<width$}+", "").unwrap();
        for line in label.lines() {
            writeln!(buffer, "|{:^width$}|", &line).unwrap();
        }
        writeln!(buffer, "+{:-<width$}+", "").unwrap();
    }
}


// ---- src/widgets/window.rs ----
use super::Widget;

pub struct Window {
    title: String,
    widgets: Vec<Box<dyn Widget>>,
}

impl Window {
    pub fn new(title: &str) -> Window {
        Window { title: title.to_owned(), widgets: Vec::new() }
    }

    pub fn add_widget(&mut self, widget: Box<dyn Widget>) {
        self.widgets.push(widget);
    }

    fn inner_width(&self) -> usize {
        std::cmp::max(
            self.title.chars().count(),
            self.widgets.iter().map(|w| w.width()).max().unwrap_or(0),
        )
    }
}

impl Widget for Window {
    fn width(&self) -> usize {
        // ANCHOR_END: Window-width
        // Add 4 paddings for borders
        self.inner_width() + 4
    }

    // ANCHOR: Window-draw_into
    fn draw_into(&self, buffer: &mut dyn std::fmt::Write) {
        // ANCHOR_END: Window-draw_into
        let mut inner = String::new();
        for widget in &self.widgets {
            widget.draw_into(&mut inner);
        }

        let inner_width = self.inner_width();

        // TODO: after learning about error handling, you can change
        // draw_into to return Result<(), std::fmt::Error>. Then use
        // the ?-operator here instead of .unwrap().
        writeln!(buffer, "+-{:-<inner_width$}-+", "").unwrap();
        writeln!(buffer, "| {:^inner_width$} |", &self.title).unwrap();
        writeln!(buffer, "+={:=<inner_width$}=+", "").unwrap();
        for line in inner.lines() {
            writeln!(buffer, "| {:inner_width$} |", line).unwrap();
        }
        writeln!(buffer, "+-{:-<inner_width$}-+", "").unwrap();
    }
}


// ---- src/main.rs ----
mod widgets;

use widgets::Widget;

fn main() {
    let mut window = widgets::Window::new("Rust GUI Demo 1.23");
    window
        .add_widget(Box::new(widgets::Label::new("This is a small text GUI demo.")));
    window.add_widget(Box::new(widgets::Button::new("Click me!")));
    window.draw();
}

测试

This segment should take about 45 minutes. It contains:

Slide	Duration
测试模块	5 minutes
其他类型的测试	5 minutes
编译器 Lint 和 Clippy	3 minutes
练习：卢恩算法	30 minutes

单元测试

Rust 和 Cargo 随附了一个简单的单元测试框架：

单元测试在您的整个代码中都受支持。
您可以通过 tests/ 目录来支持集成测试。

Tests are marked with #[test]. Unit tests are often put in a nested tests module, using #[cfg(test)] to conditionally compile them only when building tests.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

这样一来，您可以对专用帮助程序进行单元测试。
仅当您运行 cargo test 时，#[cfg(test)] 属性才有效。

Speaker Notes

This slide should take about 5 minutes.

在 Playground 中运行测试显示测试结果。

其他类型的测试

集成测试

如果您想要以客户的身份测试您的库，请使用集成测试。

在 tests/ 下方创建一个 .rs 文件：


// tests/my_library.rs
use my_library::init;

#[test]
fn test_init() {
    assert!(init().is_ok());
}

这些测试只能使用您的 crate 的公共 API。

文档测试

Rust 本身就支持文档测试：


#![allow(unused)]
fn main() {
/// Shortens a string to the given length.
///
/// ```
/// # use playground::shorten_string;
/// assert_eq!(shorten_string("Hello World", 5), "Hello");
/// assert_eq!(shorten_string("Hello World", 20), "Hello World");
/// ```
pub fn shorten_string(s: &str, length: usize) -> &str {
    &s[..std::cmp::min(length, s.len())]
}
}

/// 注释中的代码块会自动被视为 Rust 代码。
代码会作为 cargo test 的一部分进行编译和执行。
Adding # in the code will hide it from the docs, but will still compile/run it.
在 Rust Playground 上测试上述代码。

编译器 Lint 和 Clippy

Rust 编译器会生成出色的错误消息，并提供实用的内置 lint 功能。Clippy 提供了更多 lint 功能，采用按组分类方式，并可按项目灵活启用。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 3 minutes.

运行代码示例并检查错误消息。此处还会显示一些 lint，但是一旦完成代码编译，就不会再显示这些 lint。切换到 Playground 网站以显示这些 lint。

解析完 lint 之后，请在 Playground 网站上运行 clippy，以显示 clippy 警告。Clippy 提供了大量的 lint 文档，并且在不断添加新的 lint（包括默认拒绝 lint）。

请注意，带有 help: ... 的错误或警告可以通过 cargo Fix 或编辑器进行修复。

练习：卢恩算法

卢恩算法

卢恩算法用于验证信用卡号。该算法将字符串作为输入内容，并执行以下操作来验证信用卡号：

Ignore all spaces. Reject number with fewer than two digits.
从右到左，将偶数位的数字乘二。对于数字“1234”，我们将“3”和“1”乘二；对于数字“98765”，将“6”和“8”乘二。
将一个数字乘二后，如果结果大于 9，则将每位数字相加。因此，将“7”乘二得“14”，然后“1 + 4 = 5”。
将所有未乘二和已乘二的数字相加。
如果总和以“0”结尾，则信用卡号有效。

提供的代码提供了一个有缺陷的 Luhn 算法实现，附带两个基本单元测试，用于验证大部分算法是否正确实现。

Copy the code below to https://play.rust-lang.org/ and write additional tests to uncover bugs in the provided implementation, fixing any bugs you find.


#![allow(unused)]
fn main() {
pub fn luhn(cc_number: &str) -> bool {
    let mut sum = 0;
    let mut double = false;

    for c in cc_number.chars().rev() {
        if let Some(digit) = c.to_digit(10) {
            if double {
                let double_digit = digit * 2;
                sum +=
                    if double_digit > 9 { double_digit - 9 } else { double_digit };
            } else {
                sum += digit;
            }
            double = !double;
        } else {
            continue;
        }
    }

    sum % 10 == 0
}

#[cfg(test)]
mod test {
    use super::*;

    #[test]
    fn test_valid_cc_number() {
        assert!(luhn("4263 9826 4026 9299"));
        assert!(luhn("4539 3195 0343 6467"));
        assert!(luhn("7992 7398 713"));
    }

    #[test]
    fn test_invalid_cc_number() {
        assert!(!luhn("4223 9826 4026 9299"));
        assert!(!luhn("4539 3195 0343 6476"));
        assert!(!luhn("8273 1232 7352 0569"));
    }
}
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Welcome Back

Including 10 minute breaks, this session should take about 2 hours and 10 minutes. It contains:

Segment	Duration
错误处理	55 minutes
不安全 Rust	1 hour and 5 minutes

错误处理

This segment should take about 55 minutes. It contains:

Slide	Duration
Panics	3 minutes
尝试运算符	5 minutes
尝试转换	5 minutes
Error 特征	5 minutes
thiserror 和 anyhow	5 minutes
练习：使用 Result 进行重写	30 minutes

Panics

Rust 通过 “panic”机制处理严重错误。

如果运行时发生严重错误，Rust 会触发 panic：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Panic 用于指示不可恢复的意外错误。
- Panic反映了程序中的 bug 问题。
- 运行时失败（例如边界检查失败）可能会触发 panic
- 断言（例如 assert!）在失败时会触发 panic
- 针对特定用途的 panic 可以使用 panic! 宏。
使用 panic 会 “展开”堆栈，并丢弃对应的值，就像函数已经返回一样。
如果崩溃不可接受，请使用不会触发 panic 的 API（例如 Vec::get）。

Speaker Notes

This slide should take about 3 minutes.

默认情况下，panic 会导致堆栈展开。您可以捕获展开信息：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

捕获异常；请勿尝试使用 catch_unwind 实现异常！
如果服务器需要持续运行（即使是在请求发生崩溃的情况下），此方法十分有用。
如果您在 Cargo.toml 中设置了 panic = 'abort'，此方法不会生效。

尝试运算符

Runtime errors like connection-refused or file-not-found are handled with the Result type, but matching this type on every call can be cumbersome. The try-operator ? is used to return errors to the caller. It lets you turn the common


match some_expression {
    Ok(value) => value,
    Err(err) => return Err(err),
}

转换成更简单的命令


some_expression?

We can use this to simplify our error handling code:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

简化 read_username 函数以使用 ?。

关键点：

username 变量可以是 Ok(string) 或 Err(error)。
可以使用 fs::write 调用来测试不同的场景：没有文件、空文件、包含用户名的文件。
Note that main can return a Result<(), E> as long as it implements std::process::Termination. In practice, this means that E implements Debug. The executable will print the Err variant and return a nonzero exit status on error.

尝试转换

? 的有效展开比前面介绍的内容略微复杂一些：


expression?

效果等同于


match expression {
    Ok(value) => value,
    Err(err)  => return Err(From::from(err)),
}

The From::from call here means we attempt to convert the error type to the type returned by the function. This makes it easy to encapsulate errors into higher-level errors.

示例


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

The ? operator must return a value compatible with the return type of the function. For Result, it means that the error types have to be compatible. A function that returns Result<T, ErrorOuter> can only use ? on a value of type Result<U, ErrorInner> if ErrorOuter and ErrorInner are the same type or if ErrorOuter implements From<ErrorInner>.

From 实现的常见替代方案是 Result::map_err，尤其是只在一个位置进行转换时。

There is no compatibility requirement for Option. A function returning Option<T> can use the ? operator on Option<U> for arbitrary T and U types.

A function that returns Result cannot use ? on Option and vice versa. However, Option::ok_or converts Option to Result whereas Result::ok turns Result into Option.

动态错误类型

Sometimes we want to allow any type of error to be returned without writing our own enum covering all the different possibilities. The std::error::Error trait makes it easy to create a trait object that can contain any error.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

read_count 函数可以返回 std::io::Error（通过文件操作）或 std::num::ParseIntError（通过 String::parse）。

Boxing errors saves on code, but gives up the ability to cleanly handle different error cases differently in the program. As such it’s generally not a good idea to use Box<dyn Error> in the public API of a library, but it can be a good option in a program where you just want to display the error message somewhere.

Make sure to implement the std::error::Error trait when defining a custom error type so it can be boxed. But if you need to support the no_std attribute, keep in mind that the std::error::Error trait is currently compatible with no_std in nightly only.

`thiserror` 和 `anyhow`

The thiserror and anyhow crates are widely used to simplify error handling.

thiserror is often used in libraries to create custom error types that implement From<T>.
anyhow is often used by applications to help with error handling in functions, including adding contextual information to your errors.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

`thiserror`

The Error derive macro is provided by thiserror, and has lots of useful attributes to help define error types in a compact way.
The std::error::Error trait is derived automatically.
The message from #[error] is used to derive the Display trait.

`anyhow`

anyhow::Error is essentially a wrapper around Box<dyn Error>. As such it’s again generally not a good choice for the public API of a library, but is widely used in applications.
anyhow::Result<V> is a type alias for Result<V, anyhow::Error>.
Actual error type inside of it can be extracted for examination if necessary.
Functionality provided by anyhow::Result<T> may be familiar to Go developers, as it provides similar usage patterns and ergonomics to (T, error) from Go.
anyhow::Context is a trait implemented for the standard Result and Option types. use anyhow::Context is necessary to enable .context() and .with_context() on those types.

练习：使用 Result 进行重写

以下代码实现了一个非常简单的表达式语言解析器。不过，它通过 panic 机制来处理错误。请重写该代码，改用惯用的错误处理方式，并将错误传播到 main 函数的返回值。您可以随意使用 thiserror 和 anyhow。

提示：请先修复 parse 函数中的错误处理问题。该部分正常运行后，请更新 Tokenizer 以实现 Iterator<Item=Result<Token, TokenizerError>>，并在解析器中进行相应处理。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

不安全 Rust

This segment should take about 1 hour and 5 minutes. It contains:

Slide	Duration
Unsafe	5 minutes
解引用裸指针	10 minutes
可变的静态变量	5 minutes
联合体	5 minutes
Unsafe 函数	5 minutes
Unsafe 特征	5 minutes
练习：FFI 封装容器	30 minutes

不安全 Rust

Rust 语言包含两个部分：

**安全 Rust：**内存安全，没有潜在的未定义行为。
**不安全 Rust：**如果违反了前提条件，可能会触发未定义的行为。

We saw mostly safe Rust in this course, but it’s important to know what Unsafe Rust is.

不安全的代码通常内容很少而且与其他代码隔离，其正确性也应得到仔细记录。这类代码通常封装在安全的抽象层中。

不安全 Rust 提供了五种新功能：

解引用原始指针。
访问或修改可变的静态变量。
访问 union 字段。
调用 unsafe 函数，包括 extern 函数。
实现 unsafe trait。

下面，我们将简要介绍这些不安全功能。如需了解完整详情，请参阅《Rust 手册》第 19.1 章和 Rustonomicon。

Speaker Notes

This slide should take about 5 minutes.

Unsafe Rust does not mean the code is incorrect. It means that developers have turned off some compiler safety features and have to write correct code by themselves. It means the compiler no longer enforces Rust’s memory-safety rules.

解引用裸指针

创建指针是安全的操作，但解引用指针需要使用 unsafe 方法：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 10 minutes.

我们建议（而且 Android Rust 样式指南要求）为每个 unsafe 代码块编写一条注释，说明该代码块中的代码如何满足其所执行的不安全操作的安全要求。

对于指针解除引用，这意味着指针必须为 valid，即：

指针必须为非 null。
指针必须是 dereferenceable（在单个已分配对象的边界内）。
对象不得已取消分配。
不得并发访问相同位置。
如果通过转换引用类型来获取指针，则底层对象必须处于活跃状态，而且不得使用任何引用来访问内存。

在大多数情况下，指针还必须正确对齐。

The “NOT SAFE” section gives an example of a common kind of UB bug: *r1 has the 'static lifetime, so r3 has type &'static String, and thus outlives s. Creating a reference from a pointer requires great care.

可变的静态变量

读取不可变的静态变量是安全的操作：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

但是，读取和写入可变的静态变量是不安全的，因为这可能会造成数据争用：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

此处的程序是安全的，因为它是单线程的。不过，Rust 编译器比较保守，会做出最坏的假设。请尝试移除 unsafe，看看编译器如何解释从多个线程中修改静态变量是一种未定义的行为。
通常，我们不建议使用可变的静态变量，但在某些情况下，在低层级 no_std 代码中可能需要这样做，例如实现堆分配器或使用某些 C API。

联合体

联合体与枚举类似，但您需要自行跟踪活跃字段：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

在 Rust 中很少需要用到联合体，因为您通常可以使用枚举。联合体只是偶尔用于与 C 库 API 进行交互。

如果您只是想将字节重新解释为其他类型，则可能需要使用 std::mem::transmute 或安全的封装容器，例如 zerocopy crate。

Unsafe 函数

调用 Unsafe 函数

如果函数或方法具有额外的前提条件，您必须遵守这些前提条件来避免未定义的行为，则可以将该函数或方法标记为 unsafe：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

编写 Unsafe 函数

如果您自己编写的函数需要满足特定条件以避免未定义的行为，您可以将这些函数标记为 unsafe。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

调用 Unsafe 函数

get_unchecked, like most _unchecked functions, is unsafe, because it can create UB if the range is incorrect. abs is incorrect for a different reason: it is an external function (FFI). Calling external functions is usually only a problem when those functions do things with pointers which might violate Rust’s memory model, but in general any C function might have undefined behaviour under any arbitrary circumstances.

本例中的“C”是 ABI；也可以使用其他 ABI。

编写 Unsafe 函数

We wouldn’t actually use pointers for a swap function - it can be done safely with references.

Note that unsafe code is allowed within an unsafe function without an unsafe block. We can prohibit this with #[deny(unsafe_op_in_unsafe_fn)]. Try adding it and see what happens. This will likely change in a future Rust edition.

实现 Unsafe Trait

与函数一样，如果您在实现某个 trait 时必须保证特定条件来避免未定义的行为，您也可以将该 trait 标记为 unsafe。

例如，zerocopy crate 包含一个不安全的 trait，大致内容是这样的：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

This slide should take about 5 minutes.

在 Rustdoc 中有关 trait 的章节下，有一个标题为 # 安全 的部分介绍了安全实现 trait 的要求。

实际上，与 AsBytes 相关的安全说明远比这里展示的更详尽、更复杂。

内置的 Send 和 Sync trait 都是不安全的。

安全 FFI 封装容器

Rust 为通过 外部函数接口 (FFI) 调用函数提供了出色的支持。我们将使用它为 libc 函数构建一个安全封装容器，用于从 C 代码中读取目录中的文件名称。

建议您参考以下手册页面：

您还需要浏览“std::ffi”模块。在下方，您会发现完成这个练习所需的多种字符串类型：

类型	编码	使用
“str”和“String”	UTF-8	用 Rust 进行文本处理
“CStr”和“CString”	以空字符结尾	与 C 函数通信
“OsStr”和“OsString”	特定于操作系统	与操作系统通信

您将在以下所有类型之间进行转换：

将 &str 转换为 CString：您需要为尾随 \0 字符分配空格，
将 CString 转换为 \*const i8 ：您需要一个指针来调用 C 函数，
将 \*const i8 转换为 &CStr ：您需要一些能够找到尾随 \0 字符的内容，
&CStr to &[u8]: a slice of bytes is the universal interface for “some unknown data”,
将 &\[u8\] 转换为 &OsStr ：&OsStr 是向 OsString 迈进的一步，请使用OsStrExt来创建它，
将“&OsStr”转换为“OsString”：您需要克隆“&OsStr”中的数据，以便能够返回它并再次调用“readdir”。

秘典中也有一个关于 FFI 的非常实用的章节。

将以下代码复制到 https://play.rust-lang.org/，并填入缺少的函数和方法：


// TODO: remove this when you're done with your implementation.
#![allow(unused_imports, unused_variables, dead_code)]

mod ffi {
    use std::os::raw::{c_char, c_int};
    #[cfg(not(target_os = "macos"))]
    use std::os::raw::{c_long, c_uchar, c_ulong, c_ushort};

    // Opaque type. See https://doc.rust-lang.org/nomicon/ffi.html.
    #[repr(C)]
    pub struct DIR {
        _data: [u8; 0],
        _marker: core::marker::PhantomData<(*mut u8, core::marker::PhantomPinned)>,
    }

    // Layout according to the Linux man page for readdir(3), where ino_t and
    // off_t are resolved according to the definitions in
    // /usr/include/x86_64-linux-gnu/{sys/types.h, bits/typesizes.h}.
    #[cfg(not(target_os = "macos"))]
    #[repr(C)]
    pub struct dirent {
        pub d_ino: c_ulong,
        pub d_off: c_long,
        pub d_reclen: c_ushort,
        pub d_type: c_uchar,
        pub d_name: [c_char; 256],
    }

    // Layout according to the macOS man page for dir(5).
    #[cfg(all(target_os = "macos"))]
    #[repr(C)]
    pub struct dirent {
        pub d_fileno: u64,
        pub d_seekoff: u64,
        pub d_reclen: u16,
        pub d_namlen: u16,
        pub d_type: u8,
        pub d_name: [c_char; 1024],
    }

    extern "C" {
        pub fn opendir(s: *const c_char) -> *mut DIR;

        #[cfg(not(all(target_os = "macos", target_arch = "x86_64")))]
        pub fn readdir(s: *mut DIR) -> *const dirent;

        // See https://github.com/rust-lang/libc/issues/414 and the section on
        // _DARWIN_FEATURE_64_BIT_INODE in the macOS man page for stat(2).
        //
        // "Platforms that existed before these updates were available" refers
        // to macOS (as opposed to iOS / wearOS / etc.) on Intel and PowerPC.
        #[cfg(all(target_os = "macos", target_arch = "x86_64"))]
        #[link_name = "readdir$INODE64"]
        pub fn readdir(s: *mut DIR) -> *const dirent;

        pub fn closedir(s: *mut DIR) -> c_int;
    }
}

use std::ffi::{CStr, CString, OsStr, OsString};
use std::os::unix::ffi::OsStrExt;

#[derive(Debug)]
struct DirectoryIterator {
    path: CString,
    dir: *mut ffi::DIR,
}

impl DirectoryIterator {
    fn new(path: &str) -> Result<DirectoryIterator, String> {
        // Call opendir and return a Ok value if that worked,
        // otherwise return Err with a message.
        unimplemented!()
    }
}

impl Iterator for DirectoryIterator {
    type Item = OsString;
    fn next(&mut self) -> Option<OsString> {
        // Keep calling readdir until we get a NULL pointer back.
        unimplemented!()
    }
}

impl Drop for DirectoryIterator {
    fn drop(&mut self) {
        // Call closedir as needed.
        unimplemented!()
    }
}

fn main() -> Result<(), String> {
    let iter = DirectoryIterator::new(".")?;
    println!("files: {:#?}", iter.collect::<Vec<_>>());
    Ok(())
}

解答


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

欢迎来到Android 中的Rust

Rust is supported for system software on Android. This means that you can write new services, libraries, drivers or even firmware in Rust (or improve existing code as needed).

今天我们会尝试在你自己的项目中调用Rust。所以试着在你的代码中找一小段来改成Rust。代码中越少依赖(dependencies)，越少“独特”的类型，越好。比如一段解析原始字符的代码就很理想。

Speaker Notes

鉴于 Android 中越来越多地使用 Rust，演讲者可能会提到以下任何一项：

服务示例：DNS-over-HTTP
库：Rutabaga 虚拟图形接口
内核驱动程序：Binder
固件：pKVM 固件

设置

We will be using a Cuttlefish Android Virtual Device to test our code. Make sure you have access to one or create a new one with:


source build/envsetup.sh
lunch aosp_cf_x86_64_phone-trunk_staging-userdebug
acloud create

更多细节请参考 Android Developer Codelab.

Speaker Notes

关键点：

Cuttlefish is a reference Android device designed to work on generic Linux desktops. MacOS support is also planned.
Cuttlefish 系统映像会保持媲美真实设备的高保真度，是运行许多 Rust 用例的理想模拟器。

构建规则

Android 构建系统（Soong）通过一系列模块来支持Rust：

Module Type	描述
`rust_binary`	Produces a Rust binary.
`rust_library`	生成一个 Rust 库，并提供 `rlib` 和 `dylib` 两种变体。
`rust_ffi`	生成一个可由 `cc` 模块使用的 Rust C 库，并提供静态和共享两种变体。
`rust_proc_macro`	生成“proc-macro”Rust 库。这些宏与编译器插件类似。
`rust_test`	生成使用标准 Rust 测试框架的 Rust 测试二进制文件。
`rust_fuzz`	生成使用 `libfuzzer` 的 Rust 模糊测试二进制文件。
`rust_protobuf`	生成源代码并生成为特定 protobuf 提供接口的 Rust 库。
`rust_bindgen`	生成源代码并生成包含 Rust 绑定到 C 库的 Rust 库。

下面我们来看看 rust_binary 和 rust_library。

Speaker Notes

演讲者可能会提及其他内容：

Cargo 未针对多语言代码库进行优化，并且从互联网下载软件包。
For compliance and performance, Android must have crates in-tree. It must also interop with C/C++/Java code. Soong fills that gap.
Soong 与 Bazel 有许多相似之处，后者是 Blaze 的开源变体（在 google3 中使用）。
我们计划逐渐在 Android，ChromeOS 和 Fuchsia 中采用 Bazel 进行开发。
对所有 Rust 操作系统开发者而言，了解类似 Bazel 的构建规都很有用。
趣味小知识：《星际迷航》中的数据是 Soong 类型的 Android。

Rust 二进制文件

让我们从一个简单的应用程序开始。在 AOSP 签出的根目录下，创建以下文件：

hello_rust/Android.bp:


rust_binary {
    name: "hello_rust",
    crate_name: "hello_rust",
    srcs: ["src/main.rs"],
}

hello_rust/src/main.rs:


//! Rust demo.

/// Prints a greeting to standard output.
fn main() {
    println!("Hello from Rust!");
}

你现在可以构建、推送和运行二进制文件：


m hello_rust
adb push "$ANDROID_PRODUCT_OUT/system/bin/hello_rust" /data/local/tmp
adb shell /data/local/tmp/hello_rust


Hello from Rust!

Rust 库

您可以使用 rust_library 为 Android 创建一个新的 Rust 库。

在这里，我们声明了对两个库的依赖：

libgreeting, 我们在下面进行了定义，
libtextwrap, 一个已经在 external/rust/crates/ 中提供的 crate。

hello_rust/Android.bp:


rust_binary {
    name: "hello_rust_with_dep",
    crate_name: "hello_rust_with_dep",
    srcs: ["src/main.rs"],
    rustlibs: [
        "libgreetings",
        "libtextwrap",
    ],
    prefer_rlib: true, // Need this to avoid dynamic link error.
}

rust_library {
    name: "libgreetings",
    crate_name: "greetings",
    srcs: ["src/lib.rs"],
}

hello_rust/src/main.rs:


//! Rust demo.

use greetings::greeting;
use textwrap::fill;

/// Prints a greeting to standard output.
fn main() {
    println!("{}", fill(&greeting("Bob"), 24));
}

hello_rust/src/lib.rs:


//! Greeting library.

/// Greet `name`.
pub fn greeting(name: &str) -> String {
    format!("Hello {name}, it is very nice to meet you!")
}

您可以像之前一样构建、推送和运行二进制文件：


m hello_rust_with_dep
adb push "$ANDROID_PRODUCT_OUT/system/bin/hello_rust_with_dep" /data/local/tmp
adb shell /data/local/tmp/hello_rust_with_dep


Hello Bob, it is very
nice to meet you!

AIDL

Rust 支持 Android 接口定义语言 (AIDL)：

Rust 代码可以调用现有的 AIDL 服务器，
您可以在 Rust 中创建新的 AIDL 服务器。

/** Birthday service interface. */

To illustrate how to use Rust with Binder, we’re going to walk through the process of creating a Binder interface. We’re then going to both implement the described service and write client code that talks to that service.

AIDL 接口

您可以使用 AIDL 接口声明您的服务的 API：

birthday_service/aidl/com/example/birthdayservice/IBirthdayService.aidl:


/** Birthday service interface. */
interface IBirthdayService {
    /** Generate a Happy Birthday message. */
    String wishHappyBirthday(String name, int years);
}

birthday_service/aidl/Android.bp:


aidl_interface {
    name: "com.example.birthdayservice",
    srcs: ["com/example/birthdayservice/*.aidl"],
    unstable: true,
    backend: {
        rust: { // Rust is not enabled by default
            enabled: true,
        },
    },
}

Speaker Notes

Note that the directory structure under the aidl/ directory needs to match the package name used in the AIDL file, i.e. the package is com.example.birthdayservice and the file is at aidl/com/example/IBirthdayService.aidl.

Generated Service API

Binder generates a trait corresponding to the interface definition. trait to talk to the service.

birthday_service/aidl/com/example/birthdayservice/IBirthdayService.aidl:


/** Birthday service interface. */
interface IBirthdayService {
    /** Generate a Happy Birthday message. */
    String wishHappyBirthday(String name, int years);
}

Generated trait:


trait IBirthdayService {
    fn wishHappyBirthday(&self, name: &str, years: i32) -> binder::Result<String>;
}

Your service will need to implement this trait, and your client will use this trait to talk to the service.

Speaker Notes

The generated bindings can be found at out/soong/.intermediates/<path to module>/.
Point out how the generated function signature, specifically the argument and return types, correspond the interface definition.
- String for an argument results in a different Rust type than String as a return type.

服务实现

我们现在可以实现AIDL服务：

birthday_service/src/lib.rs:


use com_example_birthdayservice::aidl::com::example::birthdayservice::IBirthdayService::IBirthdayService;
use com_example_birthdayservice::binder;

/// The `IBirthdayService` implementation.
pub struct BirthdayService;

impl binder::Interface for BirthdayService {}

impl IBirthdayService for BirthdayService {
    fn wishHappyBirthday(&self, name: &str, years: i32) -> binder::Result<String> {
        Ok(format!("Happy Birthday {name}, congratulations with the {years} years!"))
    }
}

birthday_service/Android.bp:


rust_library {
    name: "libbirthdayservice",
    srcs: ["src/lib.rs"],
    crate_name: "birthdayservice",
    rustlibs: [
        "com.example.birthdayservice-rust",
        "libbinder_rs",
    ],
}

Speaker Notes

Point out the path to the generated IBirthdayService trait, and explain why each of the segments is necessary.
TODO: What does the binder::Interface trait do? Are there methods to override? Where source?

AIDL 服务器

最后，我们可以创建一个暴露服务的服务器：

birthday_service/src/server.rs:


//! Birthday service.
use birthdayservice::BirthdayService;
use com_example_birthdayservice::aidl::com::example::birthdayservice::IBirthdayService::BnBirthdayService;
use com_example_birthdayservice::binder;

const SERVICE_IDENTIFIER: &str = "birthdayservice";

/// Entry point for birthday service.
fn main() {
    let birthday_service = BirthdayService;
    let birthday_service_binder = BnBirthdayService::new_binder(
        birthday_service,
        binder::BinderFeatures::default(),
    );
    binder::add_service(SERVICE_IDENTIFIER, birthday_service_binder.as_binder())
        .expect("Failed to register service");
    binder::ProcessState::join_thread_pool()
}

birthday_service/Android.bp:


rust_binary {
    name: "birthday_server",
    crate_name: "birthday_server",
    srcs: ["src/server.rs"],
    rustlibs: [
        "com.example.birthdayservice-rust",
        "libbinder_rs",
        "libbirthdayservice",
    ],
    prefer_rlib: true, // To avoid dynamic link error.
}

Speaker Notes

The process for taking a user-defined service implementation (in this case the BirthdayService type, which implements the IBirthdayService) and starting it as a Binder service has multiple steps, and may appear more complicated than students are used to if they’ve used Binder from C++ or another language. Explain to students why each step is necessary.

Create an instance of your service type (BirthdayService).
Wrap the service object in corresponding Bn* type (BnBirthdayService in this case). This type is generated by Binder and provides the common Binder functionality that would be provided by the BnBinder base class in C++. We don’t have inheritance in Rust, so instead we use composition, putting our BirthdayService within the generated BnBinderService.
Call add_service, giving it a service identifier and your service object (the BnBirthdayService object in the example).
Call join_thread_pool to add the current thread to Binder’s thread pool and start listening for connections.

部署

我们现在可以构建、推送和启动服务：


m birthday_server
adb push "$ANDROID_PRODUCT_OUT/system/bin/birthday_server" /data/local/tmp
adb root
adb shell /data/local/tmp/birthday_server

在另一个终端中，检查该服务是否正在运行：


adb shell service check birthdayservice


Service birthdayservice: found

您还可以使用 service call 命令调用该服务：


adb shell service call birthdayservice 1 s16 Bob i32 24


Result: Parcel(
  0x00000000: 00000000 00000036 00610048 00700070 '....6...H.a.p.p.'
  0x00000010: 00200079 00690042 00740072 00640068 'y. .B.i.r.t.h.d.'
  0x00000020: 00790061 00420020 0062006f 0020002c 'a.y. .B.o.b.,. .'
  0x00000030: 006f0063 0067006e 00610072 00750074 'c.o.n.g.r.a.t.u.'
  0x00000040: 0061006c 00690074 006e006f 00200073 'l.a.t.i.o.n.s. .'
  0x00000050: 00690077 00680074 00740020 00650068 'w.i.t.h. .t.h.e.'
  0x00000060: 00320020 00200034 00650079 00720061 ' .2.4. .y.e.a.r.'
  0x00000070: 00210073 00000000                   's.!.....        ')

AIDL 客户端

最后，我们可以为我们的新服务创建一个 Rust 客户端。

birthday_service/src/client.rs:


use com_example_birthdayservice::aidl::com::example::birthdayservice::IBirthdayService::IBirthdayService;
use com_example_birthdayservice::binder;

const SERVICE_IDENTIFIER: &str = "birthdayservice";

/// Call the birthday service.
fn main() -> Result<(), Box<dyn Error>> {
    let name = std::env::args().nth(1).unwrap_or_else(|| String::from("Bob"));
    let years = std::env::args()
        .nth(2)
        .and_then(|arg| arg.parse::<i32>().ok())
        .unwrap_or(42);

    binder::ProcessState::start_thread_pool();
    let service = binder::get_interface::<dyn IBirthdayService>(SERVICE_IDENTIFIER)
        .map_err(|_| "Failed to connect to BirthdayService")?;

    // Call the service.
    let msg = service.wishHappyBirthday(&name, years)?;
    println!("{msg}");
}

birthday_service/Android.bp:


rust_binary {
    name: "birthday_client",
    crate_name: "birthday_client",
    srcs: ["src/client.rs"],
    rustlibs: [
        "com.example.birthdayservice-rust",
        "libbinder_rs",
    ],
    prefer_rlib: true, // To avoid dynamic link error.
}

请注意，客户端不依赖于 libbirthdayservice。

在您的设备上构建、推送并运行客户端：


m birthday_client
adb push "$ANDROID_PRODUCT_OUT/system/bin/birthday_client" /data/local/tmp
adb shell /data/local/tmp/birthday_client Charlie 60


Happy Birthday Charlie, congratulations with the 60 years!

Speaker Notes

Strong<dyn IBirthdayService> is the trait object representing the service that the client has connected to.
- Strong is a custom smart pointer type for Binder. It handles both an in-process ref count for the service trait object, and the global Binder ref count that tracks how many processes have a reference to the object.
- Note that the trait object that the client uses to talk to the service uses the exact same trait that the server implements. For a given Binder interface, there is a single Rust trait generated that both client and server use.
Use the same service identifier used when registering the service. This should ideally be defined in a common crate that both the client and server can depend on.

更改 API

让我们扩展API以提供更多功能：我们希望允许客户端指定生日贺卡的行列表：


package com.example.birthdayservice;

/** Birthday service interface. */
interface IBirthdayService {
    /** Generate a Happy Birthday message. */
    String wishHappyBirthday(String name, int years, in String[] text);
}

This results in an updated trait definition for IBirthdayService:


trait IBirthdayService {
    fn wishHappyBirthday(
        &self,
        name: &str,
        years: i32,
        text: &[String],
    ) -> binder::Result<String>;
}

Speaker Notes

Note how the String[] in the AIDL definition is translated as a &[String] in Rust, i.e. that idiomatic Rust types are used in the generated bindings wherever possible:
- in array arguments are translated to slices.
- out and inout args are translated to &mut Vec<T>.
- Return values are translated to returning a Vec<T>.

Updating Client and Service

Update the client and server code to account for the new API.

birthday_service/src/lib.rs:


impl IBirthdayService for BirthdayService {
    fn wishHappyBirthday(
        &self,
        name: &str,
        years: i32,
        text: &[String],
    ) -> binder::Result<String> {
        let mut msg = format!(
            "Happy Birthday {name}, congratulations with the {years} years!",
        );

        for line in text {
            msg.push('\n');
            msg.push_str(line);
        }

        Ok(msg)
    }
}

birthday_service/src/client.rs:


let msg = service.wishHappyBirthday(
    &name,
    years,
    &[
        String::from("Habby birfday to yuuuuu"),
        String::from("And also: many more"),
    ],
)?;

Speaker Notes

TODO: Move code snippets into project files where they’ll actually be built?

Working With AIDL Types

AIDL types translate into the appropriate idiomatic Rust type:

Primitive types map (mostly) to idiomatic Rust types.
Collection types like slices, Vecs and string types are supported.
References to AIDL objects and file handles can be sent between clients and services.
File handles and parcelables are fully supported.

Primitive Types

Primitive types map (mostly) idiomatically:

AIDL Type	Rust Type	Note
`boolean`	`bool`
`byte`	`i8`	Note that bytes are signed.
`char`	`u16`	Note the usage of `u16`, NOT `u32`.
`int`	`i32`
`long`	`i64`
`float`	`f32`
`double`	`f64`
String	String

数组（Arrays）

The array types (T[], byte[], and List<T>) get translated to the appropriate Rust array type depending on how they are used in the function signature:

Position	Rust Type
`in` argument	`&[T]`
`out`/`inout` argument	`&mut Vec<T>`
Return	`Vec<T>`

Speaker Notes

In Android 13 or higher, fixed-size arrays are supported, i.e. T[N] becomes [T; N]. Fixed-size arrays can have multiple dimensions (e.g. int[3][4]). In the Java backend, fixed-size arrays are represented as array types.
Arrays in parcelable fields always get translated to Vec<T>.

特征对象

AIDL objects can be sent either as a concrete AIDL type or as the type-erased IBinder interface:

birthday_service/aidl/com/example/birthdayservice/IBirthdayInfoProvider.aidl:


package com.example.birthdayservice;

interface IBirthdayInfoProvider {
    String name();
    int years();
}

birthday_service/aidl/com/example/birthdayservice/IBirthdayService.aidl:


import com.example.birthdayservice.IBirthdayInfoProvider;

interface IBirthdayService {
    /** The same thing, but using a binder object. */
    String wishWithProvider(IBirthdayInfoProvider provider);

    /** The same thing, but using `IBinder`. */
    String wishWithErasedProvider(IBinder provider);
}

birthday_service/src/client.rs:


/// Rust struct implementing the `IBirthdayInfoProvider` interface.
struct InfoProvider {
    name: String,
    age: u8,
}

impl binder::Interface for InfoProvider {}

impl IBirthdayInfoProvider for InfoProvider {
    fn name(&self) -> binder::Result<String> {
        Ok(self.name.clone())
    }

    fn years(&self) -> binder::Result<i32> {
        Ok(self.age as i32)
    }
}

fn main() {
    binder::ProcessState::start_thread_pool();
    let service = connect().expect("Failed to connect to BirthdayService");

    // Create a binder object for the `IBirthdayInfoProvider` interface.
    let provider = BnBirthdayInfoProvider::new_binder(
        InfoProvider { name: name.clone(), age: years as u8 },
        BinderFeatures::default(),
    );

    // Send the binder object to the service.
    service.wishWithProvider(&provider)?;

    // Perform the same operation but passing the provider as an `SpIBinder`.
    service.wishWithErasedProvider(&provider.as_binder())?;
}

Speaker Notes

Note the usage of BnBirthdayInfoProvider. This serves the same purpose as BnBirthdayService that we saw previously.

变量

Binder for Rust supports sending parcelables directly:

birthday_service/aidl/com/example/birthdayservice/BirthdayInfo.aidl:


package com.example.birthdayservice;

parcelable BirthdayInfo {
    String name;
    int years;
}

birthday_service/aidl/com/example/birthdayservice/IBirthdayService.aidl:


import com.example.birthdayservice.BirthdayInfo;

interface IBirthdayService {
    /** The same thing, but with a parcelable. */
    String wishWithInfo(in BirthdayInfo info);
}

birthday_service/src/client.rs:


fn main() {
    binder::ProcessState::start_thread_pool();
    let service = connect().expect("Failed to connect to BirthdayService");

    service.wishWithInfo(&BirthdayInfo { name: name.clone(), years })?;
}

Sending Files

Files can be sent between Binder clients/servers using the ParcelFileDescriptor type:

birthday_service/aidl/com/example/birthdayservice/IBirthdayService.aidl:


interface IBirthdayService {
    /** The same thing, but loads info from a file. */
    String wishFromFile(in ParcelFileDescriptor infoFile);
}

birthday_service/src/client.rs:


fn main() {
    binder::ProcessState::start_thread_pool();
    let service = connect().expect("Failed to connect to BirthdayService");

    // Open a file and put the birthday info in it.
    let mut file = File::create("/data/local/tmp/birthday.info").unwrap();
    writeln!(file, "{name}")?;
    writeln!(file, "{years}")?;

    // Create a `ParcelFileDescriptor` from the file and send it.
    let file = ParcelFileDescriptor::new(file);
    service.wishFromFile(&file)?;
}

birthday_service/src/lib.rs:


impl IBirthdayService for BirthdayService {
    fn wishFromFile(
        &self,
        info_file: &ParcelFileDescriptor,
    ) -> binder::Result<String> {
        // Convert the file descriptor to a `File`. `ParcelFileDescriptor` wraps
        // an `OwnedFd`, which can be cloned and then used to create a `File`
        // object.
        let mut info_file = info_file
            .as_ref()
            .try_clone()
            .map(File::from)
            .expect("Invalid file handle");

        let mut contents = String::new();
        info_file.read_to_string(&mut contents).unwrap();

        let mut lines = contents.lines();
        let name = lines.next().unwrap();
        let years: i32 = lines.next().unwrap().parse().unwrap();

        Ok(format!("Happy Birthday {name}, congratulations with the {years} years!"))
    }
}

Speaker Notes

ParcelFileDescriptor wraps an OwnedFd, and so can be created from a File (or any other type that wraps an OwnedFd), and can be used to create a new File handle on the other side.
Other types of file descriptors can be wrapped and sent, e.g. TCP, UDP, and UNIX sockets.

Testing in Android

Building on Testing, we will now look at how unit tests work in AOSP. Use the rust_test module for your unit tests:

testing/Android.bp:


rust_library {
    name: "libleftpad",
    crate_name: "leftpad",
    srcs: ["src/lib.rs"],
}

rust_test {
    name: "libleftpad_test",
    crate_name: "leftpad_test",
    srcs: ["src/lib.rs"],
    host_supported: true,
    test_suites: ["general-tests"],
}

testing/src/lib.rs:


#![allow(unused)]
fn main() {
//! Left-padding library.

/// Left-pad `s` to `width`.
pub fn leftpad(s: &str, width: usize) -> String {
    format!("{s:>width$}")
}

#[cfg(test)]
mod tests {
    use super::*;

    #[test]
    fn short_string() {
        assert_eq!(leftpad("foo", 5), "  foo");
    }

    #[test]
    fn long_string() {
        assert_eq!(leftpad("foobar", 6), "foobar");
    }
}
}

You can now run the test with


atest --host libleftpad_test

The output looks like this:


INFO: Elapsed time: 2.666s, Critical Path: 2.40s
INFO: 3 processes: 2 internal, 1 linux-sandbox.
INFO: Build completed successfully, 3 total actions
//comprehensive-rust-android/testing:libleftpad_test_host            PASSED in 2.3s
    PASSED  libleftpad_test.tests::long_string (0.0s)
    PASSED  libleftpad_test.tests::short_string (0.0s)
Test cases: finished with 2 passing and 0 failing out of 2 test cases

Notice how you only mention the root of the library crate. Tests are found recursively in nested modules.

GoogleTest

The GoogleTest crate allows for flexible test assertions using matchers:


use googletest::prelude::*;

#[googletest::test]
fn test_elements_are() {
    let value = vec!["foo", "bar", "baz"];
    expect_that!(value, elements_are!(eq("foo"), lt("xyz"), starts_with("b")));
}

如果我们将最后一个元素更改为 "!"，测试将失败，并会提供详细的错误消息来指出错误的位置：


---- test_elements_are stdout ----
Value of: value
Expected: has elements:
  0. is equal to "foo"
  1. is less than "xyz"
  2. starts with prefix "!"
Actual: ["foo", "bar", "baz"],
  where element #2 is "baz", which does not start with "!"
  at src/testing/googletest.rs:6:5
Error: See failure output above

Speaker Notes

This slide should take about 5 minutes.

GoogleTest 不是 Rust Playground 的一部分，因此您需要在本地环境中运行此示例。使用 cargo add googletest 快速将其添加到现有 Cargo 项目中。
use googletest::prelude::*; 行会导入一些常用的宏和类型。
这只是冰山一角，还有很多内置匹配器。
A particularly nice feature is that mismatches in multi-line strings are shown as a diff:


#[test]
fn test_multiline_string_diff() {
    let haiku = "Memory safety found,\n\
                 Rust's strong typing guides the way,\n\
                 Secure code you'll write.";
    assert_that!(
        haiku,
        eq("Memory safety found,\n\
            Rust's silly humor guides the way,\n\
            Secure code you'll write.")
    );
}

显示用颜色标识的差异（此处未显示颜色）：


    Value of: haiku
Expected: is equal to "Memory safety found,\nRust's silly humor guides the way,\nSecure code you'll write."
Actual: "Memory safety found,\nRust's strong typing guides the way,\nSecure code you'll write.",
  which isn't equal to "Memory safety found,\nRust's silly humor guides the way,\nSecure code you'll write."
Difference(-actual / +expected):
 Memory safety found,
-Rust's strong typing guides the way,
+Rust's silly humor guides the way,
 Secure code you'll write.
  at src/testing/googletest.rs:17:5

crate 是适用于 C++ 的 GoogleTest 的 Rust 移植版。

模拟

对于模拟，Mockall 是一个广泛使用的库。您需要重构代码才能使用 trait，然后便可很快地对其进行模拟：


use std::time::Duration;

#[mockall::automock]
pub trait Pet {
    fn is_hungry(&self, since_last_meal: Duration) -> bool;
}

#[test]
fn test_robot_dog() {
    let mut mock_dog = MockPet::new();
    mock_dog.expect_is_hungry().return_const(true);
    assert_eq!(mock_dog.is_hungry(Duration::from_secs(10)), true);
}

Speaker Notes

This slide should take about 5 minutes.

Mockall is the recommended mocking library in Android (AOSP). There are other mocking libraries available on crates.io, in particular in the area of mocking HTTP services. The other mocking libraries work in a similar fashion as Mockall, meaning that they make it easy to get a mock implementation of a given trait.
请注意，模拟在某种程度上具有 争议性：借助模拟，您可以将测试与其依赖项完全隔离。最立竿见影的是，测试作业会更快且更稳定。另一方面，模拟对象的配置可能出现错误，并返回与真实依赖项不同的输出。

建议您尽可能使用真实依赖项。例如，许多数据库都支持您配置内存后端。这意味着，您可以在测试中获得正确的功能行为，而且测试速度会很快并会自动清理。

同样，许多 Web 框架都支持您启动进程内服务器，该服务器会绑定到 localhost 上的随机端口。相比模拟框架，请始终优先选择这种方式，因为这有助于您在真实环境中测试代码。
Mockall 不是 Rust Playground 的一部分，因此您需要在本地环境中运行此示例。使用 cargo add mockall 快速将 Mockall 添加到现有 Cargo 项目中。
Mockall 具有更多功能。具体而言，您可以设置基于传递参数的预期值。在这里，我们使用该功能来模拟一只猫，它在上次被喂食的 3 小时后会感到饥饿：


#[test]
fn test_robot_cat() {
    let mut mock_cat = MockPet::new();
    mock_cat
        .expect_is_hungry()
        .with(mockall::predicate::gt(Duration::from_secs(3 * 3600)))
        .return_const(true);
    mock_cat.expect_is_hungry().return_const(false);
    assert_eq!(mock_cat.is_hungry(Duration::from_secs(1 * 3600)), false);
    assert_eq!(mock_cat.is_hungry(Duration::from_secs(5 * 3600)), true);
}

您可以使用 .times(n) 将调用模拟方法的次数限制为 n，如果不满足此条件，模拟对象被释放时会自动 panic。

日志记录

你应该使用 log crate 来自动记录日志到 logcat （设备上）或 stdout（主机上）：

hello_rust_logs/Android.bp:


rust_binary {
    name: "hello_rust_logs",
    crate_name: "hello_rust_logs",
    srcs: ["src/main.rs"],
    rustlibs: [
        "liblog_rust",
        "liblogger",
    ],
    host_supported: true,
}

hello_rust_logs/src/main.rs:


//! Rust logging demo.

use log::{debug, error, info};

/// Logs a greeting.
fn main() {
    logger::init(
        logger::Config::default()
            .with_tag_on_device("rust")
            .with_min_level(log::Level::Trace),
    );
    debug!("Starting program.");
    info!("Things are going fine.");
    error!("Something went wrong!");
}

在你的设备上构建，推送，并运行二进制文件：


m hello_rust_logs
adb push "$ANDROID_PRODUCT_OUT/system/bin/hello_rust_logs" /data/local/tmp
adb shell /data/local/tmp/hello_rust_logs

日志将会在 adb logcat 中显示：


adb logcat -s rust


09-08 08:38:32.454  2420  2420 D rust: hello_rust_logs: Starting program.
09-08 08:38:32.454  2420  2420 I rust: hello_rust_logs: Things are going fine.
09-08 08:38:32.454  2420  2420 E rust: hello_rust_logs: Something went wrong!

互操作性

Rust 对于与其他编程语言的互操作性有着出色的支持。这意味着您可以：

从其他语言调用 Rust 函数。
从 Rust 调用用其他语言编写的函数。

当您从外部语言调用函数时，我们称之为使用 外部函数接口（Foreign Function Interface， FFI）。

与 C 的互操作性

Rust 对使用 C 调用约定链接目标文件提供了完整的支持。同样地，你可以导出 Rust 函数并从 C 中调用它们。

如果你愿意的话，你可以手工完成它：


extern "C" {
    fn abs(x: i32) -> i32;
}

fn main() {
    let x = -42;
    // SAFETY: `abs` doesn't have any safety requirements.
    let abs_x = unsafe { abs(x) };
    println!("{x}, {abs_x}");
}

我们已经在安全 FFI 封装容器练习中看到了这个例子。

这假设对目标平台拥有充分的了解，不建议用于生产环境。

接下来我们将探讨更好的选择。

使用 Bindgen

bindgen 工具可以自动生成 C 头文件的绑定代码。

首先创建一个小型的 C 语言库：

interoperability/bindgen/libbirthday.h:


typedef struct card {
  const char* name;
  int years;
} card;

void print_card(const card* card);

interoperability/bindgen/libbirthday.c:


#include <stdio.h>
#include "libbirthday.h"

void print_card(const card* card) {
  printf("+--------------\n");
  printf("| Happy Birthday %s!\n", card->name);
  printf("| Congratulations with the %i years!\n", card->years);
  printf("+--------------\n");
}

将该库添加到你的 Android.bp 文件中：

interoperability/bindgen/Android.bp:


cc_library {
    name: "libbirthday",
    srcs: ["libbirthday.c"],
}

为该库创建一个包装头文件（在此示例中不是必需的）：

interoperability/bindgen/libbirthday_wrapper.h:


#include "libbirthday.h"

您现在可以自动生成绑定代码：

interoperability/bindgen/Android.bp:


rust_bindgen {
    name: "libbirthday_bindgen",
    crate_name: "birthday_bindgen",
    wrapper_src: "libbirthday_wrapper.h",
    source_stem: "bindings",
    static_libs: ["libbirthday"],
}

最后，我们可以在 Rust 程序中使用这些绑定：

interoperability/bindgen/Android.bp:


rust_binary {
    name: "print_birthday_card",
    srcs: ["main.rs"],
    rustlibs: ["libbirthday_bindgen"],
}

interoperability/bindgen/main.rs:


//! Bindgen demo.

use birthday_bindgen::{card, print_card};

fn main() {
    let name = std::ffi::CString::new("Peter").unwrap();
    let card = card { name: name.as_ptr(), years: 42 };
    // SAFETY: The pointer we pass is valid because it came from a Rust
    // reference, and the `name` it contains refers to `name` above which also
    // remains valid. `print_card` doesn't store either pointer to use later
    // after it returns.
    unsafe {
        print_card(&card as *const card);
    }
}

在你的设备上构建，推送，并运行二进制文件：


m print_birthday_card
adb push "$ANDROID_PRODUCT_OUT/system/bin/print_birthday_card" /data/local/tmp
adb shell /data/local/tmp/print_birthday_card

最后，我们可以运行自动生成的测试来确保绑定代码正常工作：

interoperability/bindgen/Android.bp:


rust_test {
    name: "libbirthday_bindgen_test",
    srcs: [":libbirthday_bindgen"],
    crate_name: "libbirthday_bindgen_test",
    test_suites: ["general-tests"],
    auto_gen_config: true,
    clippy_lints: "none", // Generated file, skip linting
    lints: "none",
}


atest libbirthday_bindgen_test

调用 Rust

将 Rust 函数和类型导出到 C 很简单：

interoperability/rust/libanalyze/analyze.rs


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

interoperability/rust/libanalyze/analyze.h


#ifndef ANALYSE_H
#define ANALYSE_H

extern "C" {
void analyze_numbers(int x, int y);
}

#endif

interoperability/rust/libanalyze/Android.bp


rust_ffi {
    name: "libanalyze_ffi",
    crate_name: "analyze_ffi",
    srcs: ["analyze.rs"],
    include_dirs: ["."],
}

我们现在可以从一个 C 二进制文件中调用它：

interoperability/rust/analyze/main.c


#include "analyze.h"

int main() {
  analyze_numbers(10, 20);
  analyze_numbers(123, 123);
  return 0;
}

interoperability/rust/analyze/Android.bp


cc_binary {
    name: "analyze_numbers",
    srcs: ["main.c"],
    static_libs: ["libanalyze_ffi"],
}

在你的设备上构建，推送，并运行二进制文件：


m analyze_numbers
adb push "$ANDROID_PRODUCT_OUT/system/bin/analyze_numbers" /data/local/tmp
adb shell /data/local/tmp/analyze_numbers

Speaker Notes

#[no_mangle] 禁用了 Rust 通常的名称重整，因此导出的符号将仅为函数的名称。你还可以使用 #[export_name = "some_name"] 来指定任意你想要的名称。

与 C++ 交互

CXX crate 使得在 Rust 和 C++ 之间进行安全的互操作成为可能。

整体的方法如下：

桥接模块

CXX依赖于提供的函数签名说明，这些签名会在不用语言之间进行交互使用。您可以在带有 #[cxx::bridge] 属性宏注解的 Rust 模块中使用 extern 代码块提供此说明。


#[allow(unsafe_op_in_unsafe_fn)]
#[cxx::bridge(namespace = "org::blobstore")]
mod ffi {
    // Shared structs with fields visible to both languages.
    struct BlobMetadata {
        size: usize,
        tags: Vec<String>,
    }

    // Rust types and signatures exposed to C++.
    extern "Rust" {
        type MultiBuf;

        fn next_chunk(buf: &mut MultiBuf) -> &[u8];
    }

    // C++ types and signatures exposed to Rust.
    unsafe extern "C++" {
        include!("include/blobstore.h");

        type BlobstoreClient;

        fn new_blobstore_client() -> UniquePtr<BlobstoreClient>;
        fn put(self: Pin<&mut BlobstoreClient>, parts: &mut MultiBuf) -> u64;
        fn tag(self: Pin<&mut BlobstoreClient>, blobid: u64, tag: &str);
        fn metadata(&self, blobid: u64) -> BlobMetadata;
    }
}

Speaker Notes

桥接通常在您的 crate 内的 ffi 模块中声明。
根据在桥接模块中进行的声明，CXX 将生成匹配的 Rust 和 C++ 类型/函数定义，以便将这些内容公开给这两种语言。
如需查看生成的 Rust 代码，请使用 cargo-expand 查看展开后的 proc 宏。对于大多数示例，您可以使用 cargo expand ::ffi 来仅展开 ffi 模块（但这不适用于 Android 项目）。
如需查看生成的 C++ 代码，请在 target/cxxbridge 中查找。

Rust Bridge Declarations


#[cxx::bridge]
mod ffi {
    extern "Rust" {
        type MyType; // Opaque type
        fn foo(&self); // Method on `MyType`
        fn bar() -> Box<MyType>; // Free function
    }
}

struct MyType(i32);

impl MyType {
    fn foo(&self) {
        println!("{}", self.0);
    }
}

fn bar() -> Box<MyType> {
    Box::new(MyType(123))
}

Speaker Notes

extern "Rust" 中声明的内容引用了父级模块中作用域内的内容。
CXX 代码生成器使用 extern "Rust" 部分生成包含相应 C++ 声明的 C++ 头文件。生成的头文件与包含桥接的 Rust 源文件的路径相同，但文件扩展名为 .rs.h。

生成的 C++ 代码


#[cxx::bridge]
mod ffi {
    // Rust types and signatures exposed to C++.
    extern "Rust" {
        type MultiBuf;

        fn next_chunk(buf: &mut MultiBuf) -> &[u8];
    }
}

大致生成以下 C++：


struct MultiBuf final : public ::rust::Opaque {
  ~MultiBuf() = delete;

private:
  friend ::rust::layout;
  struct layout {
    static ::std::size_t size() noexcept;
    static ::std::size_t align() noexcept;
  };
};

::rust::Slice<::std::uint8_t const> next_chunk(::org::blobstore::MultiBuf &buf) noexcept;

C++ 桥接声明


#[cxx::bridge]
mod ffi {
    // C++ types and signatures exposed to Rust.
    unsafe extern "C++" {
        include!("include/blobstore.h");

        type BlobstoreClient;

        fn new_blobstore_client() -> UniquePtr<BlobstoreClient>;
        fn put(self: Pin<&mut BlobstoreClient>, parts: &mut MultiBuf) -> u64;
        fn tag(self: Pin<&mut BlobstoreClient>, blobid: u64, tag: &str);
        fn metadata(&self, blobid: u64) -> BlobMetadata;
    }
}

大致生成以下 Rust：


#[repr(C)]
pub struct BlobstoreClient {
    _private: ::cxx::private::Opaque,
}

pub fn new_blobstore_client() -> ::cxx::UniquePtr<BlobstoreClient> {
    extern "C" {
        #[link_name = "org$blobstore$cxxbridge1$new_blobstore_client"]
        fn __new_blobstore_client() -> *mut BlobstoreClient;
    }
    unsafe { ::cxx::UniquePtr::from_raw(__new_blobstore_client()) }
}

impl BlobstoreClient {
    pub fn put(&self, parts: &mut MultiBuf) -> u64 {
        extern "C" {
            #[link_name = "org$blobstore$cxxbridge1$BlobstoreClient$put"]
            fn __put(
                _: &BlobstoreClient,
                parts: *mut ::cxx::core::ffi::c_void,
            ) -> u64;
        }
        unsafe {
            __put(self, parts as *mut MultiBuf as *mut ::cxx::core::ffi::c_void)
        }
    }
}

// ...

Speaker Notes

程序员无需承诺他们输入的签名准确无误。CXX 会执行静态断言，确认签名与 C++ 中声明的内容完全一致。
借助 unsafe extern 代码块，您可以声明可从 Rust 安全调用的 C++ 函数。

共享类型


#[cxx::bridge]
mod ffi {
    #[derive(Clone, Debug, Hash)]
    struct PlayingCard {
        suit: Suit,
        value: u8,  // A=1, J=11, Q=12, K=13
    }

    enum Suit {
        Clubs,
        Diamonds,
        Hearts,
        Spades,
    }
}

Speaker Notes

仅支持类似 C 函数（单元）的枚举。
共享类型的 #[derive()] 支持有限数量的 trait。系统还会针对 C++ 代码生成相应的功能，例如，如果您派生了 Hash，还会为相应的 C++ 类型生成 std::hash 实现。

共享枚举


#[cxx::bridge]
mod ffi {
    enum Suit {
        Clubs,
        Diamonds,
        Hearts,
        Spades,
    }
}

Generated Rust:


#![allow(unused)]
fn main() {
#[derive(Copy, Clone, PartialEq, Eq)]
#[repr(transparent)]
pub struct Suit {
    pub repr: u8,
}

#[allow(non_upper_case_globals)]
impl Suit {
    pub const Clubs: Self = Suit { repr: 0 };
    pub const Diamonds: Self = Suit { repr: 1 };
    pub const Hearts: Self = Suit { repr: 2 };
    pub const Spades: Self = Suit { repr: 3 };
}
}

Generated C++:


enum class Suit : uint8_t {
  Clubs = 0,
  Diamonds = 1,
  Hearts = 2,
  Spades = 3,
};

Speaker Notes

在 Rust 端，为共享枚举生成的代码实际上是封装数值的结构体。这是因为在 C++ 中，枚举类存储与所有已列变体不同的值不属于 UB，而 Rust 表示法需要具有相同的行为。

Rust 错误处理


#[cxx::bridge]
mod ffi {
    extern "Rust" {
        fn fallible(depth: usize) -> Result<String>;
    }
}

fn fallible(depth: usize) -> anyhow::Result<String> {
    if depth == 0 {
        return Err(anyhow::Error::msg("fallible1 requires depth > 0"));
    }

    Ok("Success!".into())
}

Speaker Notes

在 C++ 方面，返回 Result 的 Rust 函数会被翻译为异常。
抛出的异常始终是 rust::Error 类型，该类型主要用于提供获取错误消息字符串的方法。错误消息将由错误类型的 Display impl 提供。
当 panic 从 Rust 展开到 C++ 时，会始终导致进程立即终止。

C++ 错误处理


#[cxx::bridge]
mod ffi {
    unsafe extern "C++" {
        include!("example/include/example.h");
        fn fallible(depth: usize) -> Result<String>;
    }
}

fn main() {
    if let Err(err) = ffi::fallible(99) {
        eprintln!("Error: {}", err);
        process::exit(1);
    }
}

Speaker Notes

声明用于返回 Result 的 C++ 函数将捕获 C++ 端抛出的任何异常，并将其作为 Err 值返回给调用 Rust 函数。
如果外部 “C++”函数抛出l了异常，但 CXX 桥接中未声明该函数用于返回 Result，则程序会调用 C++ 的 std::terminate。此行为等同于通过 C++ 函数 nowithout 抛出了相同的异常。

其他类型

Rust Type	C++ Type
String	`rust::String`
`&str`	`rust::Str`
`CxxString`	`std::string`
`&[T]`/`&mut [T]`	`rust::Slice`
`Box<T>`	`rust::Box<T>`
`UniquePtr<T>`	`std::unique_ptr<T>`
`Vec<T>`	`rust::Vec<T>`
`CxxVector<T>`	`std::vector<T>`

Speaker Notes

这些类型可用于共享结构体的字段以及外部函数的参数和返回结果。
请注意，Rust 的 String 不会直接映射到 std::string。导致这种情况的原因有以下几种：
- std::string 不遵循 String 所需的 UTF-8 不变性。
- 这两种类型的内存布局不同，因此无法直接在语言之间进行传递。
- std::string 需要与 Rust 的移动语义不匹配的 move 构造函数，因此 std::string 无法按值传递给 Rust。

Building in Android

创建 cc_library_static 以构建 C++ 库，包括 CXX 生成的头文件和源文件。


cc_library_static {
    name: "libcxx_test_cpp",
    srcs: ["cxx_test.cpp"],
    generated_headers: [
        "cxx-bridge-header",
        "libcxx_test_bridge_header"
    ],
    generated_sources: ["libcxx_test_bridge_code"],
}

Speaker Notes

指出 libcxx_test_bridge_header 和 libcxx_test_bridge_code 是 CXX 生成的 C++ 绑定的依赖项。我们将在下一张幻灯片中介绍具体的设置方法。
请注意，您还需要依靠 cxx-bridge-header 库才能提取常见的 CXX 定义。
如需了解如何在 Android 中使用 CXX 的完整文档，请参阅 Android 文档。建议您与全班同学分享该链接，以便学生知道日后可以在哪里找到这些说明。

Building in Android

创建两个 genrule：一个用于生成 CXX 头文件，另一个用于生成 CXX 源文件。然后，这些内容会被用作 cc_library_static 的输入。


// Generate a C++ header containing the C++ bindings
// to the Rust exported functions in lib.rs.
genrule {
    name: "libcxx_test_bridge_header",
    tools: ["cxxbridge"],
    cmd: "$(location cxxbridge) $(in) --header > $(out)",
    srcs: ["lib.rs"],
    out: ["lib.rs.h"],
}

// Generate the C++ code that Rust calls into.
genrule {
    name: "libcxx_test_bridge_code",
    tools: ["cxxbridge"],
    cmd: "$(location cxxbridge) $(in) > $(out)",
    srcs: ["lib.rs"],
    out: ["lib.rs.cc"],
}

Speaker Notes

cxxbridge 工具是一款独立工具，用于生成桥接模块的 C++ 端。它包含在 Android 中，并作为 Soong 工具提供。
按照惯例，如果您的 Rust 源文件是 lib.rs，则头文件将命名为 lib.rs.h，源文件将命名为 lib.rs.cc。不过，系统并不强制执行此命名惯例。

Building in Android

创建一个依赖于 libcxx 和 cc_library_static 的 rust_binary。


rust_binary {
    name: "cxx_test",
    srcs: ["lib.rs"],
    rustlibs: ["libcxx"],
    static_libs: ["libcxx_test_cpp"],
}

与 Java 的互操作性

Java可以通过 Java 本地接口 (JNI) 加载共享对象。jni crate 允许您创建一个兼容的库。

首先，我们创建一个可以导出到 Java 的 Rust 函数：

interoperability/java/src/lib.rs:


#![allow(unused)]
fn main() {
//! Rust <-> Java FFI demo.

use jni::objects::{JClass, JString};
use jni::sys::jstring;
use jni::JNIEnv;

/// HelloWorld::hello method implementation.
#[no_mangle]
pub extern "system" fn Java_HelloWorld_hello(
    env: JNIEnv,
    _class: JClass,
    name: JString,
) -> jstring {
    let input: String = env.get_string(name).unwrap().into();
    let greeting = format!("Hello, {input}!");
    let output = env.new_string(greeting).unwrap();
    output.into_inner()
}
}

interoperability/java/Android.bp:


rust_ffi_shared {
    name: "libhello_jni",
    crate_name: "hello_jni",
    srcs: ["src/lib.rs"],
    rustlibs: ["libjni"],
}

We then call this function from Java:

interoperability/java/HelloWorld.java:


class HelloWorld {
    private static native String hello(String name);

    static {
        System.loadLibrary("hello_jni");
    }

    public static void main(String[] args) {
        String output = HelloWorld.hello("Alice");
        System.out.println(output);
    }
}

interoperability/java/Android.bp:


java_binary {
    name: "helloworld_jni",
    srcs: ["HelloWorld.java"],
    main_class: "HelloWorld",
    required: ["libhello_jni"],
}

最后，您可以构建、同步和运行二进制文件：


m helloworld_jni
adb sync  # requires adb root && adb remount
adb shell /system/bin/helloworld_jni

习题

这是一个小组练习：我们将查看你们正在处理的项目之一，并尝试将一些 Rust 代码集成进去。以下是一些建议：

使用 Rust 编写的客户端调用你的 AIDL 服务。
将你项目中的某个函数迁移到 Rust 中并调用它。

Speaker Notes

此处没有提供解决方案，因为这是开放式的：它依赖于班级中是否有人有一段您可以即时转换成 Rust 的代码。

Welcome to Rust in Chromium

Chromium 中的第三方库支持 Rust，并使用第一方粘合代码连接 Rust 和现有 Chromium C++ 代码。

今天，我们将调用 Rust 对字符串进行一些有趣的操作。如果您的代码中某个部分是用于向用户展示 UTF8 字符串，那么可以在代码库中的对应部分按照这个步骤来操作，而不一定要在我们所讨论的确切部分。

设置

请确保您可以构建并运行 Chromium。只要您的代码较新（提交位置始于 1223636，对应于 2023 年 11 月），则可在任何平台和任何一组 build 标志下运行。


gn gen out/Debug
autoninja -C out/Debug chrome
out/Debug/chrome # or on Mac, out/Debug/Chromium.app/Contents/MacOS/Chromium

（建议使用调试 build 组件，以缩短迭代时间。这是默认值！）

如果您不具备这点，请参阅如何构建 Chromium。注意：设置 build Chromium 需要花些时间。

此外，我们还建议您安装 Visual Studio 代码。

About the exercises

本课程的这一部分包含一系列练习，它们之间是相辅相成的。我们将在整个课程中进行这些练习，而不仅仅是在最后阶段完成。如果您没有时间完成某个部分，也无需担心：可以在下一阶段赶上进度。

Chromium 和 Cargo 的生态对比

The Rust community typically uses cargo and libraries from crates.io. Chromium is built using gn and ninja and a curated set of dependencies.

使用 Rust 编写代码时，您可以选择：

借助 //build/rust/*.gni 模板（例如 rust_static_library，我们稍后会介绍）使用 gn 和 ninja。该操作会使用经过审核的 Chromium 工具链和 crate。
使用 cargo，但仅限于经过审核的 Chromium 工具链和 crate
使用 cargo，信任工具链和/或从互联网下载的 crate

From here on we’ll be focusing on gn and ninja, because this is how Rust code can be built into the Chromium browser. At the same time, Cargo is an important part of the Rust ecosystem and you should keep it in your toolbox.

Mini exercise

分成各个小组开展以下活动：

思考 cargo 在哪些场景下具有优势，并评估这些场景的风险状况。
讨论在使用 gn 和 ninja 以及离线 cargo 等时，需要信任哪些工具、库和人群。

Speaker Notes

Ask students to avoid peeking at the speaker notes before completing the exercise. Assuming folks taking the course are physically together, ask them to discuss in small groups of 3-4 people.

与第一部分练习相关的备注/提示（“Cargo 可能具有优势的场景”）：

It’s fantastic that when writing a tool, or prototyping a part of Chromium, one has access to the rich ecosystem of crates.io libraries. There is a crate for almost anything and they are usually quite pleasant to use. (clap for command-line parsing, serde for serializing/deserializing to/from various formats, itertools for working with iterators, etc.).
- 借助 cargo，您便可以轻松试用库（只需向 `Cargo.toml`` 添加一行代码然后开始编写即可）
- 不妨比较下 CPAN 是如何帮助 perl 成为热门之选的。或者与 python + pip 进行比较。
Development experience is made really nice not only by core Rust tools (e.g. using rustup to switch to a different rustc version when testing a crate that needs to work on nightly, current stable, and older stable) but also by an ecosystem of third-party tools (e.g. Mozilla provides cargo vet for streamlining and sharing security audits; criterion crate gives a streamlined way to run benchmarks).
- 借助 cargo，您可通过 cargo install --locked cargo-vet 轻松添加工具。
- 不妨与 Chrome 扩展程序或 VScode 扩展程序进行比较。
以下是一些适合选用 cargo，较为宽泛的通用项目示例：
- Perhaps surprisingly, Rust is becoming increasingly popular in the industry for writing command line tools. The breadth and ergonomics of libraries is comparable to Python, while being more robust (thanks to the rich typesystem) and running faster (as a compiled, rather than interpreted language).
- 如要加入 Rust 生态系统，必须使用 Cargo 等标准 Rust 工具。如果希望库获得外部贡献，并且能够用于除 Chromium 之外（例如用于 Bazel 或 Android/Soong 构建环境）的项目，则应该使用 Cargo。
基于 cargo\ 的 Chromium 相关项目示例：
- serde_json_lenient（已在 Google 的其他部分进行了实验，结果是能使 PR 性能得到提升）
- 字体库（例如 font-types）
- gnrt 工具（我们将在本课程稍后的部分中加以介绍）使用 clap 进行命令行解析，使用 toml 处理配置文件。
  - Disclaimer: a unique reason for using cargo was unavailability of gn when building and bootstrapping Rust standard library when building Rust toolchain.
  - run_gnrt.py uses Chromium’s copy of cargo and rustc. gnrt depends on third-party libraries downloaded from the internet, but run_gnrt.py asks cargo that only --locked content is allowed via Cargo.lock.)

学生可将以下内容认定为隐式信任或明确信任：

rustc（Rust 编译器）则依赖于 LLVM 库、Clang 编译器、“rustc”源代码（从 GitHub 获取，并由 Rust 编译器团队审核）、为引导而下载的二进制 Rust 编译器
rustup（值得注意的是，rustup 是在 https://github.com/rust-lang/ 保护下组织开发的，与 rustc 相同。）
cargo、rustfmt 等
各种内部基础架构（用于构建 rustc 的聊天机器人、用于将预构建工具链分发给 Chromium 工程师的系统等。）
cargo Audit、cargo vet 等 Cargo 工具
包含到 //third_party/rust 的 Rust 库（由 security@chromium.org 进行审核）
其他 Rust 库（一些小众但很受欢迎又常用的库）

Chromium Rust 政策

Chromium 尚不支持使用第一方 Rust，除非是经过区域技术主管批准的特殊情况。

有关 Chromium 的第三方库政策，请参阅此处。根据该这些政策，在很多情况下允许将 Rust 作为第三方库使用，包括无论是在性能还是安全方面，它们都是理想之选。

鲜少有 Rust 库会直接公开 C/C++ API，这意味着几乎所有此类库都需要使用少量的第一方粘合代码。

特定第三方 crate 的第一方 Rust 粘合代码通常应放在 third_party/rust/<crate>/<version>/wrapper 目录中。

因此，今天的课程将重点介绍以下内容：

引入第三方 Rust 库 (“crates”)
编写粘合代码，以便能够从 Chromium C++ 中使用这些 crate。

如果此政策随时间而发生变化，则本课程也会随之更新。

Build rules

Rust 代码通常使用 cargo 构建而成。为提高效率，Chromium 使用 gn 和 ninja 进行构建，其静态规则支持实现最大程度的并行处理。Rust 也不例外。

Adding Rust code to Chromium

在一些现有的 Chromium BUILD.gn 文件中，声明 rust_static_library：


import("//build/rust/rust_static_library.gni")

rust_static_library("my_rust_lib") {
  crate_root = "lib.rs"
  sources = [ "lib.rs" ]
}

您还可以在其他 Rust 目标上添加 deps。稍后，我们通过该操作来使用第三方代码。

Speaker Notes

您必须同时指定 crate 根目录和完整的源代码列表。crate_root 是供 Rust 编译器使用的文件，表示编译单元的根文件，通常为 lib.rs。sources 是所有源文件的完整列表，ninja 需要用它来确定何时该进行重新构建。

（在 Rust 中，并不存在所谓的 Rust source_set，因为整个 crate 就是一个编译单元。static_library 是最小的单元。）

学生可能会疑惑为何我们需要 gn 模板，而不使用 gn 内置的 Rust 静态库支持进行操作。原因是此模板支持 CXX 互操作性、各项 Rust 功能以及单元测试，我们稍后便会用到其中的一些功能。

Including `unsafe` Rust Code

默认情况下，禁止在 rust_static_library 中使用不安全的 Rust 代码，因为此类代码无法编译。如需使用不安全的 Rust 代码，请将 allow_unsafe = true 添加到 gn 目标中。（在本课程的稍后部分，我们将介绍在哪些情形下必须这样做。）


import("//build/rust/rust_static_library.gni")

rust_static_library("my_rust_lib") {
  crate_root = "lib.rs"
  sources = [
    "lib.rs",
    "hippopotamus.rs"
  ]
  allow_unsafe = true
}

在 Chromium C++ 中导入 Rust 代码

只需将上述目标添加到某些 Chromium C++ 目标的 deps 中即可。


import("//build/rust/rust_static_library.gni")

rust_static_library("my_rust_lib") {
  crate_root = "lib.rs"
  sources = [ "lib.rs" ]
}

# or source_set, static_library etc.
component("preexisting_cpp") {
  deps = [ ":my_rust_lib" ]
}

Speaker Notes

We'll see that this relationship only works if the Rust code exposes plain C APIs which can be called from C++, or if we use a C++/Rust interop tool.

Visual Studio Code

在 Rust 代码中，通常会省略类型，这使得拥有强大的 IDE 甚至比 C++ 更为有用。Visual Studio Code 非常适用于在 Chromium 中处理 Rust 代码。如需使用此功能，

请确保您的 VSCode 安装了 rust-analyzer 扩展程序，而不是较早版本的 Rust 支持插件。
gn gen out/Debug --export-rust-project（或输出目录的等效项）
In -s out/Debug/rust-project.json rust-project.json

Speaker Notes

如果受众群体对 IDE 持有怀疑态度，不妨演示下 rust-analyzer 的部分代码注解和探索功能，或许能让他们改观。

以下步骤可能会对演示有所帮助（但您也可以选用一段最为熟悉的 Chromium 相关 Rust 代码）：

打开 components/qr_code_generator/qr_code_generator_ffi_glue.rs
将光标置于 `qr_code_generator_ffi_glue.rs 中的 QrCode::new 调用（大约第 26 行）上
Demo show documentation (typical bindings: vscode = ctrl k i; vim/CoC = K).
Demo go to definition (typical bindings: vscode = F12; vim/CoC = g d). (This will take you to //third_party/rust/.../qr_code-.../src/lib.rs.)
演示 outline 并前往 QrCode::with_bits 方法（大约第 164 行；大纲位于 vscode 中的文件资源管理器窗格中；典型绑定：vim/CoC = space o）
演示 type annotations（QrCode::with_bits 方法中提供了一些很好的示例）

值得注意的是，修改 BUILD.gn 文件后，需要重新运行 gn gen ... --export-rust-project（我们将在本课程的练习中多次执行该操作）。

Build rules exercise

在 Chromium build 中，向 //ui/base/BUILD.gn 添加新的 Rust 目标，其中包含：


#![allow(unused)]
fn main() {
#[no_mangle]
pub extern "C" fn hello_from_rust() {
    println!("Hello from Rust!")
}
}

Important: note that no_mangle here is considered a type of unsafety by the Rust compiler, so you’ll need to allow unsafe code in your gn target.

将这个新的 Rust 目标添加为 //ui/base:base 的依赖项。在 ui/base/resource/resource_bundle.cc 顶部声明此函数（稍后，我们将介绍如何通过绑定生成工具来自动执行此操作）：


extern "C" void hello_from_rust();

从 ui/base/resource/resource_bundle.cc 中的某个位置调用此函数，我们建议在从 ResourceBundle::MaybeMangleLocalizedString 的顶部调用此函数。构建并运行 Chromium，并确保多次显示 “Hello from Rust!”。

如果您使用 VSCode，现在就请设置 Rust，以便其能在 VSCode 中正常运行。这对后续练习会很有帮助。如果操作成功，则可使用右键点击 println! 上的 “Go to definition”。

如何获取帮助

适用于 [rust_static_library gn 模板] 的选项 (https://source.chromium.org/chromium/chromium/src/+/main:build/rust/rust_static_library.gni;l=16)
关于 [#[no_mangle]] 的详细信息 (https://doc.rust-lang.org/beta/reference/abi.html#the-no_mangle-attribute)
关于 [extern "C"] 的详细信息 (https://doc.rust-lang.org/std/keyword.extern.html)
关于 gn 的 [--export-rust-project] 开关的详细信息 (https://gn.googlesource.com/gn/+/main/docs/reference.md#compilation-database)
如何在 VSCode 中安装 rust-analyzer

Speaker Notes

It's really important that students get this running, because future exercises will build on it.

此示例很独特，因为其归根结底是最通用的互操作语言，即 C 语言。C++ 和 Rust 本身都可以声明和调用 C ABI 函数。在本课程的稍后部分，我们会直接将 C++ 和 Rust 关联起来。

此处需要使用 allow_unsafe = true，因为 #[no_mangle] 可能会支持 Rust 生成两个同名函数，而 Rust 无法保证会调用正确的函数。

如果需要纯 Rust 可执行文件，也可以使用 rust_executable gn 模板执行此操作。

测试

Rust community typically authors unit tests in a module placed in the same source file as the code being tested. This was covered earlier in the course and looks like this:


#![allow(unused)]
fn main() {
#[cfg(test)]
mod tests {
    #[test]
    fn my_test() {
        todo!()
    }
}
}

In Chromium we place unit tests in a separate source file and we continue to follow this practice for Rust — this makes tests consistently discoverable and helps to avoid rebuilding .rs files a second time (in the test configuration).

This results in the following options for testing Rust code in Chromium:

Native Rust tests (i.e. #[test]). Discouraged outside of //third_party/rust.
gtest tests authored in C++ and exercising Rust via FFI calls. Sufficient when Rust code is just a thin FFI layer and the existing unit tests provide sufficient coverage for the feature.
gtest tests authored in Rust and using the crate under test through its public API (using pub mod for_testing { ... } if needed). This is the subject of the next few slides.

Speaker Notes

Mention that native Rust tests of third-party crates should eventually be exercised by Chromium bots. (Such testing is needed rarely — only after adding or updating third-party crates.)

Some examples may help illustrate when C++ gtest vs Rust gtest should be used:

QR has very little functionality in the first-party Rust layer (it’s just a thin FFI glue) and therefore uses the existing C++ unit tests for testing both the C++ and the Rust implementation (parameterizing the tests so they enable or disable Rust using a ScopedFeatureList).
Hypothetical/WIP PNG integration may need to implement memory-safe implementation of pixel transformations that are provided by libpng but missing in the png crate - e.g. RGBA => BGRA, or gamma correction. Such functionality may benefit from separate tests authored in Rust.

`rust_gtest_interop` 库

The rust_gtest_interop library provides a way to:

Use a Rust function as a gtest testcase (using the #[gtest(...)] attribute)
Use expect_eq! and similar macros (similar to assert_eq! but not panicking and not terminating the test when the assertion fails).

Example:


use rust_gtest_interop::prelude::*;

#[gtest(MyRustTestSuite, MyAdditionTest)]
fn test_addition() {
    expect_eq!(2 + 2, 4);
}

Rust 测试的 GN 规则

The simplest way to build Rust gtest tests is to add them to an existing test binary that already contains tests authored in C++. For example:


test("ui_base_unittests") {
  ...
  sources += [ "my_rust_lib_unittest.rs" ]
  deps += [ ":my_rust_lib" ]
}

Authoring Rust tests in a separate static_library also works, but requires manually declaring the dependency on the support libraries:


rust_static_library("my_rust_lib_unittests") {
  testonly = true
  is_gtest_unittests = true
  crate_root = "my_rust_lib_unittest.rs"
  sources = [ "my_rust_lib_unittest.rs" ]
  deps = [
    ":my_rust_lib",
    "//testing/rust_gtest_interop",
  ]
}

test("ui_base_unittests") {
  ...
  deps += [ ":my_rust_lib_unittests" ]
}

`chromium::import!` 宏

After adding :my_rust_lib to GN deps, we still need to learn how to import and use my_rust_lib from my_rust_lib_unittest.rs. We haven’t provided an explicit crate_name for my_rust_lib so its crate name is computed based on the full target path and name. Fortunately we can avoid working with such an unwieldy name by using the chromium::import! macro from the automatically-imported chromium crate:


chromium::import! {
    "//ui/base:my_rust_lib";
}

use my_rust_lib::my_function_under_test;

Under the covers the macro expands to something similar to:


extern crate ui_sbase_cmy_urust_ulib as my_rust_lib;

use my_rust_lib::my_function_under_test;

More information can be found in the doc comment of the chromium::import macro.

Speaker Notes

rust_static_library supports specifying an explicit name via crate_name property, but doing this is discouraged. And it is discouraged because the crate name has to be globally unique. crates.io guarantees uniqueness of its crate names so cargo_crate GN targets (generated by the gnrt tool covered in a later section) use short crate names.

Testing exercise

Time for another exercise!

In your Chromium build:

Add a testable function next to hello_from_rust. Some suggestions: adding two integers received as arguments, computing the nth Fibonacci number, summing integers in a slice, etc.
Add a separate ..._unittest.rs file with a test for the new function.
Add the new tests to BUILD.gn.
Build the tests, run them, and verify that the new test works.

与 C++ 的互操作性

Rust 社区提供了多种 C++/Rust 互操作选项，并且一直在不断开发新工具。目前，Chromium 使用一种名为 CXX 的工具。

您可以使用接口定义语言（与 Rust 极为相似）描述整个语言边界，然后 CXX 工具会据此生成 Rust 和 C++ 函数及类型的声明。

Overview diagram of cxx, showing that the same interface definition is used to create both C++ and Rust side code which then communicate via a lowest common denominator C API

See the CXX tutorial for a full example of using this.

Speaker Notes

请仔细研究这个图表。解释背后的原理和您之前所做的完全相同。说明自动执行这一流程具有以下好处：

使用该工具可保证 C++ 端和 Rust 端相匹配（例如，如果 #[cxx::bridge] 与实际的 C++ 或 Rust 定义不匹配，则会出现编译错误。但使用不同步的手动绑定，可能会导致未定义行为）
该工具还可自动生成 FFI thunk（即小型但可兼容 C-ABI 的自由函数），以便适应非 C 语言特性（例如，启用对 Rust 或 C++ 方法的 FFI 调用；而手动实现绑定一般需要自行编写这种顶级的自由函数）
该工具和库可以处理一系列核心类型，例如：
- &[T] 可以跨 FFI 边界进行传递，即使它无法保证任何特定的 ABI 或内存布局一致无误。进行手动绑定时，必须手动解构 std::span<T> / &[T]，并根据指针和长度进行重新构建，但这很容易出错，因为每种语言对于空 slice 的表示方式略有不同
- 系统对 std::unique_ptr<T>、std::shared_ptr<T> 和/或 Box 等智能指针提供原生支持。如果使用手动绑定，则必须传递可兼容 C-ABI 的原始指针，这会增加生命周期和内存安全风险。
- rust::String 和 CxxString 类型能够识别并处理不同语言之间在字符串表示方面的差异（例如，rust::String::lossy 可以通过非 UTF8 输入构建 Rust 字符串；rust::String::c_str 可以为字符串加上 NUL 终止符）。

绑定示例

CXX requires that the whole C++/Rust boundary is declared in cxx::bridge modules inside .rs source code.


#[cxx::bridge]
mod ffi {
    extern "Rust" {
        type MultiBuf;

        fn next_chunk(buf: &mut MultiBuf) -> &[u8];
    }

    unsafe extern "C++" {
        include!("example/include/blobstore.h");

        type BlobstoreClient;

        fn new_blobstore_client() -> UniquePtr<BlobstoreClient>;
        fn put(self: &BlobstoreClient, buf: &mut MultiBuf) -> Result<u64>;
    }
}

// Definitions of Rust types and functions go here

Speaker Notes

指出：

Although this looks like a regular Rust mod, the #[cxx::bridge] procedural macro does complex things to it. The generated code is quite a bit more sophisticated - though this does still result in a mod called ffi in your code.
Native support for C++’s std::unique_ptr in Rust
Native support for Rust slices in C++
从 C++ 调用 Rust，并使用 Rust 类型（顶部位置）
从 Rust 调用 C++，并使用 C++ 类型（底部位置）

常见误解：这看似 Rust 在解析 C++ 头文件，其实具有误导性。Rust 不会对此头文件进行解释，只是在生成的 C++ 代码中添加 #include，以便于 C++ 编译器使用。

CXX 的局限性

By far the most useful page when using CXX is the type reference.

CXX 基本适用于以下情况：

您的 Rust-C++ 接口非常简单，可以声明其中的全部内容。
目前，您只使用了 CXX 提供原生支持的类型，例如 std::unique_ptr、std::string、&[u8] 等。

这样具有很多局限性，例如缺少对 Rust 的 Option 类型的支持。

由于这些局限，我们在 Chromium 中只能将 Rust 用于隔离紧密的 “叶节点”，而无法用于任意的 Rust-C++ 互操作。当您打算在 Chromium 中探索 Rust 的应用场景时，推荐先从拟定针对语言边界的 CXX 绑定入手，以查看该语言边界是否足够简单明了。

Speaker Notes

In addition, right now, Rust code in one component cannot depend on Rust code in another, due to linking details in our component build. That's another reason to restrict Rust to use in leaf nodes.

此外，还应讨论使用 CXX 时的其他一些困难之处，例如：

其根据 C++ 异常来进行错误处理（下一张幻灯片中将加以介绍）
函数指针使用起来很不方便。

CXX 错误处理

CXX’s support for Result<T,E> relies on C++ exceptions, so we can’t use that in Chromium. Alternatives:

The T part of Result<T, E> can be:
- Returned via out parameters (e.g. via &mut T). This requires that T can be passed across the FFI boundary - for example T has to be:
  - A primitive type (like u32 or usize)
  - A type natively supported by cxx (like UniquePtr<T>) that has a suitable default value to use in a failure case (unlike Box<T>).
- Retained on the Rust side, and exposed via reference. This may be needed when T is a Rust type, which cannot be passed across the FFI boundary, and cannot be stored in UniquePtr<T>.
The E part of Result<T, E> can be:
- Returned as a boolean (e.g. true representing success, and false representing failure)
- Preserving error details is in theory possible, but so far hasn’t been needed in practice.

CXX Error Handling: QR Example

The QR code generator is an example where a boolean is used to communicate success vs failure, and where the successful result can be passed across the FFI boundary:


#[cxx::bridge(namespace = "qr_code_generator")]
mod ffi {
    extern "Rust" {
        fn generate_qr_code_using_rust(
            data: &[u8],
            min_version: i16,
            out_pixels: Pin<&mut CxxVector<u8>>,
            out_qr_size: &mut usize,
        ) -> bool;
    }
}

Speaker Notes

Students may be curious about the semantics of the out_qr_size output. This is not the size of the vector, but the size of the QR code (and admittedly it is a bit redundant - this is the square root of the size of the vector).

It may be worth pointing out the importance of initializing out_qr_size before calling into the Rust function. Creation of a Rust reference that points to uninitialized memory results in Undefined Behavior (unlike in C++, when only the act of dereferencing such memory results in UB).

If students ask about Pin, then explain why CXX needs it for mutable references to C++ data: the answer is that C++ data can’t be moved around like Rust data, because it may contain self-referential pointers.

CXX Error Handling: PNG Example

A prototype of a PNG decoder illustrates what can be done when the successful result cannot be passed across the FFI boundary:


#[cxx::bridge(namespace = "gfx::rust_bindings")]
mod ffi {
    extern "Rust" {
        /// This returns an FFI-friendly equivalent of `Result<PngReader<'a>,
        /// ()>`.
        fn new_png_reader<'a>(input: &'a [u8]) -> Box<ResultOfPngReader<'a>>;

        /// C++ bindings for the `crate::png::ResultOfPngReader` type.
        type ResultOfPngReader<'a>;
        fn is_err(self: &ResultOfPngReader) -> bool;
        fn unwrap_as_mut<'a, 'b>(
            self: &'b mut ResultOfPngReader<'a>,
        ) -> &'b mut PngReader<'a>;

        /// C++ bindings for the `crate::png::PngReader` type.
        type PngReader<'a>;
        fn height(self: &PngReader) -> u32;
        fn width(self: &PngReader) -> u32;
        fn read_rgba8(self: &mut PngReader, output: &mut [u8]) -> bool;
    }
}

Speaker Notes

PngReader and ResultOfPngReader are Rust types — objects of these types cannot cross the FFI boundary without indirection of a Box<T>. We can’t have an out_parameter: &mut PngReader, because CXX doesn’t allow C++ to store Rust objects by value.

This example illustrates that even though CXX doesn’t support arbitrary generics nor templates, we can still pass them across the FFI boundary by manually specializing / monomorphizing them into a non-generic type. In the example ResultOfPngReader is a non-generic type that forwards into appropriate methods of Result<T, E> (e.g. into is_err, unwrap, and/or as_mut).

Using cxx in Chromium

在 Chromium 中，针对每个需要使用 Rust 的叶节点，我们定义独立的 #[cxx::bridge] mod。通常，每个 rust_static_library 都有对应的定义。只需将


cxx_bindings = [ "my_rust_file.rs" ]
   # list of files containing #[cxx::bridge], not all source files
allow_unsafe = true

添加到您现有的 rust_static_library 以及 crate_root 和 sources 的目标中。

C++ 头文件会在合理的位置生成，因此您只需


#include "ui/base/my_rust_file.rs.h"

您会发现，//base 中提供了一些实用函数，可将 Chromium C++ 类型与 CXX Rust 类型相互转换，例如 SpanToRustSlice。

Speaker Notes

学生可能会问：为什么我们仍然需要 allow_unsafe = true？

总的来说，按照常规 Rust 标准，没有任何 C/C++ 代码是 “安全”的。在 Rust 中来回调用 C/C++ 可能会对内存执行任意操作，并危及 Rust 自身数据布局的安全性。如果 C/C++ 互操作性中出现过多的 unsafe 关键字，可能会损害此类关键字的信噪比，并且存在争议。但严格地说，将任何外部代码引入 Rust 二进制文件可能会导致 Rust 中出现意外行为。

The narrow answer lies in the diagram at the top of this page — behind the scenes, CXX generates Rust unsafe and extern "C" functions just like we did manually in the previous section.

Exercise: Interoperability with C++

第一部分

在您之前创建的 Rust 文件中，添加 #[cxx::bridge] 来指定一个将从 C++ 调用的函数（名为 hello_from_rust），该函数不接受任何参数也不返回任何值。
修改之前的 hello_from_rust 函数，移除 extern "C" 和 #[no_mangle]。现在，这只是一个标准的 Rust 函数。
请修改 gn 目标以构建这些绑定。
在 C++ 代码中，移除 hello_from_rust 的正向声明，然后添加生成的头文件。
Build and run!

第二部分

建议尝试操作一下 CXX。这有助于您更好地理解 Rust 在Chromium 中的灵活性。

Some things to try:

从 Rust 回调到 C++。您需要执行以下操作：
- 创建一个附加头文件，且您可以从 cxx::bridge 对其进行 include! 操作。您需要在这个新的头文件中声明要调用的 C++ 函数。
- 创建一个 unsafe 代码块，用于调用此类函数，也可以在 #[cxx::bridge] 中指定 unsafe 关键字，如此处所述。
- 您可能还需要添加 #include "third_party/rust/cxx/v1/crate/include/cxx.h"
将 C++ 字符串从 C++ 传递到 Rust。
将对 C++ 对象的引用传递到 Rust。
刻意让 Rust 函数签名与 #[cxx::bridge] 不匹配，并逐渐熟悉所看到的错误信息。
刻意让 C++ 函数签名与 #[cxx::bridge] 不匹配，并适应您看到的错误。
将某个类型的 std::unique_ptr 从 C++ 传递到 Rust，以便 Rust 拥有某个 C++ 对象的所有权。
创建一个 Rust 对象并将其传递到 C++，以便 C++ 拥有它的所有权。（提示：您需要使用 Box）。
声明调用某个 C++ 类型的方法。从 Rust 调用它们。
声明调用某个 Rust 类型的方法。从 C++ 调用它们。

第三部分

现在，您已经了解了 CXX 互操作性的优势和局限，请思考几个 Rust 在 Chromium 中的应用场景，其中接口要足够简单构思该如何定义该接口。

如何获取帮助

Speaker Notes

As students explore Part Two, they're bound to have lots of questions about how to achieve these things, and also how CXX works behind the scenes.

您可能会遇到以下问题：

当我用类型 Y 初始化类型 X 的变量时，出现了初始化问题，其中 X 和 Y 都是函数类型。这是因为您的 C++ 函数实现与 cxx::bridge 中的声明并不完全一致。
我好像能随意将 C++ 引用转换为 Rust 引用。这样不会导致 UB 风险吗？对于 CXX 的_不透明_类型，答案为否，因为它们的大小为零。对于 CXX 的基本类型，确实_有可能_导致 UB，但鉴于 CXX 的设计策略，要构建能导致这种情况的示例颇为困难。

添加第三方 Crate

Rust 库被称为 crate，可在 crates.io 中找到。Rust 的 crate 之间非常容易相互依赖。事实证明，他们确实如此！

属性	C++ library	Rust crate
Build system	很多	保持一致：`Cargo.toml`
典型库的大小	大	小
传递依赖项	很少	很多

对于 Chromium 工程师来说，这种依赖关系具有以下利弊：

所有 crate 都使用共同的构建系统，这样我们就可以自动将其收录到 Chromium 中…
…但是，crate 通常具有传递依赖项，因此可能需要引入多个库。

我们将讨论以下内容：

如何将 crate 添加到 Chromium 源代码树中
如何为其制定 gn 构建规则
如何审核其源代码以确保足够的安全性。

Speaker Notes

All of the things in the table on this slide are generalizations, and counter-examples can be found. But in general it's important for students to understand that most Rust code depends on other Rust libraries, because it's easy to do so, and that this has both benefits and costs.

配置 `Cargo.tom` 文件以添加 crate

Chromium 具有一组集中管理的直接 crate 依赖项。这些依赖项通过单独的 Cargo.toml 文件进行管理：


[dependencies]
bitflags = "1"
cfg-if = "1"
cxx = "1"
# lots more...

与任何其他 Cargo.toml 一样，您可以指定有关依赖项的更多详细信息。最常见的是，您需要指定要在 crate 中启用的 features。

向 Chromium 中添加 crate 时，通常需要在附加文件 gnrt_config.toml 中提供一些额外的信息，我们将在下文中加以介绍。

配置 `gnrt_config.toml`

与 Cargo.toml 一起使用的是 gnrt_config.toml。此文件包含 Chromium 专用扩展程序，可用于处理 crate。

如果添加新的 crate，至少要明确指定 group。可以为以下选项之一：


#   'safe': The library satisfies the rule-of-2 and can be used in any process.
#   'sandbox': The library does not satisfy the rule-of-2 and must be used in
#              a sandboxed process such as the renderer or a utility process.
#   'test': The library is only used in tests.

例如：


[crate.my-new-crate]
group = 'test' # only used in test code

根据 crate 源代码布局，您可能还需要使用此文件指定其 LICENSE 文件的所在位置。

稍后，我们将介绍需要在此文件中配置的其他内容，以便能够解决问题。

下载 Crate

有一款名为 gnrt 的工具，具有下载 crate 以及生成 BUILD.gn 规则的功能。

首先，按如下所示下载所需的 crate：


cd chromium/src
vpython3 tools/crates/run_gnrt.py -- vendor

虽然 gnrt 工具是 Chromium 源代码的一部分，但通过运行此命令，您可以从 crates.io 下载并运行其依赖项。有关该安全决策的讨论，请参阅前面的部分。

运行此 vendor 命令可能会下载以下内容：

Your crate
直接依赖项和传递依赖项
cargo 要求的其他 crate 的新版本，用于解析 Chromium 所需的全部 crate。

Chromium 会修复一些 crate 的补丁，并将其保存在 //third_party/rust/chromium_crates_io/patches 中。系统会自动重新应用这些补丁，但如果补丁应用失败，您可能需要进行手动操作。

生成 `gn` 构建规则

下载 crate 后，按如下方式生成 BUILD.gn 文件：


vpython3 tools/crates/run_gnrt.py -- gen

现在，运行 git status。您应该会看到：

third_party/rust/chromium_crates_io/vendor 中至少包含一个新的 crate 源代码
third_party/rust/<crate name>/v<major semver version> 中至少包含一个新的 BUILD.gn
相应的 README.chromium

The “major semver version” is a Rust “semver” version number.

请仔细观察，尤其是 third_party/rust 中生成的内容。

Speaker Notes

浅谈下 semver，特别是在 Chromium 中，它支持使用多个不兼容的 crate 版本。虽然在 Cargo 生态系统中不鼓励这种方式，但在某些情况下却是必要的。

解决问题

如果构建失败，可能是 build.rs 文件所致：这些程序在构建过程中执行了任意操作。这与 gn 和 ninja 的设计完全不相符，它们旨在实现静态、确定性的构建规则，以最大限度地提高构建的并行性和可重复性。

系统支持自动进行某些 build.rs 操作；而有些需要进行额外的处理：

构建脚本效果	我们的 gn 模板均支持	您需要完成的工作
检查 rustc 版本以配置启用和停用功能	是	无
检查平台或 CPU 以配置启用和停用功能	是	无
Generating code	是	是的，在 `gnrt_config.toml` 中指定
构建 C/C++	否	进行补丁修复
Arbitrary other actions	否	进行补丁修复

幸运的是，大多数 crate 不包含构建脚本，而且大多数的构建脚本只执行前两项操作。

构建用于生成代码的脚本

如果 ninja 提示有文件缺失，请检查 build.rs，确认其是否写入了源代码文件。

如果是，请修改 gnrt_config.toml，将 build-script-outputs 添加到 crate。如果这是一个传递依赖项（即 Chromium 代码不应直接依赖的依赖项），还要添加 allow-first-party-usage=false。该文件中已经提供了若干示例：


[crate.unicode-linebreak]
allow-first-party-usage = false
build-script-outputs = ["tables.rs"]

现在，请重新运行 gnrt.py -- gen，重新生成 BUILD.gn 文件，以通知 ninja 此特定输出文件将被用作后续构建步骤的输入。

构建用于构建 C++ 或执行任意操作的脚本

有些 crate 使用 cc crate 来构建和关联 C/C++ 库。其他 crate 会在其构建脚本中使用 bindgen 解析 C/C++。Chromium 环境中不支持进行这些操作，因为我们的 gn、ninja 和 LLVM 构建系统在表达构建操作之间的关系方面具有非常严格具体的要求。

因此，您可以选择：

避开这些 crate
对 crate 应用补丁。

补丁应保存在 third_party/rust/chromium_crates_io/patches/<crate> 中，请参阅面向 cxx crate 的补丁中的示例。每当 gnrt 升级该 crate 时，将会自动应用补丁文件。

依赖于 Crate

添加第三方 crate 并生成构建规则后，就可轻松使用该 crate。请找到 rust_static_library 目标，并在 crate 中的 :lib 目标上添加 dep。

Specifically,

例如：


rust_static_library("my_rust_lib") {
  crate_root = "lib.rs"
  sources = [ "lib.rs" ]
  deps = [ "//third_party/rust/example_rust_crate/v1:lib" ]
}

Auditing Third Party Crates

添加新库须遵守 Chromium 的标准政策，当然也要遵守安全审核标准。您可能不仅要引入单个 crate，而且还要引入传递依赖项，因此可能需要审核大量代码。另一方面，安全的 Rust 代码可能产生的副作用有限。应如何对其进行审核？

随着时间的推移，Chromium 计划逐步采用以 cargo vet 为核心的处理流程。

与此同时，每当添加新的 crate 时，我们都会检查以下内容：

了解每个 crate 的用途。crate 之间存在什么关系？如果每个 crate 的构建系统都包含 build.rs 或过程宏，请确定它们的用途。它们能否与 Chromium 的正常构建方式相兼容。
检查每个 crate 是否得到合理维护。
使用 cd third-party/rust/chromium_crates_io; cargo review 检查已知漏洞（首先需要运行 cargo install cargo-audit，令人意外的是，这个过程中需要从互联网下载大量的依赖项 2）
确保所有 unsafe 代码都符合两大规则的要求
检查是否使用了 fs 或 net API
尽可能地仔细阅读所有代码，查找任何可能属于恶意插入，稍显异常的地方。（但现实中，您不可能做到百无遗漏，因为代码量通常太庞大了。）

以下只是一些指导建议，请与 security@chromium.org 的审核者合租，共同找出能够确保 crate 安全的正确方法。

Checking Crates into Chromium Source Code

git status 应显示以下内容：

//third_party/rust/chromium_crates_io 中的 crate 代码
//third_party/rust/<crate>/<version> 中的元数据（BUILD.gn 和 README.chromium）

此外，请在后面的位置添加 OWNERS 文件。

您应将所有这些内容，以及对 Cargo.toml 和 gnrt_config.toml 的更改一起提交到 Chromium 仓库中。

重要提示：您需要使用 git add -f 命令，否则 .gitignore 文件可能会导致某些文件被跳过。

在此过程中，您可能会发现由于使用了非包容性语言，导致提交前检查失败。这是因为 Rust crate 数据往往包含 git 分支的名称，而许多项目仍然在使用非包容性术语。因此，您需要运行以下命令：


infra/update_inclusive_language_presubmit_exempt_dirs.sh > infra/inclusive_language_presubmit_exempt_dirs.txt
git add -p infra/inclusive_language_presubmit_exempt_dirs.txt # add whatever changes are yours

及时更新 Crate

作为任何第三方 Chromium 依赖项的所有者，您应使用任何安全修复程序，确保该依赖项处于最新状态。我们希望能够尽快实现对 Rust crate 自动执行此操作，但目前仍由您负责执行，就像对待任何其他第三方依赖项一样。

练习

将 uwuify 添加到 Chromium，以停用 crate 的默认功能。假设该 crate 会交付 Chromium 时被使用，但不会用于处理不可信的输入内容。

（在下一个练习中，我们将使用 Chromium 中的 uwuify；但您也可以跳过这一步，现在就开始此操作。或者，您可以创建一个使用 uwuify 的新 rust_executable 目标。

Speaker Notes

Students will need to download lots of transitive dependencies.

The total crates needed are:

instant,
lock_api,
parking_lot,
parking_lot_core,
redox_syscall,
scopeguard,
smallvec, and
uwuify.

If students are downloading even more than that, they probably forgot to turn off the default features.

Thanks to Daniel Liu for this crate!

Bringing It Together — Exercise

在本练习中，您将运用之前所学的全部知识，添加一项全新的 Chromium 功能。

The Brief from Product Management

在偏僻的热带雨林中，发现住着一群小精灵。我们务必尽快创建一款精灵版 Chromium，交付给他们。

要求是将 Chromium 的所有界面字符串翻译为精灵语。

由于时间紧迫，无法等待准确的翻译。但幸运的是，精灵语与英语非常接近，然后我们发现有一个 Rust crate 可以执行此翻译任务。

事实上，您已经在上一个练习中导入了该 crate。

（显然，对 Chrome 进行准确恰当的翻译，需要做到极度细致和全力以赴。请勿交付此产品！）

步骤

修改 ResourceBundle::MaybeMangleLocalizedString，在显示之前对所有字符串进行 uwu 处理。在这个特殊的 Chromium 版本中，无论 mangle_localized_strings_ 的设置如何，都应该始终执行此操作。

如果您正确完成了这些练习中的所有操作，那么恭喜您已经成功创建了一款精灵版 Chrome！

Chromium UI screenshot with uwu language

Speaker Notes

Students will likely need some hints here. Hints include:

UTF16 与 UTF8。学生应注意，Rust 字符串始终采用 UTF8 编码，它们可能会决定在 C++ 端使用 base::UTF16ToUTF8 进行转换，然后再进行反向转换。
如果学生决定在 Rust 端进行转换，则需要考虑使用 String::from_utf16 方法，同时注意错误处理，并确定哪些 CXX 支持的类型可以传输大量 u16s。
学生可以通过多种方式设计 C++/Rust 边界，例如按值传递和返回字符串，或对字符串采取可变引用。如果使用可变引用，CXX 可能会提示学生需要使用 Pin 方法。您可能需要解释“Pin”的用途，以及为何 CXX 需要它来对 C++ 数据进行可变引用：原因是 C++ 数据无法像 Rust 数据那样自由移动，它可能包含自引用指针。
对于包含 ResourceBundle::MaybeMangleLocalizedString 的 C++ 目标，需要依赖于 rust_static_library 目标。学生可能已经执行了此操作。
rust_static_library 目标需要依赖于 //third_party/rust/uwuify/v0_2:lib。

练习解答

Solutions to the Chromium exercises can be found in this series of CLs.

Welcome to Bare Metal Rust

这是单独为裸机 Rust 开设的课程，为期一天，主要面向熟悉 Rust 基础知识的人员（例如已学完 Comprehensive Rust），最好也有一些使用其他语言（例如 C）进行裸机编程的经验。

今天，我们将探讨 bare-metal Rust：即在没有操作系统支持的情况下运行 Rust 代码。该部分主要分为以下内容：

什么是 no_std Rust？
编写微控制器固件。
为应用处理器编写引导加载程序 / 内核代码。
有助于裸机 Rust 开发的一些实用 crate。

在本课程的微控制器部分，我们将使用 BBC micro:bit v2 作为示例。这是一款基于 Nordic nRF51822 微控制器的开发板，配有一些 LED 和按钮、连接 I2C 的加速度计和罗盘，以及板载 SWD 调试程序。

首先，请安装我们稍后需要用到的一些工具。在 gLinux 或 Debian 上：


sudo apt install gcc-aarch64-linux-gnu gdb-multiarch libudev-dev picocom pkg-config qemu-system-arm
rustup update
rustup target add aarch64-unknown-none thumbv7em-none-eabihf
rustup component add llvm-tools-preview
cargo install cargo-binutils cargo-embed

然后，向 plugdev 组中的用户授予 micro:bit 编程器的访问权限：


echo 'SUBSYSTEM=="usb", ATTR{idVendor}=="0d28", MODE="0664", GROUP="plugdev"' |\
  sudo tee /etc/udev/rules.d/50-microbit.rules
sudo udevadm control --reload-rules

在 MacOS 上：


xcode-select --install
brew install gdb picocom qemu
brew install --cask gcc-aarch64-embedded
rustup update
rustup target add aarch64-unknown-none thumbv7em-none-eabihf
rustup component add llvm-tools-preview
cargo install cargo-binutils cargo-embed

`no_std`

`core`	`alloc`	`std`
Slice、`&str`、`CStr` `NonZeroU8`… `Option`、`Result` `Display`、`Debug`、`write!`… `Iterator` `panic!`、`assert_eq!`… `NonNull` 和所有常见的指针相关函数 `Future` 和 `async`/`await` `fence`、`AtomicBool`、`AtomicPtr`、`AtomicU32`… `Duration`	`Box`、`Cow`、`Arc`、`Rc` `Vec`、`BinaryHeap`、`BtreeMap`、`LinkedList`、`VecDeque` `String`、`CString`、`format!`	`Error` `HashMap` `Mutex`、`Condvar`、`Barrier`、`Once`、`RwLock`、`mpsc` `File` 和 `fs` 的其余部分 `println!`、`Read`、`Write`、`Stdin`、`Stdout` 以及 `io` 的其余部分 `Path`, `OsString` `net` `Command`、`Child`、`ExitCode` `spawn`、`sleep` 和 `thread` 的其余部分 `SystemTime`、`Instant`

Speaker Notes

HashMap 依赖于 RNG。
std 会重新导出 core 和 alloc 的内容。

极小的 `no_std` 程序


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

这将编译为空二进制文件。
std 提供了一个 panic 处理程序；如果没有它，我们就必须自行提供。
其他 crate（例如 panic-halt）也可以提供该处理程序。
根据目标不同，可能需要使用 panic = "abort" 进行编译，以避免出现与 eh_personality 相关的错误。
请注意，未提供 main 函数或任何其他入口点；您可以自行定义入口点。通常需要使用链接器脚本和一些汇编代码进行设置工作，以便 Rust 代码能够顺利运行。

`alloc`

如需使用 alloc，您必须实现全局（堆）分配器。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

buddy_system_allocator 是第三方 crate，用于实现基本伙伴系统分配器。还可以使用其他 crate，或者自行编写 crate，或者接入现有分配器。
LockedHeap 的常量参数是分配器的最大阶数；即在本例中，它可以最多分配 2**32 字节大小的区域。
如果依赖项树中的所有 crate 都依赖于 alloc，则您必须在二进制文件中明确定义一个全局分配器。通常，在顶级二进制 crate 中完成此操作。
为了确保能够成功关联 panic_halt crate，以便我们获取其 panic 处理程序，必须使用 extern crate panic_halt as _ 方法。
我们可以构建该示例，但由于没有入口点，无法运行。

微控制器

cortex_m_rt crate 提供针对 Cortex M 微控制器的重置处理程序（以及其他内容）。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

接下来，我们看看随着抽象层级的不断提升，该如何访问外围设备。

Speaker Notes

cortex_m_rt::entry 宏要求函数的类型为 fn() -> !，因为返回重置处理程序会毫无意义。
使用 cargo embed --bin minimum 运行该示例

原始 MMIO

大多数微控制器通过内存映射 IO 访问外围设备。现在试着开启 micro:bit 上的 LED 指示灯：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

将 GPIO 0 的引脚 21 连接到 LED 矩阵的第一列，将引脚 28 连接到第一行。

使用以下命令运行该示例：


cargo embed --bin mmio

外围设备访问 crate

svd2rust 使用 CMSIS-SVD 文件为内存映射外围设备生成了大部分安全的 Rust 封装容器。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

SVD（系统视图描述）文件通常是由芯片供应商提供的 XML 文件，用于描述设备的内存映射。
- 这些文件按照外围设备、寄存器、字段和值进行组织分类，其中包含名称、内容描述、地址等信息。
- 由于 SVD 文件常常存在错误和不完整的情况，因此有许多项目会修复这些错误，补充缺失的相关信息，并发布生成的 crate。
cortex-m-rt 提供矢量表以及其他功能。
如果您使用 cargo install cargo-binutils，则可以运行 cargo objdump --bin pac -- -d --no-show-raw-insn，查看生成的二进制文件。

使用以下命令运行该示例：


cargo embed --bin pac

HAL crates

许多微控制器的 HAL crate 为各种外围设备提供了封装容器。通常，这些封装容器可以实现 embedded-hal 中的各种 trait。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

set_low 和 set_high 是 embedded_hal OutputPin trait 上的方法。
HAL crate 被广泛用于许多 Cortex-M 和 RISC-V 设备，包括各种 STM32、GD32、nRF、NXP、MSP430、AVR 和 PIC 微控制器。

使用以下命令运行该示例：


cargo embed --bin hal

Board support crates

为了方便使用，板级支持 crate 为特定开发板提供了更高级别的封装功能。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

在本例中，该板级支持 crate 仅提供了一些实用的名称和初始化功能。
除微控制器以外，该 crate 还包含一些可用于板载设备的驱动程序。
- microbit-v2 包含一个可用于 LED 矩阵的简单驱动程序。

使用以下命令运行该示例：


cargo embed --bin board_support

类型状态模式


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

引脚无法实现 Copy 或 Clone，因此，每个引脚只能存在一个实例。一旦某个引脚从端口结构体移出，任何人都无法再使用它。
更改引脚的配置会使旧的引脚实例失效，因此您无法再继续使用旧实例。
值的类型表示其所处状态：例如，在本例中，表示 GPIO 引脚的配置状态。通过这种方式，可将状态机编码成类型系统，并确保在未正确完成引脚配置之前，不会尝试以某种方式使用引脚。编译时捕获了非法状态转换。
您可以在输入引脚上调用 is_high，在输出引脚上调用 set_high，反之则不行。
许多 HAL crate 都遵循此模式。

`embedded-hal`

embedded-hal crate 提供许多适用于常见微控制器外围设备的 trait。

GPIO
ADC
I2C、SPI、UART、CAN
RNG
定时器
监控定时器

然后，其他 crate 可以根据这些 trait 实现驱动程序，例如加速度计驱动程序可能需要通过 I2C 或 SPI 总线实现。

Speaker Notes

还有适用于许多微控制器以及其他平台（例如 Raspberry Pi 上的 Linux）的实现。
我们正在开发 async 版本的 embedded-hal，但目前还欠缺稳定性。

`probe-rs` 和 `cargo-embed`

probe-rs 是一个方便的嵌入式调试工具集，跟 OpenOCD 较为相似，但集成度更高。

通过 CMSIS-DAP、ST-Link 和 J-Link 探针实现 SWD（串行调试）和 JTAG
GDB 桩和 Microsoft DAP（调试适配器协议）服务器
Cargo integration

cargo-embed 是一个 cargo 子命令，用于构建和刷写二进制文件、记录 RTT（实时传输）输出以及连接 GDB。您可通过项目目录中的 Embed.toml 文件对其进行配置。

Speaker Notes

CMSIS-DAP 是一项基于 USB 的 Arm 标准协议，使得电路内调试程序能够接入各种 Arm Cortex 处理器的 CoreSight 调试访问端口。BBC micro:bit 的板载调试程序所使用的便是此协议。
ST-Link 属于 ST Microelectronics 产品系列，J-Link 是 SEGGER 系列。
调试访问端口通常为 5 针 JTAG 接口或 2 针串行线调试接口。
probe-rs 是一个库，如有需要，可以将其集成到您的工具中。
借助 Microsoft 调试适配器协议，VSCode 和其他 IDE 可以调试任何受支持的微控制器上运行的代码。
cargo-embed 是使用 probe-rs 库构建的二进制文件。
RTT（实时传输）是一种通过多个环形缓冲区，在调试主机和目标之间进行数据传输的机制。

调试

Embed.toml:


[default.general]
chip = "nrf52833_xxAA"

[debug.gdb]
enabled = true

在 src/bare-metal/microcontrollers/examples/ 目录下某个终端中：


cargo embed --bin board_support debug

在同一目录下的另一个终端中：

在 gLinux 或 Debian 上：


gdb-multiarch target/thumbv7em-none-eabihf/debug/board_support --eval-command="target remote :1337"

在 MacOS 上：


arm-none-eabi-gdb target/thumbv7em-none-eabihf/debug/board_support --eval-command="target remote :1337"

Speaker Notes

在 GDB 中，请尝试运行以下命令：


b src/bin/board_support.rs:29
b src/bin/board_support.rs:30
b src/bin/board_support.rs:32
c
c
c

Other projects

RTIC
- “Real-Time Interrupt-driven Concurrency”
- 共享资源管理、消息传递、任务调度、定时器队列
Embassy
- 具有优先级、计时器、网络和 USB 功能的 async 执行器
TockOS
- 以安全为中心的 RTOS，具有抢占式调度功能和提供内存保护单元支持
Hubris
- Oxide Computer Company 开发的微内核 RTOS，提供内存保护、非特权驱动程序和 IPC 功能
FreeRTOS 的绑定
有些平台可以实现 std，例如 esp-idf。

Speaker Notes

RTIC 可被视为 RTOS 或并发框架，
- 但不包含任何 HAL。
- 它使用 Cortex-M NVIC（嵌套虚拟中断控制器）进行调度，而不是选用适合的内核。
- 仅限 Cortex-M。
Google 在 Haven 微控制器上使用 TockOS 作为 Titan 安全密钥的操作系统。
FreeRTOS 主要使用 C 语言编写，但也提供了专用于编写应用的 Rust 绑定。

习题

我们将从 I2C 罗盘读取方向，并将读数记录到串行端口。

Speaker Notes

After looking at the exercises, you can look at the solutions provided.

罗盘

我们将从 I2C 罗盘读取方向，并将读数记录到串行端口。如有时间，请尝试通过 LED 灯亮起的方式，或者使用按钮来显示方向。

提示：

请参阅 lsm303agr 和 microbit-v2 crate，以及 micro:bit 硬件相关文档。
LSM303AGR 惯性测量装置与内部 I2C 总线相连接。
TWI 是 I2C 的别称，因此 I2C 主外围设备称为 TWIM。
LSM303AGR 驱动程序需要使用某种方法来实现 embedded_hal::blocking::i2c::WriteRead trait。microbit::hal::Twim 结构体便可以做到这一点。
您拥有一个 microbit::Board 结构体，其中包含各种引脚和外围设备的字段。
如有需要，您还可以查看 nRF52833 数据表，但对本练习来说这不是必需的。

下载练习模板并在 compass 目录中查找以下文件。

src/main.rs:


#![no_main]
#![no_std]

extern crate panic_halt as _;

use core::fmt::Write;
use cortex_m_rt::entry;
use microbit::{hal::uarte::{Baudrate, Parity, Uarte}, Board};

#[entry]
fn main() -> ! {
    let board = Board::take().unwrap();

    // Configure serial port.
    let mut serial = Uarte::new(
        board.UARTE0,
        board.uart.into(),
        Parity::EXCLUDED,
        Baudrate::BAUD115200,
    );

    // Use the system timer as a delay provider.
    let mut delay = Delay::new(board.SYST);

    // Set up the I2C controller and Inertial Measurement Unit.
    // TODO

    writeln!(serial, "Ready.").unwrap();

    loop {
        // Read compass data and log it to the serial port.
        // TODO
    }
}

Cargo.toml（无需对此进行更改）：


[workspace]

[package]
name = "compass"
version = "0.1.0"
edition = "2021"
publish = false

[dependencies]
cortex-m-rt = "0.7.3"
embedded-hal = "1.0.0"
lsm303agr = "0.3.0"
microbit-v2 = "0.13.0"
panic-halt = "0.2.0"

Embed.toml（无需对此进行更改）：


[default.general]
chip = "nrf52833_xxAA"

[debug.gdb]
enabled = true

[debug.reset]
halt_afterwards = true

.cargo/config.toml（无需对此进行更改）：


[build]
target = "thumbv7em-none-eabihf" # Cortex-M4F

[target.'cfg(all(target_arch = "arm", target_os = "none"))']
rustflags = ["-C", "link-arg=-Tlink.x"]

运行以下命令查看 Linux 上的串行输出：


picocom --baud 115200 --imap lfcrlf /dev/ttyACM0

或者在 Mac OS 上，如下所示（设备名称可能略有不同）：


picocom --baud 115200 --imap lfcrlf /dev/tty.usbmodem14502

使用 Ctrl+A Ctrl+Q 退出 picocom。

裸机 Rust 上午练习

罗盘

(返回练习)


#![no_main]
#![no_std]

extern crate panic_halt as _;

use core::fmt::Write;
use cortex_m_rt::entry;
use core::cmp::{max, min};
use lsm303agr::{
    AccelMode, AccelOutputDataRate, Lsm303agr, MagMode, MagOutputDataRate,
};
use microbit::display::blocking::Display;
use microbit::hal::prelude::*;
use microbit::hal::twim::Twim;
use microbit::hal::uarte::{Baudrate, Parity, Uarte};
use microbit::hal::{Delay, Timer};
use microbit::pac::twim0::frequency::FREQUENCY_A;
use microbit::Board;

const COMPASS_SCALE: i32 = 30000;
const ACCELEROMETER_SCALE: i32 = 700;

#[entry]
fn main() -> ! {
    let board = Board::take().unwrap();

    // Configure serial port.
    let mut serial = Uarte::new(
        board.UARTE0,
        board.uart.into(),
        Parity::EXCLUDED,
        Baudrate::BAUD115200,
    );

    // Use the system timer as a delay provider.
    let mut delay = Delay::new(board.SYST);

    // Set up the I2C controller and Inertial Measurement Unit.
    writeln!(serial, "Setting up IMU...").unwrap();
    let i2c = Twim::new(board.TWIM0, board.i2c_internal.into(), FREQUENCY_A::K100);
    let mut imu = Lsm303agr::new_with_i2c(i2c);
    imu.init().unwrap();
    imu.set_mag_mode_and_odr(
        &mut delay,
        MagMode::HighResolution,
        MagOutputDataRate::Hz50,
    )
    .unwrap();
    imu.set_accel_mode_and_odr(
        &mut delay,
        AccelMode::Normal,
        AccelOutputDataRate::Hz50,
    )
    .unwrap();
    let mut imu = imu.into_mag_continuous().ok().unwrap();

    // Set up display and timer.
    let mut timer = Timer::new(board.TIMER0);
    let mut display = Display::new(board.display_pins);

    let mut mode = Mode::Compass;
    let mut button_pressed = false;

    writeln!(serial, "Ready.").unwrap();

    loop {
        // Read compass data and log it to the serial port.
        while !(imu.mag_status().unwrap().xyz_new_data()
            && imu.accel_status().unwrap().xyz_new_data())
        {}
        let compass_reading = imu.magnetic_field().unwrap();
        let accelerometer_reading = imu.acceleration().unwrap();
        writeln!(
            serial,
            "{},{},{}\t{},{},{}",
            compass_reading.x_nt(),
            compass_reading.y_nt(),
            compass_reading.z_nt(),
            accelerometer_reading.x_mg(),
            accelerometer_reading.y_mg(),
            accelerometer_reading.z_mg(),
        )
        .unwrap();

        let mut image = [[0; 5]; 5];
        let (x, y) = match mode {
            Mode::Compass => (
                scale(-compass_reading.x_nt(), -COMPASS_SCALE, COMPASS_SCALE, 0, 4)
                    as usize,
                scale(compass_reading.y_nt(), -COMPASS_SCALE, COMPASS_SCALE, 0, 4)
                    as usize,
            ),
            Mode::Accelerometer => (
                scale(
                    accelerometer_reading.x_mg(),
                    -ACCELEROMETER_SCALE,
                    ACCELEROMETER_SCALE,
                    0,
                    4,
                ) as usize,
                scale(
                    -accelerometer_reading.y_mg(),
                    -ACCELEROMETER_SCALE,
                    ACCELEROMETER_SCALE,
                    0,
                    4,
                ) as usize,
            ),
        };
        image[y][x] = 255;
        display.show(&mut timer, image, 100);

        // If button A is pressed, switch to the next mode and briefly blink all LEDs
        // on.
        if board.buttons.button_a.is_low().unwrap() {
            if !button_pressed {
                mode = mode.next();
                display.show(&mut timer, [[255; 5]; 5], 200);
            }
            button_pressed = true;
        } else {
            button_pressed = false;
        }
    }
}

#[derive(Copy, Clone, Debug, Eq, PartialEq)]
enum Mode {
    Compass,
    Accelerometer,
}

impl Mode {
    fn next(self) -> Self {
        match self {
            Self::Compass => Self::Accelerometer,
            Self::Accelerometer => Self::Compass,
        }
    }
}

fn scale(value: i32, min_in: i32, max_in: i32, min_out: i32, max_out: i32) -> i32 {
    let range_in = max_in - min_in;
    let range_out = max_out - min_out;
    cap(min_out + range_out * (value - min_in) / range_in, min_out, max_out)
}

fn cap(value: i32, min_value: i32, max_value: i32) -> i32 {
    max(min_value, min(value, max_value))
}

应用处理器

到目前为止，我们已经讨论了微控制器，例如 Arm Cortex-M 系列。现在，尝试为 Cortex-A 编写一些代码。为简单起见，我们只使用 QEMU 的 aarch64 virt 开发板进行编写。

Speaker Notes

一般来说，微控制器不具备 MMU 或多级特权（例如，Arm CPU 中的异常级别，x86 中的铃声级别）的功能，而应用处理器则具备这些功能。
QEMU 支持针对每种架构模拟不同的机器或板级模型。“虚拟”开发板并不适用于任何特定的真实硬件，而是专为虚拟机设计的。

准备使用 Rust

在开始运行 Rust 代码之前，我们需要进行一些初始化设置。


.section .init.entry, "ax"
.global entry
entry:
    /*
     * Load and apply the memory management configuration, ready to enable MMU and
     * caches.
     */
    adrp x30, idmap
    msr ttbr0_el1, x30

    mov_i x30, .Lmairval
    msr mair_el1, x30

    mov_i x30, .Ltcrval
    /* Copy the supported PA range into TCR_EL1.IPS.*/
    mrs x29, id_aa64mmfr0_el1
    bfi x30, x29, #32, #4

    msr tcr_el1, x30

    mov_i x30, .Lsctlrval

    /*
     * Ensure everything before this point has completed, then invalidate any
     * potentially stale local TLB entries before they start being used.
     */
    isb
    tlbi vmalle1
    ic iallu
    dsb nsh
    isb

    /*
     * Configure sctlr_el1 to enable MMU and cache and don't proceed until this
     * has completed.
     */
    msr sctlr_el1, x30
    isb

    /* Disable trapping floating point access in EL1.*/
    mrs x30, cpacr_el1
    orr x30, x30, #(0x3 << 20)
    msr cpacr_el1, x30
    isb

    /* Zero out the bss section.*/
    adr_l x29, bss_begin
    adr_l x30, bss_end
0:  cmp x29, x30
    b.hs 1f
    stp xzr, xzr, [x29], #16
    b 0b

1:  /* Prepare the stack.*/
    adr_l x30, boot_stack_end
    mov sp, x30

    /* Set up exception vector.*/
    adr x30, vector_table_el1
    msr vbar_el1, x30

    /* Call into Rust code.*/
    bl main

    /* Loop forever waiting for interrupts.*/
2:  wfi
    b 2b

Speaker Notes

这与 C 语言的情况相同：初始化处理器状态，将 BSS 清零，然后设置堆栈指针。
- BSS（由于历史原因，称为代码块起始符）属于对象文件的一部分，其中包含静态分配的变量，这些变量被初始化为零。图像中省略了这些符号，以避免因存储零值而占用过多空间。编译器假定加载器会负责将它们清零。
BSS 可能已经被清零，具体取决于内存的初始化方式以及图像的加载方式，但为了确保起见，我们会将其手动清零。
我们需要先启用 MMU 和缓存功能，然后才能读取或写入任何内存。否则：
- 非对齐访问将会出错。我们为 aarch64-unknown-none 目标构建 Rust 代码，该目标会设置 +Strict-align 以防止编译器生成非对齐访问，因此在本例中应该没有问题，但一般情况下并不一定如此。
- 如果是在虚拟机中运行该命令，可能会导致缓存一致性问题。问题在于，虚拟机是在禁用缓存的情况下直接访问内存，而主机具有同一内存的缓存别名。即使主机并没有明确访问该内存，推测性访问仍然会导致缓存被填充，然后在清除缓存或虚拟机启用缓存时，任何一方对于该内存进行的更改就会丢失。（使用物理地址来键控缓存，而 VA 或 IPA。）
为简单起见，我们只使用硬编码的分页表（请参阅 dmap.S），其通过身份映射将前一个 1 GiB 的地址空间用于设备，紧接着的 1 GiB 用于 DRAM，然后在更高位置预留了 1 GiB 给其他设备。这与 QEMU 使用的内存布局一致。
我们还设置了异常矢量 (vbar_el1)，稍后会对此进行详细介绍。
今天下午的所有示例都假定我们将在异常级别 1 (EL1) 下运行。如果您需要在其他异常级别下运行，则需要修改相应的 entry.S。

内嵌汇编

有时，如果无法通过 Rust 代码实现某些操作，我们就需要使用汇编来解决。例如，如需发出 HVC（Hypervisor 调用）来指示固件关闭系统，请使用以下命令：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

（如果确实想要这样做，请使用 smccc crate，其中包含适用于所有这些函数的封装容器。）

Speaker Notes

PSCI 是 Arm 电源状态协调接口，为一组标准函数，用于管理系统和 CPU 电源状态等。在许多系统中，通过 EL3 固件和 Hypervisor 来实现该函数。
0 => _ 语法表示在运行内嵌汇编代码之前将寄存器初始化为 0，并在之后忽略寄存器中的内容。我们需要使用 inout 而非 in，因为该调用操作可能会破坏寄存器中的内容。
所用 main 函数必须是 #[no_mangle] 和 extern "C"，因为是从 entry.S 中的入口点调用该函数。
_x0–_x3 表示寄存器 x0-x3 的值，引导加载程序通常使用这些值来传递各种内容（例如将指针传递到设备树）。根据标准的 aarch64 调用规范（extern "C"指定使用此规范），需要使用寄存器 x0-x7 将前 8 个参数传递给函数，因此 entry.S 无需执行任何特殊操作，只要确保不会更改这些寄存器。
在 QEMU 中，使用 src/bare-metal/aps/examples 目录下的 make qemu_psci 运行该示例。

MMIO 的易失性内存访问

使用 pointer::read_volatile 和 pointer::write_volatile。
切勿提及引用。
借助 addr_of!，您无需创建中间引用即可获取结构体字段。

Speaker Notes

易失性访问：执行读取或写入操作可能会产生副作用，因此应阻止编译器或硬件对这些操作进行重新排序、复制或省略。
- 通常情况下，如果您先写入操作，紧接着进行读取操作（例如通过可变引用），则编译器可能会认为读取的值是最新写入的值，就不再执行实际的内存读取过程。
虽然在对硬件进行易失性访问时，一些 crate 确实会提及引用，但这很不安全。只要存在引用，编译器就会选择对其进行解引用操作。
使用 addr_of! 宏可以从结构体指针中获取结构体字段的指针。

编写 UART 驱动程序

QEMU “虚拟机”具有 PL011 UART，现在为其编写驱动程序。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

请注意，使用 Uart::new 方法不安全，而其他方法则安全。原因在于，只要 Uart::new 的调用方保证满足其安全要求（即所指定的 UART 只有一个驱动程序实例，且没有其他内容与其地址空间存在重叠），那么后续调用 write_byte 始终是安全的，因为我们假定需要满足的前提条件。
我们也可以采用相反的方式（即确保 new 安全，但 write_byte 不安全），不过这样会很不方便，因为每当调用 write_byte 时都需要推断是否安全。
这是安全地封装不安全代码时常见的策略：即在少数调用代码的地方进行安全验证，而不是在很多地方进行。

更好的 UART 驱动程序

事实上，PL011 具有很多额外的寄存器，但通过添加偏移量构建指针的方式来访问这些寄存器，既容易出错又难以读取。此外，其中有些寄存器是位字段，非常便于以结构化方式访问。

偏移	寄存器名称	宽度
0x00	DR	12
0x04	RSR	4
0x18	FR	9
0x20	ILPR	8
0x24	IBRD	16
0x28	FBRD	6
0x2c	LCR_H	8
0x30	CR	16
0x34	IFLS	6
0x38	IMSC	11
0x3c	RIS	11
0x40	MIS	11
0x44	ICR	11
0x48	DMACR	3

Speaker Notes

为简洁起见，我们省略了一些 ID 寄存器。

Bitflags

bitflags crate 非常适用于处理 bitflag。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

bitflags! 宏会创建类似于 Flags(u16) 的新类型，以及一系列用于获取和设置标记的方法实现。

多个寄存器

我们可以使用结构体来表示 UART 寄存器的内存布局。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

通过运行 #[repr(C)] 命令，指示编译器按顺序布置结构体字段，遵循与 C 语言相同的规则。这是确保结构体具有可预测布局的必要条件，因为默认的 Rust 表示法允许编译器（以及其他内容）按照其认为合适的方式重新排列字段。

驱动程序

现在将新的 Registers 结构体用于我们的驱动程序。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

请注意，只使用 addr_of! / addr_of_mut! 获取指向各个字段的指针，而不创建中间引用，这样很不安全。

开始使用

使用驱动程序编写一个小程序，将数据写入串行控制台，并回显传入的字节。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

与内嵌汇编示例一样，从 entry.S 中的入口点代码调用此 main 函数。如需了解详情，请参阅演讲者备注。
在 QEMU 中，使用 src/bare-metal/aps/examples 目录下的 make qemu 运行该示例。

日志记录

最好能够使用 log crate 中的日志记录宏。可以通过实现“Log”trait 来做到这一点。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

使用 log 进行解封装是安全的，因为我们会在调用 set_logger 之前初始化 LOGGER。

开始使用

需要先初始化日志记录器，然后才能使用它。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

请注意，panic 紧急处理程序现在可以记录各类 panic 详细信息。
在 QEMU 中，使用 src/bare-metal/aps/examples 目录下的 make qemu_logger 运行该示例。

异常

AArch64 定义了一个包含 16 个条目的异常矢量表，适用于处理 4 种状态（当前 EL 使用 SP0，当前 EL 使用 SPx，较低 EL 使用 AArch64，较低 EL 使用 AArch32）下的 4 种异常（同步、IRQ、FIQ、SError）。可以通过汇编方式实现这一操作，以便在调用 Rust 代码之前将易失性寄存器保存到堆栈：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

EL 指异常级别；我们今天下午的所有示例都在 EL1 级别下运行。
为简单起见，我们没有区分当前 EL 异常中的 SP0 和 SPx，也没有区分较低 EL 异常中的 AArch32 和 AArch64。
在本示例中，只需记录异常并进行关机操作，因为预计不会发生任何此类异常。
我们可以将异常处理程序和主执行上下文视为不同的线程。通过 Send 和 Sync 控制它们之间可以共享的内容，就像使用线程进行共享一样。例如，如果想在异常处理程序和程序的其余部分之间共享某个值，并且使用 Send 而非 Sync，则需要将该值封装在诸如 Mutex 之类的内容中，并放入静态变量。

Other projects

oreboot
- “coreboot without the C”
- 支持 x86、aarch64 和 RISC-V。
- 依赖于 LinuxBoot，而不是许多驱动程序本身。
Rust RaspberryPi 操作系统教程
- 初始化、UART 驱动程序、简单引导加载程序、JTAG、异常级别、异常处理、分页表
- 在 Rust 中，有些用于处理缓存维护和初始化的方法并不安全，不适宜照搬到正式版代码中。
cargo-call-stack
- 使用静态分析来确定堆栈用量上限。

Speaker Notes

在 RaspberryPi 操作系统教程中，先运行 Rust 代码然后启用 MMU 和缓存。此操作会读取和写入内存（例如堆栈）。不过：
- 如果不启用 MMU 和缓存，非对齐访问将会出错。它使用 aarch64-unknown-none 进行构建，后者会设置 +strict-align 以防止编译器生成非对齐访问，因此应该没有问题，但一般情况下并不一定如此。
- 如果是在虚拟机中运行该命令，可能会导致缓存一致性问题。问题在于，虚拟机是在禁用缓存的情况下直接访问内存，而主机具有同一内存的缓存别名。即使主机并没有明确访问该内存，推测性访问仍然会导致缓存被填充，然后任何一方对于该内存进行的更改就会丢失。再次强调，尽管在此特定示例中（即在硬件上直接运行且无 Hypervisor）这种做法是可行的，但总的来说，这并非一种良好策略。

Useful crates

接下来介绍几个 crate，用于解决裸机编程中的一些常见问题。

`zerocopy`

zerocopy crate（源自 Fuchsia）提供了 trait 和宏，用于确保在字节序列和其他类型之间进行安全转换。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

这不适用于 MMIO（因为它不使用易失性读取和写入），但在与硬件共享的结构（例如通过 DMA 传输或发送到外部接口）中进行操作时会很有用。

Speaker Notes

对于可以接受任何字节模式的类型，都可以实现 FromBytes方法，因此可以对不受信任的字节序列进行安全转换。
如果尝试为这些类型派生 FromBytes，都将会失败，因为 RequestType 不会将所有可能的 u32 值用作判别标识，所以并非所有的字节模式都有效。
zerocopy::byteorder 提供了适用于字节顺序感知的数字基元类型。
使用 src/bare-metal/useful-crates/zerocopy-example/ 目录下的 cargo run 运行该示例。（由于存在 crate 依赖项，无法在 Playground 中运行该示例。）

`aarch64-paging`

借助 aarch64-paging crate，您可根据 AArch64 虚拟内存系统架构创建分页表。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

目前，该方法仅支持 EL1 级别，但也可以直接添加对其他异常级别的支持。
在 Android 中，该方法适用于受保护的虚拟机固件。
由于此示例需要在真实硬件上或在 QEMU 中运行，因此没有简单的运行方法可用。

`buddy_system_allocator`

buddy_system_allocator 是第三方 crate，用于实现基本伙伴系统分配器。同时，也可将其用于 LockedHeap 以实现 GlobalAlloc，这样便可以使用标准的 alloc crate（正如在之前部分中所示），或者用于分配其他地址空间。例如，我们可能需要为 PCI BAR 分配 MMIO 空间：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

PCI BAR 的对齐方式始终与其大小相等。
使用 src/bare-metal/useful-crates/allocator-example/ 目录下的 cargo run 运行该示例。（由于存在 crate 依赖项，无法在 Playground 中运行该示例。）

`tinyvec`

有时，需要一些像 Vec 一样能够调整大小的特性，但无需进行堆分配。tinyvec 提供了以下特性：由数组或 slice 支持的矢量，该矢量支持进行静态分配或堆分配；用于跟踪使用的元素数量，如果元素使用量超过了分配额度，则会出现 panic。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

根据 tinyvec要求，元素类型需实现初始化 Default。
Rust Playground 中包含 tinyvec，因此本示例将以内嵌方式正常运行。

`spin`

在 core 或 alloc 中无法使用 std::sync::Mutex 和 std::sync 中的其他同步基元。那么该如何管理同步或内部可变性，例如在不同 CPU 之间共享状态？

spin crate 为许多基元提供了基于自旋锁的等效方法。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

在中断处理程序中进行锁定操作时，请注意避免出现死锁的情况。
spin also has a ticket lock mutex implementation; equivalents of RwLock, Barrier and Once from std::sync; and Lazy for lazy initialisation.
once_cell crate 也提供了一些适用于延迟初始化的实用类型，它们与 spin::once::Once 所用方法略有不同。
Rust Playground 中包含 spin，因此本示例将以内嵌方式正常运行。

Android

如需在 AOSP 中构建裸机 Rust 二进制文件，应使用 rust_ffi_static Soong 规则来构建 Rust 代码，然后通过 cc_binary 和链接器脚本生成二进制文件本身，再使用 raw_binary 将 ELF 转换为可以正常运行的原始二进制文件。


rust_ffi_static {
    name: "libvmbase_example",
    defaults: ["vmbase_ffi_defaults"],
    crate_name: "vmbase_example",
    srcs: ["src/main.rs"],
    rustlibs: [
        "libvmbase",
    ],
}

cc_binary {
    name: "vmbase_example",
    defaults: ["vmbase_elf_defaults"],
    srcs: [
        "idmap.S",
    ],
    static_libs: [
        "libvmbase_example",
    ],
    linker_scripts: [
        "image.ld",
        ":vmbase_sections",
    ],
}

raw_binary {
    name: "vmbase_example_bin",
    stem: "vmbase_example.bin",
    src: ":vmbase_example",
    enabled: false,
    target: {
        android_arm64: {
            enabled: true,
        },
    },
}

vmbase

对于在 aarch64 上使用 crosvm 运行的虚拟机，vmbase 库提供了链接器脚本和实用的默认构建规则，以及入口点、UART 控制台日志记录等功能。


#![no_main]
#![no_std]

use vmbase::{main, println};

main!(main);

pub fn main(arg0: u64, arg1: u64, arg2: u64, arg3: u64) {
    println!("Hello world");
}

Speaker Notes

main! 宏用于标记需从 vmbase 入口点调用的 main 函数。
vmbase 入口点用于处理控制台初始化，并在 main 函数返回时发送 PSCI_SYSTEM_OFF 命令以关闭虚拟机。

习题

我们将为 PL031 实时时钟设备编写驱动程序。

Speaker Notes

After looking at the exercises, you can look at the solutions provided.

RTC 驱动程序

QEMU aarch64 虚拟机在 0x9010000 地址处配备了 PL031 实时时钟。对于本练习，应该为其编写驱动程序。

使用该时钟可将当前时间输出到串行控制台。您可以使用 chrono crate 设置日期/时间格式。
通过匹配寄存器和原始中断状态，使得系统在某段指定的时间内一直进行繁忙等待（例如 3 秒后）。（在循环操作中调用 core::hint::spin_loop。）
_进行扩展（如有时间）：_启用并处理由 RTC 匹配产生的中断。可以使用 arm-gic crate 中提供的驱动程序来配置 Arm 通用中断控制器。
- 请使用 RTC 中断，将其作为 IntId::spi(2) 连接到 GIC。
- 启用中断后，可以通过 arm_gic::wfi() 让核心进入休眠状态，直到它收到中断信号。

下载练习模板并在 rtc 目录中查找以下文件。

src/main.rs:


#![no_main]
#![no_std]

mod exceptions;
mod logger;
mod pl011;

use crate::pl011::Uart;
use arm_gic::gicv3::GicV3;
use core::panic::PanicInfo;
use log::{error, info, trace, LevelFilter};
use smccc::psci::system_off;
use smccc::Hvc;

/// Base addresses of the GICv3.
const GICD_BASE_ADDRESS: *mut u64 = 0x800_0000 as _;
const GICR_BASE_ADDRESS: *mut u64 = 0x80A_0000 as _;

/// Base address of the primary PL011 UART.
const PL011_BASE_ADDRESS: *mut u32 = 0x900_0000 as _;

#[no_mangle]
extern "C" fn main(x0: u64, x1: u64, x2: u64, x3: u64) {
    // SAFETY: `PL011_BASE_ADDRESS` is the base address of a PL011 device, and
    // nothing else accesses that address range.
    let uart = unsafe { Uart::new(PL011_BASE_ADDRESS) };
    logger::init(uart, LevelFilter::Trace).unwrap();

    info!("main({:#x}, {:#x}, {:#x}, {:#x})", x0, x1, x2, x3);

    // SAFETY: `GICD_BASE_ADDRESS` and `GICR_BASE_ADDRESS` are the base
    // addresses of a GICv3 distributor and redistributor respectively, and
    // nothing else accesses those address ranges.
    let mut gic = unsafe { GicV3::new(GICD_BASE_ADDRESS, GICR_BASE_ADDRESS) };
    gic.setup();

    // TODO: Create instance of RTC driver and print current time.

    // TODO: Wait for 3 seconds.

    system_off::<Hvc>().unwrap();
}

#[panic_handler]
fn panic(info: &PanicInfo) -> ! {
    error!("{info}");
    system_off::<Hvc>().unwrap();
    loop {}
}

src/exceptions.rs（只需在本练习的第 3 部分更改此项）：


#![allow(unused)]
fn main() {
// Copyright 2023 Google LLC
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
//      http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

use arm_gic::gicv3::GicV3;
use log::{error, info, trace};
use smccc::psci::system_off;
use smccc::Hvc;

#[no_mangle]
extern "C" fn sync_exception_current(_elr: u64, _spsr: u64) {
    error!("sync_exception_current");
    system_off::<Hvc>().unwrap();
}

#[no_mangle]
extern "C" fn irq_current(_elr: u64, _spsr: u64) {
    trace!("irq_current");
    let intid =
        GicV3::get_and_acknowledge_interrupt().expect("No pending interrupt");
    info!("IRQ {intid:?}");
}

#[no_mangle]
extern "C" fn fiq_current(_elr: u64, _spsr: u64) {
    error!("fiq_current");
    system_off::<Hvc>().unwrap();
}

#[no_mangle]
extern "C" fn serr_current(_elr: u64, _spsr: u64) {
    error!("serr_current");
    system_off::<Hvc>().unwrap();
}

#[no_mangle]
extern "C" fn sync_lower(_elr: u64, _spsr: u64) {
    error!("sync_lower");
    system_off::<Hvc>().unwrap();
}

#[no_mangle]
extern "C" fn irq_lower(_elr: u64, _spsr: u64) {
    error!("irq_lower");
    system_off::<Hvc>().unwrap();
}

#[no_mangle]
extern "C" fn fiq_lower(_elr: u64, _spsr: u64) {
    error!("fiq_lower");
    system_off::<Hvc>().unwrap();
}

#[no_mangle]
extern "C" fn serr_lower(_elr: u64, _spsr: u64) {
    error!("serr_lower");
    system_off::<Hvc>().unwrap();
}
}

src/logger.rs（无需对此进行更改）：


#![allow(unused)]
fn main() {
// Copyright 2023 Google LLC
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
//      http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

// ANCHOR: main
use crate::pl011::Uart;
use core::fmt::Write;
use log::{LevelFilter, Log, Metadata, Record, SetLoggerError};
use spin::mutex::SpinMutex;

static LOGGER: Logger = Logger { uart: SpinMutex::new(None) };

struct Logger {
    uart: SpinMutex<Option<Uart>>,
}

impl Log for Logger {
    fn enabled(&self, _metadata: &Metadata) -> bool {
        true
    }

    fn log(&self, record: &Record) {
        writeln!(
            self.uart.lock().as_mut().unwrap(),
            "[{}] {}",
            record.level(),
            record.args()
        )
        .unwrap();
    }

    fn flush(&self) {}
}

/// Initialises UART logger.
pub fn init(uart: Uart, max_level: LevelFilter) -> Result<(), SetLoggerError> {
    LOGGER.uart.lock().replace(uart);

    log::set_logger(&LOGGER)?;
    log::set_max_level(max_level);
    Ok(())
}
}

src/pl011.rs（无需对此进行更改）：


#![allow(unused)]
fn main() {
// Copyright 2023 Google LLC
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
//      http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

#![allow(unused)]

use core::fmt::{self, Write};
use core::ptr::{addr_of, addr_of_mut};

// ANCHOR: Flags
use bitflags::bitflags;

bitflags! {
    /// Flags from the UART flag register.
    #[repr(transparent)]
    #[derive(Copy, Clone, Debug, Eq, PartialEq)]
    struct Flags: u16 {
        /// Clear to send.
        const CTS = 1 << 0;
        /// Data set ready.
        const DSR = 1 << 1;
        /// Data carrier detect.
        const DCD = 1 << 2;
        /// UART busy transmitting data.
        const BUSY = 1 << 3;
        /// Receive FIFO is empty.
        const RXFE = 1 << 4;
        /// Transmit FIFO is full.
        const TXFF = 1 << 5;
        /// Receive FIFO is full.
        const RXFF = 1 << 6;
        /// Transmit FIFO is empty.
        const TXFE = 1 << 7;
        /// Ring indicator.
        const RI = 1 << 8;
    }
}
// ANCHOR_END: Flags

bitflags! {
    /// Flags from the UART Receive Status Register / Error Clear Register.
    #[repr(transparent)]
    #[derive(Copy, Clone, Debug, Eq, PartialEq)]
    struct ReceiveStatus: u16 {
        /// Framing error.
        const FE = 1 << 0;
        /// Parity error.
        const PE = 1 << 1;
        /// Break error.
        const BE = 1 << 2;
        /// Overrun error.
        const OE = 1 << 3;
    }
}

// ANCHOR: Registers
#[repr(C, align(4))]
struct Registers {
    dr: u16,
    _reserved0: [u8; 2],
    rsr: ReceiveStatus,
    _reserved1: [u8; 19],
    fr: Flags,
    _reserved2: [u8; 6],
    ilpr: u8,
    _reserved3: [u8; 3],
    ibrd: u16,
    _reserved4: [u8; 2],
    fbrd: u8,
    _reserved5: [u8; 3],
    lcr_h: u8,
    _reserved6: [u8; 3],
    cr: u16,
    _reserved7: [u8; 3],
    ifls: u8,
    _reserved8: [u8; 3],
    imsc: u16,
    _reserved9: [u8; 2],
    ris: u16,
    _reserved10: [u8; 2],
    mis: u16,
    _reserved11: [u8; 2],
    icr: u16,
    _reserved12: [u8; 2],
    dmacr: u8,
    _reserved13: [u8; 3],
}
// ANCHOR_END: Registers

// ANCHOR: Uart
/// Driver for a PL011 UART.
#[derive(Debug)]
pub struct Uart {
    registers: *mut Registers,
}

impl Uart {
    /// Constructs a new instance of the UART driver for a PL011 device at the
    /// given base address.
    ///
    /// # Safety
    ///
    /// The given base address must point to the MMIO control registers of a
    /// PL011 device, which must be mapped into the address space of the process
    /// as device memory and not have any other aliases.
    pub unsafe fn new(base_address: *mut u32) -> Self {
        Self { registers: base_address as *mut Registers }
    }

    /// Writes a single byte to the UART.
    pub fn write_byte(&self, byte: u8) {
        // Wait until there is room in the TX buffer.
        while self.read_flag_register().contains(Flags::TXFF) {}

        // SAFETY: We know that self.registers points to the control registers
        // of a PL011 device which is appropriately mapped.
        unsafe {
            // Write to the TX buffer.
            addr_of_mut!((*self.registers).dr).write_volatile(byte.into());
        }

        // Wait until the UART is no longer busy.
        while self.read_flag_register().contains(Flags::BUSY) {}
    }

    /// Reads and returns a pending byte, or `None` if nothing has been
    /// received.
    pub fn read_byte(&self) -> Option<u8> {
        if self.read_flag_register().contains(Flags::RXFE) {
            None
        } else {
            // SAFETY: We know that self.registers points to the control
            // registers of a PL011 device which is appropriately mapped.
            let data = unsafe { addr_of!((*self.registers).dr).read_volatile() };
            // TODO: Check for error conditions in bits 8-11.
            Some(data as u8)
        }
    }

    fn read_flag_register(&self) -> Flags {
        // SAFETY: We know that self.registers points to the control registers
        // of a PL011 device which is appropriately mapped.
        unsafe { addr_of!((*self.registers).fr).read_volatile() }
    }
}
// ANCHOR_END: Uart

impl Write for Uart {
    fn write_str(&mut self, s: &str) -> fmt::Result {
        for c in s.as_bytes() {
            self.write_byte(*c);
        }
        Ok(())
    }
}

// Safe because it just contains a pointer to device memory, which can be
// accessed from any context.
unsafe impl Send for Uart {}
}

Cargo.toml（无需对此进行更改）：


[workspace]

[package]
name = "rtc"
version = "0.1.0"
edition = "2021"
publish = false

[dependencies]
arm-gic = "0.1.0"
bitflags = "2.5.0"
chrono = { version = "0.4.37", default-features = false }
log = "0.4.21"
smccc = "0.1.1"
spin = "0.9.8"

[build-dependencies]
cc = "1.0.94"

build.rs（无需对此进行更改）：


// Copyright 2023 Google LLC
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
//      http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

use cc::Build;
use std::env;

fn main() {
    #[cfg(target_os = "linux")]
    env::set_var("CROSS_COMPILE", "aarch64-linux-gnu");
    #[cfg(not(target_os = "linux"))]
    env::set_var("CROSS_COMPILE", "aarch64-none-elf");

    Build::new()
        .file("entry.S")
        .file("exceptions.S")
        .file("idmap.S")
        .compile("empty")
}

entry.S（无需对此进行更改）：


/*
 * Copyright 2023 Google LLC
 *
 * Licensed under the Apache License, Version 2.0 (the "License");
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 *
 *     https://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

.macro adr_l, reg:req, sym:req
    adrp \reg, \sym
    add \reg, \reg, :lo12:\sym
.endm

.macro mov_i, reg:req, imm:req
    movz \reg, :abs_g3:\imm
    movk \reg, :abs_g2_nc:\imm
    movk \reg, :abs_g1_nc:\imm
    movk \reg, :abs_g0_nc:\imm
.endm

.set .L_MAIR_DEV_nGnRE,	0x04
.set .L_MAIR_MEM_WBWA,	0xff
.set .Lmairval, .L_MAIR_DEV_nGnRE | (.L_MAIR_MEM_WBWA << 8)

/* 4 KiB granule size for TTBR0_EL1. */
.set .L_TCR_TG0_4KB, 0x0 << 14
/* 4 KiB granule size for TTBR1_EL1. */
.set .L_TCR_TG1_4KB, 0x2 << 30
/* Disable translation table walk for TTBR1_EL1, generating a translation fault instead. */
.set .L_TCR_EPD1, 0x1 << 23
/* Translation table walks for TTBR0_EL1 are inner sharable. */
.set .L_TCR_SH_INNER, 0x3 << 12
/*
 * Translation table walks for TTBR0_EL1 are outer write-back read-allocate write-allocate
 * cacheable.
 */
.set .L_TCR_RGN_OWB, 0x1 << 10
/*
 * Translation table walks for TTBR0_EL1 are inner write-back read-allocate write-allocate
 * cacheable.
 */
.set .L_TCR_RGN_IWB, 0x1 << 8
/* Size offset for TTBR0_EL1 is 2**39 bytes (512 GiB). */
.set .L_TCR_T0SZ_512, 64 - 39
.set .Ltcrval, .L_TCR_TG0_4KB | .L_TCR_TG1_4KB | .L_TCR_EPD1 | .L_TCR_RGN_OWB
.set .Ltcrval, .Ltcrval | .L_TCR_RGN_IWB | .L_TCR_SH_INNER | .L_TCR_T0SZ_512

/* Stage 1 instruction access cacheability is unaffected. */
.set .L_SCTLR_ELx_I, 0x1 << 12
/* SP alignment fault if SP is not aligned to a 16 byte boundary. */
.set .L_SCTLR_ELx_SA, 0x1 << 3
/* Stage 1 data access cacheability is unaffected. */
.set .L_SCTLR_ELx_C, 0x1 << 2
/* EL0 and EL1 stage 1 MMU enabled. */
.set .L_SCTLR_ELx_M, 0x1 << 0
/* Privileged Access Never is unchanged on taking an exception to EL1. */
.set .L_SCTLR_EL1_SPAN, 0x1 << 23
/* SETEND instruction disabled at EL0 in aarch32 mode. */
.set .L_SCTLR_EL1_SED, 0x1 << 8
/* Various IT instructions are disabled at EL0 in aarch32 mode. */
.set .L_SCTLR_EL1_ITD, 0x1 << 7
.set .L_SCTLR_EL1_RES1, (0x1 << 11) | (0x1 << 20) | (0x1 << 22) | (0x1 << 28) | (0x1 << 29)
.set .Lsctlrval, .L_SCTLR_ELx_M | .L_SCTLR_ELx_C | .L_SCTLR_ELx_SA | .L_SCTLR_EL1_ITD | .L_SCTLR_EL1_SED
.set .Lsctlrval, .Lsctlrval | .L_SCTLR_ELx_I | .L_SCTLR_EL1_SPAN | .L_SCTLR_EL1_RES1

/**
 * This is a generic entry point for an image. It carries out the operations required to prepare the
 * loaded image to be run. Specifically, it zeroes the bss section using registers x25 and above,
 * prepares the stack, enables floating point, and sets up the exception vector. It preserves x0-x3
 * for the Rust entry point, as these may contain boot parameters.
 */
.section .init.entry, "ax"
.global entry
entry:
    /* Load and apply the memory management configuration, ready to enable MMU and caches. */
    adrp x30, idmap
    msr ttbr0_el1, x30

    mov_i x30, .Lmairval
    msr mair_el1, x30

    mov_i x30, .Ltcrval
    /* Copy the supported PA range into TCR_EL1.IPS. */
    mrs x29, id_aa64mmfr0_el1
    bfi x30, x29, #32, #4

    msr tcr_el1, x30

    mov_i x30, .Lsctlrval

    /*
     * Ensure everything before this point has completed, then invalidate any potentially stale
     * local TLB entries before they start being used.
     */
    isb
    tlbi vmalle1
    ic iallu
    dsb nsh
    isb

    /*
     * Configure sctlr_el1 to enable MMU and cache and don't proceed until this has completed.
     */
    msr sctlr_el1, x30
    isb

    /* Disable trapping floating point access in EL1. */
    mrs x30, cpacr_el1
    orr x30, x30, #(0x3 << 20)
    msr cpacr_el1, x30
    isb

    /* Zero out the bss section. */
    adr_l x29, bss_begin
    adr_l x30, bss_end
0:	cmp x29, x30
    b.hs 1f
    stp xzr, xzr, [x29], #16
    b 0b

1:	/* Prepare the stack. */
    adr_l x30, boot_stack_end
    mov sp, x30

    /* Set up exception vector. */
    adr x30, vector_table_el1
    msr vbar_el1, x30

    /* Call into Rust code. */
    bl main

    /* Loop forever waiting for interrupts. */
2:	wfi
    b 2b

exceptions.S（无需对此进行更改）：


/*
 * Copyright 2023 Google LLC
 *
 * Licensed under the Apache License, Version 2.0 (the "License");
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 *
 *     https://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

/**
 * Saves the volatile registers onto the stack. This currently takes 14
 * instructions, so it can be used in exception handlers with 18 instructions
 * left.
 *
 * On return, x0 and x1 are initialised to elr_el2 and spsr_el2 respectively,
 * which can be used as the first and second arguments of a subsequent call.
 */
.macro save_volatile_to_stack
    /* Reserve stack space and save registers x0-x18, x29 & x30. */
    stp x0, x1, [sp, #-(8 * 24)]!
    stp x2, x3, [sp, #8 * 2]
    stp x4, x5, [sp, #8 * 4]
    stp x6, x7, [sp, #8 * 6]
    stp x8, x9, [sp, #8 * 8]
    stp x10, x11, [sp, #8 * 10]
    stp x12, x13, [sp, #8 * 12]
    stp x14, x15, [sp, #8 * 14]
    stp x16, x17, [sp, #8 * 16]
    str x18, [sp, #8 * 18]
    stp x29, x30, [sp, #8 * 20]

    /*
     * Save elr_el1 & spsr_el1. This such that we can take nested exception
     * and still be able to unwind.
     */
    mrs x0, elr_el1
    mrs x1, spsr_el1
    stp x0, x1, [sp, #8 * 22]
.endm

/**
 * Restores the volatile registers from the stack. This currently takes 14
 * instructions, so it can be used in exception handlers while still leaving 18
 * instructions left; if paired with save_volatile_to_stack, there are 4
 * instructions to spare.
 */
.macro restore_volatile_from_stack
    /* Restore registers x2-x18, x29 & x30. */
    ldp x2, x3, [sp, #8 * 2]
    ldp x4, x5, [sp, #8 * 4]
    ldp x6, x7, [sp, #8 * 6]
    ldp x8, x9, [sp, #8 * 8]
    ldp x10, x11, [sp, #8 * 10]
    ldp x12, x13, [sp, #8 * 12]
    ldp x14, x15, [sp, #8 * 14]
    ldp x16, x17, [sp, #8 * 16]
    ldr x18, [sp, #8 * 18]
    ldp x29, x30, [sp, #8 * 20]

    /* Restore registers elr_el1 & spsr_el1, using x0 & x1 as scratch. */
    ldp x0, x1, [sp, #8 * 22]
    msr elr_el1, x0
    msr spsr_el1, x1

    /* Restore x0 & x1, and release stack space. */
    ldp x0, x1, [sp], #8 * 24
.endm

/**
 * This is a generic handler for exceptions taken at the current EL while using
 * SP0. It behaves similarly to the SPx case by first switching to SPx, doing
 * the work, then switching back to SP0 before returning.
 *
 * Switching to SPx and calling the Rust handler takes 16 instructions. To
 * restore and return we need an additional 16 instructions, so we can implement
 * the whole handler within the allotted 32 instructions.
 */
.macro current_exception_sp0 handler:req
    msr spsel, #1
    save_volatile_to_stack
    bl \handler
    restore_volatile_from_stack
    msr spsel, #0
    eret
.endm

/**
 * This is a generic handler for exceptions taken at the current EL while using
 * SPx. It saves volatile registers, calls the Rust handler, restores volatile
 * registers, then returns.
 *
 * This also works for exceptions taken from EL0, if we don't care about
 * non-volatile registers.
 *
 * Saving state and jumping to the Rust handler takes 15 instructions, and
 * restoring and returning also takes 15 instructions, so we can fit the whole
 * handler in 30 instructions, under the limit of 32.
 */
.macro current_exception_spx handler:req
    save_volatile_to_stack
    bl \handler
    restore_volatile_from_stack
    eret
.endm

.section .text.vector_table_el1, "ax"
.global vector_table_el1
.balign 0x800
vector_table_el1:
sync_cur_sp0:
    current_exception_sp0 sync_exception_current

.balign 0x80
irq_cur_sp0:
    current_exception_sp0 irq_current

.balign 0x80
fiq_cur_sp0:
    current_exception_sp0 fiq_current

.balign 0x80
serr_cur_sp0:
    current_exception_sp0 serr_current

.balign 0x80
sync_cur_spx:
    current_exception_spx sync_exception_current

.balign 0x80
irq_cur_spx:
    current_exception_spx irq_current

.balign 0x80
fiq_cur_spx:
    current_exception_spx fiq_current

.balign 0x80
serr_cur_spx:
    current_exception_spx serr_current

.balign 0x80
sync_lower_64:
    current_exception_spx sync_lower

.balign 0x80
irq_lower_64:
    current_exception_spx irq_lower

.balign 0x80
fiq_lower_64:
    current_exception_spx fiq_lower

.balign 0x80
serr_lower_64:
    current_exception_spx serr_lower

.balign 0x80
sync_lower_32:
    current_exception_spx sync_lower

.balign 0x80
irq_lower_32:
    current_exception_spx irq_lower

.balign 0x80
fiq_lower_32:
    current_exception_spx fiq_lower

.balign 0x80
serr_lower_32:
    current_exception_spx serr_lower

idmap.S（无需对此进行更改）


/*
 * Copyright 2023 Google LLC
 *
 * Licensed under the Apache License, Version 2.0 (the "License");
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 *
 *     https://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

.set .L_TT_TYPE_BLOCK, 0x1
.set .L_TT_TYPE_PAGE,  0x3
.set .L_TT_TYPE_TABLE, 0x3

/* Access flag. */
.set .L_TT_AF, 0x1 << 10
/* Not global. */
.set .L_TT_NG, 0x1 << 11
.set .L_TT_XN, 0x3 << 53

.set .L_TT_MT_DEV, 0x0 << 2			// MAIR #0 (DEV_nGnRE)
.set .L_TT_MT_MEM, (0x1 << 2) | (0x3 << 8)	// MAIR #1 (MEM_WBWA), inner shareable

.set .L_BLOCK_DEV, .L_TT_TYPE_BLOCK | .L_TT_MT_DEV | .L_TT_AF | .L_TT_XN
.set .L_BLOCK_MEM, .L_TT_TYPE_BLOCK | .L_TT_MT_MEM | .L_TT_AF | .L_TT_NG

.section ".rodata.idmap", "a", %progbits
.global idmap
.align 12
idmap:
    /* level 1 */
    .quad		.L_BLOCK_DEV | 0x0		    // 1 GiB of device mappings
    .quad		.L_BLOCK_MEM | 0x40000000	// 1 GiB of DRAM
    .fill		254, 8, 0x0			// 254 GiB of unmapped VA space
    .quad		.L_BLOCK_DEV | 0x4000000000 // 1 GiB of device mappings
    .fill		255, 8, 0x0			// 255 GiB of remaining VA space

image.ld（无需对此进行更改）：


/*
 * Copyright 2023 Google LLC
 *
 * Licensed under the Apache License, Version 2.0 (the "License");
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 *
 *     https://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

/*
 * Code will start running at this symbol which is placed at the start of the
 * image.
 */
ENTRY(entry)

MEMORY
{
	image : ORIGIN = 0x40080000, LENGTH = 2M
}

SECTIONS
{
	/*
	 * Collect together the code.
	 */
	.init : ALIGN(4096) {
		text_begin = .;
		*(.init.entry)
		*(.init.*)
	} >image
	.text : {
		*(.text.*)
	} >image
	text_end = .;

	/*
	 * Collect together read-only data.
	 */
	.rodata : ALIGN(4096) {
		rodata_begin = .;
		*(.rodata.*)
	} >image
	.got : {
		*(.got)
	} >image
	rodata_end = .;

	/*
	 * Collect together the read-write data including .bss at the end which
	 * will be zero'd by the entry code.
	 */
	.data : ALIGN(4096) {
		data_begin = .;
		*(.data.*)
		/*
		 * The entry point code assumes that .data is a multiple of 32
		 * bytes long.
		 */
		. = ALIGN(32);
		data_end = .;
	} >image

	/* Everything beyond this point will not be included in the binary. */
	bin_end = .;

	/* The entry point code assumes that .bss is 16-byte aligned. */
	.bss : ALIGN(16)  {
		bss_begin = .;
		*(.bss.*)
		*(COMMON)
		. = ALIGN(16);
		bss_end = .;
	} >image

	.stack (NOLOAD) : ALIGN(4096) {
		boot_stack_begin = .;
		. += 40 * 4096;
		. = ALIGN(4096);
		boot_stack_end = .;
	} >image

	. = ALIGN(4K);
	PROVIDE(dma_region = .);

	/*
	 * Remove unused sections from the image.
	 */
	/DISCARD/ : {
		/* The image loads itself so doesn't need these sections. */
		*(.gnu.hash)
		*(.hash)
		*(.interp)
		*(.eh_frame_hdr)
		*(.eh_frame)
		*(.note.gnu.build-id)
	}
}

Makefile（无需对此进行更改）：


# Copyright 2023 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#      http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

UNAME := $(shell uname -s)
ifeq ($(UNAME),Linux)
    TARGET = aarch64-linux-gnu
else
    TARGET = aarch64-none-elf
endif
OBJCOPY = $(TARGET)-objcopy

.PHONY: build qemu_minimal qemu qemu_logger

all: rtc.bin

build:
    cargo build

rtc.bin: build
    $(OBJCOPY) -O binary target/aarch64-unknown-none/debug/rtc $@

qemu: rtc.bin
    qemu-system-aarch64 -machine virt,gic-version=3 -cpu max -serial mon:stdio -display none -kernel $< -s

clean:
    cargo clean
    rm -f *.bin

.cargo/config.toml（无需对此进行更改）：


[build]
target = "aarch64-unknown-none"
rustflags = ["-C", "link-arg=-Timage.ld"]

使用 make qemu 在 QEMU 中运行代码。

嵌入式 Rust：进阶篇

RTC 驱动程序

(返回练习)

main.rs:


#![no_main]
#![no_std]

mod exceptions;
mod logger;
mod pl011;
mod pl031;

use crate::pl031::Rtc;
use arm_gic::gicv3::{IntId, Trigger};
use arm_gic::{irq_enable, wfi};
use chrono::{TimeZone, Utc};
use core::hint::spin_loop;
use crate::pl011::Uart;
use arm_gic::gicv3::GicV3;
use core::panic::PanicInfo;
use log::{error, info, trace, LevelFilter};
use smccc::psci::system_off;
use smccc::Hvc;

/// Base addresses of the GICv3.
const GICD_BASE_ADDRESS: *mut u64 = 0x800_0000 as _;
const GICR_BASE_ADDRESS: *mut u64 = 0x80A_0000 as _;

/// Base address of the primary PL011 UART.
const PL011_BASE_ADDRESS: *mut u32 = 0x900_0000 as _;

/// Base address of the PL031 RTC.
const PL031_BASE_ADDRESS: *mut u32 = 0x901_0000 as _;
/// The IRQ used by the PL031 RTC.
const PL031_IRQ: IntId = IntId::spi(2);

#[no_mangle]
extern "C" fn main(x0: u64, x1: u64, x2: u64, x3: u64) {
    // SAFETY: `PL011_BASE_ADDRESS` is the base address of a PL011 device, and
    // nothing else accesses that address range.
    let uart = unsafe { Uart::new(PL011_BASE_ADDRESS) };
    logger::init(uart, LevelFilter::Trace).unwrap();

    info!("main({:#x}, {:#x}, {:#x}, {:#x})", x0, x1, x2, x3);

    // SAFETY: `GICD_BASE_ADDRESS` and `GICR_BASE_ADDRESS` are the base
    // addresses of a GICv3 distributor and redistributor respectively, and
    // nothing else accesses those address ranges.
    let mut gic = unsafe { GicV3::new(GICD_BASE_ADDRESS, GICR_BASE_ADDRESS) };
    gic.setup();

    // SAFETY: `PL031_BASE_ADDRESS` is the base address of a PL031 device, and
    // nothing else accesses that address range.
    let mut rtc = unsafe { Rtc::new(PL031_BASE_ADDRESS) };
    let timestamp = rtc.read();
    let time = Utc.timestamp_opt(timestamp.into(), 0).unwrap();
    info!("RTC: {time}");

    GicV3::set_priority_mask(0xff);
    gic.set_interrupt_priority(PL031_IRQ, 0x80);
    gic.set_trigger(PL031_IRQ, Trigger::Level);
    irq_enable();
    gic.enable_interrupt(PL031_IRQ, true);

    // Wait for 3 seconds, without interrupts.
    let target = timestamp + 3;
    rtc.set_match(target);
    info!("Waiting for {}", Utc.timestamp_opt(target.into(), 0).unwrap());
    trace!(
        "matched={}, interrupt_pending={}",
        rtc.matched(),
        rtc.interrupt_pending()
    );
    while !rtc.matched() {
        spin_loop();
    }
    trace!(
        "matched={}, interrupt_pending={}",
        rtc.matched(),
        rtc.interrupt_pending()
    );
    info!("Finished waiting");

    // Wait another 3 seconds for an interrupt.
    let target = timestamp + 6;
    info!("Waiting for {}", Utc.timestamp_opt(target.into(), 0).unwrap());
    rtc.set_match(target);
    rtc.clear_interrupt();
    rtc.enable_interrupt(true);
    trace!(
        "matched={}, interrupt_pending={}",
        rtc.matched(),
        rtc.interrupt_pending()
    );
    while !rtc.interrupt_pending() {
        wfi();
    }
    trace!(
        "matched={}, interrupt_pending={}",
        rtc.matched(),
        rtc.interrupt_pending()
    );
    info!("Finished waiting");

    system_off::<Hvc>().unwrap();
}

#[panic_handler]
fn panic(info: &PanicInfo) -> ! {
    error!("{info}");
    system_off::<Hvc>().unwrap();
    loop {}
}

pl031.rs：


#![allow(unused)]
fn main() {
use core::ptr::{addr_of, addr_of_mut};

#[repr(C, align(4))]
struct Registers {
    /// Data register
    dr: u32,
    /// Match register
    mr: u32,
    /// Load register
    lr: u32,
    /// Control register
    cr: u8,
    _reserved0: [u8; 3],
    /// Interrupt Mask Set or Clear register
    imsc: u8,
    _reserved1: [u8; 3],
    /// Raw Interrupt Status
    ris: u8,
    _reserved2: [u8; 3],
    /// Masked Interrupt Status
    mis: u8,
    _reserved3: [u8; 3],
    /// Interrupt Clear Register
    icr: u8,
    _reserved4: [u8; 3],
}

/// Driver for a PL031 real-time clock.
#[derive(Debug)]
pub struct Rtc {
    registers: *mut Registers,
}

impl Rtc {
    /// Constructs a new instance of the RTC driver for a PL031 device at the
    /// given base address.
    ///
    /// # Safety
    ///
    /// The given base address must point to the MMIO control registers of a
    /// PL031 device, which must be mapped into the address space of the process
    /// as device memory and not have any other aliases.
    pub unsafe fn new(base_address: *mut u32) -> Self {
        Self { registers: base_address as *mut Registers }
    }

    /// Reads the current RTC value.
    pub fn read(&self) -> u32 {
        // SAFETY: We know that self.registers points to the control registers
        // of a PL031 device which is appropriately mapped.
        unsafe { addr_of!((*self.registers).dr).read_volatile() }
    }

    /// Writes a match value. When the RTC value matches this then an interrupt
    /// will be generated (if it is enabled).
    pub fn set_match(&mut self, value: u32) {
        // SAFETY: We know that self.registers points to the control registers
        // of a PL031 device which is appropriately mapped.
        unsafe { addr_of_mut!((*self.registers).mr).write_volatile(value) }
    }

    /// Returns whether the match register matches the RTC value, whether or not
    /// the interrupt is enabled.
    pub fn matched(&self) -> bool {
        // SAFETY: We know that self.registers points to the control registers
        // of a PL031 device which is appropriately mapped.
        let ris = unsafe { addr_of!((*self.registers).ris).read_volatile() };
        (ris & 0x01) != 0
    }

    /// Returns whether there is currently an interrupt pending.
    ///
    /// This should be true if and only if `matched` returns true and the
    /// interrupt is masked.
    pub fn interrupt_pending(&self) -> bool {
        // SAFETY: We know that self.registers points to the control registers
        // of a PL031 device which is appropriately mapped.
        let ris = unsafe { addr_of!((*self.registers).mis).read_volatile() };
        (ris & 0x01) != 0
    }

    /// Sets or clears the interrupt mask.
    ///
    /// When the mask is true the interrupt is enabled; when it is false the
    /// interrupt is disabled.
    pub fn enable_interrupt(&mut self, mask: bool) {
        let imsc = if mask { 0x01 } else { 0x00 };
        // SAFETY: We know that self.registers points to the control registers
        // of a PL031 device which is appropriately mapped.
        unsafe { addr_of_mut!((*self.registers).imsc).write_volatile(imsc) }
    }

    /// Clears a pending interrupt, if any.
    pub fn clear_interrupt(&mut self) {
        // SAFETY: We know that self.registers points to the control registers
        // of a PL031 device which is appropriately mapped.
        unsafe { addr_of_mut!((*self.registers).icr).write_volatile(0x01) }
    }
}

// SAFETY: `Rtc` just contains a pointer to device memory, which can be
// accessed from any context.
unsafe impl Send for Rtc {}
}

欢迎了解 Rust 中的并发

Rust 完全支持使用带有互斥锁和通道的操作系统线程进行并发。

Rust 类型系统能帮助我们把许多并发bug转换为编译期bug 发挥着重要作用。这通常称为“无畏并发”，因为你可以依靠编译器来确保运行时的正确性。

Speaker Notes

Rust lets us access OS concurrency toolkit: threads, sync. primitives, etc.
The type system gives us safety for concurrency without any special features.
The same tools that help with “concurrent” access in a single thread (e.g., a called function that might mutate an argument or save references to it to read later) save us from multi-threading issues.

线程

Rust 线程的运作方式与其他语言中的线程类似：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

线程均为守护程序线程，主线程不会等待这些线程。
线程紧急警报 (panic) 是彼此独立的。
- 紧急警报可以携带载荷，并可以使用 downcast_ref 对载荷进行解压缩。

Speaker Notes

Rust thread APIs look not too different from e.g. C++ ones.
Run the example.
- 5ms timing is loose enough that main and spawned threads stay mostly in lockstep.
- Notice that the program ends before the spawned thread reaches 10!
- This is because main ends the program and spawned threads do not make it persist.
  - Compare to pthreads/C++ std::thread/boost::thread if desired.
How do we wait around for the spawned thread to complete?
thread::spawn returns a JoinHandle. Look at the docs.
- JoinHandle has a .join() method that blocks.
Use let handle = thread::spawn(...) and later handle.join() to wait for the thread to finish and have the program count all the way to 10.
Now what if we want to return a value?
Look at docs again:
- thread::spawn’s closure returns T
- JoinHandle .join() returns thread::Result<T>
Use the Result return value from handle.join() to get access to the returned value.
Ok, what about the other case?
- Trigger a panic in the thread. Note that this doesn’t panic main.
- Access the panic payload. This is a good time to talk about Any.
Now we can return values from threads! What about taking inputs?
- Capture something by reference in the thread closure.
- An error message indicates we must move it.
- Move it in, see we can compute and then return a derived value.
If we want to borrow?
- Main kills child threads when it returns, but another function would just return and leave them running.
- That would be stack use-after-return, which violates memory safety!
- How do we avoid this? see next slide.

范围线程

常规线程不能从它们所处的环境中借用：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

不过，你可以使用范围线程来实现此目的：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

其原因在于，在 thread::scope 函数完成后，可保证所有线程都已联结在一起，使得线程能够返回借用的数据。
此时须遵守常规 Rust 借用规则：你可以通过一个线程以可变的方式借用，也可以通过任意数量的线程以不可变的方式借用。

通道

Rust 通道（Channel）包含两个部分：Sender<T> 和 Receiver<T>。这两个部分通过通道进行连接，但你只能看到端点。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

mpsc 代表多个生产方，单个使用方。Sender 和 SyncSender 会实现 Clone（因此，你可以设置多个生产方），但 Receiver 不会实现。
send() 和 recv() 会返回 Result。如果它们返回 Err，则表示对应的 Sender 或 Receiver 已被丢弃，且通道已关闭。

无界通道

你可以使用 mpsc::channel() 获得无边界的异步通道：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

有界通道

With bounded (synchronous) channels, send can block the current thread:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

调用 send 将阻塞当前线程，直到通道中有足够的空间放置新消息。如果无人从通道读取数据，线程会被无限期地阻塞。
如果通道关闭，调用 send 将中止并返回错误（这就是它会返回 Result 的原因）。当接收器被丢弃时，通道将关闭。
A bounded channel with a size of zero is called a “rendezvous channel”. Every send will block the current thread until another thread calls recv.

`Send` 和 `Sync`

How does Rust know to forbid shared access across threads? The answer is in two traits:

Send：如果跨线程边界移动 T 是安全的，则类型 T 为 Send。
Sync：如果跨线程边界移动 &T 是安全的，则类型 T 为 Sync。

Send 和 Sync 均为不安全特征。只要类型仅包含 Send 和 Sync 类型，编译器就会自动为类型派生这两种特征。你也可以手动实现它们（如果你确定这样有效的话）。

Speaker Notes

不妨将这些特征视为类型包含某些线程安全属性的标记。
它们可以在泛型约束中作为常规特征使用。

`Send`

如果将 T 值移动到另一个线程是安全的，则类型 T 为 Send。

将所有权转移到另一个线程的影响是，“析构函数”将在相应线程中运行。因此，问题在于你何时可以在一个线程中分配某个值，然后在另一个线程中取消分配该值。

Speaker Notes

例如，与 SQLite 库的连接只能通过单个线程访问。

`Sync`

如果同时从多个线程访问 T 值是安全的，则类型 T 为 Sync。

更准确地说，定义是：

当且仅当 &T 为 Send 时，T 为 Sync

Speaker Notes

该语句实质上是一种简写形式，表示如果某个类型对于共享使用是线程安全的，那么跨线程传递对该类型的引用也是线程安全的。

这是因为如果某个类型为 Sync，则意味着它可以在多个线程之间共享，而不存在数据争用或其他同步问题的风险，因此将其移动到另一个线程是安全的。对该类型的引用同样可以安全地移动到另一个线程，因为它引用的数据可以从任何线程安全地访问。

示例

`Send + Sync`

你遇到的类型大都属于 Send + Sync：

i8、f32、bool、char、&str…
(T1, T2)、[T; N]、&[T]、struct { x: T }…
String、Option<T>、Vec<T>、Box<T>…
Arc<T>：明确通过原子引用计数实现线程安全。
Mutex<T>：明确通过内部锁定实现线程安全。
mpsc::Sender<T>: As of 1.72.0.
AtomicBool、AtomicU8…：使用特殊的原子指令。

当类型参数为 Send + Sync 时，泛型类型通常为 Send + Sync。

`Send + !Sync`

这些类型可以移动到其他线程，但它们不是线程安全的。这通常是由内部可变性造成的：

mpsc::Receiver<T>
Cell<T>
RefCell<T>

`!Send + Sync`

这些类型是线程安全的，但它们不能移动到另一个线程：

MutexGuard<T: Sync>: Uses OS level primitives which must be deallocated on the thread which created them.

`!Send + !Sync`

这些类型不是线程安全的，不能移动到其他线程：

Rc<T>：每个 Rc<T> 都具有对 RcBox<T> 的引用，其中包含非原子引用计数。
*const T、*mut T：Rust 会假定原始指针可能在并发方面有特殊的注意事项。

共享状态

Rust 使用类型系统来强制同步共享数据。这主要通过两种类型实现：

Arc<T>，对 T 进行原子计数：用于处理线程之间的共享，并负责在最后一个引用被丢弃时取消分配 T。
Mutex<T>：确保对 T 值的互斥访问。

`Arc`

Arc<T> 允许通过 Arc::clone 实现共享只读权限：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

Arc 代表“原子引用计数”，它是使用原子操作的 Rc 的线程安全版本。
Arc<T> implements Clone whether or not T does. It implements Send and Sync if and only if T implements them both.
Arc::clone() 在执行原子操作方面有开销，但在此之后，T 便可随意使用，而没有任何开销。
请警惕引用循环，Arc 不会使用垃圾回收器检测引用循环。
- std::sync::Weak 对此有所帮助。

互斥器（`Mutex`）

Mutex<T> ensures mutual exclusion and allows mutable access to T behind a read-only interface (another form of interior mutability):


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

请注意我们如何设置 impl<T: Send> Sync for Mutex<T> 通用实现。

Speaker Notes

Mutex in Rust looks like a collection with just one element — the protected data.
- 在访问受保护的数据之前不可能忘记获取互斥量。
你可以通过获取锁，从 &Mutex<T> 中获取 &mut T。MutexGuard 能够确保 &mut T 存在的时间不会比持有锁的时间更长。
Mutex<T> implements both Send and Sync iff (if and only if) T implements Send.
A read-write lock counterpart: RwLock.
Why does lock() return a Result?
- 如果持有 Mutex 的线程发生panic，Mutex 便会“中毒”并发出信号，表明其所保护的数据可能处于不一致状态。对中毒的互斥量调用 lock() 将会失败，并将显示 PoisonError。无论如何，你可以对该错误调用 into_inner() 来恢复数据。

示例

让我们看看 Arc 和 Mutex 的实际效果：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

可能有用的解决方案：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

值得注意的部分：

Arc 和 Mutex 中都封装了 v，因为它们的关注点是正交的。
- 将 Mutex 封装在 Arc 中是一种在线程之间共享可变状态的常见模式。
v: Arc<_> 必须先克隆为 v2，然后才能移动到另一个线程中。请注意，lambda 签名中添加了 move。
我们引入了块，以尽可能缩小 LockGuard 的作用域。

习题

现在通过

哲学家用餐示例来练习我们新学习到的并发技巧：该示例是典型的并发问题。
多线程链接检查器：对于大型项目，需要使用 Cargo 下载依赖项，然后并行检查链接。

Speaker Notes

After looking at the exercises, you can look at the solutions provided.

哲学家就餐问题

哲学家用餐示例是一个典型的并发问题：

五位哲学家在同一桌子上用餐。每位哲学家在桌前都有自己的座位。每个盘子之间都有一把叉子。上的菜品是一种意大利面，需要用两把叉子才能吃。每位哲学家只能交替进行思考和用餐。此外，只有当哲学家们同时拿到左边和右边的叉子才能吃这个意大利面。因此，只有当两旁坐着的人在思考，而非在吃面时，他们才能使用两把叉子。每位哲学家吃完饭后，就会放下手中的两把叉子。

在本练习中，需要使用本地 Cargo 安装。将以下代码复制到名为 src/main.rs 的文件中，并填写空白的地方，然后测试 cargo run 不会死锁：


use std::sync::{mpsc, Arc, Mutex};
use std::thread;
use std::time::Duration;

struct Fork;

struct Philosopher {
    name: String,
    // left_fork: ...
    // right_fork: ...
    // thoughts: ...
}

impl Philosopher {
    fn think(&self) {
        self.thoughts
            .send(format!("Eureka! {} has a new idea!", &self.name))
            .unwrap();
    }

    fn eat(&self) {
        // Pick up forks...
        println!("{} is eating...", &self.name);
        thread::sleep(Duration::from_millis(10));
    }
}

static PHILOSOPHERS: &[&str] =
    &["Socrates", "Hypatia", "Plato", "Aristotle", "Pythagoras"];

fn main() {
    // Create forks

    // Create philosophers

    // Make each of them think and eat 100 times

    // Output their thoughts
}

您可以使用以下 Cargo.toml：


[package]
name = "dining-philosophers"
version = "0.1.0"
edition = "2021"

多线程链接检查器

运用掌握的新知识创建一个多线程链接检查工具。应先从网页入手，并检查网页上的链接是否有效。该工具应以递归方式检查同一网域中的其他网页，并且一直执行此操作，直到所有网页都通过验证。

For this, you will need an HTTP client such as reqwest. You will also need a way to find links, we can use scraper. Finally, we’ll need some way of handling errors, we will use thiserror.

Create a new Cargo project and reqwest it as a dependency with:


cargo new link-checker
cd link-checker
cargo add --features blocking,rustls-tls reqwest
cargo add scraper
cargo add thiserror

如果 cargo add 操作失败并显示 error: no such subcommand，请手动修改 Cargo.toml 文件。添加下面列出的依赖项。

cargo add 调用会将 Cargo.toml 文件更新为如下所示：


[package]
name = "link-checker"
version = "0.1.0"
edition = "2021"
publish = false

[dependencies]
reqwest = { version = "0.11.12", features = ["blocking", "rustls-tls"] }
scraper = "0.13.0"
thiserror = "1.0.37"

您现在可以下载初始页了。请尝试使用一个小网站，例如 https://www.google.org/。

您的 src/main.rs 文件应如下所示：


use reqwest::blocking::Client;
use reqwest::Url;
use scraper::{Html, Selector};
use thiserror::Error;

#[derive(Error, Debug)]
enum Error {
    #[error("request error: {0}")]
    ReqwestError(#[from] reqwest::Error),
    #[error("bad http response: {0}")]
    BadResponse(String),
}

#[derive(Debug)]
struct CrawlCommand {
    url: Url,
    extract_links: bool,
}

fn visit_page(client: &Client, command: &CrawlCommand) -> Result<Vec<Url>, Error> {
    println!("Checking {:#}", command.url);
    let response = client.get(command.url.clone()).send()?;
    if !response.status().is_success() {
        return Err(Error::BadResponse(response.status().to_string()));
    }

    let mut link_urls = Vec::new();
    if !command.extract_links {
        return Ok(link_urls);
    }

    let base_url = response.url().to_owned();
    let body_text = response.text()?;
    let document = Html::parse_document(&body_text);

    let selector = Selector::parse("a").unwrap();
    let href_values = document
        .select(&selector)
        .filter_map(|element| element.value().attr("href"));
    for href in href_values {
        match base_url.join(href) {
            Ok(link_url) => {
                link_urls.push(link_url);
            }
            Err(err) => {
                println!("On {base_url:#}: ignored unparsable {href:?}: {err}");
            }
        }
    }
    Ok(link_urls)
}

fn main() {
    let client = Client::new();
    let start_url = Url::parse("https://www.google.org").unwrap();
    let crawl_command = CrawlCommand{ url: start_url, extract_links: true };
    match visit_page(&client, &crawl_command) {
        Ok(links) => println!("Links: {links:#?}"),
        Err(err) => println!("Could not extract links: {err:#}"),
    }
}

使用以下命令运行 src/main.rs 中的代码


cargo run

任务

通过线程并行检查链接：将要检查的网址发送到某个通道，然后使用多个线程并行检查这些网址。
您可以对此进行扩展，以递归方式从 www.google.org 网域的所有网页中提取链接。设置网页上限（例如 100 个），以免被网站屏蔽。

并发编程：上午练习

哲学家就餐问题

(返回练习)


use std::sync::{mpsc, Arc, Mutex};
use std::thread;
use std::time::Duration;

struct Fork;

struct Philosopher {
    name: String,
    left_fork: Arc<Mutex<Fork>>,
    right_fork: Arc<Mutex<Fork>>,
    thoughts: mpsc::SyncSender<String>,
}

impl Philosopher {
    fn think(&self) {
        self.thoughts
            .send(format!("Eureka! {} has a new idea!", &self.name))
            .unwrap();
    }

    fn eat(&self) {
        println!("{} is trying to eat", &self.name);
        let _left = self.left_fork.lock().unwrap();
        let _right = self.right_fork.lock().unwrap();

        println!("{} is eating...", &self.name);
        thread::sleep(Duration::from_millis(10));
    }
}

static PHILOSOPHERS: &[&str] =
    &["Socrates", "Hypatia", "Plato", "Aristotle", "Pythagoras"];

fn main() {
    let (tx, rx) = mpsc::sync_channel(10);

    let forks = (0..PHILOSOPHERS.len())
        .map(|_| Arc::new(Mutex::new(Fork)))
        .collect::<Vec<_>>();

    for i in 0..forks.len() {
        let tx = tx.clone();
        let mut left_fork = Arc::clone(&forks[i]);
        let mut right_fork = Arc::clone(&forks[(i + 1) % forks.len()]);

        // To avoid a deadlock, we have to break the symmetry
        // somewhere. This will swap the forks without deinitializing
        // either of them.
        if i == forks.len() - 1 {
            std::mem::swap(&mut left_fork, &mut right_fork);
        }

        let philosopher = Philosopher {
            name: PHILOSOPHERS[i].to_string(),
            thoughts: tx,
            left_fork,
            right_fork,
        };

        thread::spawn(move || {
            for _ in 0..100 {
                philosopher.eat();
                philosopher.think();
            }
        });
    }

    drop(tx);
    for thought in rx {
        println!("{thought}");
    }
}

Link Checker

(back to exercise)


use std::sync::{mpsc, Arc, Mutex};
use std::thread;

use reqwest::blocking::Client;
use reqwest::Url;
use scraper::{Html, Selector};
use thiserror::Error;

#[derive(Error, Debug)]
enum Error {
    #[error("request error: {0}")]
    ReqwestError(#[from] reqwest::Error),
    #[error("bad http response: {0}")]
    BadResponse(String),
}

#[derive(Debug)]
struct CrawlCommand {
    url: Url,
    extract_links: bool,
}

fn visit_page(client: &Client, command: &CrawlCommand) -> Result<Vec<Url>, Error> {
    println!("Checking {:#}", command.url);
    let response = client.get(command.url.clone()).send()?;
    if !response.status().is_success() {
        return Err(Error::BadResponse(response.status().to_string()));
    }

    let mut link_urls = Vec::new();
    if !command.extract_links {
        return Ok(link_urls);
    }

    let base_url = response.url().to_owned();
    let body_text = response.text()?;
    let document = Html::parse_document(&body_text);

    let selector = Selector::parse("a").unwrap();
    let href_values = document
        .select(&selector)
        .filter_map(|element| element.value().attr("href"));
    for href in href_values {
        match base_url.join(href) {
            Ok(link_url) => {
                link_urls.push(link_url);
            }
            Err(err) => {
                println!("On {base_url:#}: ignored unparsable {href:?}: {err}");
            }
        }
    }
    Ok(link_urls)
}

struct CrawlState {
    domain: String,
    visited_pages: std::collections::HashSet<String>,
}

impl CrawlState {
    fn new(start_url: &Url) -> CrawlState {
        let mut visited_pages = std::collections::HashSet::new();
        visited_pages.insert(start_url.as_str().to_string());
        CrawlState { domain: start_url.domain().unwrap().to_string(), visited_pages }
    }

    /// Determine whether links within the given page should be extracted.
    fn should_extract_links(&self, url: &Url) -> bool {
        let Some(url_domain) = url.domain() else {
            return false;
        };
        url_domain == self.domain
    }

    /// Mark the given page as visited, returning false if it had already
    /// been visited.
    fn mark_visited(&mut self, url: &Url) -> bool {
        self.visited_pages.insert(url.as_str().to_string())
    }
}

type CrawlResult = Result<Vec<Url>, (Url, Error)>;
fn spawn_crawler_threads(
    command_receiver: mpsc::Receiver<CrawlCommand>,
    result_sender: mpsc::Sender<CrawlResult>,
    thread_count: u32,
) {
    let command_receiver = Arc::new(Mutex::new(command_receiver));

    for _ in 0..thread_count {
        let result_sender = result_sender.clone();
        let command_receiver = command_receiver.clone();
        thread::spawn(move || {
            let client = Client::new();
            loop {
                let command_result = {
                    let receiver_guard = command_receiver.lock().unwrap();
                    receiver_guard.recv()
                };
                let Ok(crawl_command) = command_result else {
                    // The sender got dropped. No more commands coming in.
                    break;
                };
                let crawl_result = match visit_page(&client, &crawl_command) {
                    Ok(link_urls) => Ok(link_urls),
                    Err(error) => Err((crawl_command.url, error)),
                };
                result_sender.send(crawl_result).unwrap();
            }
        });
    }
}

fn control_crawl(
    start_url: Url,
    command_sender: mpsc::Sender<CrawlCommand>,
    result_receiver: mpsc::Receiver<CrawlResult>,
) -> Vec<Url> {
    let mut crawl_state = CrawlState::new(&start_url);
    let start_command = CrawlCommand { url: start_url, extract_links: true };
    command_sender.send(start_command).unwrap();
    let mut pending_urls = 1;

    let mut bad_urls = Vec::new();
    while pending_urls > 0 {
        let crawl_result = result_receiver.recv().unwrap();
        pending_urls -= 1;

        match crawl_result {
            Ok(link_urls) => {
                for url in link_urls {
                    if crawl_state.mark_visited(&url) {
                        let extract_links = crawl_state.should_extract_links(&url);
                        let crawl_command = CrawlCommand { url, extract_links };
                        command_sender.send(crawl_command).unwrap();
                        pending_urls += 1;
                    }
                }
            }
            Err((url, error)) => {
                bad_urls.push(url);
                println!("Got crawling error: {:#}", error);
                continue;
            }
        }
    }
    bad_urls
}

fn check_links(start_url: Url) -> Vec<Url> {
    let (result_sender, result_receiver) = mpsc::channel::<CrawlResult>();
    let (command_sender, command_receiver) = mpsc::channel::<CrawlCommand>();
    spawn_crawler_threads(command_receiver, result_sender, 16);
    control_crawl(start_url, command_sender, result_receiver)
}

fn main() {
    let start_url = reqwest::Url::parse("https://www.google.org").unwrap();
    let bad_urls = check_links(start_url);
    println!("Bad URLs: {:#?}", bad_urls);
}

异步 Rust

“异步”是一种并发模型，可以同时执行多个任务。具体做法是逐个执行任务直至阻塞，然后切换到另一项可以继续进行的任务。该模型支持在有限数量的线程上运行更多任务。原因在于，每个任务的开销通常很低，并且操作系统提供了基元来高效识别能够执行的 I/O 任务。

Rust 的异步操作基于 “Futures” 来实现，即表示未来可能完成的工作。系统会对这些 Future 进行 “轮询”，直到显示全部已完成。

由异步运行时对这些 Future 进行轮询，并且有多种不同的运行时可供选择。

比较

Python 的 asyncio 中也有类似的模型。不过，其 Future 类型基于回调的实现方式，而非通过轮询。使用异步 Python 程序需要类似于 Rust 中运行时的 “循环” 进行管理。
JavaScript 的 Promise 与之类似，但同样基于回调的实现方式。语言运行时实现了事件循环，因此许多与 Promise 解析相关的细节被隐藏起来。

`async`/`await`

从高层次上看，异步 Rust 代码与“正常”的顺序代码非常类似：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

关键点：

请注意，这只是一个简单的示例，用于展示语法。其中没有长时间运行的操作或任何真正的并发！
异步调用的返回类型是什么？
- 在 main 中使用 let future: () = async_main(10); 来查看类型。
The “async” keyword is syntactic sugar. The compiler replaces the return type with a future.
你不能将 main 声明为异步函数，除非在编译器中加入额外的指令来告诉它如何使用返回的 future。
You need an executor to run async code. block_on blocks the current thread until the provided future has run to completion.
.await 会异步地等待另一个操作的完成。与 block_on 不同，.await 不会阻塞当前线程。
.await can only be used inside an async function (or block; these are introduced later).

Futures

Future 是一种 trait，由表示尚未完成的操作的对象所实现。可以轮询 Future，并且 poll 会返回一个 Poll 对象。


#![allow(unused)]
fn main() {
use std::pin::Pin;
use std::task::Context;

pub trait Future {
    type Output;
    fn poll(self: Pin<&mut Self>, cx: &mut Context<'_>) -> Poll<Self::Output>;
}

pub enum Poll<T> {
    Ready(T),
    Pending,
}
}

异步函数会返回 impl Future。对于自定义的类型，也可以实现 Future（但不常见）。例如，从 tokio::spawn 返回的 JoinHandle 会实现 Future，以允许加入该任务。

在 Future 中使用 .await 关键字会导致当前异步函数暂停，直到该 Future 准备就绪，然后计算其输出。

Speaker Notes

Future 和 Poll 类型的实现完全如下所示：请点击链接查看文档中的实现。
我们的重点在于编写异步代码，而不是构建新的异步基元，因此不会涉及 Pin 和 Context。简言之：
- 通过 Context，Future 在事件发生时可自行安排重新进行轮询。
- Pin 确保 Future 不会移到内存中，以便指向该 Future 的指针仍然有效。为了确保使用 .await 之后引用依然有效，必须执行此操作。

Runtimes

运行时 支持异步执行操作（即 _反应器），并负责执行 Future （即 _执行器）。Rust 没有 “内置”运行时，但有以下几个选项可供选择：

Tokio：性能出色，拥有成熟的功能生态系统，例如适用于 HTTP 的 Hyper 或适用于 gRPC 的 Tonic。
async-std：旨在成为 “异步编程的标准库”，并在 async::task 中包含基本运行时。
smol：简单且轻量

有些大型应用具有自己的运行时。例如，Fuchsia 已有一个运行时。

Speaker Notes

请注意，在列出的运行时中，Rust Playground 仅支持 Tokio。该 Playground 也不支持任何 I/O 操作，因此大多数有趣的异步操作无法在该平台上运行。
Future 是 “惯性”的，除非有执行程序对其进行轮询，否则它们不会执行任何操作（甚至不会启动 I/O 操作）。这与 JS promise 不同，例如，后者即使从未使用也会完成运行过程。

Tokio

Tokio provides:

用于执行异步代码的多线程运行时。
An asynchronous version of the standard library.
庞大的库生态系统。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

借助 tokio::main 宏，我们现在可以将 main 设为异步函数。
spawn 函数会创建新的并发 “任务”。
注意：spawn 使用 Future 方法实现，而不是对 count_to 调用 .await。

深入探索：

为何 count_to 通常无法计数到 10？这是一个异步取消的示例。tokio::spawn 会返回一个句柄，可以等待该句柄直至其代表的任务执行完毕。
尝试使用 count_to(10).await，而不是派生方法。
尝试等待 tokio::spawn 返回的任务执行完毕。

任务

Rust 有一个任务系统，这是一种轻量级线程处理形式。

每个任务只有一个顶级 Future，执行器会对此进行轮询来推进任务进度。该 Future 可能包含一个或多个嵌套的 Future，可以通过其 poll 方法对它们进行轮询，类似于调用堆栈。可以通过轮询多个子 Future（例如争用定时器和 I/O 操作）在任务内部实现并发操作。


use tokio::io::{self, AsyncReadExt, AsyncWriteExt};
use tokio::net::TcpListener;

#[tokio::main]
async fn main() -> io::Result<()> {
    let listener = TcpListener::bind("127.0.0.1:0").await?;
    println!("listening on port {}", listener.local_addr()?.port());

    loop {
        let (mut socket, addr) = listener.accept().await?;

        println!("connection from {addr:?}");

        tokio::spawn(async move {
            socket.write_all(b"Who are you?\n").await.expect("socket error");

            let mut buf = vec![0; 1024];
            let name_size = socket.read(&mut buf).await.expect("socket error");
            let name = std::str::from_utf8(&buf[..name_size]).unwrap().trim();
            let reply = format!("Thanks for dialing in, {name}!\n");
            socket.write_all(reply.as_bytes()).await.expect("socket error");
        });
    }
}

Speaker Notes

将此示例复制到准备好的 src/main.rs 文件中，并从该文件运行它。

请尝试使用像 nc 或 telnet 这样的 TCP 连接工具进行连接。

让学生想象一下，当连接多个客户端时，示例服务器会达到怎样的状态。存在哪些任务？具有哪些 Future？
This is the first time we’ve seen an async block. This is similar to a closure, but does not take any arguments. Its return value is a Future, similar to an async fn.
Refactor the async block into a function, and improve the error handling using ?.

异步通道

有些 crate 支持使用异步通道。例如 tokio：


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

将通道大小更改为 3，然后看看对操作执行会有什么影响。
总体而言，该接口类似于上午课程中所讲的 sync 通道。
尝试移除 std::mem::drop 调用。会出现什么情况？这是为什么？
Flume crate 包含可以同时实现 sync、async send 和 recv 的渠道，为涉及 IO 和大量 CPU 处理任务的复杂应用提供了极大便利。
使用 async 通道的优势在于，我们能够将它们与其他 future 结合起来，从而创建复杂的控制流。

Futures Control Flow

可以将很多 Future 组合在一起生成并发计算流图。我们已经介绍过用作独立线程的任务类型。

加入

联接操作会等待一组 Future 全部就绪，然后返回它们的结果集合。这类似于 JavaScript 中的 Promise.al 或 Python 中的 asyncio.gather。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

将此示例复制到准备好的 src/main.rs 文件中，并从该文件运行它。

对于多个类型不相交的 Future，可以使用 std::future::join! 进行处理，但必须要确定在编译时 Future 的数量。目前，可在 futures crate 中使用该功能，但很快也会在 std::future 中正式发布。
The risk of join is that one of the futures may never resolve, this would cause your program to stall.
还可以将 join_all 与 join! 结合使用，并行处理所有对 http 服务的请求和数据库查询。尝试使用 futures::join! 将 tokio::time::sleep 添加到 Future 中。这不是一个超时操作（其需要使用 select!，下一章会详细介绍），而是展示了 join! 的使用方式。

选择

选择操作会等待一组 Future 中的任意一个就绪，并对 Future 产生的结果进行响应。在 JavaScript 中，该操作类似于 Promise.race。在 Python 中，它相当于 asyncio.wait(task_set, return_when=asyncio.FIRST_COMPLETED)。

Similar to a match statement, the body of select! has a number of arms, each of the form pattern = future => statement. When a future is ready, its return value is destructured by the pattern. The statement is then run with the resulting variables. The statement result becomes the result of the select! macro.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

在本示例中，猫和狗之间进行了一场比赛。first_animal_to_finish_race 会同时监听这两个通道，并选择最先到达终点的作为胜者。由于狗用时 50 毫秒，而猫用时 500 毫秒，前者在此比赛中大获全胜。
在本示例中，可以使用 oneshot 通道，因为这些通道只能接收一次 send 信号。
尝试为比赛添加截至时间，演示如何选择不同类型的 Future。
请注意，select! 会丢弃不匹配的分支，相对应的 Future 也会随之取消。最简单的方法是，每次执行 select! 时创建新的 Future。
- 另一种方法是传递 &mut future 而不是 future 本身，但这可能会导致问题，在本幻灯片的 “固定” 部分进行了详细介绍。

关于 async/await 的误区

async / await 为处理并发异步编程提供了一种方便高效的抽象方法。然而，Rust 中的 async/await 模型也存在一些误区和隐患。本章将展示其中的部分内容：

阻塞执行器

大多数异步运行时支持并发运行 IO 任务。这意味着 CPU 的阻塞性任务会阻塞执行器，并阻止执行其他任务。最简单的方法是，尽可能使用异步等效方法。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

运行该代码，您会发现休眠操作是连续发生的，而不是并发进行的。
"current_thread" 变种将所有任务放在单个线程上。这样做效果会更明显，但 bug 仍然存在于多线程变种中。
将 std::thread::sleep 切换为 tokio::time::sleep，并等待结果。
另一个修复方案是 tokio::task::spawn_blocking，其会生成实际线程并将句柄转换为 Future，且不会阻塞执行器。
不应将任务视为操作系统线程。它们之间并非一对一的映射关系，并且大多数执行器都支持在单个操作系统线程上运行多个任务。尤其是通过 FFI 与其他库交互时，会更容易出现问题，因为在 FFI 中，因为该库可能依赖于线程本地存储或映射到特定的操作系统线程（例如，CUDA）。在这些情况下，首选 tokio::task::spawn_blocking。
请谨慎使用同步互斥操作。对 .await 一直执行互斥操作能会导致另一个任务阻塞，并且该任务可能与其在同一线程上运行。

`Pin`

Async blocks and functions return types implementing the Future trait. The type returned is the result of a compiler transformation which turns local variables into data stored inside the future.

Some of those variables can hold pointers to other local variables. Because of that, the future should never be moved to a different memory location, as it would invalidate those pointers.

To prevent moving the future type in memory, it can only be polled through a pinned pointer. Pin is a wrapper around a reference that disallows all operations that would move the instance it points to into a different memory location.


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

您可能认为这是执行器模式的一个示例。执行器通常会循环调用 select!。

本部分是对前面几节课的总结，因此请多花时间用心学习。

只是单纯地在 select! 中添加 _ = sleep(Duration::from_millis(100)) => { println!(..) }，该行代码将不会执行任何操作。这是为什么？

请改为在 loop 外部添加包含该 Future 的 timeout_fut：


#![allow(unused)]
fn main() {
let mut timeout_fut = sleep(Duration::from_millis(100));
loop {
    select! {
        ..,
        _ = timeout_fut => { println!(..); },
    }
}
}

这仍然不起作用。根据编译器提示的错误，通过向 select! 中的 timeout_fut 添加 &mut 解决移动问题，然后使用 Box::pin：


#![allow(unused)]
fn main() {
let mut timeout_fut = Box::pin(sleep(Duration::from_millis(100)));
loop {
    select! {
        ..,
        _ = &mut timeout_fut => { println!(..); },
    }
}
}

可以编译这段代码了，但超时过期后，每次迭代都会变为 Poll::Ready（使用混合 Future 有助于解决此问题）。每次超时过期后，通过更新重置 timeout_fut。

Box 在堆上进行分配。在某些情况下，也可以选择使用 std::pin::pin!（最近才正式发布，较旧的代码通常使用 tokio::pin!），但对于重新分配的 Future，使用此功能较为困难。
另一种替代方案是完全不使用 pin，而是生成另一个任务，该任务每隔 100 毫秒就会发送到 oneshot 通道。
Data that contains pointers to itself is called self-referential. Normally, the Rust borrow checker would prevent self-referential data from being moved, as the references cannot outlive the data they point to. However, the code transformation for async blocks and functions is not verified by the borrow checker.
Pin is a wrapper around a reference. An object cannot be moved from its place using a pinned pointer. However, it can still be moved through an unpinned pointer.
The poll method of the Future trait uses Pin<&mut Self> instead of &mut Self to refer to the instance. That’s why it can only be called on a pinned pointer.

异步特征

Async methods in traits are were stabilized only recently, in the 1.75 release. This required support for using return-position impl Trait (RPIT) in traits, as the desugaring for async fn includes -> impl Future<Output = ...>.

However, even with the native support today there are some pitfalls around async fn and RPIT in traits:

Return-position impl Trait captures all in-scope lifetimes (so some patterns of borrowing cannot be expressed)
Traits whose methods use return-position impl trait or async are not dyn compatible.

If we do need dyn support, the crate async_trait provides a workaround through a macro, with some caveats:


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

async_trait 易于使用，但请注意，它通过堆分配来实现这一点。这种堆分配会产生性能开销。
对于 async trait 的语言支持中的挑战是深入 Rust的，并且可能不值得深入描述。如果您对深入了解感兴趣，Niko Matsakis 在这篇文章中对它们做了很好的解释。
尝试创建一个新的 sleeper 结构，使其随机休眠一段时间，并将其添加到 Vec 中。

消除

丢弃 Future 意味着无法再对其进行轮询。这称为取消，在任何 await 点都可能发生。请务必小心谨慎，确保即使 Future 任务被取消，系统也能正常运行。例如，系统不应死锁或丢失数据。


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Speaker Notes

编译器无法确保取消操作的安全性。您需要阅读 API 文档，并考虑 async fn 所持状态。
与 panic 和 ? 不同，取消属于正常控制流的一部分（而非错误处理）。

该示例丢失了字符串的某些部分。

每当 tick() 分支先完成操作时，next() 及其 buf` 均会被丢弃。

通过将 buf 整合到结构体中，`LinesReader 可以确保取消操作的安全性：


#![allow(unused)]
fn main() {
struct LinesReader {
    stream: DuplexStream,
    bytes: Vec<u8>,
    buf: [u8; 1],
}

impl LinesReader {
    fn new(stream: DuplexStream) -> Self {
        Self { stream, bytes: Vec::new(), buf: [0] }
    }
    async fn next(&mut self) -> io::Result<Option<String>> {
        // prefix buf and bytes with self.
        // ...
        let raw = std::mem::take(&mut self.bytes);
        let s = String::from_utf8(raw)
        // ...
    }
}
}

Interval::tick is cancellation-safe because it keeps track of whether a tick has been ‘delivered’.
AsyncReadExt::read is cancellation-safe because it either returns or doesn’t read data.
AsyncBufReadExt::read_line is similar to the example and isn’t cancellation-safe. See its documentation for details and alternatives.

习题

为了练习您的异步 Rust 技能，我们再次为您提供了两个练习：

哲学家进餐：我们已经在上午看到了这个问题。这次你将使用异步 Rust 来实现它。
广播聊天应用：这是一个更大的项目，允许您尝试更高级的异步Rust功能。

Speaker Notes

After looking at the exercises, you can look at the solutions provided.

Dining Philosophers — Async

查看哲学家进餐以获取问题的描述。

As before, you will need a local Cargo installation for this exercise. Copy the code below to a file called src/main.rs, fill out the blanks, and test that cargo run does not deadlock:


use std::sync::Arc;
use tokio::sync::mpsc::{self, Sender};
use tokio::sync::Mutex;
use tokio::time;

struct Fork;

struct Philosopher {
    name: String,
    // left_fork: ...
    // right_fork: ...
    // thoughts: ...
}

impl Philosopher {
    async fn think(&self) {
        self.thoughts
            .send(format!("Eureka! {} has a new idea!", &self.name))
            .await
            .unwrap();
    }

    async fn eat(&self) {
        // Keep trying until we have both forks
        println!("{} is eating...", &self.name);
        time::sleep(time::Duration::from_millis(5)).await;
    }
}

static PHILOSOPHERS: &[&str] =
    &["Socrates", "Hypatia", "Plato", "Aristotle", "Pythagoras"];

#[tokio::main]
async fn main() {
    // Create forks

    // Create philosophers

    // Make them think and eat

    // Output their thoughts
}

因为这次您正在使用异步Rust，您将需要一个 tokio 依赖。您可以使用以下的 Cargo.toml：


[package]
name = "dining-philosophers-async-dine"
version = "0.1.0"
edition = "2021"

[dependencies]
tokio = { version = "1.26.0", features = ["sync", "time", "macros", "rt-multi-thread"] }

另外，请注意，这次您必须使用来自 tokio 包的 Mutex 和 mpsc 模块。

Speaker Notes

Can you make your implementation single-threaded?

广播聊天应用

在本练习中，我们想要使用我们的新知识来实现一个广播聊天应用。我们有一个聊天服务器，客户端连接到该服务器并发布他们的消息。客户端从标准输入读取用户消息，并将其发送到服务器。聊天服务器将收到的每条消息广播给所有客户端。

For this, we use a broadcast channel on the server, and tokio_websockets for the communication between the client and the server.

创建一个新的 Cargo 项目并添加以下依赖：

Cargo.toml:


[package]
name = "chat-async"
version = "0.1.0"
edition = "2021"

[dependencies]
futures-util = { version = "0.3.30", features = ["sink"] }
http = "1.1.0"
tokio = { version = "1.37.0", features = ["full"] }
tokio-websockets = { version = "0.7.0", features = ["client", "fastrand", "server", "sha1_smol"] }

所需的API

You are going to need the following functions from tokio and tokio_websockets. Spend a few minutes to familiarize yourself with the API.

StreamExt::next() implemented by WebSocketStream: for asynchronously reading messages from a Websocket Stream.
SinkExt::send() implemented by WebSocketStream: for asynchronously sending messages on a Websocket Stream.
Lines::next_line(): for asynchronously reading user messages from the standard input.
Sender::subscribe()：用于订阅广播频道。

两个可执行文件

Normally in a Cargo project, you can have only one binary, and one src/main.rs file. In this project, we need two binaries. One for the client, and one for the server. You could potentially make them two separate Cargo projects, but we are going to put them in a single Cargo project with two binaries. For this to work, the client and the server code should go under src/bin (see the documentation).

Copy the following server and client code into src/bin/server.rs and src/bin/client.rs, respectively. Your task is to complete these files as described below.

src/bin/server.rs:


use futures_util::sink::SinkExt;
use futures_util::stream::StreamExt;
use std::error::Error;
use std::net::SocketAddr;
use tokio::net::{TcpListener, TcpStream};
use tokio::sync::broadcast::{channel, Sender};
use tokio_websockets::{Message, ServerBuilder, WebSocketStream};

async fn handle_connection(
    addr: SocketAddr,
    mut ws_stream: WebSocketStream<TcpStream>,
    bcast_tx: Sender<String>,
) -> Result<(), Box<dyn Error + Send + Sync>> {

    // TODO: For a hint, see the description of the task below.

}

#[tokio::main]
async fn main() -> Result<(), Box<dyn Error + Send + Sync>> {
    let (bcast_tx, _) = channel(16);

    let listener = TcpListener::bind("127.0.0.1:2000").await?;
    println!("listening on port 2000");

    loop {
        let (socket, addr) = listener.accept().await?;
        println!("New connection from {addr:?}");
        let bcast_tx = bcast_tx.clone();
        tokio::spawn(async move {
            // Wrap the raw TCP stream into a websocket.
            let ws_stream = ServerBuilder::new().accept(socket).await?;

            handle_connection(addr, ws_stream, bcast_tx).await
        });
    }
}

src/bin/client.rs:


use futures_util::stream::StreamExt;
use futures_util::SinkExt;
use http::Uri;
use tokio::io::{AsyncBufReadExt, BufReader};
use tokio_websockets::{ClientBuilder, Message};

#[tokio::main]
async fn main() -> Result<(), tokio_websockets::Error> {
    let (mut ws_stream, _) =
        ClientBuilder::from_uri(Uri::from_static("ws://127.0.0.1:2000"))
            .connect()
            .await?;

    let stdin = tokio::io::stdin();
    let mut stdin = BufReader::new(stdin).lines();


    // TODO: For a hint, see the description of the task below.

}

运行可执行文件

Run the server with:


cargo run --bin server

and the client with:


cargo run --bin client

任务

在 src/bin/server.rs 中实现 handle_connection 函数。
- 提示：使用 tokio::select! 在一个连续的循环中并发执行两个任务。一个任务从客户端接收消息并广播它们。另一个任务将服务器接收到的消息发送给客户端。
完成 src/bin/client.rs 中的 main 函数。
- Hint: As before, use tokio::select! in a continuous loop for concurrently performing two tasks: (1) reading user messages from standard input and sending them to the server, and (2) receiving messages from the server, and displaying them for the user.
Optional: Once you are done, change the code to broadcast messages to all clients, but the sender of the message.

并发编程：下午练习

Dining Philosophers — Async

(返回练习)


use std::sync::Arc;
use tokio::sync::mpsc::{self, Sender};
use tokio::sync::Mutex;
use tokio::time;

struct Fork;

struct Philosopher {
    name: String,
    left_fork: Arc<Mutex<Fork>>,
    right_fork: Arc<Mutex<Fork>>,
    thoughts: Sender<String>,
}

impl Philosopher {
    async fn think(&self) {
        self.thoughts
            .send(format!("Eureka! {} has a new idea!", &self.name))
            .await
            .unwrap();
    }

    async fn eat(&self) {
        // Keep trying until we have both forks
        let (_left_fork, _right_fork) = loop {
            // Pick up forks...
            let left_fork = self.left_fork.try_lock();
            let right_fork = self.right_fork.try_lock();
            let Ok(left_fork) = left_fork else {
                // If we didn't get the left fork, drop the right fork if we
                // have it and let other tasks make progress.
                drop(right_fork);
                time::sleep(time::Duration::from_millis(1)).await;
                continue;
            };
            let Ok(right_fork) = right_fork else {
                // If we didn't get the right fork, drop the left fork and let
                // other tasks make progress.
                drop(left_fork);
                time::sleep(time::Duration::from_millis(1)).await;
                continue;
            };
            break (left_fork, right_fork);
        };

        println!("{} is eating...", &self.name);
        time::sleep(time::Duration::from_millis(5)).await;

        // The locks are dropped here
    }
}

static PHILOSOPHERS: &[&str] =
    &["Socrates", "Hypatia", "Plato", "Aristotle", "Pythagoras"];

#[tokio::main]
async fn main() {
    // Create forks
    let mut forks = vec![];
    (0..PHILOSOPHERS.len()).for_each(|_| forks.push(Arc::new(Mutex::new(Fork))));

    // Create philosophers
    let (philosophers, mut rx) = {
        let mut philosophers = vec![];
        let (tx, rx) = mpsc::channel(10);
        for (i, name) in PHILOSOPHERS.iter().enumerate() {
            let left_fork = Arc::clone(&forks[i]);
            let right_fork = Arc::clone(&forks[(i + 1) % PHILOSOPHERS.len()]);
            philosophers.push(Philosopher {
                name: name.to_string(),
                left_fork,
                right_fork,
                thoughts: tx.clone(),
            });
        }
        (philosophers, rx)
        // tx is dropped here, so we don't need to explicitly drop it later
    };

    // Make them think and eat
    for phil in philosophers {
        tokio::spawn(async move {
            for _ in 0..100 {
                phil.think().await;
                phil.eat().await;
            }
        });
    }

    // Output their thoughts
    while let Some(thought) = rx.recv().await {
        println!("Here is a thought: {thought}");
    }
}

广播聊天应用

(返回练习)

src/bin/server.rs:


use futures_util::sink::SinkExt;
use futures_util::stream::StreamExt;
use std::error::Error;
use std::net::SocketAddr;
use tokio::net::{TcpListener, TcpStream};
use tokio::sync::broadcast::{channel, Sender};
use tokio_websockets::{Message, ServerBuilder, WebSocketStream};

async fn handle_connection(
    addr: SocketAddr,
    mut ws_stream: WebSocketStream<TcpStream>,
    bcast_tx: Sender<String>,
) -> Result<(), Box<dyn Error + Send + Sync>> {

    ws_stream
        .send(Message::text("Welcome to chat! Type a message".to_string()))
        .await?;
    let mut bcast_rx = bcast_tx.subscribe();

    // A continuous loop for concurrently performing two tasks: (1) receiving
    // messages from `ws_stream` and broadcasting them, and (2) receiving
    // messages on `bcast_rx` and sending them to the client.
    loop {
        tokio::select! {
            incoming = ws_stream.next() => {
                match incoming {
                    Some(Ok(msg)) => {
                        if let Some(text) = msg.as_text() {
                            println!("From client {addr:?} {text:?}");
                            bcast_tx.send(text.into())?;
                        }
                    }
                    Some(Err(err)) => return Err(err.into()),
                    None => return Ok(()),
                }
            }
            msg = bcast_rx.recv() => {
                ws_stream.send(Message::text(msg?)).await?;
            }
        }
    }
}

#[tokio::main]
async fn main() -> Result<(), Box<dyn Error + Send + Sync>> {
    let (bcast_tx, _) = channel(16);

    let listener = TcpListener::bind("127.0.0.1:2000").await?;
    println!("listening on port 2000");

    loop {
        let (socket, addr) = listener.accept().await?;
        println!("New connection from {addr:?}");
        let bcast_tx = bcast_tx.clone();
        tokio::spawn(async move {
            // Wrap the raw TCP stream into a websocket.
            let ws_stream = ServerBuilder::new().accept(socket).await?;

            handle_connection(addr, ws_stream, bcast_tx).await
        });
    }
}

src/bin/client.rs:


use futures_util::stream::StreamExt;
use futures_util::SinkExt;
use http::Uri;
use tokio::io::{AsyncBufReadExt, BufReader};
use tokio_websockets::{ClientBuilder, Message};

#[tokio::main]
async fn main() -> Result<(), tokio_websockets::Error> {
    let (mut ws_stream, _) =
        ClientBuilder::from_uri(Uri::from_static("ws://127.0.0.1:2000"))
            .connect()
            .await?;

    let stdin = tokio::io::stdin();
    let mut stdin = BufReader::new(stdin).lines();

    // Continuous loop for concurrently sending and receiving messages.
    loop {
        tokio::select! {
            incoming = ws_stream.next() => {
                match incoming {
                    Some(Ok(msg)) => {
                        if let Some(text) = msg.as_text() {
                            println!("From server: {}", text);
                        }
                    },
                    Some(Err(err)) => return Err(err.into()),
                    None => return Ok(()),
                }
            }
            res = stdin.next_line() => {
                match res {
                    Ok(None) => return Ok(()),
                    Ok(Some(line)) => ws_stream.send(Message::text(line.to_string())).await?,
                    Err(err) => return Err(err.into()),
                }
            }

        }
    }
}

谢谢！

Thank you for taking Comprehensive Rust 🦀! We hope you enjoyed it and that it was useful.

组织这门课程让我们收获了很多乐趣。本课程并非完美无缺，因此，如果您发现任何错误或有任何改进建议，请在 GitHub 上与我们联系。我们期待收到您的宝贵意见。

词汇表

本页面的词汇表提供了许多 Rust 术语的简要定义。同时提供翻译版本和英语原文的对应。

分配（allocate）：
在堆上进行动态内存分配。
参数（argument）：
传入某个函数或方法中的信息。
裸机 Rust（Bare-metal Rust）：
底层 Rust 开发方式，通常部署于没有操作系统的系统。请参阅裸机 Rust。
代码块（block）：
请参阅代码块和 作用域。
借用（borrow）：
请参阅借用。
借用检查器（borrow checker）：
Rust 编译器的一部分，用于检查所有借用操作是否有效。
大括号（brace）：
{ 和 }。也称为 花括号，用于分隔 代码块。
构建（build）：
将源代码转换为可执行代码或可用程序的过程。
调用（call）：
调用或执行某个函数或方法。
通道（channel）：
用于安全地在线程之间传递消息。
Comprehensive Rust 🦀：
本课程统称为 Comprehensive Rust 🦀。
并发（concurrency）：
同时执行多个任务或进程。
Rust 中的并发（Concurrency in Rust）：
请参阅 Rust 中的并发.
常量（constant）：
在程序执行期间不会改变的值。
控制流（control flow）：
程序中各个语句或指令的执行顺序。
崩溃（crash）：
程序出现意外的、未处理的故障或终止。
枚举（enumeration）：
一种用于保存多个已命名常量中的一个的数据类型，可能还有一个相关的元组或结构体。
错误（error）：
与预期行为存在偏差的意外情况或结果。
错误处理（error handling）：
对程序执行期间发生的错误进行管理和响应的过程。
练习（exercise）：
专为练习和测试编程技能而设计的任务或问题。
函数（function）：
用于执行特定任务且可重复使用的代码块。
垃圾回收器（garbage collector）：
一种自动释放不再使用的对象所占内存的机制。
泛型（generics）：
这项功能支持使用类型占位符编写代码，支持对不同数据类型的代码进行重复使用。
不可变（immutable）：
创建后无法再进行更改。
集成测试（integration test）：
一种验证系统不同部分或组件之间交互的测试类型。
关键字（keyword）：
编程语言中的保留字，具有特定含义且不能用作标识符。
库（library）：
程序可以使用的一组预编译例程或代码。
宏（macro）：
Rust 宏可通过名称中的 ! 符号识别。当普通函数无法满足需求时，可以使用宏。一个典型示例是 format!，其接受可变数量的参数，但 Rust 函数不支持这种类型。
main 函数（main function）：
Rust 程序从 main 函数开始执行。
匹配（match）：
Rust 中的控制流结构，允许对表达式的值进行模式匹配。
内存泄漏（memory leak）：
程序无法释放不再不要的内存的情况，会导致内存用量不断增加。
方法（method）：
与 Rust 中的某个对象或类型相关联的函数。
模块（module）：
Rust 中用于归纳整理代码的命名空间，其中包含函数、类型或特性等定义。
移动（move）：
在 Rust 中，将值的所有权从一个变量转移到另一个变量。
可变（mutable）：
Rust 中的一个属性，支持在声明变量后对其进行修改。
所有权（ownership）：
Rust 中的概念，用于定义代码中的哪一部分负责管理与值关联的内存。
panic：
Rust 中导致程序终止且不可恢复的错误情况。
参数（parameter）：
在调用函数或方法时传入函数或方法的值。
模式（pattern）：
Rust 中可与表达式匹配的值、字面量或结构的组合。
载荷（payload）：
消息、事件或数据结构所携带的数据或信息。
程序（program）：
计算机为执行特定任务或解决特定问题而执行的一组指令。
编程语言（programming language）：
用于向计算机传递指令的正式系统，例如 Rust。
接收器（receiver）：
Rust 方法中的首个参数，表示正在调用该方法的实例。
引用计数（reference counting）：
一种内存管理方法，可以跟踪某个对象的引用数量，并在计数为零时释放该对象。
返回（return）：
Rust 中的一个关键字，用于表示从函数返回的值。
Rust：
一种系统编程语言，专注于安全性、性能和并发性。
Rust 基础（Rust Fundamentals）：
本课程第 1 天到第 4 天的内容。
Android 中的 Rust（Rust in Android）：
请参阅 Android 中的 Rust.
Chromium 中的 Rust（Rust in Chromium）:
请参阅 Chromium 中的 Rust.
安全（safe）：
指代码遵循 Rust 的所有权和借用规则，以防止出现与内存相关的错误。
作用域（scope）：
程序中变量有效且可使用的区域。
标准库（standard library）：
Rust 中提供基本功能的一系列模块。
静态（static）：
Rust 中的关键字，用于定义具有 'static 生命周期的静态变量或项。
字符串（string）：
一种存储文本数据的数据类型。如需了解详情，请参阅 String 与 str。
结构体（struct）：
Rust 中的复合数据类型，可将不同类型的变量归到同一名称下。
测试（test）：
Rust 中的模块，其中包含用于测试其他函数是否正确的函数。
线程（thread）：
程序中的单独执行顺序，支持并发执行。
线程安全（thread safety）：
一种程序属性，用于确保多线程环境中的行为正确无误。
特征（trait）：
用于定义未知类型的一系列方法，为在 Rust 中实现多态性提供了方法。
特征约束（trait bound）:
一种可以要求类型实现一些感兴趣的特性的抽象。
元组（tuple）：
包含不同类型变量的复合数据类型。元组的字段没有名称，需要通过序号访问。
类型（type）：
一种分类方式，用于指定可以对 Rust 中特定类型的值执行哪些操作。
类型推导（type inference）：
Rust 编译器能够推断变量或表达式的类型。
未定义行为（undefined behavior）：
Rust 中未指定结果的操作或条件，通常会导致不可预测的程序行为。
联合体（union）：
一种数据类型，可以存储不同类型的值，但一次只能保存一个值。
单元测试（unit test）：
Rust 内置了运行小型单元测试和大型集成测试的支持功能。请参阅单元测试。
单元类型（unit type）:
不保存数据的类型，写为没有成员的元组。
不安全（unsafe）：
Rust 的子集，允许触发 未定义行为。请参阅不安全 Rust。
变量（variable）：
用于存储数据的内存位置。变量在 作用域 内有效。

其他 Rust 资源

Rust 社区已经创造了丰富的高质量免费资源在线提供。

官方文档

Rust 项目提供了许多资源。这些资源涵盖了 Rust 的一般内容：

Rust 程序设计语言：一部有关 Rust 的免费权威图书。书中详细介绍了该语言，并包含一些可供读者构建的项目。
通过例子学 Rust：通过一系列展示不同结构的示例介绍 Rust 语法。有时会包括一些小练习，会要求您充分地阐述示例中的代码。
Rust 标准库：Rust 标准库的完整文档。
Rust 参考手册：一本未完成的书，介绍了 Rust 语法和内存模型。

Rust 官方网站上有更多专业指南：

Rust 秘典：介绍了不安全 Rust，包括使用原始指针以及与其他语言 (FFI) 交互。
Rust 中的异步编程：介绍了在《Rust 程序设计语言》成书后引入的新异步编程模型。
嵌入式 Rust 之书：介绍如何在没有操作系统的嵌入式设备上使用 Rust。

非官方学习资料

其他 Rust 指南和教程的小选集：

Learn Rust the Dangerous Way（以危险的方式学 Rust）：从低级 C 语言程序员的角度介绍 Rust。
面向嵌入式 C 程序员的 Rust：从使用 C 语言编写固件的开发者的角度介绍 Rust。
Rust for professionals（面向专业人士的 Rust）：通过与其他语言（例如 C、C++、Java、JavaScript 和 Python）进行并排比较，介绍 Rust 的语法。
Rust on Exercism（在 Exercism 上学 Rust）：100 多项练习助您学习 Rust。
Ferrous Teaching Material：一系列小演示文稿，涵盖 Rust 语言的基础知识和高级部分。还涵盖了 WebAssembly 和 async/await 等其他主题。
面向 Rust 的初学者系列和使用 Rust 迈出第一步：两个面向新手开发者的 Rust 指南。第一个指南包含 35 个视频，第二个指南包含 11 个模块，内容涵盖 Rust 语法和基本结构。
通过大量的链表学习Rust：通过实现几种不同类型的列表结构，深入探索 Rust 的内存管理规则。

如需更多 Rust 图书，请查看 Rust 小册。

鸣谢

本课中的资料以众多优秀的 Rust 文档资源为基础。如需查看实用资源的完整列表，请参阅关于其他资源的页面。

The material of Comprehensive Rust is licensed under the terms of the Apache 2.0 license, please see LICENSE for details.