4.5. IO 单子🔗

IO 作为单子可以从两个角度理解，这在运行程序一节中进行了描述。每个角度都可以帮助理解 IO 的 pure 和 bind 的含义。

从第一个视角看，IO 活动是 Lean 运行时系统的指令。例如，指令可能是 “从该文件描述符读取字符串，然后使用该字符串重新调用纯 Lean 代码”。这是一种外部的视角，即从操作系统的视角看待程序。在这种情况下，pure 是一个不请求 RTS 产生任何作用的 IO 活动，而 bind 指示 RTS 首先执行一个产生潜在作用的操作，然后使用结果值调用程序的其余部分。

从第二个视角看，IO 活动会变换整个世界。IO 活动实际上是纯（Pure）的，因为它接受一个唯一的世界作为参数，然后返回改变后的世界。这是一种内部的视角，它对应了 IO 在 Lean 中的表示方式。世界在 Lean 中表示为一个标记，而 IO 单子的结构化可以确保标记刚好使用一次。

为了了解其工作原理，逐层解析它的定义会很有帮助。 #print 命令揭示了 Lean 数据类型和定义的内部结构。例如：

inductive Nat : Type
number of parameters: 0
constructors:
Nat.zero : Nat
Nat.succ : Nat → Nat#print Nat

的结果为

inductive Nat : Type
number of parameters: 0
constructors:
Nat.zero : Nat
Nat.succ : Nat → Nat

而

def Char.isAlpha : Char → Bool :=
fun c => c.isUpper || c.isLower#print Char.isAlpha

的结果为

def Char.isAlpha : Char → Bool :=
fun c => c.isUpper || c.isLower

有时，#print 的输出包含了本书中尚未展示的 Lean 特性。例如：

def List.isEmpty.{u} : {α : Type u} → List α → Bool :=
fun {α} x =>
  match x with
  | [] => true
  | head :: tail => false#print List.isEmpty

会产生

def List.isEmpty.{u} : {α : Type u} → List α → Bool :=
fun {α} x =>
  match x with
  | [] => true
  | head :: tail => false

它在定义名的后面包含了一个 .{u}，并将类型标注为 Type u 而非只是 Type。目前可以安全地忽略它。

打印 IO 的定义表明它是根据更简单的结构定义的：

@[reducible] def IO : Type → Type :=
EIO IO.Error#print IO

@[reducible] def IO : Type → Type :=
EIO IO.Error

IO.Error 表示 IO 活动可能抛出的所有错误：

inductive IO.Error : Type
number of parameters: 0
constructors:
IO.Error.alreadyExists : Option String → UInt32 → String → IO.Error
IO.Error.otherError : UInt32 → String → IO.Error
IO.Error.resourceBusy : UInt32 → String → IO.Error
IO.Error.resourceVanished : UInt32 → String → IO.Error
IO.Error.unsupportedOperation : UInt32 → String → IO.Error
IO.Error.hardwareFault : UInt32 → String → IO.Error
IO.Error.unsatisfiedConstraints : UInt32 → String → IO.Error
IO.Error.illegalOperation : UInt32 → String → IO.Error
IO.Error.protocolError : UInt32 → String → IO.Error
IO.Error.timeExpired : UInt32 → String → IO.Error
IO.Error.interrupted : String → UInt32 → String → IO.Error
IO.Error.noFileOrDirectory : String → UInt32 → String → IO.Error
IO.Error.invalidArgument : Option String → UInt32 → String → IO.Error
IO.Error.permissionDenied : Option String → UInt32 → String → IO.Error
IO.Error.resourceExhausted : Option String → UInt32 → String → IO.Error
IO.Error.inappropriateType : Option String → UInt32 → String → IO.Error
IO.Error.noSuchThing : Option String → UInt32 → String → IO.Error
IO.Error.unexpectedEof : IO.Error
IO.Error.userError : String → IO.Error#print IO.Error

inductive IO.Error : Type
number of parameters: 0
constructors:
IO.Error.alreadyExists : Option String → UInt32 → String → IO.Error
IO.Error.otherError : UInt32 → String → IO.Error
IO.Error.resourceBusy : UInt32 → String → IO.Error
IO.Error.resourceVanished : UInt32 → String → IO.Error
IO.Error.unsupportedOperation : UInt32 → String → IO.Error
IO.Error.hardwareFault : UInt32 → String → IO.Error
IO.Error.unsatisfiedConstraints : UInt32 → String → IO.Error
IO.Error.illegalOperation : UInt32 → String → IO.Error
IO.Error.protocolError : UInt32 → String → IO.Error
IO.Error.timeExpired : UInt32 → String → IO.Error
IO.Error.interrupted : String → UInt32 → String → IO.Error
IO.Error.noFileOrDirectory : String → UInt32 → String → IO.Error
IO.Error.invalidArgument : Option String → UInt32 → String → IO.Error
IO.Error.permissionDenied : Option String → UInt32 → String → IO.Error
IO.Error.resourceExhausted : Option String → UInt32 → String → IO.Error
IO.Error.inappropriateType : Option String → UInt32 → String → IO.Error
IO.Error.noSuchThing : Option String → UInt32 → String → IO.Error
IO.Error.unexpectedEof : IO.Error
IO.Error.userError : String → IO.Error

EIO ε α 表示一个 IO 活动，它将以类型为 ε 的错误表示终止，或者以类型为 α 的值表示成功。这意味着，与 Except ε 单子一样，IO 单子也包括定义错误处理和异常的能力。

剥离另一层，EIO 本身又是根据更简单的结构定义的：

def EIO : Type → Type → Type :=
fun ε => EStateM ε IO.RealWorld#print EIO

def EIO : Type → Type → Type :=
fun ε => EStateM ε IO.RealWorld

EStateM 单子同时包括错误和状态——它是 Except 和 State 的组合。它使用另一个类型 EStateM.Result 定义：

def EStateM.{u} : Type u → Type u → Type u → Type u :=
fun ε σ α => σ → EStateM.Result ε σ α#print EStateM

def EStateM.{u} : Type u → Type u → Type u → Type u :=
fun ε σ α => σ → EStateM.Result ε σ α

换句话说，类型为 EStateM ε σ α 的程序是一个函数，它接受类型为 σ 的初始状态并返回一个 EStateM.Result ε σ α。

EStateM.Result 与 Except 的定义非常相似，一个构造子表示成功终止，另一个构造子表示错误：

inductive EStateM.Result.{u} : Type u → Type u → Type u → Type u
number of parameters: 3
constructors:
EStateM.Result.ok : {ε σ α : Type u} → α → σ → EStateM.Result ε σ α
EStateM.Result.error : {ε σ α : Type u} → ε → σ → EStateM.Result ε σ α#print EStateM.Result

inductive EStateM.Result.{u} : Type u → Type u → Type u → Type u
number of parameters: 3
constructors:
EStateM.Result.ok : {ε σ α : Type u} → α → σ → EStateM.Result ε σ α
EStateM.Result.error : {ε σ α : Type u} → ε → σ → EStateM.Result ε σ α

就像 Except ε α 一样，ok 构造子包含类型为 α 的结果， error 构造子包含类型为 ε 的异常。与 Except 不同，这两个构造子都有一个附加的状态字段，其中包含计算的最终状态。

EStateM ε σ 的 Monad 实例需要 pure 和 bind。与 State 一样，EStateM 的 pure 实现接受一个初始状态并将其返回而不改变，并且与 Except 一样，它在 ok 构造子中返回其参数：

protected def EStateM.pure.{u} : {ε σ α : Type u} → α → EStateM ε σ α :=
fun {ε σ α} a s => EStateM.Result.ok a s#print EStateM.pure

protected def EStateM.pure.{u} : {ε σ α : Type u} → α → EStateM ε σ α :=
fun {ε σ α} a s => EStateM.Result.ok a s

protected 意味着即使打开了 EStateM 命名空间，也需要完整的名称 EStateM.pure。

类似地，EStateM 的 bind 将初始状态作为参数。它将此初始状态传递给其第一个操作。与 Except 的 bind 一样，它然后检查结果是否为错误。如果是，则错误将保持不变，并且 bind 的第二个参数保持未使用。如果结果成功，则将第二个参数应用于返回值和结果状态。

protected def EStateM.bind.{u} : {ε σ α β : Type u} → EStateM ε σ α → (α → EStateM ε σ β) → EStateM ε σ β :=
fun {ε σ α β} x f s =>
  match x s with
  | EStateM.Result.ok a s => f a s
  | EStateM.Result.error e s => EStateM.Result.error e s#print EStateM.bind

protected def EStateM.bind.{u} : {ε σ α β : Type u} → EStateM ε σ α → (α → EStateM ε σ β) → EStateM ε σ β :=
fun {ε σ α β} x f s =>
  match x s with
  | EStateM.Result.ok a s => f a s
  | EStateM.Result.error e s => EStateM.Result.error e s

综上所述，IO 是同时跟踪状态和错误的单子。可用错误的集合由数据类型 IO.Error 给出，该数据类型具有描述程序中可能出错的许多情况的构造子。状态是一种表示现实世界的类型，称为 IO.RealWorld。每个基本的 IO 活动都会接收这个现实世界并返回另一个，与错误或结果配对。在 IO 中，pure 返回未更改的世界，而 bind 将修改后的世界从一个活动传递到下一个活动。

由于计算机内存无法容纳整个宇宙，因此传递的世界仅仅是一种表示。只要不重复使用世界标记，该表示就是安全的。这意味着世界标记根本不需要包含任何数据：

def IO.RealWorld : Type :=
Unit#print IO.RealWorld

def IO.RealWorld : Type :=
Unit