隐含的符号约定

Terence Tao 2014-07-29

隐含的符号约定

[转载自 2010 年 3 月 20 日的一篇 Google Buzz 文章。]

与任何其他人类语言一样,数学符号有许多隐含的约定,这些约定通常不会在语言的正式描述中明确说明。这些约定通过传达数学句子形式逻辑内容之外的额外上下文数据,起到了有益的作用。

一个很好的例子是变量的命名约定。虽然原则上任何符号都可以用于一种变量类型,但实际上单个符号具有预先存在的内涵,使得将它们分配给特定的变量类型更加自然。例如,人们通常表示:

  • x 表示实数
  • z 表示复数
  • n 表示自然数

一个涉及复数 x、自然数 z 和实数 n 的数学论证读起来会非常奇怪。这方面最著名的例子可能是在分析中使用符号 ε;一个涉及非常大或负的 ε 量的分析论证会引起大量不必要的认知失调。相比之下,通过坚持每个符号所扮演的传统角色,论证的符号结构得到加强,更容易记忆;暂时忘记了论证中 z 定义的读者至少可以猜测它应该是一个复数,这有助于回忆起实际的定义是什么。

作为分析中的另一个例子,当陈述不等式如 f(x) ≤ C|S| ≤ N 时,习惯上左侧代表希望控制的”未知”量,右侧代表更能控制的”已知”量;因此,例如 f(x) ≤ C 优于 C ≥ f(x),尽管这两个陈述在逻辑上是等价的。这就是为什么分析学家对”上界”和”下界”做出重要区分的原因;两者并不对称,因为在两种情况下,都是用已知量来界定未知量。(分析中另一个相关的约定是,最好界定非负量而不是非正量,因此例如 f(x) ≥ 0 优于 -f(x) ≤ 0。)

继续上面的例子,如果已知界 C 本身是几个项的和,例如 C = A + B,那么习惯上将”主”项放在前面,“误差”项放在后面;因此例如 f(x) ≤ A + B 优于 f(x) ≤ B + A。通过遵守这个标准约定,可以传达哪些项被视为主项、哪些项被视为误差项的有用信息。

当然,在分析或其他数学领域中还有许多其他隐含的约定;不幸的是,没有这些约定的详尽列表,人们通常必须通过广泛阅读某个主题来掌握它们。

关于分配给如 100 这样的整数的具体内涵的一些后续讨论,可以在这里找到

Implicit notational conventions

[Reprinted from a Google buzz article from Mar 20, 2010.]

Like any other human language, mathematical notation has a number of implicit conventions which are usually not made explicit in the formal description of the langauge. These conventions serve a useful purpose by conveying additional contextual data beyond the formal logical content of the mathematical sentences.

A good example of this is the naming conventions for variables. While in principle any symbol can be used for one type of variable, in practice individual symbols have pre-existing connotations that make it more natural to assign them to specific variable types. For instance, one usually denotes:

  • x to denote a real number
  • z to denote a complex number
  • n to denote a natural number

A mathematical argument involving a complex number x, a natural number z, and a real number n would read very strangely. The most famous example of this is perhaps the use of the symbol ε in analysis; an analysis argument involving a quantity ε which was very large or negative would cause a lot of unnecessary cognitive dissonance. In contrast, by sticking to the conventional roles that each symbol plays, the notational structure of an argument is reinforced and made easier to remember; a reader who has temporarily forgotten the definition of, say, z, in an argument can at least guess that it should be a complex number, which can assist in recalling what the actual definition is.

As another example from analysis, when stating an inequality such as f(x) ≤ C or |S| ≤ N, it is customary that the left-hand side represents an “unknown” that one wishes to control, and the right-hand side represents a more “known” quantity that one is better able to control; thus for instance f(x) ≤ C is preferable to C ≥ f(x), despite the logical equivalence of the two statements. This is why analysts make a significant distinction between “upper bounds” and “lower bounds”; the two are not symmetric, because in both cases one is bounding an unknown quantity by a known quantity. (Another relevant convention in analysis here is that it is preferable to bound non-negative quantities rather than non-positive ones, thus for instance f(x) ≥ 0 is preferable to -f(x) ≤ 0.)

Continuing the above example, if the known bound C is itself the sum of several terms, e.g. C = A + B, then it is customary to put the “main” term first and the “error” terms later; thus for instance f(x) ≤ A + B is preferable to f(x) ≤ B + A. By adhering to this standard convention, one conveys useful information as to which terms are considered main terms and which ones considered error terms.

There are of course many other implied conventions, both in analysis or in other fields of mathematics; unfortunately, there is no exhaustive listing of such conventions, and one generally has to pick them up through extensive reading of a subject.

Some followup discussion, on the specific connotations assigned to round numbers such as 100, can be found here.