撰写数学研究论文

Ashley Reiter

原文(EN)

撰写数学研究论文

Ashley Reiter 1995年9月12日

第一节:引言:为何要费心?

优秀的数学写作,如同优秀的数学思维,是一项需要通过实践和培养才能达到最佳表现的技能。本文旨在为初次撰写论文的年轻数学家提供帮助。其目的不仅在于协助写出一篇优秀的文章,也在于帮助学生开始思考数学写作本身。

我深受一本精彩的小册子《如何写作数学》(“How to Write Mathematics”)的启发,本文的许多实质内容都来源于此。我将引用其中的许多直接引文,特别是来自保罗·哈尔莫斯(Paul Halmos)撰写的部分,但我怀疑本文中几乎所有的观点都源于我对那本小册子的阅读。该书可从美国数学会(American Mathematical Society)购得,认真研究数学写作的学生应当亲自查阅。其他大部分想法则源于我自己对糟糕数学写作的失望。虽然从糟糕的数学作品中学习数学并非学习优秀写作的最佳方式,但它能提供极佳的反面教材。因此,积极的数学读者的一项活动,就是留意数学作品中变得不清晰的地方,并在自己的写作中避免犯同样的错误。

数学交流,无论是书面还是口头,都是外界看待你数学工作的滤镜。如果将数学的创造性比作创作一首乐曲,那么写作的艺术则可视为指挥演奏这首乐曲。作为一名数学家,你拥有指挥演奏自己作品的特权!对听众而言,出色的指挥与创作优秀的作品同等重要。如果你纯粹为了个人乐趣而研究数学,那就没有理由把它写下来。但如果你希望分享你所完成的数学之美,那么仅仅写下来是不够的;你必须努力写得

本文将从关于数学写作的总体思路开始,目的是帮助学生为论文构建一个大纲。下一节将描述论文中“形式”与“非形式”部分之间的区别,并为每一部分提供指导。第四节将讨论单个证明的写法。最后,本文将以一节包含具体建议的内容收尾,供你在撰写和修改论文时参考。

第二节:动笔之前:构建论文结构

几乎所有写作的目的都是为了交流。为了良好地交流,你必须同时考虑你想传达什么,以及你希望与谁交流。这对数学写作来说,与其他任何形式的写作并无二致。数学写作的首要目标是,通过精心构建的逻辑推导,来断言一个数学陈述的真实性。严谨的数学读者不会假设你的工作是 основательнoй;他们需要被说服。这是你在数学写作中的第一个目标。

然而,仅仅说服读者相信你工作的真实性是不够的。当你撰写自己的数学研究时,你会有另一个包含这两个目标在内的更高目标:你希望读者能欣赏你所完成的数学之美,并理解其重要性。如果将整个数学领域,甚至你所工作的子领域,想象成一幅巨大的画作,那么你的研究必然只占整个作品中微不足道的一部分。它的美不仅体现在对你所描绘的特定区域的审视中(尽管这很重要),也体现在观察你的工作如何与整幅画作“融为一体”的方式上。

这两个目标——说服读者相信你推论的真实性,以及让你的受众看到你的工作在整个数学领域中的美——在你构思论文大纲时至关重要。有时,你可能会把自己想象成一名导游,带领读者穿越一片只有你绘制过地图的领域。

一位成功的数学作者会为她的读者铺陈两张逻辑地图:一张展示了她个人工作与广阔数学世界之间的联系,另一张则揭示了她工作内部的逻辑结构。

为了引导你的读者,你必须首先为自己思考你的工作在数学地图上的位置。如果你的读者曾到访过邻近区域,那么你应当唤起他们对那些经历的记忆,以便他们能更好地理解你的补充,并将其与相关数学联系起来。问自己几个问题可能有助于你辨别工作的形态和位置:

  • 你的结果是否通过对某事物给出更精确的刻画,从而加强了先前的结果?
  • 你是否通过减弱假设或加强结论,证明了一个旧定理的更强版本?
  • 你是否证明了两个定义的等价性?
  • 它是否是一个对先前已定义但未被理解的结构的分类定理?
  • 它是否连接了数学中两个先前不相关的方面?
  • 它是否将一种新方法应用于一个老问题?
  • 它是否为一则旧定理提供了新的证明?
  • 它是否是一个更大问题的特例?

你必须明确地考虑这个在数学结构中的定位问题,因为它会一直萦绕在读者的脑海中,直到你给出答案。未能解决这个问题会让读者感到相当不满意。

除了提供一张地图帮助读者在数学领域内定位你的工作,你还必须帮助他们理解你工作内部的组织结构:

  • 你的成果是集中在一个引人注目的定理中吗?
  • 还是你有几个相互关联但同等重要的定理?
  • 你是否找到了重要的反例?
  • 你的研究是纯粹的理论数学,即“定理-证明”式的,还是涉及多种不同类型的活动,例如,在计算机上对问题建模,证明一个定理,然后进行与你工作相关的物理实验?
  • 你的工作是朝着解决一个经典问题迈出的清晰(尽管很小)一步,还是一个全新的问题?

因为你的读者在读完你的论文之前,并不知道你将要证明什么,所以事先告知他们将要读到的内容,就像旅行社为顾客做准备一样,能让他们更享受旅程,并更多地理解你引导他们看到的东西。

要诚实而审慎地解释你的工作在数学研究大局中的位置,可能需要极大的谦逊。你很可能会因自己的成就看似微小而感到沮丧。别担心!数学是数千年来,基于成千上万(甚至数百万)实践者的工作积累而成的。据说,即便是最优秀的数学家,一生中也很少有超过一个真正杰出的想法。如果你在高中时期就有了这样的想法,那才真是令人惊讶呢!

一旦你考虑清楚了你研究的结构和意义,你就可以开始为论文拟定大纲了。与许多其他科学领域相比,数学研究论文的格式定义要宽松得多。你有自由度以一种适合你特定工作的方式来制定大纲。然而,你几乎总会包含几个标准部分:背景、引言、正文和未来工作。背景部分将为你的读者定位,提供你将引导他们去向何方的初步概念。在背景部分,你会对你的问题历史给出最明确的描述,尽管提示和参考文献也可能出现在其他地方。读者希望在这一部分找到以下问题的答案:他为什么要读这篇论文?这篇论文的重点是什么?这个问题从何而来?这个领域已知些什么?为什么作者认为这个问题有趣?例如,如果他不喜欢偏微分方程,他应该尽早得到警告,他将会遇到它们。如果他对概率论的基本概念不熟悉,那么如果你的论文依赖于这种理解,也应提前警告。在这一点上请记住,尽管你可能花了数百小时研究你的问题,你的读者希望在几分钟内清楚地得到所有这些问题的答案。

在论文的第二部分,即引言,你将开始引导读者进入你的具体工作,从宏观画面逐渐聚焦到你的特定结果。这里是介绍该领域标准但你的读者可能不知道的定义和引理的地方。正文将由几个部分组成,包含你大部分的工作。当你到达最后一节“意义”时,你可能已经对你的问题感到厌倦,但这一节对你的读者至关重要。作为你论文主题的世界级专家,你处于一个独特的位置,可以指导该领域的未来研究。喜欢你论文的读者可能希望在你的领域继续工作。他(她)自然会有自己的问题,但你,作为这篇论文的作者,会比你的读者更清楚哪些问题可能有趣,哪些可能不然。如果你要继续研究这个主题,你会问什么问题?此外,对于某些论文,你的工作可能会有重要的意义。如果你研究的是一个物理现象的数学模型,那么你的数学工作在物理世界中会有什么后果?这些都是你的读者希望在论文最后一节找到答案的问题。你应该小心,不要让他们失望!

第三节:形式与非形式论述

一旦你有了论文的基本大纲,你就应该考虑“由定义、定理和证明组成的形式逻辑结构,以及由动机、类比、例子和元数学解释组成的互补的非形式介绍性材料。在任何数学陈述中,都应显著地保持材料的这种划分,因为该学科的性质首先要求逻辑结构清晰。”(第1页)这两种类型的材料并行工作,使你的读者能够从逻辑上和认知上理解你的工作(这两者通常大相径庭——在你们当中,有多少人能在证明微积分基本定理之前,就相信可以用反导数计算积分?)“由于形式结构不依赖于非形式结构,作者可以在添加任何后者之前,先将前者完整详细地写出。”(第2页)

因此,写作过程的下一个阶段可能是为你论文的逻辑结构制定一个大纲。几个问题可能会有所帮助:首先,你到底证明了什么?这些定理所依赖的引理(你自己的或其他人的)是什么?这些定理的推论又是什么?在决定哪些结果称为引理、哪些是定理、哪些是推论时,问问自己哪些是核心思想。哪些是自然而然从其他结果中得出的,哪些是论文中真正的主力?写作的结构要求你的假设和推论必须符合线性顺序。然而,很少有研究论文真正具有线性结构,即引理变得越来越复杂,一个叠一个,直到证明了一个定理,然后是一系列日益复杂的推论。相反,大多数证明可以用非常复杂的图来建模,其中几个基本假设以复杂的方式与一些众所周知的定理相结合。可能会有几条看似独立的推理路线在最后一步汇合。不言而喻,任何断言都应遵循其所依赖的引理和定理。然而,可能有很多满足这一要求的线性顺序。鉴于这一困难,你的责任是,首先,理解这个结构,其次,安排你写作中必然的线性结构,以尽可能好地反映工作的结构。具体如何进行,当然取决于具体情况。

帮助你揭示论文复杂逻辑结构的一种技巧是恰当地命名你的成果。通过恰当地命名你的成果(引理作为基础,定理作为实质内容,推论作为收尾工作),你将在你的引理之间创造一种并行感,并帮助你的读者,在没有与你一起经历研究的挣扎的情况下,领会哪些是真正关键的思想,哪些他们可以更快地浏览。

另一种构建简洁逻辑大纲的技巧源于保罗·哈尔莫斯在《如何写作数学》中的一个警告:永远不要重复一个证明:

如果定理2证明中的几个步骤与定理1证明的部分非常相似,这是一个信号,表明某些东西可能没有被完全理解。同样病症的其他症状是:‘用与定理1证明中相同的技巧(或方法、或手段、或窍门)…’,或者,粗暴地,‘参见定理1的证明’。当这种情况发生时,很有可能存在一个值得发现、阐述和证明的引理,从这个引理可以更容易、更清晰地推导出定理1和定理2。(第35页)

这些结构问题应该在你开始写论文之前就深思熟虑,尽管写作过程本身肯定会帮助你更好地理解结构。

现在我们已经讨论了形式结构,我们转向非形式结构。形式结构包含形式定义、“定理-证明”格式和严谨的逻辑,这是“纯”数学的语言。非形式结构补充了形式结构并与之并行。它使用不那么严谨(但同样准确!)的语言,并在阐明工作的数学定位(如上所述)以及向读者呈现更具认知性的工作展示方面发挥重要作用。因为虽然数学家逻辑的语言写作,但很少有人真正逻辑的语言思考(尽管我们确实进行逻辑思考),所以为了理解你的工作,通过巧妙地展示为什么某件事是真的,以及你是如何证明这样一个定理的,将会极大地帮助他们。在写作之前,勾勒出你希望在这些非形式部分传达的内容,很可能会带来更有效的交流。

在你开始写作之前,你还必须考虑符号。符号的选择是撰写研究论文的关键部分。实际上,你是在发明一种语言,你的读者必须学习这种语言才能理解你的论文。好的符号首先能让读者忘记他正在学习一门新语言,其次提供了一个框架,在这个框架中你的证明要点能被清晰地理解。另一方面,糟糕的符号是灾难性的,甚至可能阻止读者阅读你的论文。在大多数情况下,遵循惯例是明智的。用 epsilon 表示一个素数,或用 x(f) 表示一个函数,当然是可能的,但几乎从来不是一个好主意。

第四节:撰写证明

写好一个证明的第一步是陈述定理。一个措辞恰当的定理会使证明的书写容易得多。首先,定理的陈述应该包含恰到好处的假设。当然,所有必要的假设都必须包括在内。另一方面,无关的假设只会分散定理的重点,应尽可能地删去。

在撰写证明时,就像撰写整篇论文一样,你必须以线性顺序记下一组可能并非线性形式的假设和推论。我建议,在动笔之前,你先规划出假设和推论,并尝试以一种能给读者带来最少困惑的方式来排列这些陈述。

在《如何写作数学》中,哈尔莫斯就撰写证明提出了一些重要的建议:

  1. 向前写证明

    一个糟糕教学中常见的伎俩是以这样一句话开始一个证明:“给定 ε,令 δ 为 ε/2”。这是经典分析学中传统的倒写证明法。它的优点是容易被机器验证(而不是被人类理解),还有一个可疑的优点是,最后得到的东西会小于 ε。让读者任务不那么费力的方法是显而易见的:向前写证明。像作者一样,从某个小于 ε 的东西开始,然后做需要做的事情——在适当的时候乘以 3M² + 7,之后再除以 24,等等——直到你得到最终结果。两种安排都不优雅,但向前写的方式是可把握和可记忆的。(第43页)

  2. 避免不必要的符号。 思考一下:

    一个由一长串用等号分隔的表达式组成的证明。这样的证明很容易写。作者从第一个方程开始,做一个自然的代换得到第二个,合并同类项,排列,插入并立即消去一个灵感迸发的因子,通过这些步骤直到得到最后一个方程。这又一次是编码,读者被迫边学边解码。这种双重努力是不必要的。作者多花十分钟写一个措辞严谨的段落,就可以为每位读者节省半小时和大量的困惑。这个段落应该是一个行动的指南,来取代那个仅仅报告行动结果却让读者去猜测如何得到的无用代码。这个段落可能会这样说:“为了证明,首先用 p 代替 q,然后合并同类项,排列因子,最后,插入并消去一个因子 r。”(第42-43页)

第五节:具体建议

与任何形式的交流一样,某些文体实践会使你的写作或多或少地易于理解。这些最好在写完初稿后进行检查和修正。其中许多想法来自《如何写作数学》,并在那里有更充分的论证。

  • 几页(甚至几段)未用过的符号应附上引用或对其含义的提醒。
  • 文章结构应通过标题和标点符号清晰可辨。
  • 全文应始终对当前问题有清晰的定义。
  • 标题是读者与你论文的第一次接触。它必须向你领域的专家以及感兴趣的新手传达一些实质内容。因此,虽然术语应技术上正确…
  • “不要过度使用像句号或逗号这样的小标点符号。读者很容易忽略它们,而这种忽略会导致回读、困惑和延迟。例如:‘假设 aÂX. X 属于 C 类,…’。两个 X 之间的句号被过度使用了…一个好的通用法则是:永远不要用符号开始一个句子。如果你坚持要用提及该符号所代表的事物来开始句子,请使用同位语,例如:‘集合 X 属于 C 类,…’…过度使用的句号并不比过度使用的逗号好。不要写’对于可逆的 X, X* 也是可逆的’,而要写’对于可逆的 X,其伴随矩阵 X* 也是可逆的。’”(第44页)
  • “我建议在所有数学语境中的’if’后都使用’then’。‘then’的存在永远不会引起混淆;它的缺席则可能。”(第44页)
  • 另一个关键特征是页面的布局或架构。“如果它看起来像实心散文,会有一种令人生畏的说教感;如果它看起来像计算的乱码,满页都是符号,会有一种吓人的复杂感。黄金中庸之道是最好的。把它分段,但不要太零碎;使用散文,但不要太多。穿插足够的展示来让眼睛帮助大脑;使用符号,但要置于足够的散文之中,以免思维淹没在后缀的泥潭中。”(第44-45页)
  • 同一个符号绝不能用于多于一件事物;如果你在一个证明中用 n 作计数器,在下一个证明中就用 m,除非两者在各自的证明中扮演相似的角色。
  • 所有符号都应有意义(没有自由变量):
  • “避免使用无关的符号。例如:‘在一个紧空间上,每个实值连续函数 f 都是有界的。’ 符号 ‘f’ 对该陈述的清晰度有什么贡献?… 一个华丽的说法是‘不要使用多余的字母’,也就是‘不要只使用一个字母一次’。”(第41页)
  • “在日常英语中,‘any’是一个模棱两可的词;根据上下文,它可能暗示存在量词(‘你有羊毛吗?’,‘如果有人能做到,他就能’)或全称量词(‘任何数字都可以玩’)。结论:永远不要在数学写作中使用’any’。用’each’或’every’替换它,或者重塑整个句子。”(第38页)
  • “其他罪责较轻的词是’where’、‘equivalent’和’if… then…if…then’。‘Where’通常是一个懒惰的事后补充的标志,本应事先考虑清楚。‘如果 n 足够大,则 |an| < ε, where ε 是一个预先指定的正数’;病症和疗法都很明显。用’Equivalent’来形容定理在逻辑上是荒谬的。(我所说的“定理”是指一个数学真理,一个被证明了的东西。一个有意义的陈述可以是假的,但定理不能;“一个假的定理”是自相矛盾的)。至于’if…then…if…then’,那只是快手作者常犯的文体错误,却让慢读者后悔。‘如果 p,那么如果 q,那么 r。’ 逻辑上,一切都好,但心理上,这只是另一个不必要的绊脚石。通常避免它所需要的只是重塑句子,但没有普遍适用的好方法;最好的方法取决于具体情况下的重点。可能是’如果 pq,那么 r’,或者’在 p 的前提下,假设 q 意味着结论 r’,或者许多其他版本。”(第38-39页)
  • 使用反例来证明定理条件的必要性。
  • 正确用词:区分函数和值。
Ashley Reiter  
Mathematics Instructor  
Maine School of Science & Mathematics  
77 High Street  
Limestone, ME 04750  
reitera@mssm.lcs.k12.me.us

“How to write Mathematics”, Steenrod, N.E. Steenrod, Amer. Math. Soc. 1983, ISBN 0821800558

中文版

Writing a Research Paper in Mathematics

Ashley Reiter
September 12, 1995

Section 1: Introduction: Why bother?

Good mathematical writing, like good mathematics thinking, is a skill which must be practiced and developed for optimal performance. The purpose of this paper is to provide assistance for young mathematicians writing their first paper. The aim is not only to aid in the development of a well written paper, but also to help students begin to think about mathematical writing.

I am greatly indebted to a wonderful booklet, “How to Write Mathematics,” which provided much of the substance of this essay. I will reference many direct quotations, especially from the section written by Paul Halmos, but I suspect that nearly everything idea in this paper has it origin in my reading of the booklet. It is available from the American Mathematical Society, and serious students of mathematical writing should consult this booklet themselves. Most of the other ideas originated in my own frustrations with bad mathematical writing. Although studying mathematics from bad mathematical writing is not the best way to learn good writing, it can provide excellent examples of procedures to be avoided. Thus, one activity of the active mathematical reader is to note the places at which a sample of written mathematics becomes unclear, and to avoid making the same mistakes his own writing.

Mathematical communication, both written and spoken, is the filter through which your mathematical work is viewed. If the creative aspect of mathematics is compared to the act of composing a piece of music, then the art of writing may be viewed as conducting a performance of that same piece. As a mathematician, you have the privilege of conducting a performance of your own composition! Doing a good job of conducting is just as important to the listeners as composing a good piece. If you do mathematics purely for your own pleasure, then there is no reason to write about it. If you hope to share the beauty of the mathematics you have done, then it is not sufficient to simply write; you must strive to write_ well_.

This essay will begin with general ideas about mathematical writing. The purpose is to help the student develop an outline for the paper. The next section will describe the difference between “formal” and “informal” parts of a paper, and give guidelines for each one. Section four will discuss the writing of an individual proof. The essay will conclude with a section containing specific recommendations to consider as you write and rewrite the paper.

Section 2. Before you write: Structuring the paper

The purpose of nearly all writing is to communicate. In order to communicate well, you must consider both what you want to communicate, and to whom you hope to communicate it. This is no less true for mathematical writing than for any other form of writing. The primary goal of mathematical writing is to assert, using carefully constructed logical deductions, the truth of a mathematical statement. Careful mathematical readers do not assume that your work is well-founded; they must be convinced. This is your first goal in mathematical writing.

However, convincing the reader of the simple truth of your work is not sufficient. When you write about your own mathematical research, you will have another goal, which includes these two; you want your reader to appreciate the beauty of the mathematics you have done, and to understand its importance. If the whole of mathematics, or even the subfield in which you are working, is thought of as a large painting, then your research will necessarily constitute a relatively minuscule portion of the entire work. Its beauty is seen not only in the examination of the specific region which you have painted (although this is important), but also by observing the way in which your own work ‘fits’ in the picture as a whole.

These two goals—to convince your reader of the truth of your deductions, and to allow your audience to see the beauty of your work in relation to the whole of mathematics—will be critical as you develop the outline for your paper. At times you may think of yourself as a travel guide, leading the reader through territory charted only by you.

A successful mathematical writer will lay out for her readers two logical maps, one which displays the connections between her own work and the wide world of mathematics, and another which reveals the internal logical structure of her own work.

In order to advise your reader, you must first consider for yourself where your work is located on the map of mathematics. If your reader has visited nearby regions, then you would like to recall those experiences to his mind, so that he will be better able to understand what you have to add and to connect it to related mathematics. Asking several questions may help you discern the shape and location of your work:

  • Does your result strengthen a previous result by giving a more precise characterization of something?
  • Have you proved a stronger result of an old theorem by weakening the hypotheses or by strengthening the conclusions?
  • Have you proven the equivalence of two definitions?
  • Is it a classification theorem of structures which were previously defined but not understood?
  • Does is connect two previously unrelated aspects of mathematics?
  • Does it apply a new method to an old problem?
  • Does it provide a new proof for an old theorem?
  • Is it a special case of a larger question?

It is necessary that you explicitly consider this question of placement in the structure of mathematics, because it will linger in your readers’ minds until you answer it. Failure to address this very question will leave the reader feeling quite dissatisfied.

In addition to providing a map to help your readers locate your work within the field of mathematics, you must also help them understand the internal organization of your work:

  • Are your results concentrated in one dramatic theorem?
  • Or do you have several theorems which are related, but equally significant?
  • Have you found important counterexamples?
  • Is your research purely theoretical mathematics, in the theorem-proof sense, or does your research involve several different types of activity, for example, modeling a problem on the computer, proving a theorem, and then doing physical experiments related to your work?
  • Is your work a clear (although small) step toward the solution of a classic problem, or is it a new problem?

Since your reader does not know what you will be proving until after he has read your paper, advising him beforehand about what he will read, just as the travel agent prepares his customer, will allow him to enjoy the trip more, and to understand more of the things you lead him to.

To honestly and deliberately explain where your work fits into the big picture of mathematical research may require a great deal of humility. You will likely despair that your accomplishments seem rather small. Do not fret! Mathematics has been accumulating for thousands of years, based on the work of thousands (or millions) of practitioners. It has been said that even the best mathematicians rarely have more than one really outstanding idea during their lifetimes. It would be truly surprising if yours were to come as a high school student!

Once you have considered the structure and relevance of your research, you are ready to outline your paper. The accepted format for research papers is much less rigidly defined for mathematics than for many other scientific fields. You have the latitude to develop the outline in a way which is appropriate for your work in particular. However, you will almost always include a few standard sections: Background, Introduction, Body, and Future Work. The background will serve to orient your reader, providing the first idea of where you will be leading him. In the background, you will give the most explicit description of the history of your problem, although hints and references may occur elsewhere. The reader hopes to have certain questions answered in this section: Why should he read this paper? What is the point of this paper? Where did this problem come from? What was already known in this field? Why did this author think this question was interesting? If he dislikes partial differential equations, for example, he should be warned early on that he will encounter them. If he isn’t familiar with the first concepts of probability, then he should be warned in advance if your paper depends on that understanding. Remember at this point that although you may have spent hundreds of hours working on your problem, your reader wants to have all these questions answered clearly in a matter of minutes.

In the second section of your paper, the introduction, you will begin to lead the reader into your work in particular, zooming in from the big picture towards your specific results. This is the place to introduce the definitions and lemmas which are standard in the field, but which your readers may not know. The body, which will be made up of several sections, contains most of your work. By the time you reach the final section, implications, you may be tired of your problem, but this section is critical to your readers. You, as the world expert on the topic of your paper, are in a unique situation to direct future research in your field. A reader who likes your paper may want to continue work in your field. (S)he will naturally have her/his own questions, but you, having worked on this paper, will know, better than your reader, which questions may be interesting, and which may not. If you were to continue working on this topic, what questions would you ask? Also, for some papers, there may be important implications of your work. If you have worked on a mathematical model of a physical phenomenon, what are the consequences, in the physical world, of your mathematical work? These are the questions which your readers will hope to have answered in the final section of the paper. You should take care not to disappoint them!

Section 3. Formal and Informal Exposition

Once you have a basic outline for your paper, you should consider “the formal or logical structure consisting of definitions, theorems, and proofs, and the complementary informal or introductory material consisting of motivations, analogies, examples, and metamathematical explanations. This division of the material should be conspicuously maintained in any mathematical presentation, because the nature of the subject requires above all else that the logical structure be clear.” (p.1) These two types of material work in parallel to enable your reader to understand your work both logically and cognitively (which are often quite different—how many of you believed that integrals could be calculated using antiderivatives before you could prove the Fundamental Theorem of Calculus?) “Since the formal structure does not depend on the informal, the author can write up the former in complete detail before adding any of the latter.” (p. 2)

Thus, the next stage in the writing process may be to develop an outline of the logical structure of your paper. Several questions may help: To begin, what exactly have you proven? What are the lemmas (your own or others) on which these theorems stand. Which are the corollaries of these theorems? In deciding which results to call lemmas, which theorems, and which corollaries, ask yourself which are the central ideas. Which ones follow naturally from others, and which ones are the real work horses of the paper? The structure of writing requires that your hypotheses and deductions must conform to a linear order. However, few research papers actually have a linear structure, in which lemmas become more and more complicated, one on top of another, until one theorem is proven, followed by a sequence of increasingly complex corollaries. On the contrary, most proofs could be modeled with very complicated graphs, in which several basic hypotheses combine with a few well known theorems in a complex way. There may be several seemingly independent lines of reasoning which converge at the final step. It goes without saying that any assertion should follow the lemmas and theorems on which it depends. However, there may be many linear orders which satisfy this requirement. In view of this difficulty, it is your responsibility to, first, understand this structure, and, second, to arrange the necessarily linear structure of your writing to reflect the structure of the work as well as possible. The exact way in which this will proceed depends, of course, on the specific situation.

One technique to assist you in revealing the complex logical structure of your paper is a proper naming of results. By naming your results appropriately (lemmas as underpinnings, theorems as the real substance, and corollaries as the finishing work), you will create a certain sense of parallelness among your lemmas, and help your reader to appreciate, without having struggled through the research with you, which are the really critical ideas, and which they can skim through more quickly.

Another technique for developing a concise logical outline stems from a warning by Paul Halmos, in HTWM, never to repeat a proof:

If several steps in the proof of Theorem 2 bear a very close resemblance to parts of the proof of Theorem 1, that’s a signal that something may be less than completely understood. Other symptoms of the same disease are: ‘by the same technique (or method, or device, or trick) as in the proof of Theorem 1…’, or, brutally, ‘see the proof of Theorem 1’. When that happens the chances are very good that there is a lemma that is worth finding, formulating, and proving, a lemma from which both Theorem 1 and Theorem 2 are more easily and more clearly deduced. (p. 35)

These issues of structure should be well thought through BEFORE you begin to write your paper, although the process of writing itself which surely help you better understand the structure.

Now that we have discussed the formal structure, we turn to the informal structure. The formal structure contains the formal definitions, theorem-proof format, and rigorous logic which is the language of ‘pure’ mathematics. The informal structure complements the formal and runs in parallel. It uses less rigorous, (but no less accurate!) language, and plays an important part in elucidating both the mathematical location of the work, as we discussed above, and in presenting to the reader a more cognitive presentation of the work. For although mathematicians write in the language of logic, very few actually think in the language of logic (although we do think logically), and so to understand your work, they will be immensely aided by subtle demonstration of why something is true, and how you came to prove such a theorem. Outlining, before you write, what you hope to communicate in these informal sections will, most likely, lead to more effective communication.

Before you begin to write, you must also consider notation. The selection of notation is a critical part of writing a research paper. In effect, you are inventing a language which your readers must learn in order to understand your paper. Good notation firstly allows the reader to forget that he is learning a new language, and secondly provides a framework in which the essentials of your proof are clearly understood. Bad notation, on the other hand, is disastrous and may deter the reader from even reading your paper. In most cases, it is wise to follow convention. Using epsilon for a prime integer, or x(f) for a function, is certainly possible, but almost never a good idea.

Section 4: Writing a Proof

The first step in writing a good proof comes with the statement of the theorem. A well-worded theorem will make writing the proof much easier. The statement of the theorem should, first of all, contain exactly the right hypotheses. Of course, all the necessary hypotheses must be included. On the other hand, extraneous assumptions will simply distract from the point of the theorem, and should be eliminated when possible.

When writing a proof, as when writing an entire paper, you must put down, in a linear order, a set of hypotheses and deductions which are probably not linear in form. I suggest that, before you write you map out the hypotheses and the deductions, and attempt to order the statements in a way which will cause the least confusion to the reader.

In HTWM, Halmos offers several important recommendations about writing proofs:

  1. Write the proof forward

A familiar trick of bad teaching is to begin a proof by saying: “Given e, let d be e/2”. This is the traditional backward proof-writing of classical analysis. It has the advantage of being easily verifiable by a machine (as opposed to understandable by a human being), and it has the dubious advantage that something at the end comes out to be less than e. The way to make the human reader’s task less demanding is obvious: write the proof forward. Start, as the author always starts, by putting something less than e, and then do what needs to be done—multiply by 3M2 + 7 at the right time and divide by 24 later, etc., etc.—till you end up with what you end up with. Neither arrangement is elegant, but the forward one is graspable and rememberable. (p. 43)

  1. Avoid unnecessary notation. Consider:

a proof that consists of a long chain of expressions separated by equal signs. Such a proof is easy to write. The author starts from the first equation, makes a natural substitution to get the second, collects terms, permutes, inserts and immediately cancels an inspired factor, and by steps such as these proceeds till he gets the last equation. This is, once again, coding, and the reader is forced not only to learn as he goes, but, at the same time, to decode as he goes. The double effort is needless. By spending another ten minutes writing a carefully worded paragraph, the author can save each of his readers half an hour and a lot of confusion. The paragraph should be a recipe for action, to replace the unhelpful code that merely reports the results of the act and leaves the reader to guess how they were obtained. The paragraph would say something like this: “For the proof, first substitute p for q, the collect terms, permute the factors, and, finally, insert and cancel a factor r. (p. 42-43)

Section 5. Specific Recommendations

As in any form of communication, there are certain stylistic practice which will make your writing more or less understandable. These may be best checked and corrected after writing the first draft. Many of these ideas are from HTWM, and are more fully justified there.

  • Notation that hasn’t been used in several pages (or even paragraphs) should carry a reference or a reminder of the meaning.
  • The structure should be easily discernible by headings and punctuation.
  • There should be a clear definition of the problem at hand all the way through.
  • The title is the first contact that readers will have with your paper. It must communicate something of the substances to the experts in your field as well as to the novices who will be interested. Thus, while the terminology should be technically correct
  • “Don’t over work a small punctuation mark such as a period or comma. They are easy for the reader to overlook, and the oversight causes backtracking, confusion, delay. Example: “Assume that aÂX. X belongs to the class C, …”. The period between the two X’s is overworked… A good general rule is: never start a sentence with a symbol. If you insist on starting the sentence with a mention of the thing the symbol denotes, put the appropriate word in apposition, thus: “The set X belongs to the class C, …”…The overworked period is no worse than the overworked comma. Not “For invertible X, X* also is invertible”, but “For invertible X, the adjoint X* also is invertible.” (p. 44)
  • “I recommend “then” with “if” in all mathematical contexts. The presence of “then” can never confuse; its absence can.”(p.44)
  • Another critical feature is the layout or architecture of the page. “If it looks like solid prose, it will have a forbidding, sermony aspect; if it looks like computational hash, with a page full of symbols, it will have a frightening, complicated aspect. The golden mean is golden. Break it up, but not too small; use prose, but not too much. Intersperse enough displays to give the eye a chance to help the brain; use symbols, but in the middle of enough prose to keep the mind from drowning in a morass of suffixes.” (p. 44-5)
  • The same symbol should never be used for more than one thing; if you have used n as a counter in one proof, use m in the next proof, unless the two play a similar role in each
  • All notation should be meaningful (no free variables):
  • “avoid the use of irrelevant symbols. Example: “On a compact space every real-valued continuous function f is bounded.” What does the symbol “f” contribute to the clarity of that statement?… A showy way to say “use no superfluous letters” is to say “use no letter only once”. (p. 41)
  • “In everyday English “any” is an ambiguous word; depending on context that may hint at an existential quantifier (“have you any wool?”, “if anyone can do it, he can”) or a universal one (“any number can play”). Conclusion: never use “any” in mathematical writing. Replace it by “each” or “every”, or recast the whole sentence” (p. 38)
  • “Other offenders, charged with lesser crimes, are ‘where’, and ‘equivalent’, and ‘if… then…if…then’. “Where” is usually a sign of a lazy afterthought that should have been thought through before. “If n is sufficiently large, then |an| < e, where e is a preassigned positive number”; both disease and cure are clear. “Equivalent” for theorems is logical nonsense. (By “theorem” I mean a mathematical truth, something that has been proved. A meaningful statement can be false, but a theorem cannot; “a false theorem” is self-contradictory). As for “if…then…if…then”, that is just a frequent stylistic bobble committed by quick writers and rued by slow readers. “If p, then if q, then r.” Logically, all is well, but psychologically it is just another pebble to stumble over, unnecessarily. Usually all that is needed to avoid it is to recast the sentence, but no universally good recasting exists; what is best depends on what is important in the case at hand. It could be “If p and q then r”, or “In the presence of p, the hypothesis q implies the conclusion r”, or many other versions."" (p. 38-39)
  • Use counter-examples to demonstrate the necessity of conditions on theorem
  • Use words correctly: distinguish between function and value
Ashley Reiter  
Mathematics Instructor  
Maine School of Science & Mathematics  
77 High Street  
Limestone, ME 04750  
reitera@mssm.lcs.k12.me.us

“How to write Mathematics”, Steenrod, N.E. Steenrod, Amer. Math. Soc. 1983, ISBN 0821800558