Pythagorean Theorem


 

1.1 INTRODUCTION

1.1.1 Exploring the Pythagorean Theorem: Its History and Significance

In this first lecture, we look at one of the most important theorems in mathematics, the theorem of Pythagoras. The historical roots of the theorem are mesmerizing: the first examples of identities like already appeared in Sumerian mathematics. Triples of numbers like are called Pythagorean triples. The theorem itself is much more than that. The theorem not only lists a few examples for evidence but states and proves that for all triangles, the relation holds if and only if the triangle is a right angle triangle. Without exaggeration, the Pythagorean theorem is one of the most beautiful and most important theorems. It has cameos in various other parts of mathematics. In harmonic analysis for example, it tells that the square of the length of a periodic function is the sum of the squares of its Fourier coefficients. In probability theory it tells that if two random variables are uncorrelated, then the variance of is the sum of the variance of and .

Figure 1. The picture appears in "The story of the greatest nations", (1910) and shows Pythagoras teaching mathematics.

1.1.2 Redefining Vectors

We use here the theorem also while introducing vectors and linear spaces. The language of matrices is not only a matter of notation, but also allows for a slightly more sophisticated approach to vector calculus in which one distinguishes between column vectors and row vectors. Unlike in standard vector analysis courses, this is possible when working closer to linear algebra. Traditionally, many sources define a vector as a quantity with "magnitude" and "direction". This is highly problematic as a "movie" qualifies for this notion: it has a length and has a director. But we don’t need to mock this with a pun: the zero vector is a quantity which would not qualify as a vector because the zero vector does not have a direction. Because of such problems, one usually defines a vector as a quantity defined by two points in space, writes and thinks about the vector as a translation from to or as an "arrow" starting from and ending at . Now, one has the difficulty that two parallel vectors of equal length are identified. One actually uses equivalence classes to get from affine space to linear space. The modern point of view is that one can attach a linear space of vectors at every point and think of as a vector attached to the point . We will see for example the concept of a gradient field, which attaches at every point a row vector. Force fields are examples.

1.1.3 Matrix Foundations in Data Analysis

In any case, introducing spaces of matrices early has advantages also in a time where data analysis is recognized as an important tool. Relational data bases are founded on the concept of matrices. Most familiar are spread sheets which are two-dimensional arrays in which data are organized. More recently, such concept are also displaced with more sophisticated data structures like graph data bases. Still, a graph can also be described by matrices. Given two nodes of the network, write in the matrix entry tells how they are related. In the simplest case put a if the nodes are connected and if they are not connected. In any case, data are always arrays of more basic quantities. The memory structure of a computer is organized as an array. As Alan Turing has shown, all computations we have formalized can be done on a one dimensional tape with entries and . Modern computer storage devices are essentially such Turing tapes, but organized in a more sophisticated way, using partitions or sectors similarly as matrices are organized with rows and columns.

1.2 LECTURE

1.2.1 Matrix Essentials

A finite rectangular array of real numbers is called a matrix. If there are rows and columns in , it is called an matrix. We address the entry in the ’th row and ’th column with . An matrix is a column vector, a matrix is a row vector. A matrix is called a scalar. Given an matrix and a matrix , the matrix is defined as It is called the matrix product. The transpose of an matrix is the matrix . The transpose of a column vector is a row vector.

1.2.2 Vector Space of Matrices

Denote by the set of matrices. It contains the zero matrix with . In the case , it is the zero vector. The addition of two matrices in is defined as The scalar multiplication is defined as if is a real number. These operations make a vector space linear space: the addition is associative, commutative with a unique additive inverse satisfying The multiplications are distributive: and and

1.2.3 Euclidean Spaces, Dot Products, and Length

The space is also called . It is the -dimensional Euclidean space. The vector space is the plane and is the physical space. These spaces are dear to us as we draw on paper and live in space. The dot product between two column vectors is the matrix product Because the dot product is a scalar, the product is also called the scalar product. In the matrix product of two matrices , the entry at position is the dot product of the ’th row in with the ’th column in . More generally, the dot product between two arbitrary matrices can be defined by where the trace of a matrix is the sum of its diagonal entries. This means We just take the product over all matrix entries and add them up. The dot product is distributive and commutative We can use it to define the length of a vector or the length of a matrix, where we took the positive square root. The sum of the squares is zero exactly if all components are zero. The only vector satisfying is therefore .

1.2.4 Cauchy-Schwarz Inequality

An important key result is the Cauchy-Schwarz inequality.

Theorem 1.

Proof. If , there is nothing to prove as both sides are zero. If , then we can divide both sides of the equation by and so achieve that . Define . Now, meaning or . ◻

1.2.5 Angle Between Two Vectors

It follows from the Cauchy-Schwarz inequality that for any two non-zero vectors , the number is in the closed interval : There exists therefore a unique angle such that If this angle between and is equal to , the two vectors are orthogonal. If or the two vectors are called parallel. There exists then a real number such that . The zero vector is considered both orthogonal as well as parallel to any other vector.

1.2.6 Law of Cosines

Two vectors define a (possibly degenerate) triangle in Euclidean space . The above formula defines an angle at the point (which could be the zero angle). The side lengths of the triangle satisfy the following cos formula. It is also called the Al-Kashi identity.

Corollary 1.

Proof. We use the definitions as well as the distributive property (FOIL out):  ◻

1.2.7 Understanding the Pythagorean Theorem: A Special Case of the Law of Cosines

The case is particularly important. It is the Pythagorean theorem:

Theorem 2. In a right angle triangle we have .

1.3 EXAMPLES

Example 1. The dot product is We have and angle .

Example 2. The dot product of and is The length of is , the length of is . The angle between and is

Example 3. and are perpendicular because The angle between them is . The length of is . The length of is . The length of is . We confirm Note that . Multiplication is not commutative.

Example 4. Find the angles in a triangle of length and .

Answer: Al Kashi gives so that Similarly so that and so that

1.4 ILLUSTRATION

Figure 2. A cuboid of integer side length and such that are squares is an Euler brick. Its side diagonals are now integers. The smallest one was found in 1719. If also is a square, meaning that the space diagonal is an integer too, we have a perfect Euler brick. Nobody has found one. It is a famous open problem due to Euler, whether there exists one.
Figure 3. This Povray scene was generated by a method which involves a lot of vector calculus and linear algebra: this open source ray tracer bounces around light in the virtual scene and computes the reflections. A camera then captures the photons, similarly as a real camera does. Textures are implemented by images, here a postcard of Harvard square from 1930. It is a image file encoding three matrices R,G,B, red, green and blue values at each pixel. The scene is an "homage" to the novel "On Time and the River" by Thomas Wolfe who was a Harvard undergraduate here from 1920-1922 (notice the 22!)

1.4.1 Infinite Horizons in Mathematics

Mathematics is not only eternal, but also infinite. To illustrate this, look at the "Eternals" problem.1 Define the Babylonian graph in which the positive integers are the vertices and where are connected, if is a perfect square. Every edge in belongs to a Pythagorean triple . One can ask which type of sub-graphs appear, how many connected components there are, whether the diameter is infinite, or how large closed loops can become. Hundreds of questions could be asked. Embedded triangles in for example are Euler bricks! Are there embedded tetrahedra , cliques of numbers for which every pair is a Pythagorean triple? This would be an Eulerian tesseract. Is there one? Before proving anything, we have a data problem. Experiment!

 
Figure 4. To the left we see the largest component . An experiment like ListPlot[Table[GraphDiameter[Babylonian[n]],n,1000]] gives the diameter of the largest component of . We have .

EXERCISES

Exercise 1. Use the definitions to find the angle between the vector and in . If we think about as data, the value is the correlation between the two data points and . If the cosine is positive, the data have positive correlation. If the cosine is negative, they have negative correlation.

Exercise 2. Given the matrix .

  1. Find , then build and . The first matrix is called symmetric, the second is called anti-symmetric.
  2. Compute and . Then evaluate and .
  3. Why are these two numbers computed in b) the same? Is it true in general for two matrices that ? (There is a short verification using the sum notation).

Exercise 3.

  1. Verify the triangle identity in general by FOILing out , then generate an example of two vectors with integer coordinates in the plane , where one can apply this. Draw the situation.
  2. Verify that if and have the same length, then and are perpendicular. Describe the situation in b) geometrically in a sentence.

Exercise 4. Write the vector as a sum of a vector parallel to and a vector perpendicular to . If we interpret as a force acting on a kite of mass and as the velocity then has an interpretation as power, the rate of change of the energy of the kite. The vector parallel to would by Newton be the acceleration of the kite.

Exercise 5.

  1. Find two vectors in for which all coordinate entries are or and which are both perpendicular to each other.
  2. Design four vectors in for which all coordinate entries are or which are all perpendicular to each other.

Optional: can you invent a strategy which allows you for example to find vectors in which are all perpendicular to each other and have still entries in ?


  1. This problem has been communicated to us by Ajak, who knows thousands of years of mathematics↩︎