Compass & Straightedge Construction and the Impossible Constructions
From Math Images
Line 4: | Line 4: | ||
|ImageIntro=This image shows the step by step construction of a hexagon inscribed in the circle using a compass and a unmarked straightedge. | |ImageIntro=This image shows the step by step construction of a hexagon inscribed in the circle using a compass and a unmarked straightedge. | ||
|ImageDescElem= | |ImageDescElem= | ||
- | Let's assume we only have a compass and a unmarked straightedge. What can we construct and how can we construct them? That were the problems that Euclid pondered not only because those were probably the only instruments that he had at his time but also he wanted to build his theorems with as few assumptions, or {{EasyBalloon|Link=axioms|Balloon=In traditional logic, an axiom or postulate is a proposition that is not proved or demonstrated but considered to be either self-evident, or subject to necessary decision. Therefore, its truth is taken for granted, and serves as a starting point for deducing and inferring other (theory dependent) truths.}}, as possible.< | + | Let's assume we only have a compass and a unmarked straightedge. What can we construct and how can we construct them? That were the problems that Euclid pondered not only because those were probably the only instruments that he had at his time but also he wanted to build his theorems with as few assumptions, or {{EasyBalloon|Link=axioms|Balloon=In traditional logic, an axiom or postulate is a proposition that is not proved or demonstrated but considered to be either self-evident, or subject to necessary decision. Therefore, its truth is taken for granted, and serves as a starting point for deducing and inferring other (theory dependent) truths.}}, as possible.<ref>Peterson, 2003</ref> With these two simple tools, he managed to build myriad of theorems in both plane and solid geometry .<ref>Hartshorne, 2000</ref> These theorem are as good as they were some 2000 years ago and it is this enduring quality of Euclid's work that inspired this page. In the main image of this page, we want to divide the circle into six equal arcs and then connect consecutive points to form the hexagon. It seems to be a fairly simple construction but you should be prompted to ask two questions: Are other polygons constructible and is every polygon constructible, that is able to be constructed using only compass and straightedge? To extend the question, what are constructible and what are not? That is the problem that is resolved in this page. |
|ImageDesc===What is Compass & Straightedge Constructions== | |ImageDesc===What is Compass & Straightedge Constructions== | ||
Line 19: | Line 19: | ||
{{!}}'''''2. that a line segment may be extended into a straight line; | {{!}}'''''2. that a line segment may be extended into a straight line; | ||
{{!}}- | {{!}}- | ||
- | {{!}}'''''3. that given any straight line segment, a circle may be described having the segment as radius and one endpoint as center.< | + | {{!}}'''''3. that given any straight line segment, a circle may be described having the segment as radius and one endpoint as center.<ref>Taylor, 1895, p. 14</ref> |
{{!}}} | {{!}}} | ||
</blockquote> | </blockquote> | ||
Line 28: | Line 28: | ||
{{!}}- | {{!}}- | ||
{{!}} | {{!}} | ||
- | Thus, we define Compass & Straightedge Construction as the construction of points, lengths, angles, and circles using only ideal straightedge and compass. A straightedge is infinite in length, has no markings on it and only one edge. A compass has two legs, one end of which is fixed on the plane of construction and the other end is of given distance away and maintains the distance throughout the construction. It collapses when lifted from the page, so may not be '''directly''' used to transfer distances. However, it turns out that this restriction makes no difference due to the Compass Equivalence Theorem which was stated as Proposition II of Book I of Euclid's Elements. It stated that from a given point, it was possible to construct a line segment equal to a given line segment using collapsible compass in any desirable direction. Euclid's proof for the Compass Equivalence Theorem will be presented after the section of Basic Construction. Since Euclid has proven this using only the three postulates, then he did not have to use a collapsible compass any more.< | + | Thus, we define Compass & Straightedge Construction as the construction of points, lengths, angles, and circles using only ideal straightedge and compass. A straightedge is infinite in length, has no markings on it and only one edge. A compass has two legs, one end of which is fixed on the plane of construction and the other end is of given distance away and maintains the distance throughout the construction. It collapses when lifted from the page, so may not be '''directly''' used to transfer distances. However, it turns out that this restriction makes no difference due to the Compass Equivalence Theorem which was stated as Proposition II of Book I of Euclid's Elements. It stated that from a given point, it was possible to construct a line segment equal to a given line segment using collapsible compass in any desirable direction. Euclid's proof for the Compass Equivalence Theorem will be presented after the section of Basic Construction. Since Euclid has proven this using only the three postulates, then he did not have to use a collapsible compass any more.<ref>Peterson, 2003</ref> |
{{!}}} | {{!}}} | ||
Line 36: | Line 36: | ||
===='''Line Segment Bisection'''==== | ===='''Line Segment Bisection'''==== | ||
[[Image:CS1.png|border|550px|center]] | [[Image:CS1.png|border|550px|center]] | ||
- | {{ | + | {{SwitchPreview|ShowMessage=Click here to show construction|HideMessage=Click here to hide construction|PreviewText=Given points <math>{\color{Gray}A}</math> and <math>{\color{Gray}B}</math> and the straight line passing through it. Construct a line that bisects line segment <math>AB</math>.|FullText=Given points <math>A</math> and <math>B</math> and the straight line passing through it. Construct a line that bisects line segment <math>AB</math>. |
# Draw a circle centered at point <math>A</math> with radius equals <math>AB</math>. | # Draw a circle centered at point <math>A</math> with radius equals <math>AB</math>. | ||
Line 48: | Line 48: | ||
===='''Angle Bisection'''==== | ===='''Angle Bisection'''==== | ||
[[Image:CS2.png|border|550px|center]] | [[Image:CS2.png|border|550px|center]] | ||
- | {{ | + | {{SwitchPreview|ShowMessage=Click here to show construction|HideMessage=Click here to hide construction|PreviewText=Given angle <math>{\color{Gray}\angle AOB}</math>, construct a line that bisects the angle.|FullText=Given angle <math>\angle AOB</math>, construct a line that bisects the angle. |
# Construct a circle centered at point <math>O</math> with radius <math>OA</math>. This circle intersects <math>OA</math> and <math>OB</math> at point <math>A</math> and <math>B</math>. | # Construct a circle centered at point <math>O</math> with radius <math>OA</math>. This circle intersects <math>OA</math> and <math>OB</math> at point <math>A</math> and <math>B</math>. | ||
Line 57: | Line 57: | ||
===='''Perpendicular Through a Point'''==== | ===='''Perpendicular Through a Point'''==== | ||
[[Image:Perp.png|border|450px|center]] | [[Image:Perp.png|border|450px|center]] | ||
- | {{ | + | {{SwitchPreview|ShowMessage=Click here to show construction|HideMessage=Click here to hide construction|PreviewText=Given a point <math>{\color{Gray}M}</math> on a line <math>{\color{Gray}AB}</math>, construct a line that is perpendicular to the given line through <math>M</math>.|FullText=Given a point <math>M</math> on a line <math>AB</math>, construct a line that is perpendicular to the given line through <math>M</math>. |
- | + | ||
# Draw a circle centered at <math>M</math> with radius <math>MB</math> (or <math>MA</math>. It is your choice). | # Draw a circle centered at <math>M</math> with radius <math>MB</math> (or <math>MA</math>. It is your choice). | ||
# Where the circle intersects the original line, construct a perpendicular bisector (see the line segment bisector construction above).}} | # Where the circle intersects the original line, construct a perpendicular bisector (see the line segment bisector construction above).}} | ||
Line 64: | Line 64: | ||
===='''Parallel Line'''==== | ===='''Parallel Line'''==== | ||
[[Image:CS4.png|border|550px|center]] | [[Image:CS4.png|border|550px|center]] | ||
- | {{ | + | {{SwitchPreview|ShowMessage=Click here to show construction|HideMessage=Click here to hide construction|PreviewText=Given two points, <math>{\color{Gray}A}</math> and <math>{\color{Gray}B}</math> and the straight line passing through them, construct a line that is parallel to the given line through another given point <math>C</math>.|FullText=Given two points, <math>A</math> and <math>B</math> and the straight line passing through them, construct a line that is parallel to the given line through another given point <math>C</math>. |
# Draw a circle at <math>A</math>, crossing <math>C</math>. Where the circle <math>A</math> intersects <math>AB</math>, call the point <math>D</math>. | # Draw a circle at <math>A</math>, crossing <math>C</math>. Where the circle <math>A</math> intersects <math>AB</math>, call the point <math>D</math>. | ||
Line 74: | Line 74: | ||
===='''Tangent Line to a Circle'''==== | ===='''Tangent Line to a Circle'''==== | ||
[[Image:CS5.png|border|550px|center]] | [[Image:CS5.png|border|550px|center]] | ||
- | {{ | + | {{SwitchPreview|ShowMessage=Click here to show construction|HideMessage=Click here to hide construction|PreviewText=Given a circle centered at <math>{\color{Gray}O}</math> and another given point, <math>{\color{Gray}A}</math>, construct a line that is tangent to the circle.|FullText=Given a circle centered at <math>O</math> and another given point, <math>A</math>, construct a line that is tangent to the circle. |
# Draw a line through point <math>A</math> and the center of the circle <math>O</math> | # Draw a line through point <math>A</math> and the center of the circle <math>O</math> | ||
Line 88: | Line 88: | ||
[[Image:CETpic.png|border|center|450px]] | [[Image:CETpic.png|border|center|450px]] | ||
- | {{ | + | {{SwitchPreview|ShowMessage=Click here to show construction|HideMessage=Click here to hide construction|PreviewText=From a given point to draw a line segment equal to a given line segment.|FullText=From a given point to draw a line segment equal to a given line segment. <u>Let <math>A</math> be the given point, and <math>BC</math> the given straight line : it is required to draw from the point <math>A</math> a straight line equal to <math>BC</math>.</u> |
- | + | ||
- | <u>Let <math>A</math> be the given point, and <math>BC</math> the given straight line : it is required to draw from the point <math>A</math> a straight line equal to <math>BC</math>.</u> | + | |
Line 106: | Line 104: | ||
- | Wherefore from the given point <math>A</math> a straight line <math>AL</math> has been drawn equal to the given straight line <math>BC</math>.∎< | + | Wherefore from the given point <math>A</math> a straight line <math>AL</math> has been drawn equal to the given straight line <math>BC</math>.∎<ref>Taylor, 1895, p. 18</ref> |
It should be noted that from <math>AL</math>, we could "duplicate" <math>AL</math> in all directions by construct a circle centered at <math>A</math> with radius <math>AL</math>.}} | It should be noted that from <math>AL</math>, we could "duplicate" <math>AL</math> in all directions by construct a circle centered at <math>A</math> with radius <math>AL</math>.}} | ||
Line 114: | Line 112: | ||
==='''What is Algebraicization?'''=== | ==='''What is Algebraicization?'''=== | ||
- | From the few basic constructions, you would have probably realized that the different possibilities seems infinite. Hence, mathematician are curious to find out what are constructible and what aren't and for this purpose, the language of pure geometry seems to have "limited vocabulary"< | + | From the few basic constructions, you would have probably realized that the different possibilities seems infinite. Hence, mathematician are curious to find out what are constructible and what aren't and for this purpose, the language of pure geometry seems to have "limited vocabulary"<ref>Hudson, 1916, p. 3</ref>. Back in ancient times, mathematicians had limited algebraic knowledge and were more familiar with geometry. But in modern times, the reverse is true. Hence, today's mathematicians go back to their familiar realm of Algebra and try to find the link between geometry and algebra. |
''Algebraicization'' is the translation of any problem statements into algebraic problems. In the case of Compass & Straightedge construction, we algebraicize each step of a straightedge and compass construction, and consequently obtaining general results about the nature of constructibility. Hilda P. Hudson put it aptly in his lecture '''''Ruler & Compasses''''', | ''Algebraicization'' is the translation of any problem statements into algebraic problems. In the case of Compass & Straightedge construction, we algebraicize each step of a straightedge and compass construction, and consequently obtaining general results about the nature of constructibility. Hilda P. Hudson put it aptly in his lecture '''''Ruler & Compasses''''', | ||
- | <blockquote>''"each step of a ruler and compass construction is equivalent to a certain analytical process; it is found that the power to use a ruler corresponds exactly to the power to solve linear equations, and the power to use compasses to the power to solve quadratics...... Since each step of a ruler and compass construction is equivalent to the solution of an equation of the first or second degree, we consider that these algebraic processes can lead to , when combined in every possible way, and that enables us to answer the question before us......"''< | + | <blockquote>''"each step of a ruler and compass construction is equivalent to a certain analytical process; it is found that the power to use a ruler corresponds exactly to the power to solve linear equations, and the power to use compasses to the power to solve quadratics...... Since each step of a ruler and compass construction is equivalent to the solution of an equation of the first or second degree, we consider that these algebraic processes can lead to , when combined in every possible way, and that enables us to answer the question before us......"''<ref>Hudson, 1916, p. 3</ref></blockquote> |
Hudson lectured on this in the early 20th century and certain phrases of his could potentially cause confusion. The take-away from this paragraph is that in order to algebraicize straightedge and compass construction, we begin by designating a given point as the origin and the coordinates of another given point (we are given two points at least) as <math>(1,0)</math> or <math>(0,1)</math>. Thus we have established the Cartesian Coordinates. Then, every time we construct a straight line or a circle, we think of it instead as adding a new equation into a system of equations. These equations represent the coordinates of all the points on the line or circle, but that is easy since we all know the expression for a line and a circle as <math>y = ax + b</math> and <math>(x-m)^2 + (y-n)^2 = r^2</math>. However, the only times we can pinpoint a point (and find its coordinates as a result) is when a line intersects with a line, or a circle, or a circle intersects with another circle in which case we can pinpoint 2 points. We then conclude that only those coordinates of the points of intersections are constructible. In this way, a geometric process is translated into an algebraic process. | Hudson lectured on this in the early 20th century and certain phrases of his could potentially cause confusion. The take-away from this paragraph is that in order to algebraicize straightedge and compass construction, we begin by designating a given point as the origin and the coordinates of another given point (we are given two points at least) as <math>(1,0)</math> or <math>(0,1)</math>. Thus we have established the Cartesian Coordinates. Then, every time we construct a straight line or a circle, we think of it instead as adding a new equation into a system of equations. These equations represent the coordinates of all the points on the line or circle, but that is easy since we all know the expression for a line and a circle as <math>y = ax + b</math> and <math>(x-m)^2 + (y-n)^2 = r^2</math>. However, the only times we can pinpoint a point (and find its coordinates as a result) is when a line intersects with a line, or a circle, or a circle intersects with another circle in which case we can pinpoint 2 points. We then conclude that only those coordinates of the points of intersections are constructible. In this way, a geometric process is translated into an algebraic process. | ||
Line 126: | Line 124: | ||
==='''A Simple Derivation'''=== | ==='''A Simple Derivation'''=== | ||
- | Firstly, we define "1" on a straight line as stated previously. Then, once you have chosen that | + | Firstly, we define "1" on a straight line as stated previously. Then, once you have chosen that point to be <math>(1,0)</math>, you have to stick to this specification throughout your construction. Next, it is very obvious that we could construct all the integers, that is <math>\cdots -3,-2,-1,0,1,2,3,\cdots</math> (or <math>x</math> = <math>\{x|- \infty < x < \infty,x \in \mathbb{Z}\}</math>). How so? Well, once we have the "1", all we have to do is to use the Compass Equivalence Theorem finite number of times to duplicate the length "1" that we previously defined. Now, that means that we could have any two random integers, <math>a</math> and <math>b</math>, and for the sake of this discussion and clarity, we are talking about positive integers here. Next, it is shown that from <math>a</math> and <math>b</math>, we could construct <math>a \pm b</math>, <math>a \times b</math>, <math>\frac {a}{b}</math> and <math>\sqrt {a}</math>. |
{{{!}}border="1" | {{{!}}border="1" | ||
{{!}}align="center"{{!}}<math>a \pm b</math>{{!}}{{!}}align="center"{{!}}<math>a \times b</math> | {{!}}align="center"{{!}}<math>a \pm b</math>{{!}}{{!}}align="center"{{!}}<math>a \times b</math> | ||
{{!}}- | {{!}}- | ||
- | {{!}}align="center"{{!}}[[Image:A+-b2.png|center|border|300px]]{{!}}{{!}}align="center"{{!}}[[Image:Achub.png|center|border| | + | {{!}}align="center"{{!}}[[Image:A+-b2.png|center|border|300px]]{{!}}{{!}}align="center"{{!}}[[Image:Achub.png|center|border|400px]] |
{{!}}- | {{!}}- | ||
- | {{!}}To construct <math>a \pm b</math>, we will use <math>a</math> as the center and use <math>b</math> as radius. The two points of intersection with the line will be <math>a+b</math> and <math>a-b</math>.{{!}}{{!}}To construct <math>a \times b</math>, we have <math>0</math>, <math>1</math>, <math>a</math> and <math>b</math> on the straight line. | + | {{!}}To construct <math>a \pm b</math>, we will use <math>a</math> as the center and use <math>b</math> as radius. The two points of intersection with the line will be <math>a+b</math> and <math>a-b</math>.{{!}}{{!}}To construct <math>a \times b</math>, we have <math>0</math>, <math>1</math>, <math>a</math> and <math>b</math> on the straight line <math>l_0</math>. |
- | # Draw a straight line through <math>0</math>, call it <math>l_1</math>. <math>l_1</math> could be constructed in many ways. For example, it could be the tangent line to circle centered at <math>a</math> with radius <math> | + | # Draw a straight line through <math>0</math>, call it <math>l_1</math>. <math>l_1</math> could be constructed in many ways. For example, it could be the tangent line to circle centered at <math>a</math> with radius equals to the line segment connecting <math>b</math> to <math>a</math>. |
# Construct circle centered at <math>0</math> with radius <math>b</math>, intersecting <math>l_1</math> at <math>B</math>. | # Construct circle centered at <math>0</math> with radius <math>b</math>, intersecting <math>l_1</math> at <math>B</math>. | ||
# Connect <math>B</math> and <math>1</math>, call it <math>l_2</math>. | # Connect <math>B</math> and <math>1</math>, call it <math>l_2</math>. | ||
Line 145: | Line 143: | ||
{{!}}align="center"{{!}}<math>\frac {a}{b}</math>{{!}}{{!}}align="center"{{!}}<math>\sqrt {a}</math> | {{!}}align="center"{{!}}<math>\frac {a}{b}</math>{{!}}{{!}}align="center"{{!}}<math>\sqrt {a}</math> | ||
{{!}}- | {{!}}- | ||
- | {{!}}align="center"{{!}}[[Image:Axb.png|center|border| | + | {{!}}align="center"{{!}}[[Image:Axb.png|center|border|400px]]{{!}}{{!}}align="center"{{!}}[[Image:Sqrta2.png|center|border|350px]] |
{{!}}- | {{!}}- | ||
{{!}}To construct <math>\frac {a}{b}</math>, we have <math>0</math>, <math>1</math>, <math>a</math> and <math>b</math> on the straight line. | {{!}}To construct <math>\frac {a}{b}</math>, we have <math>0</math>, <math>1</math>, <math>a</math> and <math>b</math> on the straight line. | ||
Line 204: | Line 202: | ||
\end{cases}</math> | \end{cases}</math> | ||
- | To solve for the points of intersection, we only need the operations of addition, subtraction, multiplication and division along with the <u>extraction of square roots</u>. Therefore, from this analysis, we have turned geometric problem into algebraic problem and come to the conclusion that '''a number is constructible if and only if it may be obtained from the integers by repeated use of addition, subtraction, multiplication, division and the extraction of square roots'''.< | + | Similarly, if you there are two circles intersecting, then the two points of intersections have to satisfy the two equations of the circles. To solve for the points of intersection, we only need the operations of addition, subtraction, multiplication and division along with the <u>extraction of square roots</u>. Therefore, from this analysis, we have turned geometric problem into algebraic problem and come to the conclusion that '''a number is constructible if and only if it may be obtained from the integers by repeated use of addition, subtraction, multiplication, division and the extraction of square roots'''.<ref>Bryant, & Sangwin, 2008, p. 77</ref> |
Line 212: | Line 210: | ||
What I have presented above is a simplified version of the derivation towards the theorem. To see a rigorous proof of this theorem at a college level, refer to the text below which is mainly taken from I. N. Herstein's ''Topics in Algebra, Second Edition''. You need some knowledge in Linear Algebra and/or Abstract Algebra. Also see [http://en.wikipedia.org/wiki/Constructible_number Constructible Numbers]. You should not be discouraged should you find it hard to understand. Instead, you should be marveled by the simplicity and elegance of the algebraic proof. | What I have presented above is a simplified version of the derivation towards the theorem. To see a rigorous proof of this theorem at a college level, refer to the text below which is mainly taken from I. N. Herstein's ''Topics in Algebra, Second Edition''. You need some knowledge in Linear Algebra and/or Abstract Algebra. Also see [http://en.wikipedia.org/wiki/Constructible_number Constructible Numbers]. You should not be discouraged should you find it hard to understand. Instead, you should be marveled by the simplicity and elegance of the algebraic proof. | ||
- | {{ | + | {{SwitchPreview|ShowMessage=Click here to show proof|HideMessage=Click here to hide proof|PreviewText=We have proven that if <math>{\color{Gray}a}</math> and <math>{\color{Gray}b}</math> are constructible numbers,|FullText=<blockquote>We have proven that if <math>a</math> and <math>b</math> are constructible numbers, then so are <math>a \pm b</math>, <math>ab</math>, and when <math>b \ne 0</math>, <math>\frac {a}{b}</math>. Therefore, the set of constructible numbers form a subfield, <math>W</math>, of the [http://en.wikipedia.org/wiki/Field_(mathematics)#Constructible_numbers field] of real numbers. |
Line 238: | Line 236: | ||
# '''If <math>a</math> is constructible then <math>a</math> lies in some extension of the rationals of degree a power of 2.''' | # '''If <math>a</math> is constructible then <math>a</math> lies in some extension of the rationals of degree a power of 2.''' | ||
- | # '''If the real number <math>a</math> satisfies an irreducible polynomial over the field of rational numbers of degree <math>k</math>, and if <math>k</math> is not a power of 2, then <math>a</math> is not constructible.''' < | + | # '''If the real number <math>a</math> satisfies an irreducible polynomial over the field of rational numbers of degree <math>k</math>, and if <math>k</math> is not a power of 2, then <math>a</math> is not constructible.'''<ref>Herstein, 1975, p. 229</ref></blockquote>}} |
Line 250: | Line 248: | ||
<li>From the above impossible construction, it follows that it is impossible to "square the circle (that is to construct a square that has the same area as a given circle)" because given a circle with radius 1, which is constructible, the area of the circle will be <math>\pi</math> and we have to construct square with sides equal to <math>\sqrt \pi</math> which is not constructible. Due to this exception, there is no general method to square the circle.</li> | <li>From the above impossible construction, it follows that it is impossible to "square the circle (that is to construct a square that has the same area as a given circle)" because given a circle with radius 1, which is constructible, the area of the circle will be <math>\pi</math> and we have to construct square with sides equal to <math>\sqrt \pi</math> which is not constructible. Due to this exception, there is no general method to square the circle.</li> | ||
<li>We could not double the volume of a given cube. Say we start with cube of volume 1, which is constructible. Then we have to construct cube of volume 2, which means we have to construct sides of <math>\sqrt [3]{2}</math> which is impossible to construct. So we can double the cube.</li> | <li>We could not double the volume of a given cube. Say we start with cube of volume 1, which is constructible. Then we have to construct cube of volume 2, which means we have to construct sides of <math>\sqrt [3]{2}</math> which is impossible to construct. So we can double the cube.</li> | ||
- | <li>We generally can not trisect any given angle because the process involves taking cube root. For example, it is impossible to trisect <math>60^\circ</math>. See below for proof. For more, refer to [http://www.jimloy.com/geometry/trisect.htm#curves Trisection of an Angle]for explanation in great detail. Proof of <math>60^\circ</math> is impossible to trisect. {{HideShowThis|ShowMessage=Click here to show proof|HideMessage=Click here to hide proof|HiddenText=If we could trisect <math>60^\circ</math> by compass and straightedge, then the length <math>a = \cos 20^\circ</math> would be constructible. Since <math>\cos 3\theta = 4\cos^3\theta - 3 \cos \theta</math>. Substituting <math>\theta = 20^\circ</math> and <math>\cos60^\circ=\frac {1}{2}</math>, we obtain <math>4a^3-3a=\frac {1}{2}</math>. Thus <math>a</math> is a root of a cubic polynomial over the rational field. Since this polynomial is irreducible over the rational field and its degree is 3, <math>a</math> is not constructible. Thus <math>60^\circ</math> cannot be trisected.< | + | <li>We generally can not trisect any given angle because the process involves taking cube root. For example, it is impossible to trisect <math>60^\circ</math>. See below for proof. For more, refer to [http://www.jimloy.com/geometry/trisect.htm#curves Trisection of an Angle]for explanation in great detail. Proof of <math>60^\circ</math> is impossible to trisect. {{HideShowThis|ShowMessage=Click here to show proof|HideMessage=Click here to hide proof|HiddenText=If we could trisect <math>60^\circ</math> by compass and straightedge, then the length <math>a = \cos 20^\circ</math> would be constructible. Since <math>\cos 3\theta = 4\cos^3\theta - 3 \cos \theta</math>. Substituting <math>\theta = 20^\circ</math> and <math>\cos60^\circ=\frac {1}{2}</math>, we obtain <math>4a^3-3a=\frac {1}{2}</math>. Thus <math>a</math> is a root of a cubic polynomial over the rational field. Since this polynomial is irreducible over the rational field and its degree is 3, <math>a</math> is not constructible. Thus <math>60^\circ</math> cannot be trisected.<ref>Herstein, 1975, p. 230</ref>}} |
</li> | </li> | ||
<li>There are certain polygons that are impossible to construct. See [http://en.wikipedia.org/wiki/Constructible_polygon Constructible polygon] for more detail. </li> | <li>There are certain polygons that are impossible to construct. See [http://en.wikipedia.org/wiki/Constructible_polygon Constructible polygon] for more detail. </li> | ||
Line 275: | Line 273: | ||
*):http://www.ams.org/notices/200004/fea-hartshorne.pdf | *):http://www.ams.org/notices/200004/fea-hartshorne.pdf | ||
=Notes= | =Notes= | ||
- | + | <references/> | |
- | + | ||
- | + | ||
- | + | ||
- | + | ||
- | + | ||
|References= | |References= | ||
#Peterson, D. (2003, November 21). Collapsible Compass. Retrieved from The Math Forum: http://mathforum.org/library/drmath/view/66052.html | #Peterson, D. (2003, November 21). Collapsible Compass. Retrieved from The Math Forum: http://mathforum.org/library/drmath/view/66052.html |
Revision as of 11:18, 20 July 2010
- This image shows the step by step construction of a hexagon inscribed in the circle using a compass and a unmarked straightedge.
Creating a regular hexagon with a ruler and compass |
---|
Contents |
Basic Description
Let's assume we only have a compass and a unmarked straightedge. What can we construct and how can we construct them? That were the problems that Euclid pondered not only because those were probably the only instruments that he had at his time but also he wanted to build his theorems with as few assumptions, or axioms , as possible.^{[1]} With these two simple tools, he managed to build myriad of theorems in both plane and solid geometry .^{[2]} These theorem are as good as they were some 2000 years ago and it is this enduring quality of Euclid's work that inspired this page. In the main image of this page, we want to divide the circle into six equal arcs and then connect consecutive points to form the hexagon. It seems to be a fairly simple construction but you should be prompted to ask two questions: Are other polygons constructible and is every polygon constructible, that is able to be constructed using only compass and straightedge? To extend the question, what are constructible and what are not? That is the problem that is resolved in this page.A More Mathematical Explanation
- Note: understanding of this explanation requires: *A little Geometry and Some Abstract Algebra
What is Compass & Straightedge Constructions
Introduction
We start by familiarizing ourselves with Euclid's three Postulates in his books Elements.
Let it be granted 1. that a straight line may be drawn from any one point to any other point; 2. that a line segment may be extended into a straight line; 3. that given any straight line segment, a circle may be described having the segment as radius and one endpoint as center.^{[3]}
It should be carefully noted that Euclid started with two given points and produced a line segment, from where he could extend into a straight line if he pleased. Then ONLY from the original two points, he could use one point as center and ONLY spread the legs of compass the distance of the line segment to produce a circle. Even though many translations say that a circle can be described at any center with any radius, we must take it with a pinch of salt. He could not specify any points and lengths other than what was already given, that is to say he could not claim "I wanted to spread the legs of the compass centimeters apart (or any specified denominations) with the center of the circle half way between the two given points". In all, Compass and Straightedge Constructions only allow us to start with points (and hence lengths) we have been given(or constructed from given points), and create the ones we don't. |
Thus, we define Compass & Straightedge Construction as the construction of points, lengths, angles, and circles using only ideal straightedge and compass. A straightedge is infinite in length, has no markings on it and only one edge. A compass has two legs, one end of which is fixed on the plane of construction and the other end is of given distance away and maintains the distance throughout the construction. It collapses when lifted from the page, so may not be directly used to transfer distances. However, it turns out that this restriction makes no difference due to the Compass Equivalence Theorem which was stated as Proposition II of Book I of Euclid's Elements. It stated that from a given point, it was possible to construct a line segment equal to a given line segment using collapsible compass in any desirable direction. Euclid's proof for the Compass Equivalence Theorem will be presented after the section of Basic Construction. Since Euclid has proven this using only the three postulates, then he did not have to use a collapsible compass any more.^{[4]} |
Some Basic Constructions
The constructions below are some basic ones from where many more constructions are possible and they are by no means exhaustive. In the figures below, what we are given are in blue; intermediate steps are in dotted black; the resulting products are in red. The proofs for these constructions are relatively simple and only require the knowledge of congruent triangles. Euclid derived the theories on congruency and congruent triangles directly from his Postulates. Try proving the theorems yourself!
Line Segment Bisection
Given points and and the straight line passing through it. Construc [...]
Given points and and the straight line passing through it. Construct a line that bisects line segment .
- Draw a circle centered at point with radius equals .
- Next, draw a circle centered at point with the same radius.
- Where the two circles intersect, call those points and .
- Draw a line through points and .
Line intersects line segment at the midpoint. It should be noted that as well. Line is the perpendicular bisector of line segment .
Angle Bisection
Given angle , construct a line that bisects the angle.
Given angle , construct a line that bisects the angle.
- Construct a circle centered at point with radius . This circle intersects and at point and .
- Keeping the same radius, draw a circle at point and respectively. Where they intersect each other, call it point .
- Draw a line through point and . This line bisects .
Perpendicular Through a Point
Given a point on a line , construct a line that is perpendicular to [...]
Given a point on a line , construct a line that is perpendicular to the given line through .
- Draw a circle centered at with radius (or . It is your choice).
- Where the circle intersects the original line, construct a perpendicular bisector (see the line segment bisector construction above).
Parallel Line
Given two points, and and the straight line passing through them, c [...]
Given two points, and and the straight line passing through them, construct a line that is parallel to the given line through another given point .
- Draw a circle at , crossing . Where the circle intersects , call the point .
- Centered at points and , draw circles crossing point . Where these two circles intersect each other, call it .
- Draw a line through point and .
is parallel to
Tangent Line to a Circle
Given a circle centered at and another given point, , construct a li [...]
Given a circle centered at and another given point, , construct a line that is tangent to the circle.
- Draw a line through point and the center of the circle
- Let be the midpoint of (construction omitted since we know how to construction mid point). Draw the circle centered at going through and .
- Let the point where the two circles meet be . Draw line segment .
To see that is tangent to the circle, connect . is an inscribed angle in the circle about , so the inscribed is . So is perpendicular to . Since the tangent is perpendicular to the radius to the point of tangency, by the uniqueness of the line through perpendicular to , is tangent to the circle.
Euclid's Proof of Compass Equivalence Theorem
This part refers back to the previous section about the issue of compass being collapsible. Euclid's original proof is presented. Additional comments are contained in the parenthesis.
From a given point to draw a line segment equal to a given line segment.
From a given point to draw a line segment equal to a given line segment. Let be the given point, and the given straight line : it is required to draw from the point a straight line equal to .
- From the point to draw the straight line and on it describe the equilateral triangle (this is done the same way as bisecting a line segment), and produce the straight lines , to and .
- From the center , at the distance (with the radius equals to ), describe the circle , meeting at .
- From the center , at the distance (with the radius equals to ), describe the circle , meeting at .
- shall be equal to .
Because the point is the center of the circle , is equal to . And because the point is the center of the circle , is equal to and , parts of them are equal therefore the remainder is equal to the remainder .
But it has been shewn that is equal to ; therefore and are each of them equal to . But things which are equal to the same thing are equal to one another. Therefore is equal to .
Wherefore from the given point a straight line has been drawn equal to the given straight line .∎^{[5]}
It should be noted that from , we could "duplicate" in all directions by construct a circle centered at with radius .
Algebraicization of Compass & Straightedge Constructions
What is Algebraicization?
From the few basic constructions, you would have probably realized that the different possibilities seems infinite. Hence, mathematician are curious to find out what are constructible and what aren't and for this purpose, the language of pure geometry seems to have "limited vocabulary"^{[6]}. Back in ancient times, mathematicians had limited algebraic knowledge and were more familiar with geometry. But in modern times, the reverse is true. Hence, today's mathematicians go back to their familiar realm of Algebra and try to find the link between geometry and algebra.
Algebraicization is the translation of any problem statements into algebraic problems. In the case of Compass & Straightedge construction, we algebraicize each step of a straightedge and compass construction, and consequently obtaining general results about the nature of constructibility. Hilda P. Hudson put it aptly in his lecture Ruler & Compasses,
"each step of a ruler and compass construction is equivalent to a certain analytical process; it is found that the power to use a ruler corresponds exactly to the power to solve linear equations, and the power to use compasses to the power to solve quadratics...... Since each step of a ruler and compass construction is equivalent to the solution of an equation of the first or second degree, we consider that these algebraic processes can lead to , when combined in every possible way, and that enables us to answer the question before us......"^{[7]}
Hudson lectured on this in the early 20th century and certain phrases of his could potentially cause confusion. The take-away from this paragraph is that in order to algebraicize straightedge and compass construction, we begin by designating a given point as the origin and the coordinates of another given point (we are given two points at least) as or . Thus we have established the Cartesian Coordinates. Then, every time we construct a straight line or a circle, we think of it instead as adding a new equation into a system of equations. These equations represent the coordinates of all the points on the line or circle, but that is easy since we all know the expression for a line and a circle as and . However, the only times we can pinpoint a point (and find its coordinates as a result) is when a line intersects with a line, or a circle, or a circle intersects with another circle in which case we can pinpoint 2 points. We then conclude that only those coordinates of the points of intersections are constructible. In this way, a geometric process is translated into an algebraic process.
A Simple Derivation
Firstly, we define "1" on a straight line as stated previously. Then, once you have chosen that point to be , you have to stick to this specification throughout your construction. Next, it is very obvious that we could construct all the integers, that is (or = ). How so? Well, once we have the "1", all we have to do is to use the Compass Equivalence Theorem finite number of times to duplicate the length "1" that we previously defined. Now, that means that we could have any two random integers, and , and for the sake of this discussion and clarity, we are talking about positive integers here. Next, it is shown that from and , we could construct , , and .
To construct , we will use as the center and use as radius. The two points of intersection with the line will be and . | To construct , we have , , and on the straight line .
The distance between and that point is . |
To construct , we have , , and on the straight line.
The distance between and that point is . |
Therefore, it has been proven that we could construction all the rational numbers since and are any arbitrary integers. The natural question to ask right now is that what else is possible to construct? It is not hard to think of numbers that are not rational. For example, is constructible. Construct a unit square and the diagonal is of length . So is it possible to construct given any constructible number ? It turns out that we could. See below for method. To construct , we start with , , .
.Again, I will leave the proof to you as well using similar triangles. |
I will leave the proofs to you since they are very simple using similar triangles.
Next, we moved to the general solution of the problem.
Assume we have two points and with coordinates and . Take an arbitrary point on the line.
By similar triangle, .
Rearranging the above we have
.
Since , , and are constant we can express this as which is the general expression of a straight line.
Now, if we have two lines specified by four given points, ,...., with coordinates . The intersection of the two lines, will satisfy two equations
You may say that the there might not be a solution. True the two lines do not have to intersect. But if they do, we only need the operations of addition, subtraction, multiplication and division to find the point.
Now, we move onto circle. Say we have circle centered at some point with coordinates and radius . We know that the explicit expression for a circle is . Hence, if that circle intersects with one of the straight lines, then the points of intersection will satisfy
Similarly, if you there are two circles intersecting, then the two points of intersections have to satisfy the two equations of the circles. To solve for the points of intersection, we only need the operations of addition, subtraction, multiplication and division along with the extraction of square roots. Therefore, from this analysis, we have turned geometric problem into algebraic problem and come to the conclusion that a number is constructible if and only if it may be obtained from the integers by repeated use of addition, subtraction, multiplication, division and the extraction of square roots.^{[8]}
A Rigorous Proof
What I have presented above is a simplified version of the derivation towards the theorem. To see a rigorous proof of this theorem at a college level, refer to the text below which is mainly taken from I. N. Herstein's Topics in Algebra, Second Edition. You need some knowledge in Linear Algebra and/or Abstract Algebra. Also see Constructible Numbers. You should not be discouraged should you find it hard to understand. Instead, you should be marveled by the simplicity and elegance of the algebraic proof.
We have proven that if and are constructible numbers,
We have proven that if and are constructible numbers, then so are , , and when , . Therefore, the set of constructible numbers form a subfield, , of the field of real numbers. In particular, since , must contain , the field of rational numbers. If , we can reach from the rational field by a finite number of constructions. Let be any subfield of the field of the field of real numbers. Consider all the points in the real Euclidean plane both of whose coordinates and are in ; we call the set of these points the plane of . Any straight line joining two points in the plane of has an equation of the form where ,, are all in . Moreover,any circle having as center a point in the plane of and having as radius an element of has equation of the form , where all of , , are in . Given two lines in which intersect in the real plane, then their intersection point is a point in the plane of . On the other hand, the intersection of a line in and a circle in need not yield a point in the plane of . But, using the fact that the equation of a line in is of the form and that of a circle in F is of the form , where , , , , , are all in , we can show that when a line and circle of intersect in the real plane, they intersect either in a point in the plane of or in the plane of for some positive in . Finally, the intersection of two circles in can be realized as that of a line in and a circle in , for if these two circles are and , then their intersection is the intersection of either of these with the line , so also yields a point either in the plane of or of for some positive in . Thus lines and circles of lead us to points either in or in quadratic extensions of . If we now are in intersect in in points in the plane of where is a positive number in . A point is constructible from is we can find real numbers , such that , , , such that the point is in the plane of . Conversely, if is such that is real then we can realize as an intersection of lines and circles in . Thus a point is constructible from is and only if we can find a finite number of real numbers , such thatand such that our point lies in the plane of . We have defined a real number to be constructible if by use of straightedge and compass we can construct a line segment of length . But this translates, in terms of the discussion above, into: is constructible if starting from the plane of the rational numbers, , we can imbed a in a field obtained from by a finite number of quadratic extensions. And therefore,
- or ;
- or for ;
- If is constructible then lies in some extension of the rationals of degree a power of 2.
- If the real number satisfies an irreducible polynomial over the field of rational numbers of degree , and if is not a power of 2, then is not constructible.^{[9]}
Why is it interesting?
What is Impossible to Construct (of course, using compass and straightedge alone)?
Below is the brief introduction of a few of the impossible constructions. Remember that a number is constructible if and only if it may be obtained from the integers by repeated use of addition, subtraction, multiplication, division and the extraction of square roots.
- cannot be obtained from the integers by repeated use of addition, subtraction, multiplication, division and the extraction of square roots. In fact, belongs to a special class of numbers called the transcendental number that does not satisfy any rational polynomials. In other words, is not a solution of any polynomials with rational coefficients. Too see complete proof that is transcendental, see Transcendental number and The 15 Most Famous Transcendental Numbers.
- From the above impossible construction, it follows that it is impossible to "square the circle (that is to construct a square that has the same area as a given circle)" because given a circle with radius 1, which is constructible, the area of the circle will be and we have to construct square with sides equal to which is not constructible. Due to this exception, there is no general method to square the circle.
- We could not double the volume of a given cube. Say we start with cube of volume 1, which is constructible. Then we have to construct cube of volume 2, which means we have to construct sides of which is impossible to construct. So we can double the cube.
- We generally can not trisect any given angle because the process involves taking cube root. For example, it is impossible to trisect . See below for proof. For more, refer to Trisection of an Anglefor explanation in great detail. Proof of is impossible to trisect.
- There are certain polygons that are impossible to construct. See Constructible polygon for more detail.
Number 2, 3 and 4 are the so-called Geometric Problems of Antiquity. Though they have been proven impossible to construct with straightedge and compass, it does not deter amateur mathematicians to come up with false proofs even today.
Teaching Materials
- There are currently no teaching materials for this page. Add teaching materials.
About the Creator of this Image
Wikipedia, Powerpoint and Flash
Related Links
Additional Resources
- ):http://planetmath.org/
- ):http://hptgn.tripod.com/
- ):http://en.wikipedia.org/wiki/Compass_and_straightedge_constructions
- ):http://www.mathopenref.com/tocs/constructionstoc.html
- ):http://mathforum.org/library/drmath/view/66052.html
- ):http://mathforum.org/library/drmath/view/52601.html
- ):http://www.ams.org/notices/200004/fea-hartshorne.pdf
Notes
- ↑ Peterson, 2003
- ↑ Hartshorne, 2000
- ↑ Taylor, 1895, p. 14
- ↑ Peterson, 2003
- ↑ Taylor, 1895, p. 18
- ↑ Hudson, 1916, p. 3
- ↑ Hudson, 1916, p. 3
- ↑ Bryant, & Sangwin, 2008, p. 77
- ↑ Herstein, 1975, p. 229
- ↑ Herstein, 1975, p. 230
References
- Peterson, D. (2003, November 21). Collapsible Compass. Retrieved from The Math Forum: http://mathforum.org/library/drmath/view/66052.html
- Hartshorne, Robin . (2000). Teaching geometry according to euclid. NOTICES OF THE AMS, 47(4), 460-465.
- Taylor, H. M. (1895). Euclid's elements of geometry. Cambridge: Cabridge University Press.
- Hudson, H. P. (1916). Ruler & compass. London: Longmans Green & Company, Inc..
- Bryant, John, & Sangwin, Christopher. (2008). How Round is your circle?. Princeton & Oxford: Princeton Univ Pr.
- Herstein, I. N. (1975). Topics in algebra. John Wiley & Sons Inc.
Leave a message on the discussion page by clicking the 'discussion' tab at the top of this image page.