Saturday, February 22, 2014

Vandermonde and Discriminants (Graph Theoretic Proof) (14.6.27)

Dummit and Foote Abstract Algebra, section 14.6, exercise 27:

MathJax TeX Test Page Let f(x) be a monic polynomial with roots α1,...,αn.
(a) Show that the discriminant D of f(x) is equal to the square of the Vandermonde determinant. |1α1α21αn111α2α22αn121α3α23αn131αnα2nαn1n|=i>j(αiαj)
(b) Taking the Vandermonde matrix above, multiplying on the left by its transpose, and taking the determinant, show that one obtains D=|p0p1p2pn1p1p2p3pnp2p3p4pn+1pn1pnpn+1p2n2|
where pk=αki, which can be computed in terms of the coefficients of f(x) using Newton's Formulas from exercise 22. This gives an efficient procedure for calculating the discriminant of a polynomial.

Proof: (a) Expanding i>j(αiαj) into sums, one finds that the summands can be represented by the collection of graphs on n points where for every point αi and αj, a "choice" is made according to whether the element from the term (αiαj) is αi or (negative) αj. Visually, this may be depicted by a regular n-gon with exactly one arrow from each vertex to another, with the vertices successively labeled α1,...,αn. Conversely, a sum may be obtained from a diagram of this type by examining how many arrows point to the vertex αi (notated d(i)) and retrieving ±αd(i)i, where the sign is negative if the number of arrows pointing from higher-ordered vertices to lower-ordered vertices is odd.

First, a lemma.

Lemma: Given such a diagram on n-points, if d(i)=d(j) for any ij, then there exist points a,b,c such that abca.
Proof: The lemma is vacuously true for n=2. We shall prove it inductively for n: Given a counterexample, assume there does not exist z such that d(z)=n1. Then for every point, there is an arrow from that point to some other point. Starting from any point x1, one may follow arrows without dead-ends to obtain a path x1x2...xi...xi, so that the existence of a circuit xi...xi is guaranteed. Observe a circuit y1y2...yky1 of minimum distance k: if k3, then either y1 points to y3 or y3 points to y1. In the former case y1y3...yky1 is a shorter circuit violating minimality, and in the latter case so too is y1y2y3y1. Hence k=3 and the lemma's hypothesis is contradicted.

Therefore there is z such that d(z)=n1, i.e. all other points point to αz. Note that in the equality d(i)=d(j) assumed by a counterexample necessarily i,jz as there cannot be two points toward which every point yields an arrow (since one of the two must yield an arrow to the other). As such, examine the diagram on n1 points given by removing αz from the diagram and all the arrows pointing toward it (αz itself yields no arrows) to obtain a counterexample of the lemma on a smaller diagram, an inductive contradiction. This establishes the lemma.

Returning to the view of the diagrams as individual summands, establish a map between positive and negative diagrams of the type in the lemma (d(i)=d(j) for some ij) given by reversing the directions of the arrows on the "first" triangle abca appearing in the diagram (triangle in order of labels a1,b1,c1 is before triangle a2,b2,c2 if the label of a1 is smaller than the label of a2, then compare b1 with b2 then c1 with c2). This map is a bijection as it is its own inverse. This ultimately implies the summands of the form ±αβ11αβ22...αβnn where βi=βj for some ij cancel out in i>j(αiαj), and the only ones remaining are those corresponding to diagrams where one point receives 0 arrows, another 1 arrow, ..., and the last n1 arrows, of which there are exactly n!; in fact, these are all point-permutations of the graph on n points where point αi receives i1 arrows. These permutations and negations correspond precisely to the definition of the determinant of the Vandermonde matrix above.

(b) If the i,j entry of the Vandermonde matrix V is vij=αj1i, then we see the i,j entry of VtV is, nk=1vkivkj=nk=1α(i1)+(j1)k
which is the matrix written above. As described above, Newton's Formulas provide an efficient means of inductively calculating pi in terms of the symmetric functions which appear in the coefficients of f(x), thus giving an efficient means of calculating the discriminant of f(x). 

No comments:

Post a Comment