Support Vector Machines (SVMs)

Karush-Kuhn-Tucker conditions

Sergios Theodoridis

July 14, 2022

The Karush-Kuhn-Tucker (KKT) conditions

“The Karush-Kuhn-Tucker (KKT) conditions play a central role in both the theory and practice of constrained optimization. For the primal problem above, the KKT conditions may be stated (Fletcher, 1987):

The KKT conditions are satisfied at the solution of any constrained optimization problem (convex or not), with any kind of constraints, provided that the intersection of the set of feasible directions with the set of descent directions coincides with the intersection of the set of feasible directions for linearized constraints with the set of descent directions (see Fletcher, 1987; McCormick, 1983)). This rather technical regularity assumption holds for all support vector machines, since the constraints are always linear. Furthermore, the problem for SVMs is convex (a convex objective function, with constraints which give a convex feasible region), and for convex problems (if the regularity condition holds), the KKT conditions are necessary and sufficient for w, b, α to be a solution (Fletcher, 1987). Thus solving the SVM problem is equivalent to finding a solution to the KKT conditions. This fact results in several approaches to finding the solution (for example, the primal-dual path following method mentioned in Section 5).

As an immediate application, note that, while w is explicitly determined by the training procedure, the threshold b is not, although it is implicitly determined. However b is easily found by using the KKT “complementarity” condition, Eq. (21), by choosing any i for which α_i ≠ 0 and computing b (note that it is numerically safer to take the mean value of b resulting from all such equations).

Notice that all we�ve done so far is to cast the problem into an optimization problem where the constraints are rather more manageable than those in Eqs. (10), (11). Finding the solution for real world problems will usually require numerical methods. We will have more to say on this later. However, let�s first work out a rare case where the problem is nontrivial (the number of dimensions is arbitrary, and the solution certainly not obvious), but where the solution can be found analytically.”
Burgess (1998)

Tags

Sergios Theodoridis

I like math and computers, but I’m not too good at them. Like math and computers, I’m not too good at science. When I was in school, my math and science grades were never high enough to get me into a good college. But now that I have the opportunity to work in an office as a receptionist, I don’t have to worry about it anymore. — Former Harvard University student in Computer Science and Economics, Silicon Valley startup founder, and data engineer. Currently working at Facebook as a data scientist. As a former student at Harvard, I was able to pursue my interests in Computer Science and Economics. My academic background led to a career as a software engineer. I became interested in data because of its significance in understanding the world and human behavior. After working at Facebook on the data science team for two years, I now lead all data engineering teams across the company’s products. — I am a student at Cambridge studying Computer Science. Outside of school, I enjoy working on my website, programming, reading and making music. #svms, #supportvectormachines, #machinelearning

Karush-Kuhn-Tucker conditions

The Karush-Kuhn-Tucker (KKT) conditions

Tags

Leave a Reply Cancel reply