书中有些段落还不错,摘录如下: Page 44: A training and design of a support vector machine is an iterative algorithm and it involves the following steps: (a) define your problem as the classification or as the regression one, (b) preprocess your input data: select the most relevant features, scale the data between [-1,1], or to the ones having zero mean and variances equal to one, check for possible outliers (strange data points), (c) select the kernel function that determines the hypothesis space of the decision and regression function in the classification and regression problems respectively, (d) select the "shape", i.e., "smoothing" parameter of the kernel function (for example, polynomial degree for polynomials and variances of the Gaussian RBF kernels respectively), (e) choose the penalty factor C and, in the regression, select the desired accuracy by defining the insensitivity zone epsilon too, (f) solve the QP problem in l and 2l variables in the case of classification and regression problems respectively, (g) validate the model obtained on some previously, during the training, unseen test data, and if not pleased iterate between steps (d) (or, eventually c) and (g).
Page 115: Global learning summarizes the data and provides the practitioners with knowledge on the structure of data, since with the precise modeling phenomena, the observations can be accurately regenerated and therefore thoroughly studied or analyzed. However, this also presents difficulties in how to choose a valid model to describe all the information. In comparison, local learning directly employs part of information critical for the specifically oriented tasks and does not assume a model for the data. Although demonstrated to be superior to global learning in various machine learning tasks, it misses some critical global information.
图书信息:
Title
Support vector machines : theory and applications / Lipo Wang (ed.).