Statistical Learning Theory
Vladimir N. Vapnik
In case you don't think yourself have a strong background in probability theory, I would recommend the book by Ralf Herbrich "Learning Kernel Classifiers". This book seems hard to read in the beginning because of heavy mathematical notation. It is quite easy to follow when you drink some ice cold water and calm down. Especially noteworthy is the derivation of VC-dimension based bounds, which is the few book/papers I read that explain how those strange equations are obtained. In addition, the book "Kernel Methods for Pattern Analysis by Nello Cristianini is also very good and readable. All in all, Vapnik's books is the foundation.
Ссылка удалена правообладателем
----
The book removed at the request of the copyright holder.