anhinga_anhinga | Vapnik@MIT; Cambridge Machine Learning Colloquium

Entry tags:

Vapnik@MIT; Cambridge Machine Learning Colloquium

A bit of information related to machine learning and philosophy talks at MIT.

This Friday, Sep 28, at noon Vladimir Vapnik will give a philosophical talk, "Inductive principles in machine learning and philosophy of science" during a Course 9.S912: "What is Intelligence?" class by Shimon Ullman and Tomaso Poggio (Location: 46-5193 (will move to 46-3310 if a larger room is required, outsiders are welcome to come and listen)). If I can summarize it, I'll update this post later.

On Wednesday, Vapnik gave a talk "From Rosenblatt's learning model to the model of learning with nontrivial teacher" at the new Cambridge Machine Learning Colloquium and Seminar Series. The main mathematical content was that A) it is well known that the error in a support vector machine is inversely proportional to the number of training samples if the classes are well separated by the kernel in question, but is only inversely proportional to the square root of that (i.e. much more training data is needed) if the classes overlap; B) Vapnik claimed that by introducing a second kernel to be used on the training data only (e.g. some creative and not necessarily well formalizable annotations by human annotators, ranging from mundane things to assigning poetic qualities to training samples) one can make the error inversely proportional to the number of training samples even when classes overlap with respect to the main, "production" kernel. (And he was making some far-reaching philosophical conclusions from that, about importance of culture in human learning and things like that. I don't know whether his conclusions can be transferred from support vector machines to other schemas of machine learning. But it certainly looked quite interesting.)

Update: There was a videorecording during the second talk, so there is some chance that there will be a public video. Some material of the first talk was repeated during the last part of the second one (which was 2 hours long). I would not retell the philosophical part. Among the machine learning part, he said that instead of Occam Razor, there is a principle of Large Margin, more precisely, the principle of admitting as many "contradictions" as possible (but contradictions situated on the manifold, and not just anywhere in the embedded space, so to generate artificial contradictions people generate "morphs" (e.g. linear combinations, or mixtures of pixels) of objects of different classes, and this also reduces the resulting error while training on a fixed data set).

Flat | Top-Level Comments Only

Если узнаешь, где можно будет взять запись - дай знать, пожалуйста!

OK. Можно попробовать написать ему e-mail.

А про пункт "B" где-нибудь можно прочитать подробности? Как он второе ядро вводит, как использует?

Он использует его, чтобы выписать добавочный член в формуле расстояния между точками из training set.

Edited 2012-09-28 13:29 (UTC)

Непонятно :(
Надеюсь, это скоро появится где-то в письменном виде.

Просто берём для обучающего множества сумму двух ядер, одно обычное, другое — как если бы мы использовали в качестве features вот эти дополнительные аннотации? А потом для классификации отбрасываем (неизвестные нам) слагаемые с дополнительными features?

Мне кажется, что да, именно так оно и работает.

Но у него есть электронная почта в разных местах, можно попробовать попросить слайды, задать вопросы...

У меня есть слайды обоих докладов, могу прислать по электронной почте.

Буду очень рад!
algol@mccme.ru

sent

Пришло. Спасибо :)

>claimed that by introducing a second kernel to be used on the training data only (e.g. some creative and not necessarily well formalizable annotations by human annotators, ranging from mundane things to assigning poetic qualities to training samples) one can make the error inversely proportional to the number of training samples even when classes overlap with respect to the main

Не в этом ли и состоит регуляционная теория (см. Плохо поставленные задачи, Тихонов А.Н., Арсенин В.Я, 1974)?

Он сказал, что нет :-) (Но кое-какая связь, видимо, есть.)

Тогда передайте ему, что его книжка была моей настольной несколько лет назад, плиз.

:-) Я боюсь, что моё с ним знакомство ограничилось тем, что я попросил его прислать мне копию слайдов по е-майлу (что он и сделал) :-)

Happy Birthday!

Спасибо!

Поздравляю с днём рождения! Happy Birthday!

Спасибо!

Поздравляю!

Спасибо!

Happy Birthday !

Спасибо!

С Днем рождения!

Интересные вещи он, видимо, рассказывал.
А он не давал ссылок на препринты или technical reports?

Спасибо!

> А он не давал ссылок на препринты или technical reports?

Насколько я помню, нет, но у меня есть его слайды, и я исхожу из того, что помещать их на сеть без его разрешения не следует, но вполне можно посылать в частном порядке по электронной почте.

(deleted comment)

послал :-)

Спасибо большое!
Ссылки на работы 2008-2010 там есть.

А, замечательно, мне надо будет тоже взглянуть! (Я надеюсь, что ещё появится видео -- некоторые вещи я понимал, потому что он их объяснял, в том числе, в порядке ответов на вопросы, и слайды сами по себе мне было бы довольно трудно понять; но, может быть, статьи можно понимать и без устных пояснений.)

О разнице в сходимостях (по-шаговых) 1/k и 1/k^2 много говорил Ю.Нестеров
в своих лекциях http://ium.mccme.ru/f12/nesterov.html - там разница между
следованием оракулу (black box) и методами с параллельным моделированием
оракула. Может быть, это как-то связано с новыми разработками Вапника,
а может и нет, надо будет смотреть его статьи.

Очень интересно...

У Вас еще не исчерпался лимит для создания трансляций в livejournal?
Если можно, попробуйте сделать трансляцию для этого rss-feeda:
http://club.pdmi.ras.ru/moodle/rss/file.php/1/2/forum/1/rss.xml
(по имени, например, clubpdmi, pdmiboard, ...)
Хотя сам сайт и feed заработают, наверно, только завтра:
http://meshulash.livejournal.com/113923.html?thread=1459459&#t1459459

Говорят, что надо подождать, пока заработает:

"There was an error retrieving this URL. The server may be down or the content unavailable at this time. Please verify the URL you have provided and try again."

Наверное, завтра заработает...

http://clubpdmi.livejournal.com/

Работает, спасибо большое!

Flat | Top-Level Comments Only

Vapnik@MIT; Cambridge Machine Learning Colloquium

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject