anhinga_anhinga | A robot with impressive language capabilities

You're viewing

anhinga_anhinga's journal
Create a Dreamwidth Account Learn More

Reload page in style: site light

This is a prototype of an elderly home care robot developed by a very small group at IBM (with the benign indifference from their employer corporation that does not want to deal with robots and headaches and liabilities associated with robots). Its ability to verbally communicate with a human and to learn from a human is very impressive (basically, one can program this robot to a large extent simply by talking to it). Here is a demo video from the talk at the AGI-12 conference:

http://www.youtube.com/watch?v=M2RXDI3QYNU

The paper itself, "An Extensible Language Interface for Robot Manipulation", explaining to some extent how this works can be found here:

http://www.mindmakers.org/boards/18/topics/73

and the free online version of AGI-12 proceedings is here (scroll down to AGI-12 Contributed Paper Sessions):

http://www.mindmakers.org/projects/agiconf-2012/wiki/Schedule

Flat | Top-Level Comments Only

From:

x-ghbdtn.livejournal.com

Ну, фричество - не фричество, но эта статья теоретическая. Матаппарат у них крайне общий, и между ним и когнитивистикой действительно заметный разрыв. Здесь, как от анаграммы Ньютона "полезно решать дифференциальные уравнения" до конкретной механики, - целая пропасть. Но предлагают ли авторы метод, этот разрыв преодолевающий, можно судить, только прочитав их прикладные работы (и есть ли работающие реализации этих методов).

Те моменты теорката, которые затрагиваются у Гольдфарба, можно посмотреть в недавнем обзоре http://arxiv.org/abs/1307.4038 "An alternative Gospel of structure: order, composition, processes", это вступление к "Quantum Physics and Linguistics: A Compositional, Diagrammatic Discourse". Только сам Гольдфарб про теоркат не говорит, а изображает самостоятельное изобретение велосипеда.

From:

bvn-mai.livejournal.com

Я очень рассчитываю, что Вы напишите свое мнение об "..."Articulatory Speech Structures" и "Future of Machine Learning". ..", как обещали :). Спасибо за ссылку, но всего лишь инженер, который знает математику чуть лучше обычного инженера из IT-сферы, у меня чисто утилитарный взгляд на математику.

From:

x-ghbdtn.livejournal.com

Я пока успел посмотреть только "Future of Machine Learning", там одна критика текущего положения дел и никаких предложений для Future. Будем смотреть дальше.

From:

x-ghbdtn.livejournal.com

В "Articulatory Speech Structures" метод никак не расрывается, просто говорится, что при тестовой классификации на данных MOCHA
articulatory corpus (я не знаю, как выглядят артикуляторные корпусы - это необработанный звук?) на 14 классов (типов фонем? почему 14?) у них получилось 77% совпадений. Что там за алгоритм, как в нем используются авторские ETS - ни слова.

Будем смотреть дальше, в поисках чего-нибудь содержательного.

From:

x-ghbdtn.livejournal.com

Предыдущее было про https://www.era.lib.ed.ac.uk/handle/1842/928 "Structural representation and matching of articulatory speech structures based on the evolving transformation system (ETS) formalism". Они работают поверх препроцессора, описанного в https://www.era.lib.ed.ac.uk/handle/1842/942 "Detection of Symbolic Gestural Events in Articulatory Data for Use in Structural Representations of Continuous Speech" - он, собственно и обрабатывает речь в дискретное представление.

Алгоритм там описан словами, без особой математики и оригинальной терминологии, он отношения к ETS не имеет. Исходные данные это не звук, если я правильно понял, а Electromagnetic Articulograph (EMA), Laryngograph and Electropalatograph (EPG) measurements. Им сделали кластеризацию, причем индивидуально по человеку (clustering, making use of an efficient variant of k-means described in [9], is applied to the entire data available for the particular speaker), а дальше фиксировали, в какой кластер попадают данные этих артикулографов, ларингографов и электропалатографов. Какие 14 фонем выбраны - там есть табличка. По кластеризации ссылка на “An Efficient k-Means Clustering Algorithm: Analysis and Implementation,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, а их ETS не причем.

From:

bvn-mai.livejournal.com

А статья по Вашей ссылке действительно интересная.

From:

x-ghbdtn.livejournal.com

Другая статья из того же сборника - "Types and forgetfulness in categorical linguistics and quantum mechanics" http://arxiv.org/abs/1303.3170 продолжает подход, уже удостоенный внимания в этом гостеприимном журнале ( http://anhinga-anhinga.livejournal.com/77367.html ).

From:

anhinga-anhinga.livejournal.com

> Types and forgetfulness in categorical linguistics and quantum mechanics

Спасибо, очень интересная ссылка.

Flat | Top-Level Comments Only

Profile

anhinga_anhinga

Mishka's Page

July 2021

S	M	T	W	T	F	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Page Summary

x-ghbdtn.livejournal.com - (no subject)

Style Credit

Style: Neutral Good for Practicality by timeasmymeasure

Expand Cut Tags

No cut tags

Page generated Jun. 21st, 2025 10:25 pm

Anhinga anhinga

A robot with impressive language capabilities

A robot with impressive language capabilities

no subject

no subject

no subject

no subject

no subject

no subject

no subject

no subject

Profile

July 2021

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags