<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dw="https://www.dreamwidth.org">
  <id>tag:dreamwidth.org,2017-07-05:3235205</id>
  <title>Anhinga anhinga</title>
  <subtitle>anhinga_anhinga</subtitle>
  <author>
    <name>anhinga_anhinga</name>
  </author>
  <link rel="alternate" type="text/html" href="https://anhinga-anhinga.dreamwidth.org/"/>
  <link rel="self" type="text/xml" href="https://anhinga-anhinga.dreamwidth.org/data/atom"/>
  <updated>2021-07-19T11:21:37Z</updated>
  <dw:journal username="anhinga_anhinga" type="personal"/>
  <entry>
    <id>tag:dreamwidth.org,2017-07-05:3235205:84277</id>
    <link rel="alternate" type="text/html" href="https://anhinga-anhinga.dreamwidth.org/84277.html"/>
    <link rel="self" type="text/xml" href="https://anhinga-anhinga.dreamwidth.org/data/atom/?itemid=84277"/>
    <title>OpenAI Codex (статья года), JuliaCon 2021 starts on July 20</title>
    <published>2021-07-19T09:09:29Z</published>
    <updated>2021-07-19T11:21:37Z</updated>
    <dw:security>public</dw:security>
    <dw:reply-count>1</dw:reply-count>
    <content type="html">&amp;quot;Evaluating Large Language Models Trained on Code&amp;quot;, &lt;a href="https://arxiv.org/abs/2107.03374"&gt;arxiv.org/abs/2107.03374&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;A detailed description of early (pre-GitHub Copilot) versions of OpenAI Codex. This is the &amp;quot;paper of the year&amp;quot; so far: we finally have real progress in AI-assisted computer programming (and difficulties of computer programming form the key bottleneck limiting the speed of progress).&lt;br /&gt;&lt;br /&gt;See comments in&amp;nbsp;&lt;a href="https://dmm.dreamwidth.org/44860.html"&gt;dmm.dreamwidth.org/44860.html&lt;/a&gt; for details.&lt;br /&gt;&lt;br /&gt;JuliaCon 2021 starts on July 20 with 8 days of workshops followed by 3 days of main conference. JuliaCon 2020 was great, this is likely to be even better.&lt;br /&gt;&lt;br /&gt;This is a fully virtual conference for the second year in a row; the registration is free and needed to access interactive features, poster sessions, and such, but the bulk of materials will be accessible via YouTube without registration. I created a post&amp;nbsp;&lt;a href="https://dmm.dreamwidth.org/46160.html"&gt;dmm.dreamwidth.org/46160.html&lt;/a&gt; with links and I'll keep populating it with various comments as the conference progresses.&lt;br /&gt;&lt;br /&gt;Cross-post: &lt;a href="https://anhinga-anhinga.livejournal.com/85003.html"&gt;anhinga-anhinga.livejournal.com/85003.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.dreamwidth.org/tools/commentcount?user=anhinga_anhinga&amp;ditemid=84277" width="30" height="12" alt="comment count unavailable" style="vertical-align: middle;"/&gt; comments</content>
  </entry>
  <entry>
    <id>tag:dreamwidth.org,2017-07-05:3235205:84201</id>
    <link rel="alternate" type="text/html" href="https://anhinga-anhinga.dreamwidth.org/84201.html"/>
    <link rel="self" type="text/xml" href="https://anhinga-anhinga.dreamwidth.org/data/atom/?itemid=84201"/>
    <title>9 months since GPT-3 revolution</title>
    <published>2021-02-28T08:44:10Z</published>
    <updated>2021-02-28T08:57:45Z</updated>
    <dw:security>public</dw:security>
    <dw:reply-count>1</dw:reply-count>
    <content type="html">On May 28, 2020 OpenAI published the GPT-3 paper, &amp;quot;Language Models are Few-Shot Learners&amp;quot;, &lt;a href="https://arxiv.org/abs/2005.14165"&gt;arxiv.org/abs/2005.14165&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This was the &amp;quot;AlexNet moment of the Transformer Revolution&amp;quot;, and the qualitative jump was even more significant than the AlexNet 2012 jump.&lt;br /&gt;&lt;br /&gt;One extremely strange and remarkable property of GPT-3  is that purely linguistic knowledge in this model is often sufficient to guess a piece of correct computer code from a natural language description of a problem (even though we don't think this model &amp;quot;truly understands programming&amp;quot;).&lt;br /&gt;&lt;br /&gt;There was already quite a boom in these novel models (invented as recently as 2017), after BERT and GPT-2, but now the field had just exploded: &amp;quot;efficient transformers&amp;quot;, &amp;quot;vision transformers&amp;quot;, &amp;quot;multimodal transformers&amp;quot;, etc.&lt;br /&gt;&lt;br /&gt;And tons of interesting work were done in hybrid models which combined transformers and other attention-based models with all kinds of other techniques. Hybrids of all kinds of methods with transformers and other attention-based models are probably the future. For example, the famous Alpha Fold 2 by DeepMind which &amp;quot;solved&amp;quot; protein folding in November was a hybrid model with an attention-based component at its center: &lt;a href="https://deepmind.com/blog/article/alphafold-a-solution-to-a-50-year-old-grand-challenge-in-biology"&gt;deepmind.com/blog/article/alphafold-a-solution-to-a-50-year-old-grand-challenge-in-biology&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;All this means two things: 1)&amp;nbsp;&amp;quot;True AI&amp;quot; can emerge any moment; we are seeing a flood of breakthroughs now, and one of them (or a short sequence of them) might result in a much more radical shift than anything we've seen so far. I don't know if it happens this year, but it's a possibility (from invention of convolutional neural nets in 1989 it took more than 20 years to AlexNet, from invention of Transformers in 2017 it took only 3 years to GPT-3, things can really start happening very fast).&lt;br /&gt;&lt;br /&gt;2) If you are a practitioner in any field (especially if this field is machine learning of some kind), it makes sense to ponder hybrids between your favorite methods and &amp;quot;attention&amp;quot; (which is just a linear combination of high-dimensional vectors [sometimes with all coefficients being non-negative and summing up to 1]), hybrids between your favorite methods and matrix multiplication (which is just a way to compute a lot of linear combinations of high-dimensional vectors rapidly), hybrids between your favorite methods and Transformers (a certain way of arranging those matrix multiplications and interleaving them with modest neural connectors). This is likely to be a very fruitful thing, and this is how you can supercharge your favorite methods and produce novel results.&lt;br /&gt;&lt;br /&gt;Cross-post: &lt;a href="https://anhinga-anhinga.livejournal.com/84392.html"&gt;anhinga-anhinga.livejournal.com/84392.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.dreamwidth.org/tools/commentcount?user=anhinga_anhinga&amp;ditemid=84201" width="30" height="12" alt="comment count unavailable" style="vertical-align: middle;"/&gt; comments</content>
  </entry>
  <entry>
    <id>tag:dreamwidth.org,2017-07-05:3235205:83769</id>
    <link rel="alternate" type="text/html" href="https://anhinga-anhinga.dreamwidth.org/83769.html"/>
    <link rel="self" type="text/xml" href="https://anhinga-anhinga.dreamwidth.org/data/atom/?itemid=83769"/>
    <title>Julia programming language</title>
    <published>2020-02-16T06:43:45Z</published>
    <updated>2020-02-16T06:44:32Z</updated>
    <dw:security>public</dw:security>
    <dw:reply-count>2</dw:reply-count>
    <content type="html">Julia is an unusual language. It is based around the idea of &amp;quot;eating  your cake and having it too, again and again&amp;quot;. Flexible and very fast at  the same time, friendly readable syntax and Lisp-strength macros and  multiple dispatch, etc:&lt;br /&gt;&lt;br /&gt;&lt;a href="https://julialang.org/blog/2012/02/why-we-created-julia/" target="_self"&gt;https://julialang.org/blog/2012/02/why-we-created-julia/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Julia Flux&lt;/b&gt;  is trying to become the next generation machine learning framework, and  is also characterized by this approach of &amp;quot;eating your cake and having  it too&amp;quot;. If TensorFlow 1.0 is the past, and PyTorch is the leading  state-of-the-art framework of the present, &lt;b&gt;Julia Flux&lt;/b&gt; is quite likely to become the machine learning framework of the future; see the first comment in this blog post for details:&lt;br /&gt;&lt;br /&gt;&lt;a href="https://dmm.dreamwidth.org/23453.html" target="_self"&gt;https://dmm.dreamwidth.org/23453.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Does anyone here use Julia, or does anyone here knows someone who uses Julia?&lt;/strong&gt;  &lt;br /&gt;&lt;br /&gt;Crosspost: &lt;a href="https://anhinga-anhinga.livejournal.com/84046.html"&gt;anhinga-anhinga.livejournal.com/84046.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.dreamwidth.org/tools/commentcount?user=anhinga_anhinga&amp;ditemid=83769" width="30" height="12" alt="comment count unavailable" style="vertical-align: middle;"/&gt; comments</content>
  </entry>
  <entry>
    <id>tag:dreamwidth.org,2017-07-05:3235205:83583</id>
    <link rel="alternate" type="text/html" href="https://anhinga-anhinga.dreamwidth.org/83583.html"/>
    <link rel="self" type="text/xml" href="https://anhinga-anhinga.dreamwidth.org/data/atom/?itemid=83583"/>
    <title>2019: shaders, dreamwidth, and more</title>
    <published>2019-12-31T14:00:12Z</published>
    <updated>2019-12-31T14:08:57Z</updated>
    <dw:security>public</dw:security>
    <dw:reply-count>0</dw:reply-count>
    <content type="html">I hope for us all to have a creative and safe New Year.&lt;br /&gt;&lt;br /&gt;Computer art news: I started to play with OpenGL shaders and with Shadertoy.com: &lt;a href="https://dmm.dreamwidth.org/20076.html"&gt;dmm.dreamwidth.org/20076.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.shadertoy.com/media/shaders/3tSGzV.jpg" alt="" /&gt;&lt;br /&gt;&lt;br /&gt;Other news: books and stories, my own texts, open source activity and software experiments, employment change, etc:&amp;nbsp;&lt;a href="https://dmm.dreamwidth.org/23061.html"&gt;dmm.dreamwidth.org/23061.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I generally shifted quite a bit towards dreamwidth and this blog during  this year, and away from livejournal; this was not planned, but just  happened &amp;quot;organically&amp;quot;. Most of my activity this year was at &lt;a href="https://dmm.dreamwidth.org/"&gt;dmm.dreamwidth.org&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Crosspost: &lt;a href="https://anhinga-anhinga.livejournal.com/83885.html"&gt;anhinga-anhinga.livejournal.com/83885.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.dreamwidth.org/tools/commentcount?user=anhinga_anhinga&amp;ditemid=83583" width="30" height="12" alt="comment count unavailable" style="vertical-align: middle;"/&gt; comments</content>
  </entry>
  <entry>
    <id>tag:dreamwidth.org,2017-07-05:3235205:83296</id>
    <link rel="alternate" type="text/html" href="https://anhinga-anhinga.dreamwidth.org/83296.html"/>
    <link rel="self" type="text/xml" href="https://anhinga-anhinga.dreamwidth.org/data/atom/?itemid=83296"/>
    <title>Self-referential neural nets in 2018</title>
    <published>2018-12-05T18:03:16Z</published>
    <updated>2019-09-01T04:30:28Z</updated>
    <category term="dataflow matrix machines"/>
    <dw:security>public</dw:security>
    <dw:reply-count>12</dw:reply-count>
    <content type="html">Two series of experiments with self-referential neural nets with vector flows (&lt;strong&gt;&amp;quot;dataflow matrix machines&amp;quot;&lt;/strong&gt;) were done by us in 2018.&lt;br /&gt;&lt;br /&gt;The ability of a neural net to modify itself on the fly was used to edit it interactively while it is running (&amp;quot;livecoding&amp;quot;). This also opens the way to have populations of neural nets editing each other.&lt;br /&gt;&lt;br /&gt;Emerging &amp;quot;sleep-wake&amp;quot; behavior and other emerging bistability patterns were observed in randomly initialized neural nets (May 2019 update: a couple of video recordings of those behaviors are posted: &lt;a href="https://youtu.be/_mZVVU8x3bs" rel="nofollow"&gt;https://youtu.be/_mZVVU8x3bs&lt;/a&gt; and &lt;a href="https://youtu.be/CKVwsQEMNjY" rel="nofollow"&gt;https://youtu.be/CKVwsQEMNjY&lt;/a&gt; ). There is no theoretical understanding of this emerging dynamics yet.&lt;br /&gt;&lt;br /&gt;&lt;span class="cut-wrapper"&gt;&lt;span style="display: none;" id="span-cuttag___1" class="cuttag"&gt;&lt;/span&gt;&lt;b class="cut-open"&gt;(&amp;nbsp;&lt;/b&gt;&lt;b class="cut-text"&gt;&lt;a href="https://anhinga-anhinga.dreamwidth.org/83296.html#cutid1"&gt;Details&lt;/a&gt;&lt;/b&gt;&lt;b class="cut-close"&gt;&amp;nbsp;)&lt;/b&gt;&lt;/span&gt;&lt;div style="display: none;" id="div-cuttag___1" aria-live="assertive"&gt;&lt;/div&gt;&lt;br /&gt;Crosspost: &lt;a href="https://anhinga-anhinga.livejournal.com/83697.html"&gt;anhinga-anhinga.livejournal.com/83697.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Other blogs by this author:&lt;br /&gt;&lt;br /&gt;&lt;a href="https://dmm.dreamwidth.org/"&gt;https://dmm.dreamwidth.org/&lt;/a&gt;&amp;nbsp; (partial mirror: &lt;a href="https://anhinga-travel.livejournal.com/"&gt;https://anhinga-travel.livejournal.com/&lt;/a&gt; )&lt;br /&gt;&lt;a href="https://anhinga-drafts.livejournal.com/"&gt;https://anhinga-drafts.livejournal.com/&lt;/a&gt;&amp;nbsp; (mirror: &lt;a href="https://anhinga-drafts.dreamwidth.org/"&gt;https://anhinga-drafts.dreamwidth.org/&lt;/a&gt; )&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.dreamwidth.org/tools/commentcount?user=anhinga_anhinga&amp;ditemid=83296" width="30" height="12" alt="comment count unavailable" style="vertical-align: middle;"/&gt; comments</content>
  </entry>
  <entry>
    <id>tag:dreamwidth.org,2017-07-05:3235205:83104</id>
    <link rel="alternate" type="text/html" href="https://anhinga-anhinga.dreamwidth.org/83104.html"/>
    <link rel="self" type="text/xml" href="https://anhinga-anhinga.dreamwidth.org/data/atom/?itemid=83104"/>
    <title>Dataflow matrix machines as a bridge between programs and neural nets</title>
    <published>2017-12-31T17:58:51Z</published>
    <updated>2017-12-31T18:01:27Z</updated>
    <category term="dataflow matrix machines"/>
    <dw:security>public</dw:security>
    <dw:reply-count>0</dw:reply-count>
    <content type="html">Continuing the last couple of posts&lt;br /&gt;&lt;br /&gt;&lt;a href="https://anhinga-anhinga.livejournal.com/82953.html"&gt;https://anhinga-anhinga.livejournal.com/82953.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;and&lt;span class="cut-wrapper"&gt;&lt;span style="display: none;" id="span-cuttag___1" class="cuttag"&gt;&lt;/span&gt;&lt;b class="cut-open"&gt;(&amp;nbsp;&lt;/b&gt;&lt;b class="cut-text"&gt;&lt;a href="https://anhinga-anhinga.dreamwidth.org/83104.html#cutid1"&gt;Read more...&lt;/a&gt;&lt;/b&gt;&lt;b class="cut-close"&gt;&amp;nbsp;)&lt;/b&gt;&lt;/span&gt;&lt;div style="display: none;" id="div-cuttag___1" aria-live="assertive"&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;img src="https://www.dreamwidth.org/tools/commentcount?user=anhinga_anhinga&amp;ditemid=83104" width="30" height="12" alt="comment count unavailable" style="vertical-align: middle;"/&gt; comments</content>
  </entry>
</feed>
