{"id":1884,"date":"2024-04-10T16:34:12","date_gmt":"2024-04-10T14:34:12","guid":{"rendered":"https:\/\/www.pauljorion.com\/blog_en\/?p=1884"},"modified":"2024-04-10T16:36:58","modified_gmt":"2024-04-10T14:36:58","slug":"the-transformer-is-a-thinking-object-by-claude-roux","status":"publish","type":"post","link":"https:\/\/www.pauljorion.com\/blog_en\/2024\/04\/10\/the-transformer-is-a-thinking-object-by-claude-roux\/","title":{"rendered":"<b>The <em>transformer<\/em> is a thinking object<\/b>, by Claude Roux *"},"content":{"rendered":"<p><a href=\"https:\/\/www.pauljorion.com\/blog\/wp-content\/uploads\/Stable-Diffusion-Spinoza.jpeg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-139629\" src=\"https:\/\/www.pauljorion.com\/blog\/wp-content\/uploads\/Stable-Diffusion-Spinoza.jpeg\" alt=\"\" width=\"1024\" height=\"1024\" \/><\/a><\/p>\n<blockquote><p>Spinoza&#8217;s portrait by Stable Diffusion<\/p><\/blockquote>\n<p>A few years back, I had devoured Fr\u00e9d\u00e9ric Lordon&#8217;s book <em>Capitalisme, D\u00e9sir et Servitude<\/em>. In particular, I&#8217;d been fascinated by the notion of <i>conatus<\/i>, the desire for power. <a href=\"https:\/\/heconomist.ch\/2020\/04\/18\/je-suis-spinoziste-partie-iii-frederic-lordon-capitalisme-desir-et-servitude-marx-et-spinoza\/\" target=\"_blank\" rel=\"noopener\">Fr\u00e9d\u00e9ric Lordon explained<\/a> that Spinoza, as a mathematician, had defined <i>conatus<\/i> in the form of mathematical vectors, enabling him to transform his philosophical thinking into mathematical axioms and theorems.<\/p>\n<p>And it was here that I was struck by the resemblance to <em>transformers<\/em>.<\/p>\n<p>Indeed, a transformer takes as input a succession of <em>tokens<\/em> or words, which it replaces with their <i>embedding<\/i>. An embedding is a huge vector that has been assigned to each word or token, enabling them to be compared with each other. So we can, for example, perform mathematical operations such as &#8220;boy = woman &#8211; man + girl&#8221;.<\/p>\n<p>The transformer then combines these vectors via a succession of mathematical transformations learned during training to generate a new vector <i>V<\/i>. This vector is then compared with the set of <i>embeddings<\/i> to identify the closest to generate the next word.<\/p>\n<p>[E1 E2 E3 &#8230; En] &#8211;&gt; TRANSFORM &#8211;&gt; V &#8211;&gt; next token<\/p>\n<p>It thus takes as input a sequence of words for which it constructs a true <i>interpretation in the form of a new semantic vector V.<\/i> At the next iteration, it injects this word at the end of the context and resumes its generation until it ends up with the complete text. <\/p>\n<p>I can&#8217;t help seeing the succession of <i>V<\/i> as a form of <i>conatus<\/i>. Indeed, a model learned with a reinforcement algorithm is <i>intentioned<\/i>, unlike other models, which are <i>supervised<\/i>. This intention is defined in particular by RLHF (Reinforcement learning from human feedback), which forces the model to stay in line, to align itself with human desires, their <i>conatus<\/i> according to Spinoza.<\/p>\n<p>This process can be seen as a means of imposing a particular <i>conatus<\/i> on a model.<\/p>\n<p>According to this interpretation, we are faced with a thinking object.<\/p>\n<p>* <a href=\"https:\/\/www.pauljorion.com\/blog\/2024\/03\/08\/grands-modeles-de-langage-pourquoi-les-reseaux-neuronaux-ont-ils-reussi-la-ou-la-linguistique-echouait-par-claude-roux\/\" target=\"_blank\" rel=\"noopener\">Grands Mod\u00e8les de Langage : Pourquoi les r\u00e9seaux neuronaux ont-ils r\u00e9ussi l\u00e0 o\u00f9 la linguistique \u00e9chouait ?<\/a>, by Claude Roux, March 8<sup>th<\/sup> 2024<\/p>\n<p><a href=\"https:\/\/www.pauljorion.com\/blog\/2024\/03\/15\/video-pj-tv-ia-pourquoi-la-linguistique-a-t-elle-echoue\/\" target=\"_blank\" rel=\"noopener\">PJ TV \u2013 IA : Pourquoi la linguistique a-t-elle \u00e9chou\u00e9 ?<\/a> a conversation with Claude Roux, March 15<sup>th<\/sup> 2024<\/p>\n<p><a href=\"https:\/\/www.pauljorion.com\/blog\/wp-content\/uploads\/DALL\u00b7E-2024-04-09-19.58.32-Visualize-a-thinking-machine-as-an-abstract-and-artistic-representation-merging-elements-of-technology-and-organic-thought-processes.-The-machine-s.webp\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-139630\" src=\"https:\/\/www.pauljorion.com\/blog\/wp-content\/uploads\/DALL\u00b7E-2024-04-09-19.58.32-Visualize-a-thinking-machine-as-an-abstract-and-artistic-representation-merging-elements-of-technology-and-organic-thought-processes.-The-machine-s.webp\" alt=\"\" width=\"1024\" height=\"1024\" \/><\/a><\/p>\n<blockquote><p>Portrait of a thinking object according to DALL\u00b7E<\/p><\/blockquote>\n","protected":false},"excerpt":{"rendered":"<p><a href=\"https:\/\/www.pauljorion.com\/blog\/wp-content\/uploads\/Stable-Diffusion-Spinoza.jpeg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-139629\" src=\"https:\/\/www.pauljorion.com\/blog\/wp-content\/uploads\/Stable-Diffusion-Spinoza.jpeg\" alt=\"\" width=\"1024\" height=\"1024\" \/><\/a><\/p>\n<blockquote>\n<p>Spinoza&#8217;s portrait by Stable Diffusion<\/p>\n<\/blockquote>\n<p>A few years back, I had devoured Fr\u00e9d\u00e9ric Lordon&#8217;s book <em>Capitalisme, D\u00e9sir et Servitude<\/em>. In particular, I&#8217;d been fascinated by the notion of <i>conatus<\/i>, the desire for power. <a href=\"https:\/\/heconomist.ch\/2020\/04\/18\/je-suis-spinoziste-partie-iii-frederic-lordon-capitalisme-desir-et-servitude-marx-et-spinoza\/\" target=\"_blank\" rel=\"noopener\">Fr\u00e9d\u00e9ric Lordon explained<\/a> that Spinoza, as a mathematician, had [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_crdt_document":"","footnotes":""},"categories":[3,12],"tags":[401,356,400],"class_list":["post-1884","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","category-human-complex-systems","tag-baruch-spinoza","tag-llm","tag-transformer"],"_links":{"self":[{"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/posts\/1884","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/comments?post=1884"}],"version-history":[{"count":3,"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/posts\/1884\/revisions"}],"predecessor-version":[{"id":1887,"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/posts\/1884\/revisions\/1887"}],"wp:attachment":[{"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/media?parent=1884"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/categories?post=1884"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.pauljorion.com\/blog_en\/wp-json\/wp\/v2\/tags?post=1884"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}