{"id":3233,"date":"2021-10-11T08:14:55","date_gmt":"2021-10-10T23:14:55","guid":{"rendered":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/?p=3233"},"modified":"2021-12-16T12:25:54","modified_gmt":"2021-12-16T03:25:54","slug":"ipi-seminar-1030-1200-thursday-oct-28-2021","status":"publish","type":"post","link":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/news\/3233\/","title":{"rendered":"[finished] ipi seminar 10:30-12:00, Thursday Oct. 28, 2021"},"content":{"rendered":"\n<h3>\u77e5\u306e\u7269\u7406\u5b66\u7814\u7a76\u30bb\u30f3\u30bf\u30fc \/ Institute for Physics of Intelligence (ipi)<\/h3>\n<p style=\"text-align: left\"><br \/>\u3010Speaker\u3011Hajime YOSHINO<span class=\"x_author-d-1gg9uz65z1iz85zgdz68zmqkz84zo2qowz82z5anr4b6129z90zz66zz74zwjicsz79zjz86zz81zz66zcz77zxz72z1gs\"> (Osaka University\uff09<br \/><\/span><\/p>\n<p>\u3010Date\u301110:30-12:00 JST, Thursday, Oct. 28<\/p>\n<p>\u3010Title\u3011\"Design space of a deep neural network - its spatial evolution and robustness\"<br data-rich-text-line-break=\"true\" \/><br data-rich-text-line-break=\"true\" \/>\u3010Abstract\u3011<br \/>Deep neural networks have a large number of free parameters such as synaptic weights.\u00a0Even with fixed architecture, an extremely large number of different machines can be generated by changing the parameters.\u00a0Learning amounts to\u00a0focus\u00a0on a subset of them which meets constraints imposed by a set of training data.\u00a0Increasing the number of the training data, the design space,\u00a0the\u00a0phase space of the valid machines become smaller.\u00a0It may also split into clusters - glass transition - if the frustration effect of the constraints is severe.\u00a0The Parisi order parameter function [1], developed first for spin-glasses, is useful to capture the nature of such a complex phase space.<\/p>\n<p>In this talk, I discuss how one can extend the statistical mechanics approach pioneered by E. Garder\u00a0for a single perceptron [2] based on the replica method to a multi-layered, deep perceptron network [3].\u00a0This amounts to\u00a0construct\u00a0a theory in which the Parisi order function is allowed to evolve in space.\u00a0Specifically, I discuss two scenarios: (1) random training data (2) teacher-student setting.\u00a0In both cases, we found the magnitude of the order parameter evolves in space like in 'wetting transitions':\u00a0it is larger closer to the input\/output boundaries suggesting that the effect of the constraints put by the\u00a0training data\u00a0are\u00a0stronger there. If the system is deep enough, the central region remains in the liquid phase\u00a0meaning that the design space remains very large there. Furthermore, in scenario (1) we found a peculiar\u00a0replica symmetry breaking (RSB) which evolves in space: the design space is clustered in a complex,\u00a0hierarchical manner around the boundaries which become progressively simplified approaching the center.\u00a0More recently we found the same type of spatially evolving RSB also in scenario (2) in the presence of noise\u00a0in the training data. But the latter RSB disappears if the network is made deep enough so that the liquid phase\u00a0survives in the center. Finally, I discuss the implications of the theoretical results on deep learning in practice.<\/p>\n<p>[1] G. Parisi, Phys. Rev. Lett. 43, 1754 (1979). The crucial work for the Nobel prize of this year.<br \/>\u3000\u2192see\u00a0<a href=\"https:\/\/www.nobelprize.org\/prizes\/physics\/2021\/summary\/\" target=\"_blank\" rel=\"noopener noreferrer\" data-auth=\"NotApplicable\" data-linkindex=\"2\">https:\/\/www.nobelprize.org\/prizes\/physics\/2021\/summary\/<\/a><br \/>[2] E. Gardner, J. Phys. A: Math. Gen. 21, 257 (1988), E. Gardner and B. Derrida, J. Phys. A: Math. Gen. 22, 1983 (1989).<br \/>[3] H. Yoshino, SciPostPhysCore 2, 005 (2020).<\/p>\n<div><span class=\"x_author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z8z86zdnagz83zz74zikpz70zz72zkz79zkfbz71z2z88zz74zm9gxc1z74zz79zk\">To receive the Zoom invitation and monthly reminders,<span class=\"x_Apple-converted-space\">\u00a0<\/span><\/span><span class=\"x_author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z8z86zdnagz83zz74zikpz70zz72zkz79zkfbz71z2z88zz74zm9gxc1z74zz79zk\"><b>please register via this google form<\/b><\/span><span class=\"x_author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z8z86zdnagz83zz74zikpz70zz72zkz79zkfbz71z2z88zz74zm9gxc1z74zz79zk\">:<span class=\"x_Apple-converted-space\">\u00a0<\/span><\/span><span class=\"x_attrlink x_url x_author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z8z86zdnagz83zz74zikpz70zz72zkz79zkfbz71z2z88zz74zm9gxc1z74zz79zk x_url\"><a class=\"x_attrlink\" href=\"https:\/\/forms.gle\/dqxhpsZXLNYvbSB38\" target=\"_blank\" rel=\"noreferrer nofollow noopener\" data-auth=\"NotApplicable\"><u>https:\/\/forms.gle\/dqxhpsZXLNYvbSB38<\/u><\/a><\/span><\/div>\n<div><span class=\"x_author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z8z86zdnagz83zz74zikpz70zz72zkz79zkfbz71z2z88zz74zm9gxc1z74zz79zk\">Your e-mail addresses will be used for this purpose only, you can unsubscribe anytime, and we will not send more than three e-mails per month.<\/span><\/div>\n<div>\u00a0<\/div>\n<h3><a href=\"https:\/\/www.phys.s.u-tokyo.ac.jp\/wp-content\/uploads\/2021\/10\/yoshino-ipi20211028.pdf\">PDF<\/a><\/h3>\n<div>\u00a0<\/div>\n<div>\u00a0<\/div>\n<div style=\"text-align: center\"><span class=\"x_author-d-iz88z86z86za0dz67zz78zz78zz74zz68zjz80zz71z9iz90z8z86zdnagz83zz74zikpz70zz72zkz79zkfbz71z2z88zz74zm9gxc1z74zz79zk\">\u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 <br \/><\/span><\/div>\n<div><a href=\"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/lp\/ipi\/\">\u21e6Top page<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"\u77e5\u306e\u7269\u7406\u5b66\u7814\u7a76\u30bb\u30f3\u30bf\u30fc \/ Institute for Physics of Intelligence (ipi) \u3010Speaker\u3011Hajime YOSHINO (Osaka University\uff09 \u3010Date\u301110: [&hellip;]","protected":false},"author":13,"featured_media":3278,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/posts\/3233"}],"collection":[{"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/comments?post=3233"}],"version-history":[{"count":15,"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/posts\/3233\/revisions"}],"predecessor-version":[{"id":3531,"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/posts\/3233\/revisions\/3531"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/media\/3278"}],"wp:attachment":[{"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/media?parent=3233"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/categories?post=3233"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.phys.s.u-tokyo.ac.jp\/en\/wp-json\/wp\/v2\/tags?post=3233"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}