Convergence

<BODY LANG="EN" bgcolor="#ffffff" text="#000000"> <P> <H1><A NAME="SECTION00030000000000000000">Convergence</A></H1> <P> If we direct our attention to the <EM>impact</EM> that agent actions have on others, we notice that if an agent's choice of best action is not impacted by the other agents' actions then its learning task reduces to that of learning to match the <em>fixed</em> function <IMG WIDTH=61 HEIGHT=18 ALIGN=MIDDLE ALT="tex2html_wrap_inline1157" SRC="img26.gif">. That is, <I>M</I><SUB><I>i</I></SUB> will be fixed if agent <I>i</I>'s choice of action depends solely on the world state <I>w</I>. There are many learning algorithms available that allow an agent to learn such a function. We assume the agent uses one such algorithm. From this reasoning it follows that: If agent <I>i</I>'s actions are not impacted by other agents and it is capable of learning a fixed function, then it will eventually learn <I>g</I><SUB><I>i</I></SUB>(<I>w</I>) = <I>M</I><SUB><I>i</I></SUB>(<I>w</I>) and will stay fixed after that. <P> If, on the other hand, the agent's choice of action is impacted by the other agents' actions and the other agents are changing their behavior, then we find that there is no constant <I>M</I><SUB><I>i</I></SUB>(<I>w</I>) function to learn. The MAS becomes a <EM>complex adaptive system</EM>. However, even in this case, it is still possible that all agents will all eventually settle on a fixed set of <I>g</I><SUB><I>i</I></SUB>(<I>w</I>) = <I>M</I><SUB><I>i</I></SUB>(<I>w</I>). If this happens then eventually (and concurrently), the agents will all learn the set of best actions to take in each world state, and these will not change much, if at all. At this point, we say that the system has converged. <P> <BR><IMG WIDTH=411 HEIGHT=26 ALIGN=BOTTOM ALT="definition163" SRC="img27.gif"><BR> <P> Convergence is a general phenomena that goes by many names. For example, if the system is an instance of the Pursuit Problem, we might say that the agents had agreed on a set of <EM>conventions</EM> for dealing with all situations, while in a market system, we would say that the system had reached a competitive <EM>equilibrium</EM>. <P> Unfortunately, we do not have any general rules for predicting which systems will converge, and which will not. These predictions can only be made by examining the particular system (e.g. under certain circumstances we can predict that a market system will reach a price equilibrium). Still, we can say that: <P> <BR><IMG WIDTH=411 HEIGHT=55 ALIGN=BOTTOM ALT="theorem169" SRC="img28.gif"><BR> <b>Proof</b> If <I>M</I> is fixed then, eventually, the knowledge of type <IMG WIDTH=117 HEIGHT=19 ALIGN=MIDDLE ALT="tex2html_wrap_inline1179" SRC="img29.gif"> will become fixed, and so will the <IMG WIDTH=120 HEIGHT=19 ALIGN=MIDDLE ALT="tex2html_wrap_inline1181" SRC="img30.gif"> such that the agent will actually just have a (very complicated) function of <I>w</I>. Therefore, all the knowledge can be collapsed into knowledge of the form <I>K</I><SUB><I>i</I></SUB>(<I>f</I><SUB><I>i</I></SUB>(<I>w</I>)) without losing any information. <P> If, on the other hand, the system has not converged<A NAME="tex2html5" HREF="footnode.html#522" target="footer"><IMG ALIGN=BOTTOM ALT="gif" SRC="http://ai.eecs.umich.edu/people/jmvidal/icons/foot_motif.gif"></A> then we find that deeper models are sometimes better than shallow ones, depending on exactly what knowledge the agents are trying to learn, how they are doing it and certain aspects of the structure of this knowledge, as we shall see in the next section. <P> <HR> <P><ADDRESS> <I>Jose M. Vidal <BR> jmvidal@umich.edu <BR> Thu Apr 24 15:00:31 EDT 1997</I> </ADDRESS> </BODY>