<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet href="client.xsl" type="text/xsl"?>
<article article-type="other">
<front>
<journal-meta>
<journal-id/>
<issn/>
<banner>
<!--<href>banner.jpg</href>-->
<size width="100%"/>
</banner>
</journal-meta>
<article-meta>
<title-group>
<article-title>Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies</article-title>
</title-group>

<author><a href="mailto:varakant@usc.edu"><name>Pradeep Varakantham</name></a></author>
<aff>University of Southern California<br/> Los Angeles, CA 90089</aff>

<author><a href="mailto:marecki@usc.edu"><name>Janusz Marecki</name></a></author>
<aff>University of Southern California<br/> Los Angeles, CA 90089</aff>

<author><name>Yuichi Yabu</name></author>
<aff>Dept. of Intelligent Systems<br/> Kyushu University, Fukuoka, 812-8581 Japan,</aff>

<author><a href="mailto:tambe@usc.edu"><name>Milind Tambe</name></a></author>
<aff>University of Southern California<br/> Los Angeles, CA 90089</aff>

<author><a href="mailto:yokoo@is.kyushu-u.ac.jp"><name>Makoto Yokoo</name></a></author>
<aff>Dept. of Intelligent Systems<br/> Kyushu University, Fukuoka, 812-8581 Japan,</aff>



</article-meta></front>
<body>
<abstract>
<title>ABSTRACT</title>
<p>Distributed Partially Observable Markov Decision Problems (Distributed
POMDPs) are a popular approach for modeling multi-agent
systems acting in uncertain domains. Given the significant complexity
of solving distributed POMDPs, particularly as we scale
up the numbers of agents, one popular approach has focused on
approximate solutions. Though this approach is efficient, the algorithms
within this approach do not provide any guarantees on solution
quality. A second less popular approach focuses on global
optimality, but typical results are available only for two agents,
and also at considerable computational cost. This paper overcomes
the limitations of both these approaches by providing SPIDER, a
novel combination of three key features for policy generation in distributed
POMDPs: (i) it exploits agent interaction structure given
a network of agents (i.e. allowing easier scale-up to larger number
of agents); (ii) it uses a combination of heuristics to speedup policy
search; and (iii) it allows quality guaranteed approximations, allowing
a systematic tradeoff of solution quality for time. Experimental
results show orders of magnitude improvement in performance
when compared with previous global optimal algorithms.</p>
</abstract>
<fpdf>
<href>pdflogo.jpg</href>
<hpdf>AAMAS07_0243_ca9a1c39082e75616d1c2cd7433c01b9</hpdf>
</fpdf>
</body>
</article>
