|
DRUM >
Institute for Systems Research >
Institute for Systems Research Technical Reports >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1903/6029
|
Full metadata record
| DC Field | Value | Language |
| dc.contributor.advisor | Fu, Michael C. | en_US |
| dc.contributor.advisor | Marcus, Steven I. | en_US |
| dc.contributor.author | He, Ying | en_US |
| dc.contributor.author | Fu, Michael C. | en_US |
| dc.contributor.author | Marcus, Steven I. | en_US |
| dc.date.accessioned | 2007-05-23T10:07:22Z | - |
| dc.date.available | 2007-05-23T10:07:22Z | - |
| dc.date.issued | 1999 | en_US |
| dc.identifier.uri | http://hdl.handle.net/1903/6029 | - |
| dc.description.abstract | In this paper, we give a summary of recent development of simulation-based algorithmsfor average cost MDP problems, which are different from those for discounted cost problems or shortest pathproblems. We introduce both simulation-based policy iteration algorithms and simulation-based value iterationalgorithms for average cost problems, and give the pros and cons of each algorithm. | en_US |
| dc.format.extent | 263703 bytes | - |
| dc.format.mimetype | application/pdf | - |
| dc.language.iso | en_US | en_US |
| dc.relation.ispartofseries | ISR; TR 1999-56 | en_US |
| dc.subject | algorithms, Simulation-Based Policy Iteration, Simulation-Based Value Iteration, Markov Decision Processes, Average Cost, Unichain, Systems Integration Methodology | en_US |
| dc.title | Simulation-Based Algorithms for Average Cost Markov Decision Processes | en_US |
| dc.type | Technical Report | en_US |
| dc.contributor.department | ISR | en_US |
| Appears in Collections: | Institute for Systems Research Technical Reports
|
All items in DRUM are protected by copyright, with all rights reserved.
|