Reinforcement Learning and Stochastic Optimization _ A Unified Framework for Sequential Decisions_Warren B. Powell.pdf

Reinforcement Learning and Stochastic Optimization: A Uniﬁed Framework for Sequential Decisions Reinforcement Learning and Stochastic Optimization A Uniﬁed Framework for Sequential Decisions Warren B. Powell Princeton University Princeton, NJ This edition first published 2022 ©2022 John Wiley & Sons, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by law. Advice on how to obtain permission to reuse material from this title is available athttp://www.wiley.com/go/permissions. The right of Warren B. Powell to be identified as the author of this work has been asserted in accordance with law. Registered Office John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, USA Editorial Office 111 River Street, Hoboken, NJ 07030, USA For details of our global editorial offices, customer services, and more information about Wiley products visit us atwww.wiley.com. Wiley also publishes its books in a variety of electronic formats and by print-on-demand. Some content that appears in standard print versions of this book may not be available in other formats. Limit of Liability/Disclaimer of Warranty The contents of this work are intended to further general scientific research, understanding, and discussion only and are not intended and should not be relied upon as recommending or promoting scientific method, diagnosis, or treatment by physicians for any particular patient. In view of ongoing research, equipment modifications, changes in governmental regulations, and the constant flow of information relating to the use of medicines, equipment, and devices, the reader is urged to review and evaluate the information provided in the package insert or instructions for each medicine, equipment, or device for, among other things, any changes in the instructions or indication of usage and for added warni

查看更多收起部分

Reinforcement Learning and Stochastic Optimization _ A Unified Framework for Sequential Decisions_Warren B. Powell.pdf