Deep reinforcement learning : fundamentals, research and applications
書誌情報:Deep reinforcement learning : fundamentals, research and applications
Hao Dong, Zihan Ding, Shanghang Zhang, editors
Singapore : Springer , c2020
xxvii, 514 p. : ill.(chiefly col.) ; 25 cm
WebCatPlus を見る
CiNii Books を見る


  


所蔵一覧
巻号予約人数所在請求記号登録番号資料ID状態貸出区分備考 
1 0太秦南館:5階閲覧室
  • 007.13
  • D53
  •  
1040454109004736 利用可
図書(帯出可) 

選択行を:  

書誌詳細
刊年2020
形態xxvii, 514 p. : ill.(chiefly col.) ; 25 cm
内容注記Preface
Contributors
Acknowledgements
Mathematical Notation
Acronyms
Introduction
Part 1: Foundamentals
Chapter 1: Introduction to Deep Learning
Chapter 2: Introduction to Reinforcement Learning
Chapter 3: Taxonomy of Reinforcement Learning Algorithms
Chapter 4: Deep Q-Networks
Chapter 5: Policy Gradient
Chapter 6: Combine Deep Q-Networks with Actor-Critic
Part II: Research
Chapter 7: Challenges of Reinforcement Learning
Chapter 8: Imitation Learning
Chapter 9: Integrating Learning and Planning
Chapter 10: Hierarchical Reinforcement Learning
Chapter 11: Multi-Agent Reinforcement Learning
Chapter 12: Parallel Computing
Part III: Applications
Chapter 13: Learning to Run
Chapter 14: Robust Image Enhancement
Chapter 15: AlphaZero
Chapter 16: Robot Learning in Simulation
Chapter 17: Arena Platform for Multi-Agent Reinforcement Learning
Chapter 18: Tricks of Implementation
Part IV: Summary
Chapter 19: Algorithm Table
Chapter 20: Algorithm Cheatsheet
注記Includes bibliographical references
出版国シンガポール
標題言語英語
本文言語英語
著者情報Dong, Hao
Ding, Zihan
Zhang, Shanghang
分類LCC:Q325.6
DC23:006.3/1
ISBN9789811540943
件名LCSH:Reinforcementlearning
FREE:Datamining.bicssc
FREE:Imageprocessing.bicssc
FREE:Artificialintelligence.bicssc
FREE:Computerprogramming/softwaredevelopment.bicssc
FREE:Naturallanguage&machinetranslation.bicssc
FREE:Machinelearning.bicssc
FREE:Computers -- DatabaseManagement -- DataMining.bisacsh
FREE:Computers -- ComputerGraphics.bisacsh
FREE:Technology&Engineering -- Robotics.bisacsh
FREE:Computers -- Programming -- General.bisacsh
FREE:Computers -- Speech&AudioProcessing.bisacsh
FREE:Computers -- Intelligence(AI)&Semantics.bisacsh
FREE:Reinforcementlearning.fast(OCoLC)fst01732553
NCIDBC04568244
番号NBN : 019832579,GBC0H8017

WebCatPlus を見る    CiNii Books を見る