Reinforcement Learning in Action: From Foundations to Frontier AI
Autor Uday Kamath, Vedant Vajreen Limba Engleză Paperback – 14 oct 2026
Preț: 381.70 lei
Preț vechi: 434.44 lei
-12% Precomandă
Puncte Express: 573
Carte nepublicată încă
Livrare prin curier în România Precomanda se expediază când titlul devine disponibil.
Transport gratuit de la 400.00 lei Plată online sau ramburs, în funcție de opțiunile comenzii.
Retur gratuit în 14 zile Comandă securizată și suport în română.
Doresc să fiu notificat când acest titlu va fi disponibil:
Se trimite...
Specificații
ISBN-13: 9781041130819
ISBN-10: 1041130813
Pagini: 352
Ilustrații: 90
Dimensiuni: 178 x 254 mm
Ediția:1
Editura: CRC Press
Colecția CRC Press
ISBN-10: 1041130813
Pagini: 352
Ilustrații: 90
Dimensiuni: 178 x 254 mm
Ediția:1
Editura: CRC Press
Colecția CRC Press
Public țintă
Academic, Postgraduate, and Professional Practice & DevelopmentCuprins
List of Figures. List of Tables. Foreword. Preface. Author Bios. Contributors. Notation. Chapter 1: Introduction to Reinforcement Learning. Section I: Basics of Reinforcement Learning. Chapter 2: Fundamentals of Reinforcement Learning. Section II: Classical Reinforcement Learning. Chapter 3: Classical Reinforcement Learning Algorithms. Section III: Deep Reinforcement Learning. Chapter 4: Scaling Reinforcement Learning: Function Approximation and Deep Methods. Section IV: LLMs and Reinforcement Learning. Chapter 5: Preference-Based Alignment: Reward Modeling and Reinforcement Learning for LLMs. Chapter 6: Reinforcement Learning for Reasoning Models. Section V: Agentic and Reinforcement Learning. Chapter 7: Reinforcement Learning Enabled Agentic AI. Appendix A: Mathematical Proofs and Derivations. Bibliography. Index.
Notă biografică
Uday Kamath has over 25 years of experience in AI product development with a Ph.D. in scalable machine learning. His significant contributions span numerous journals, conferences, books, and patents. Notable books include Large Language Models: A Deep Dive, Applied Causal Inference, Explainable Artificial Intelligence, Transformers for Machine Learning, Deep Learning for NLP and Speech Recognition, Mastering Java Machine Learning, and Machine Learning: End-to-End Guide for Java Developers. Currently serving as the Chief Analytics Officer at Smarsh, he spearheads data science and research in communications AI for regulated industries. He is also an active member of the Board of Advisors for entities, including commercial companies and academic institutions.
Vedant Vajre is an aspiring AI researcher with a strong interest in reinforcement learning and intelligent decision-making systems. He is graduating from Penn State University and intends to pursue doctoral research in artificial intelligence. He has authored two peer-reviewed research publications with IEEE and continues to pursue active research in machine learning. Having worked at organizations including NASA, IBM, and early-stage startups, he has gained experience applying machine learning and AI in both research and production settings. Outside of research, he loves spending time with his two Shih Tzus (Pinot and Buzz), playing tennis, and solving Sudoku puzzles.
Vedant Vajre is an aspiring AI researcher with a strong interest in reinforcement learning and intelligent decision-making systems. He is graduating from Penn State University and intends to pursue doctoral research in artificial intelligence. He has authored two peer-reviewed research publications with IEEE and continues to pursue active research in machine learning. Having worked at organizations including NASA, IBM, and early-stage startups, he has gained experience applying machine learning and AI in both research and production settings. Outside of research, he loves spending time with his two Shih Tzus (Pinot and Buzz), playing tennis, and solving Sudoku puzzles.
Descriere
Written for ML engineers, researchers, and advanced students, this book provides both the conceptual depth and implementation fluency needed to understand, build, and extend the RL systems shaping the future of AI.