برای ارتباط با ما می توانید از طریق شماره موبایل زیر از طریق تماس و پیامک با ما در ارتباط باشید

09117307688
09117179751

در صورت عدم پاسخ گویی از طریق پیامک با پشتیبان در ارتباط باشید

دسترسی نامحدود

برای کاربرانی که ثبت نام کرده اند

ضمانت بازگشت وجه

درصورت عدم همخوانی توضیحات با کتاب

پشتیبانی

از ساعت 7 صبح تا 10 شب

دانلود کتاب Grokking Deep Reinforcement Learning

دانلود کتاب Grokking یادگیری تقویت عمیق

مشخصات کتاب

Grokking Deep Reinforcement Learning

دسته بندی: سایبرنتیک: هوش مصنوعی
ویرایش: 1 
نویسندگان: Miguel Morales  
سری:  
ISBN (شابک) : 1617295450, 9781617295454 
ناشر: Manning Publications 
سال نشر: 2020 
تعداد صفحات: 472 
زبان: English 
فرمت فایل : PDF (درصورت درخواست کاربر به PDF، EPUB یا AZW3 تبدیل می شود) 
حجم فایل: 17 مگابایت

قیمت کتاب (تومان) : 50,000

کلمات کلیدی مربوط به کتاب Grokking یادگیری تقویت عمیق: یادگیری عمیق، یادگیری تقویتی، هوش مصنوعی مبتنی بر عامل، روش منتقد، روش خط مشی، روش گرادیان، هوش مصنوعی عمومی

میانگین امتیاز به این کتاب :
تعداد امتیاز دهندگان : 12

در صورت تبدیل فایل کتاب Grokking Deep Reinforcement Learning به فرمت های PDF، EPUB، AZW3، MOBI و یا DJVU می توانید به پشتیبان اطلاع دهید تا فایل مورد نظر را تبدیل نمایند.

توجه داشته باشید کتاب Grokking یادگیری تقویت عمیق نسخه زبان اصلی می باشد و کتاب ترجمه شده به فارسی نمی باشد. وبسایت اینترنشنال لایبرری ارائه دهنده کتاب های زبان اصلی می باشد و هیچ گونه کتاب ترجمه شده یا نوشته شده به فارسی را ارائه نمی دهد.

توضیحاتی در مورد کتاب Grokking یادگیری تقویت عمیق

همه ما از طریق آزمون و خطا یاد می گیریم. ما از چیزهایی که باعث رنج و شکست ما می شود اجتناب می کنیم. ما چیزهایی را که به ما پاداش و موفقیت می‌دهند، در آغوش می‌گیریم و روی آن‌ها بنا می‌کنیم. این الگوی رایج پایه و اساس یادگیری تقویتی عمیق است: ساختن سیستم های یادگیری ماشینی که بر اساس پاسخ های محیط کاوش و یاد می گیرند. Grokking Deep Reinforcement Learning این رویکرد یادگیری ماشینی قدرتمند را با استفاده از مثال‌ها، تصاویر، تمرین‌ها و آموزش شفاف معرفی می‌کند. هنگامی که در این کاوش شگفت انگیز در زمینه مبانی یادگیری تقویتی، تکنیک های یادگیری عمیق موثر و کاربردهای عملی در این زمینه نوظهور جستجو می کنید، آموزش با سرعت عالی و سبک نوشتاری هوشمندانه و جذاب را دوست خواهید داشت. در مورد تکنولوژی ما از طریق تعامل با محیط خود یاد می گیریم و پاداش ها یا تنبیه هایی که تجربه می کنیم رفتار آینده ما را هدایت می کند. یادگیری تقویتی عمیق همان فرآیند طبیعی را به هوش مصنوعی می آورد و نتایج را تجزیه و تحلیل می کند تا کارآمدترین راه ها را کشف کند. عوامل DRL می‌توانند کمپین‌های بازاریابی را بهبود بخشند، عملکرد سهام را پیش‌بینی کنند و استادان بزرگ را در بازی Go و شطرنج شکست دهند. درباره کتاب آموزش تقویت عمیق Grokking از تمرین های جذاب برای آموزش نحوه ساخت سیستم های یادگیری عمیق استفاده می کند. این کتاب کدهای حاشیه نویسی پایتون را با توضیحات بصری ترکیب می کند تا تکنیک های DRL را بررسی کند. خواهید دید که الگوریتم ها چگونه کار می کنند و یاد می گیرید که عوامل DRL خود را با استفاده از بازخورد ارزیابی توسعه دهید.

توضیحاتی درمورد کتاب به خارجی

We all learn through trial and error. We avoid the things that cause us to experience pain and failure. We embrace and build on the things that give us reward and success. This common pattern is the foundation of deep reinforcement learning: building machine learning systems that explore and learn based on the responses of the environment. Grokking Deep Reinforcement Learning introduces this powerful machine learning approach, using examples, illustrations, exercises, and crystal-clear teaching. You'll love the perfectly paced teaching and the clever, engaging writing style as you dig into this awesome exploration of reinforcement learning fundamentals, effective deep learning techniques, and practical applications in this emerging field. About the technology We learn by interacting with our environment, and the rewards or punishments we experience guide our future behavior. Deep reinforcement learning brings that same natural process to artificial intelligence, analyzing results to uncover the most efficient ways forward. DRL agents can improve marketing campaigns, predict stock performance, and beat grand masters in Go and chess. About the book Grokking Deep Reinforcement Learning uses engaging exercises to teach you how to build deep learning systems. This book combines annotated Python code with intuitive explanations to explore DRL techniques. You'll see how algorithms function and learn to develop your own DRL agents using evaluative feedback.

فهرست مطالب

foreword
	preface
	acknowledgments
	about this book
	about the author
Introduction todeep reinforcement learning
	What is deep reinforcement learning?
		The past, present, and future of deep reinforcement learning
		The suitability of deep reinforcement learning
		Setting clear two-way expectations
Mathematical foundationsof reinforcement learning
	Components of reinforcement learning
		MDPs: The engine of the environment
Balancing immediateand long-term goals
	The objective of a decision-making agent
		Planning optimal sequences of actions
Balancing the gatheringand use of information
	The challenge of interpreting evaluative feedback
		Strategic exploration
Evaluatingagents’ behaviors
	Learning to estimate the value of policies
		Learning to estimate from multiple steps
Improvingagents’ behaviors
	The anatomy of reinforcement learning agents
		Learning to improve policies of behavior
		Decoupling behavior from learning
Achieving goals moreeffectively and efficiently
	Learning to improve policies using robust targets
		Agents that interact, learn, and plan
Introduction to value-baseddeep reinforcement learning
	The kind of feedback deep reinforcement learning agents use
		Introduction to function approximation for reinforcement learning
		NFQ: The first attempt at value-based deep reinforcement learning
More stablevalue-based methods
	DQN: Making reinforcement learning more like supervised learning
		Double DQN: Mitigating the overestimation of action-value functions
Sample-efficientvalue-based methods
	Dueling DDQN: A reinforcement-learning-aware neural network architecture
		PER: Prioritizing the replay of meaningful experiences
Policy-gradient andactor-critic methods
	REINFORCE: Outcome-based policy learning
		VPG: Learning a value function
		A3C: Parallel policy updates
		GAE: Robust advantage estimation
		A2C: Synchronous policy updates
Advancedactor-critic methods
	DDPG: Approximating a deterministic policy
		TD3: State-of-the-art improvements over DDPG
		SAC: Maximizing the expected return and entropy
		PPO: Restricting optimization steps
Toward artificialgeneral intelligence
	What was covered and what notably wasn’t?
		More advanced concepts toward AGI
		What happens next?
	index