دسترسی نامحدود
برای کاربرانی که ثبت نام کرده اند
برای ارتباط با ما می توانید از طریق شماره موبایل زیر از طریق تماس و پیامک با ما در ارتباط باشید
در صورت عدم پاسخ گویی از طریق پیامک با پشتیبان در ارتباط باشید
برای کاربرانی که ثبت نام کرده اند
درصورت عدم همخوانی توضیحات با کتاب
از ساعت 7 صبح تا 10 شب
ویرایش: نویسندگان: Rathish Mohan, Shekhar Agrawal, Srinivasa Sunil Chippada سری: ISBN (شابک) : 9789348107947, 9789348107299 ناشر: Orange Education Pvt Ltd, AVA™ سال نشر: 2024 تعداد صفحات: 723 زبان: English فرمت فایل : PDF (درصورت درخواست کاربر به PDF، EPUB یا AZW3 تبدیل می شود) حجم فایل: 15 مگابایت
در صورت تبدیل فایل کتاب Ultimate AWS Data Engineering: Design, Implement, and Optimize Scalable Data Solutions on AWS with Practical Workflows and Visual Aids for Unmatched Impact به فرمت های PDF، EPUB، AZW3، MOBI و یا DJVU می توانید به پشتیبان اطلاع دهید تا فایل مورد نظر را تبدیل نمایند.
توجه داشته باشید کتاب مهندسی داده های نهایی AWS: طراحی ، پیاده سازی و بهینه سازی راه حل های داده مقیاس پذیر در AWS با گردش کار عملی و کمک های بصری برای تأثیر بی نظیر نسخه زبان اصلی می باشد و کتاب ترجمه شده به فارسی نمی باشد. وبسایت اینترنشنال لایبرری ارائه دهنده کتاب های زبان اصلی می باشد و هیچ گونه کتاب ترجمه شده یا نوشته شده به فارسی را ارائه نمی دهد.
Cover Page
Title Page
Copyright Page
Dedication Page
About the Authors
About the Technical Reviewer
Acknowledgements
Preface
Errata
Table of Contents
1. Unveiling the Secrets of Data Engineering
Introduction
Structure
Defining Data Engineering
The Data Landscape: Past, Present, and Future
A Journey through Time: Tracing the Evolution of Data
A Glimpse into the Future: Anticipating the Next Frontier
Demystifying the Role of AWS in Data Engineering
A Comprehensive Ecosystem for Data-Driven Success
Scalability and Flexibility
Reliability and Security
Cost-Effectiveness: Unleashing Efficiency and Value
Breadth of Services
Continuous Innovation
Global Community and Support
Your Journey Begins Now: Embarking on the Path to Data Mastery
Conclusion
2. Architecting for Scalability: Data Replication Techniques
Introduction
Structure
Introduction to Data Replication
Types of Data Replication
Replication Strategies in AWS
Consistency and Availability Trade-offs in Replication
Example: Real-World Application of Data Replication in AWS
Monitoring and Managing Data Replication
Synchronous versus Asynchronous Replication
Synchronous Replication: Strong Consistency at a Cost
Asynchronous Replication: Enhancing Availability and Performance
Use Cases and Trade-offs
Hybrid Replication Models
Monitoring Replication Performance in AWS
Implementing Replication in Distributed Systems on AWS
Key AWS Services for Replication
Configuring Multi-AZ Replication with Amazon RDS
Setting Up DynamoDB Global Tables for Cross-Region Replication
Using Amazon S3 Cross-Region Replication (CRR) for Backup and Compliance
Best Practices for Managing Replication on AWS
Case Study: Implementing a Multi-Region Replication Strategy for a Global SaaS Platform
Replication Best Practices and Use cases
Best Practices for Data Replication
Use Cases for Replication in Distributed Systems
Case Study: Replication Strategy for a Global E-Commerce Platform
Monitoring and Optimizing Replication in AWS
Conclusion
3. Partitioning and Sharding: Optimizing Data Management
Introduction
Structure
Fundamentals of Data Partitioning and Sharding
Types of Data Partitioning
Sharding in Distributed Systems
Choosing Between Partitioning and Sharding
Challenges in Implementing Partitioning and Sharding
Partitioning Strategies in Distributed Systems
Range-Based Partitioning
Hash-Based Partitioning
List-Based Partitioning
Composite Partitioning
Choosing the Right Partitioning Strategy
Challenges in Managing Partitioning in Distributed Systems
Implementing Sharding in Cloud-Based Distributed Systems
Sharding Strategies
Sharding on AWS
Challenges in Implementing Sharding
Best Practices for Sharding in Cloud-Based Systems
Case Study: Implementing Sharding for a Video Streaming Service on AWS
Challenges and Best Practices for Managing Partitioning and Sharding
Best Practices for Partitioning and Sharding in Distributed Systems
Applying Best Practices for Sharding in a Global E-Commerce Platform
Conclusion
4. Ensuring Consistency: Consensus Mechanisms and Models
Introduction
Structure
The CAP Theorem and Consistency Models
Types of Consistency Models
Applying Consistency Models in AWS
Trade-offs in Consistency Models
Consensus Mechanisms in Distributed Systems
Paxos: A Foundation of Consensus
Raft: A Simpler Alternative to Paxos
Comparing Paxos and Raft
Applications of Consensus Mechanisms in Distributed Systems
Practical Applications of Consistency and Consensus Mechanisms in AWS
Consistency in Amazon DynamoDB
Consensus in Amazon RDS Multi-AZ
Distributed Cache Consistency with Amazon ElastiCache
Amazon QLDB and Blockchain-Like Consensus Mechanisms
Implementing Consensus and Consistency in Microservices with AWS Step Functions
Summary of AWS Consistency and Consensus Applications
Trade-offs and Design Considerations in Consistency and Consensus
Balancing Consensus and System Performance
Designing for Consistency and Consensus in Real-World Applications
Summary: Making Informed Design Choices
Conclusion
5. Balancing the Load: Achieving Performance and Efficiency
Introduction
Structure
Load Balancing Fundamentals
Types of Load Balancing
Load Balancing Algorithms
AWS Load Balancing Solutions
Benefits and Challenges of Load Balancing
Example Case Study: Load Balancing for a Social Media Application
Horizontal Scaling Techniques
Techniques for Horizontal Scaling
AWS Solutions for Horizontal Scaling
Best Practices for Horizontal Scaling
Example Case Study: Horizontal Scaling for an E-Commerce Application
Auto Scaling Strategies and Best Practices
Types of Auto Scaling Policies
Auto Scaling in AWS
Best Practices for Implementing Auto Scaling
Example Case Study
Monitoring and Optimizing Load Balancing and Scaling
Key Metrics for Monitoring Load Balancing and Scaling
Best Practices for Optimizing Load Balancing and Scaling
Example Case Study
Conclusion
6. Building Fault-Tolerant Architectures
Introduction
Structure
Principles of Monitoring in Distributed Systems
Key Metrics for Effective Monitoring
Monitoring Levels in Distributed Systems
AWS Tools for Monitoring Distributed Systems
Best Practices for Effective Monitoring
Fault Tolerance in Distributed Systems
Core Principles of Fault Tolerance
Techniques for Achieving Fault Tolerance
AWS Services for Fault Tolerance
Best Practices for Building Fault-Tolerant Distributed Systems
Example Case Study: Fault Tolerance for a Financial Services Application
Designing Self-Healing Systems in Distributed Architectures
Key Components of a Self-Healing System
AWS Services for Building Self-Healing Systems
Best Practices for Designing Self-Healing Architectures
Using AWS Elastic Beanstalk for Self-Healing Applications
Self-Healing Features in AWS Elastic Beanstalk
Setting Up Self-Healing with Elastic Beanstalk
Best Practices for Using Elastic Beanstalk for Self-Healing
Example Case Study: Self-Healing Architecture for a Travel Booking Platform
Best Practices for Building Resilient Architectures and Disaster Recovery
Best Practices for Building Resilient Architectures
Best Practices for Disaster Recovery (DR)
Example Scenario: Disaster Recovery Plan for an E-Learning Platform
Conclusion
7. Exploring the Realm of AWS Data Storage Services
Introduction
Structure
Amazon Simple Storage Service (Amazon S3): The Object Storage Powerhouse
Understanding Object Storage: A New Paradigm
Key Features of Amazon S3: Scalability, Durability, and Flexibility
Exploring S3 Storage Classes: Optimizing Cost and Performance
Accessing and Managing S3 Data: The API and Tools at Your Disposal
Use Cases for S3: From Archiving to Streaming and Analytics
Demystifying NoSQL: A Data Model Built for Agility
Types of NoSQL Databases
Amazon DynamoDB: NoSQL for Speed and Scalability
Unveiling DynamoDB’s Key Features: Fast Performance and High Availability
Designing Data Models in DynamoDB
Primary Keys
Secondary Indexes
Document Structure
Building with DynamoDB: Tools and APIs for Efficient Data Access
DynamoDB API
AWS Management Console
AWS CLI
DynamoDB Mapper
Use Cases for DynamoDB: Powering Mobile Apps, Real-time Applications, and IoT
Amazon Redshift: The Data Warehousing Champion
Unpacking Data Warehousing: A Foundation for Big Data Analytics
Introducing Amazon Redshift: Scalable Data Warehousing in the Cloud
Querying Data with Redshift: SQL and Beyond for Efficient Analysis
Optimizing Redshift Performance: Tuning Queries and Managing Costs
Use Cases for Redshift: Business Intelligence, Reporting, and Advanced Analytics
Amazon Relational Database Service (RDS): The Trusted Choice for Structured Data
Relational Databases: The Bedrock of Data Management
Exploring RDS: Diverse Database Engine Options and Flexible Deployment Choices
Managing RDS Databases: Tools for Administration, Security, and Backup
Scaling RDS: Meeting Your Growing Data Needs with Ease
Use Cases for RDS: Transactional Applications, Web Applications, and E-commerce
Choosing the Right Storage Solution: A Strategic Approach
Matching Data Requirements to Storage Options: Selecting the Perfect Fit
Cost Analysis and Optimization: Balancing Performance with Affordability
Combining Services for Enhanced Functionality: Building Hybrid Solutions
Planning for Future Growth: Choosing Scalable and Flexible Storage Options
Your Journey Begins Now: Embarking on the Path to Data Mastery
Conclusion
8. Orchestrating Data Flow
Introduction
Structure
Unleashing the Symphony: Understanding Data Sources and Formats
Meet the Players: Your Data Flow Instruments
Kinesis Firehose: The Streamlining Screamer
SQS/SNS: The Messengers and Alert System
AWS Lambda: The On-Demand Swiss Army Knife
AWS Glue: The Data Integration Maestro
Kinesis Firehose: Your Real-Time Data Ingestion Powerhouse
SQS/SNS: The Unsung Heroes of Your Data Flow Symphony
SQS: Your Reliable Data Queue Maestro
SNS: Your Data Flow Alert System
Beyond Basic Notifications: Advanced Features
Adapting to Your Data Flow Rhythm
AWS Lambda: Your Serverless Swiss Army Knife for Data Flow Agility
Defining Lambda
AWS Glue: Conducting Your Data Integration Symphony with Ease
Catering to All Data Rhythms
Batch Efficiency: Rehearsing for a Flawless Performance
Real-Time Agility: Improvising with the Flow
Conclusion
9. Advanced Data Pipelines and Transformation
Introduction
Structure
AWS Data Pipeline: Your Orchestrator for Complex Data Workflows
Introduction to AWS Step Functions
Symphony of Services: AWS Data Pipeline versus Glue Jobs versus Lambda versus Step Functions
Choosing the Right Service
Designing for Efficiency: Orchestrating Your Data Flow with Optimization in Mind
Monitoring and Performance Optimization: Conducting Your Data Flow Symphony with Insights
Advanced-Data Transformations: Unveiling Hidden Insights and Building Smarter Models
Anomaly Detection
Feature Engineering
Complex Data Cleaning and Validation
Deep Dive into Step Functions
Chaining Lambda Functions
Conditional Branching
Wait States
Advanced Glue Job Techniques: Unveiling the Power of Custom Transformations
Custom Spark and Python Scripts
Integrating with External Libraries and Tools
Optimizing Glue Job Performance
End-to-End Orchestration Strategies: Conducting Your Data Flow Symphony with Harmony and Security
Combining Different Services for Seamless Data Flow
Managing Dependencies and Data Lineage
Ensuring Data Security and Governance
The Final Curtain: Data Pipelines as Symphonies of Insight
Conclusion
10. Data Warehousing Demystified
Introduction
Structure
Part 1: Data Warehouses and Data Lakes
Understanding Data Warehouses
Demystifying Data Warehouses and Data Lakes
Data Warehouses: The Structured Sanctuary
Data Lakes: The Unstructured Oasis
Part 2: Building and Querying Your Redshift Data Warehouse
Building Your Redshift Data Warehouse: From Blueprint to Oasis
Setting Up Redshift for Your Specific Needs: A Comprehensive Guide
Instance Selection: Power, Storage, and Budget in Harmony
Configuration Options: Beyond the Instance Type
Cost Optimization: Leveraging Strategies and Tools
Integration with Other Services: Building a Seamless Pipeline
Data Warehouse Schema Design: Building a Foundation for Analysis
Understanding Data Model Types
Data Modeling Tools
Denormalization for Performance
Data Partitioning
Getting Your Data In: Strategies for Loading Your Redshift Data Warehouse
Extract, Transform, Load (ETL) with Glue
Copy Commands: A Simpler Approach for Specific Scenarios
Incremental Loading: Efficient Updates for Evolving Data
Monitoring and Optimization: Ensuring Smooth Data Flow
Mastering SQL for Data Exploration: Advanced Features for Complex Queries
JOINs: Uniting Data from Different Tables
Window Functions: Calculations within Data “Windows”
Common Table Expressions (CTEs): Reusable Building Blocks
User-Defined Functions (UDFs): Extending SQL Functionality
Part 3: Advanced Techniques and Best Practices Taking Your Redshift Data Warehouse to the Next Level
Optimizing Your Redshift Data Warehouse: Diving Deeper
Monitoring and Performance Analysis: Unmasking Bottlenecks
Denormalization: A Double-Edged Sword for Performance
Clustering and Partitioning: Organizing for Efficiency
Cost Optimization in Redshift
Security and Governance Considerations: Safeguarding Your Data Warehouse
Advanced Analytics with Redshift: Unlocking Deeper Insights
Redshift Integration with Other Data Analytics Tools: A Deep Dive
Conclusion
11. Visualizing the Unseen
Introduction
Structure
The Power of Data Visualization: From Data to Narrative
Unlocking the Potential of Data Storytelling
Benefits of Data Visualization
Exploring Data Visualization Tools and Techniques: Popular Options
Popular Data Visualization Tools
Essential Visualization Techniques: Unveiling the Power of Data through Visuals
Designing Effective Visualizations: Transforming Data into Captivating Narratives
Connecting Redshift to BI Tools: Interactive Data Exploration
Unlocking the Power of Interactive Insights
The Power of BI Tools
Connecting Redshift to Tableau and Power BI
Exploring Both Batch and Real-Time Data Sets
A Glimpse into the Future of Data Visualization and Storytelling: Embracing New Frontiers
Summary: Visualizing the Unseen - A Quick-Review Guide
Conclusion
12. AWS Machine Learning: Classic AI to Generative AI
Introduction
Structure
Foundations of Machine Learning on AWS
Amazon SageMaker: An End-to-End Solution for ML
Model Training and Optimization with SageMaker
Example Case Study
Classic Machine Learning Workflows on AWS
Data Preparation and Feature Engineering
Training, Validation, and Evaluation
Deploying and Scaling Models
Best Practices for Classic Machine Learning Workflows
Example Case Study
Summary of Classic ML Workflows on AWS
Natural Language Processing (NLP) with AWS
Text Analysis with Amazon Comprehend
Speech Recognition and Synthesis: Amazon Transcribe and Polly
Building Custom NLP Models on Amazon SageMaker
Practical Applications of NLP on AWS
Example Case Study
Generative AI (GenAI) on AWS
Overview of GenAI Capabilities on AWS
Amazon Bedrock for GenAI
Building Custom Generative Models with Amazon SageMaker
Practical Applications of GenAI on AWS
Best Practices for Deploying GenAI on AWS
Example Case Study
Best Practices for Monitoring, Security, and Cost Management in Machine Learning on AWS
Monitoring and Model Drift Detection
Security in Machine Learning Workflows
Optimizing Costs in Machine Learning on AWS
Best Practices for Sustainable ML Operations
Example Case Study
Conclusion
13. Advanced Data Engineering with AWS
Introduction
Structure
The Evolution of Serverless Architectures
Benefits and Trade-offs of Serverless for Data Engineering
Key AWS Serverless Tools for Data Engineering
Selecting Serverless for Data Engineering
Best Practices for Building Serverless Data Architectures
Example Case Study
Real-Time Data Processing and Streaming Analytics
Benefits and Challenges of Real-Time Data Processing
Key AWS Tools for Real-Time Data Processing and Streaming Analytics
Implementing Real-Time Data Pipelines on AWS
Best Practices for Real-Time Data Processing
Example Case Study
Data Governance and Security in Modern Architectures
Key Components of Data Governance in Distributed Architectures
Key AWS Tools for Data Governance and Security
Best Practices for Data Governance and Security
Example Case Study
Conclusion
Final Thoughts
Index