دسترسی نامحدود
برای کاربرانی که ثبت نام کرده اند
برای ارتباط با ما می توانید از طریق شماره موبایل زیر از طریق تماس و پیامک با ما در ارتباط باشید
در صورت عدم پاسخ گویی از طریق پیامک با پشتیبان در ارتباط باشید
برای کاربرانی که ثبت نام کرده اند
درصورت عدم همخوانی توضیحات با کتاب
از ساعت 7 صبح تا 10 شب
ویرایش:
نویسندگان: Mike Kahn
سری:
ISBN (شابک) : 9781805125266
ناشر: Packt
سال نشر: 2023
تعداد صفحات: 264
زبان: English
فرمت فایل : PDF (درصورت درخواست کاربر به PDF، EPUB یا AZW3 تبدیل می شود)
حجم فایل: 7 Mb
در صورت تبدیل فایل کتاب Data Exploration and Preparation with BigQuery: A practical guide helping you clean, transform, and analyze data for business به فرمت های PDF، EPUB، AZW3، MOBI و یا DJVU می توانید به پشتیبان اطلاع دهید تا فایل مورد نظر را تبدیل نمایند.
توجه داشته باشید کتاب کاوش و آمادهسازی داده با BigQuery: راهنمای عملی که به شما کمک میکند دادهها را برای کسبوکار تمیز، تبدیل و تجزیه و تحلیل کنید. نسخه زبان اصلی می باشد و کتاب ترجمه شده به فارسی نمی باشد. وبسایت اینترنشنال لایبرری ارائه دهنده کتاب های زبان اصلی می باشد و هیچ گونه کتاب ترجمه شده یا نوشته شده به فارسی را ارائه نمی دهد.
Data Exploration and Preparation with BigQuery is a comprehensive guide to working with data preparation tools and strategies using BigQuery as a modern data warehouse solution.
Data Exploration and Preparation with BigQuery Contributors About the author About the reviewers Preface Who this book is for What this book covers To get the most out of this book Download the example code files Conventions used Get in touch Share Your Thoughts Download a free PDF copy of this book Part 1: Introduction to BigQuery 1 Introducing BigQuery and Its Components Technical requirements What is BigQuery? Understanding how BigQuery works Dremel, the execution engine Colossus distributed storage The Borg compute platform Jupiter network infrastructure BigQuery administration and access Tools for administration Understanding identity and access management BigQuery best practices and cost management Best practices Understanding and controlling costs Extending your data BigQuery ML External datasets External connections Summary References 2 BigQuery Organization and Design Technical requirements Understanding BigQuery’s resource hierarchy Organizations, folders, and projects BigQuery-specific resources BigQuery storage Exploring architecture patterns The centralized enterprise data warehouse The decentralized data warehouse The cross-org data exchange Schema design Table design Summary Part 2: Data Exploration with BigQuery 3 Exploring Data in BigQuery Technical requirements What is data exploration? Fundamentals Data life cycle Common challenges and solutions Introduction to exploring data in BigQuery Exploring data in the BigQuery SQL workspace Exploring schema and table structure Exploring data using SQL Exploring data using the bq command-line interface Exploring data with visualization tools Enhancing data exploration in BigQuery Advanced approaches Best practices Summary 4 Loading and Transforming Data Technical requirements Exploring data loading techniques Batch loading data Streaming ingestion of data Scheduled loading of data Situations where you do not need to load data Data transformation with BigQuery Evaluating ETL and ELT approaches for data integration Hands-on exercise – data loading and transformation in BigQuery Understanding the scenario Loading data from a local file Preparing and transforming data Summary 5 Querying BigQuery Data Technical requirements Understanding query structure Action command – the SELECT clause Location command – the FROM clause Filtering command – the WHERE clause Selection handling commands – the GROUP BY, HAVING, ORDER BY, and LIMIT clauses Understanding data types Using expressions and aggregations Expressions Aggregations Joining tables Inner joins Outer joins Using functions Advanced querying techniques Subqueries Window functions Common table expressions Array functions Saving, sharing, and scheduling queries Optimizing queries Troubleshooting queries Summary Further reading 6 Exploring Data with Notebooks Technical requirements Understanding the value of using notebooks Jupyter notebooks Using Workbench notebook instances in Vertex AI Creating a managed notebook Executions and schedules Hands-on exercise – analyzing Google Trends data with Workbench Using Colab notebooks Comparing Workbench instances and Colab Summary Further reading 7 Further Exploring and Visualizing Data Technical requirements Understanding data distributions Why is it important to understand data distributions? Uncovering relationships in data Exploring BigQuery data with Google Sheets Connecting to Sheets from BigQuery using Explore with Sheets Connecting to BigQuery using Connected Sheets Column statistics Collaboration with BigQuery data in Sheets Visualizing BigQuery data using Looker Studio Creating the right visualizations Hands-on exercise – creating visualizations with Looker Studio Commonly created charts Calculations in visualization tools Data quality discovery while creating visualizations Filtering data in visualizations Integrating other visualization tools with BigQuery Summary Further reading Part 3: Data Preparation with BigQuery 8 An Overview of Data Preparation Tools Technical requirements Getting started with data preparation Clearly defining your data preparation goals Evaluating your current data quality Data cleansing and transformation Validating prepared data Data preparation approaches Data preparation tools Visual data preparation tools Query and code-based tools Automated data preparation Summary Further reading 9 Cleansing and Transforming Data Technical requirements Using ELT for cleansing and transforming data Assessing dataset integrity The shape of the dataset Skew of the dataset Data profiling Data validation Data visualization Using SQL for data cleansing and transformation SQL data cleansing strategies and examples SQL data transformation strategies and examples Writing query results Using Cloud Dataprep for visual cleansing and transformation Summary Further reading 10 Best Practices for Data Preparation, Optimization, and Cost Control Technical requirements Data preparation best practices Understanding your data and business requirements Denormalizing your data Optimizing schema design Considering nested and repeated fields Using correct data types Data cleansing and validation Partitioning and clustering Optimizing data loading Best practices for optimizing storage Long-term and compressed storage Cross-cloud data analytics with federated access model and BigQuery Omni Backup and recovery Best practices for optimizing compute Analysis cost options Query optimization Query optimization cheat sheet Monitoring and controlling costs Query plan and query performance insights Monitoring, estimating, and optimizing costs Controlling costs Summary Further reading Part 4: Hands-On and Conclusion 11 Hands-On Exercise – Analyzing Advertising Data Technical requirements Exercise and use case overview Loading CSV data files from local upload Data preparation Standardizing date formats Data exploration, analysis, and visualization Analyzing ads and sales data Return on ad spend Visualizations Summary References 12 Hands-On Exercise – Analyzing Transportation Data Technical requirements Exercise and use case overview Loading data from GCS to BigQuery Uploading data files to Google Cloud Storage Loading data into BigQuery Data preparation Data exploration and analysis Visualizing data with BigQuery geography functions Summary Further reading 13 Hands-On Exercise – Analyzing Customer Support Data Technical requirements Exercise and use case overview Data loading from CSV upload Data preparation Data exploration and analysis Count of ticket_type across both datasets The most common support issues using ticket_subject data Average resolution time per ticket_type Customer demographics using customer_age and customer_gender Analyzing emotions with sentiment analysis Creating a connection Granting access to the external connection service account Creating a model Querying the model Summary References and further reading 14 Summary and Future Directions Summary of key points Chapter 1, Introducing BigQuery and Its Components Chapter 2, BigQuery Organization and Design Chapter 3, Exploring Data in BigQuery Chapter 4, Loading and Transforming Data Chapter 5, Querying BigQuery Data Chapter 6, Exploring Data with Notebooks Chapter 7, Further Exploring and Visualizing Data Chapter 8, An Overview of Data Preparation Tools Chapter 9, Cleansing and Transforming Data Chapter 10, Best Practices for Data Preparation, Optimization, and Cost Control Chapter 11, Hands-On Exercise – Analyzing Advertising Data Chapter 12, Hands-On Exercise – Analyzing Transportation Data Chapter 13, Hands-On Exercise – Analyzing Customer Support Data Future directions More integration with AI and ML Generative AI Natural language queries DataOps Hybrid and multi-cloud data analysis Zero-ETL and real-time analytics Data governance and privacy Federated learning Data clean rooms Data monetization Additional resources Final words Index Why subscribe? Other Books You May Enjoy Packt is searching for authors like you Share Your Thoughts Download a free PDF copy of this book