دسترسی نامحدود
برای کاربرانی که ثبت نام کرده اند
برای ارتباط با ما می توانید از طریق شماره موبایل زیر از طریق تماس و پیامک با ما در ارتباط باشید
در صورت عدم پاسخ گویی از طریق پیامک با پشتیبان در ارتباط باشید
برای کاربرانی که ثبت نام کرده اند
درصورت عدم همخوانی توضیحات با کتاب
از ساعت 7 صبح تا 10 شب
ویرایش:
نویسندگان: Rongpeng Li
سری:
ISBN (شابک) : 9781838984847
ناشر: Packt publishing pvt. ltd
سال نشر: 2020
تعداد صفحات: 0
زبان: English
فرمت فایل : EPUB (درصورت درخواست کاربر به PDF، EPUB یا AZW3 تبدیل می شود)
حجم فایل: 9 مگابایت
در صورت تبدیل فایل کتاب Essential Statistics for Non-STEM Data Analysts به فرمت های PDF، EPUB، AZW3، MOBI و یا DJVU می توانید به پشتیبان اطلاع دهید تا فایل مورد نظر را تبدیل نمایند.
توجه داشته باشید کتاب آمار ضروری برای تحلیلگران داده غیرSTEM نسخه زبان اصلی می باشد و کتاب ترجمه شده به فارسی نمی باشد. وبسایت اینترنشنال لایبرری ارائه دهنده کتاب های زبان اصلی می باشد و هیچ گونه کتاب ترجمه شده یا نوشته شده به فارسی را ارائه نمی دهد.
Cover Title Page Copyright and Credits About Packt Contributors Table of Contents Preface Section 1: Getting Started with Statistics for Data Science Chapter 1: Fundamentals of Data Collection, Cleaning, and Preprocessing Technical requirements Collecting data from various data sources Reading data directly from files Obtaining data from an API Obtaining data from scratch Data imputation Preparing the dataset for imputation Imputation with mean or median values Imputation with the mode/most frequent value Outlier removal Data standardization – when and how Examples involving the scikit-learn preprocessing module Imputation Standardization Summary Chapter 2: Essential Statistics for Data Assessment Classifying numerical and categorical variables Distinguishing between numerical and categorical variables Understanding mean, median, and mode Mean Median Mode Learning about variance, standard deviation, quartiles,percentiles, and skewness Variance Standard deviation Quartiles Skewness Knowing how to handle categorical variables and mixed data types Frequencies and proportions Transforming a continuous variable to a categorical one Using bivariate and multivariate descriptive statistics Covariance Cross-tabulation Summary Chapter 3: Visualization with Statistical Graphs Basic examples with the Python Matplotlib package Elements of a statistical graph Exploring important types of plotting in Matplotlib Advanced visualization customization Customizing the geometry Customizing the aesthetics Query-oriented statistical plotting Example 1 – preparing data to fit the plotting function API Example 2 – combining analysis with plain plotting Presentation-ready plotting tips Use styling Font matters a lot Summary Section 2: Essentials of Statistical Analysis Chapter 4: Sampling and Inferential Statistics Understanding fundamental concepts in sampling techniques Performing proper sampling under different scenarios The dangers associated with non-probability sampling Probability sampling – the safer approach Understanding statistics associated with sampling Sampling distribution of the sample mean Standard error of the sample mean The central limit theorem Summary Chapter 5: Common Probability Distributions Understanding important concepts in probability Events and sample space The probability mass function and the probability density function Subjective probability and empirical probability Understanding common discrete probability distributions Bernoulli distribution Binomial distribution Poisson distribution Understanding the common continuous probability distribution Uniform distribution Exponential distribution Normal distribution Learning about joint and conditional distribution Independency and conditional distribution Understanding the power law and black swan The ubiquitous power law Be aware of the black swan Summary Chapter 6: Parametric Estimation Understanding the concepts of parameter estimation and the features of estimators Evaluation of estimators Using the method of moments to estimate parameters Example 1 – the number of 911 phone calls in a day Example 2 – the bounds of uniform distribution Applying the maximum likelihood approach with Python Likelihood function MLE for uniform distribution boundaries MLE for modeling noise MLE and the Bayesian theorem Summary Chapter 7: Statistical Hypothesis Testing An overview of hypothesis testing Understanding P-values, test statistics, and significance levels Making sense of confidence intervals and P-values from visual examples Calculating the P-value from discrete events Calculating the P-value from the continuous PDF Significance levels in t-distribution The power of a hypothesis test Using SciPy for common hypothesis testing The paradigm T-test The normality hypothesis test The goodness-of-fit test A simple ANOVA model Stationarity tests for time series Examples of stationary and non-stationary time series Appreciating A/B testing with a real-world example Conducting an A/B test Randomization and blocking Common test statistics Common mistakes in A/B tests Summary Section 3: Statistics for Machine Learning Chapter 8: Statistics for Regression Understanding a simple linear regression model and its rich content Least squared error linear regression and variance decomposition The coefficient of determination Hypothesis testing Connecting the relationship between regression and estimators Simple linear regression as an estimator Having hands-on experience with multivariate linear regression and collinearity analysis Collinearity Learning regularization from logistic regression examples Summary Chapter 9: Statistics for Classification Understanding how a logistic regression classifier works The formulation of a classification problem Implementing logistic regression from scratch Evaluating the performance of the logistic regression classifier Building a naïve Bayes classifier from scratch Underfitting, overfitting, and cross-validation Summary Chapter 10: Statistics for Tree-Based Methods Overviewing tree-based methods for classification tasks Growing and pruning a classification tree Understanding how splitting works Evaluating decision tree performance Exploring regression tree Using tree models in scikit-learn Summary Chapter 11: Statistics for Ensemble Methods Revisiting bias, variance, and memorization Understanding the bootstrapping and bagging techniques Understanding and using the boosting module Exploring random forests with scikit-learn Summary Section 4: Appendix Chapter 12: A Collection of Best Practices Understanding the importance of data quality Understanding why data can be problematic Avoiding the use of misleading graphs Example 1 – COVID-19 trend Example 2 – Bar plot cropping Fighting against false arguments Summary Chapter 13: Exercises and Projects Exercises Chapter 1 – Fundamentals of Data Collection, Cleaning, and Preprocessing Chapter 2 – Essential Statistics for Data Assessment Chapter 3 – Visualization with Statistical Graphs Chapter 4 – Sampling and Inferential Statistics Chapter 5 – Common Probability Distributions Chapter 6 – Parameter Estimation Chapter 7 – Statistical Hypothesis Testing Chapter 8 – Statistics for Regression Chapter 9 – Statistics for Classification Chapter 10 – Statistics for Tree-Based Methods Chapter 11 – Statistics for Ensemble Methods Project suggestions Non-tabular data Real-time weather data Goodness of fit for discrete distributions Building a weather prediction web app Building a typing suggestion app Further reading Textbooks Visualization Exercising your mind Summary Other Books You May Enjoy Index