COURSES
MBAData Science & AnalyticsDoctorate Software & Tech AI | ML MarketingManagement
Professional Certificate Programme in HR Management and AnalyticsPost Graduate Certificate in Product ManagementExecutive Post Graduate Program in Healthcare ManagementExecutive PG Programme in Human Resource ManagementMBA in International Finance (integrated with ACCA, UK)Global Master Certificate in Integrated Supply Chain ManagementAdvanced General Management ProgramManagement EssentialsLeadership and Management in New Age BusinessProduct Management Online Certificate ProgramStrategic Human Resources Leadership Cornell Certificate ProgramHuman Resources Management Certificate Program for Indian ExecutivesGlobal Professional Certificate in Effective Leadership and ManagementCSM® Certification TrainingCSPO® Certification TrainingLeading SAFe® 5.1 Training (SAFe® Agilist Certification)SAFe® 5.1 POPM CertificationSAFe® 5.1 Scrum Master Certification (SSM)Implementing SAFe® 5.1 with SPC CertificationSAFe® 5 Release Train Engineer (RTE) CertificationPMP® Certification TrainingPRINCE2® Foundation and Practitioner Certification
Law
Job Linked
Bootcamps
Study Abroad
MS in Data AnalyticsMS in Project ManagementMS in Information TechnologyMasters Degree in Data Analytics and VisualizationMasters Degree in Artificial IntelligenceMBS in Entrepreneurship and MarketingMSc in Data AnalyticsMS in Data AnalyticsMS in Computer ScienceMaster of Science in Business AnalyticsMaster of Business Administration MS in Data ScienceMS in Information TechnologyMaster of Business AdministrationMS in Applied Data ScienceMaster of Business Administration | STEMMS in Data AnalyticsMaster of Business AdministrationMS in Information Technology and Administrative Management MS in Computer Science Master of Business Administration Master of Business Administration-90 ECTSMSc International Business ManagementMS Data Science Master of Business Administration MSc Business Intelligence and Data ScienceMS Data Analytics MS in Management Information SystemsMSc International Business and ManagementMS Engineering ManagementMS in Machine Learning EngineeringMS in Engineering ManagementMSc Data EngineeringMSc Artificial Intelligence EngineeringMPS in InformaticsMPS in Applied Machine IntelligenceMS in Project ManagementMPS in AnalyticsMS in Project ManagementMS in Organizational LeadershipMPS in Analytics - NEU CanadaMBA with specializationMPS in Informatics - NEU Canada Master in Business AdministrationMS in Digital Marketing and MediaMSc Sustainable Tourism and Event ManagementMSc in Circular Economy and Sustainable InnovationMSc in Impact Finance and Fintech ManagementMS Computer ScienceMBA in Technology, Innovation and EntrepreneurshipMSc Data Science with Work PlacementMSc Global Business Management with Work Placement MBA with Work PlacementMS in Robotics and Autonomous SystemsMS in Civil EngineeringMS in Internet of ThingsMSc International Logistics and Supply Chain ManagementMBA- Business InformaticsMSc International ManagementMBA in Strategic Data Driven ManagementMSc Digital MarketingMBA Business and MarketingMSc in Sustainable Global Supply Chain ManagementMSc Digital Business Analytics MSc in International HospitalityMSc Luxury and Innovation ManagementMaster of Business Administration-International Business ManagementMS in Computer EngineeringMS in Industrial and Systems EngineeringMaster in ManagementMSc MarketingMSc Global Supply Chain ManagementMS in Information Systems and Technology with Business Intelligence and Analytics ConcentrationMSc Corporate FinanceMSc Data Analytics for BusinessMaster of Business AdministrationMaster of Business AdministrationMaster of Business AdministrationMSc in International FinanceMSc in International Management and Global LeadershipMaster of Business AdministrationBachelor of BusinessBachelor of Business AnalyticsBachelor of Information TechnologyMaster of Business AdministrationMBA Business AnalyticsMSc in Marketing Analytics and Data IntelligenceMS Biotechnology Management and EntrepreneurshipMSc in Luxury and Fashion ManagementMaster of Business Administration (90 ECTS)Bachelor of Business Administration (180 ECTS)B.Sc. Computer Science (180 ECTS) MSc in International Corporate Finance MSc in Sustainable Luxury and Creative IndustriesMSc Digital MarketingMSc Global Supply Chain Management (PGMP)MSc Marketing (PGMP)MSc Corporate Finance (PGMP)MSc Data Analytics for Business (PGMP)MS Business AnalyticsMaster of Business AdministrationMS Quantitative FinanceMS Fintech ManagementMS Business Analytics PGMPState University of New York Bachelors Program - STEM MSc Business Intelligence and Data Science (PGMP)MSc International Logistics and Supply Chain Management ( PGMP)MSc International Management (PGMP)MSc Psychology & Management (PGMP)MSc Finance (PGMP)State University of New York Bachelor's Year 1 Program
For College Students

Sampling and Estimation in Statistics

$$/$$

Sampling Distribution

Sampling distribution is the probability distribution of a particular sample statistic (such as mean) obtained by drawing all possible samples of a particular sample size ‘n’ from the population and calculating their statistics.

 

Sampling Distribution of Sample Mean and the Central Limit Theorem

If you draw samples of size, let’s say, ‘n’ from a population, calculate the sample mean for all the samples and then draw the probability distribution for the random variable X (where X denotes the mean of the sample), the resulting probability distribution is called ‘sampling distribution of sample means’. The mean of the sample means is denoted by . The standard deviation of the sampling distribution of the sample means is denoted by .

 

Central Limit Theorem: In simple words, the central limit theorem can be stated as follows: 

When you draw a sampling distribution of sample means, where the sample size is sufficiently large, the sampling distribution of the sample means will look like a normal distribution.

 

When is the sample size (n) considered sufficiently large?

For a non-normally distributed population, ‘n’ should be greater than or equal to 30. (This 30-rule is an oversimplification and can be verified). For a normally distributed population, the sample size can be anything.

 

Significance of the central limit theorem: The central limit theorem states that for a sufficiently large sample size, the sampling distribution is approximately normally distributed. This approximation improves with an increase in the sample size because of this normal distribution. The sampling distribution of sample means has its own normal variate (Z). In the next section, you will see how this Z is used to estimate population parameters.

 

Important property: Mean of the sample means (μx̅) = Mean of the population (μ)

    =  σ / √n, where n is the sample size of all the samples.

So, the normal variate or the Z-score for the sampling distribution of a sample means is:

Z = ( x̅ - μ ) / (  σ / √n )

 

Estimation

The process of drawing inferences about a population using the information from its samples is known as estimation.

 

Types of estimation

  1. Point estimate: Here, a statistic obtained from a sample is used to estimate a population parameter. So, its accuracy depends on how well the sample represents the population. The population parameters derived from sample statistics of various samples may vary. This is why interval estimate is preferred to point estimate.

  2. Interval estimate: Here, the lower and upper limits of values (that is, the confidence interval) within which a population parameter will lie are estimated along with a certain level of confidence.

 

The mathematics involved in interval estimate:

As discussed above, the normal variate of the sampling distribution of a sample means is: 

Z* = (X̅ - μ)/(σ/√n)

Rearranging the equation above, you get:  

(X̅ - μ) = Z*(σ/√n)

Since Z can be both positive and negative (for a random variable smaller than the mean), you have:

(X̅ - μ) = ± Z*(σ/√n)

The equation above can be rearranged to:  

μ= X̅ ± ( Z*(σ/√n))

 

So, you can say that the population mean μ will lie between:

X̅ - (Z*(σ/√n)) <    μ    < X̅ + ( Z*(σ/√n))

 

 

The formula above is used to calculate the upper and the lower limits of μ for a certain level of confidence (a certain value of Z), where the value of σ is known.

What if the value of σ is not known? In that case, you use the t-distribution.
 

T-distribution

Properties of T-distribution:

  1. It can only be applied when the samples are drawn from a normally distributed population.

  2. It is flatter than a normal distribution. 

 

Degrees of freedom = Sample size - Number of unknown parameters

Here, there is only one unknown parameter: the population standard deviation. So, the degree of freedom for a t-distribution is given by ‘sample size (n) - 1’.

 

Standard normal variate or test statistic for t-distribution = (X̅ - μ)/(s/√n),

where ‘s’ is the sample standard deviation.

 

The formula to find the confidence interval is:

X̅ - ( tα/2*(s/√n)) <    μ    < X̅ + ( tα/2*(s/√n)),

where  1-α is the confidence level associated with it.

 

Go to this link for more details.