COURSES
MBAData Science & AnalyticsDoctorate Software & Tech AI | ML MarketingManagement
Professional Certificate Programme in HR Management and AnalyticsPost Graduate Certificate in Product ManagementExecutive Post Graduate Program in Healthcare ManagementExecutive PG Programme in Human Resource ManagementMBA in International Finance (integrated with ACCA, UK)Global Master Certificate in Integrated Supply Chain ManagementAdvanced General Management ProgramManagement EssentialsLeadership and Management in New Age BusinessProduct Management Online Certificate ProgramStrategic Human Resources Leadership Cornell Certificate ProgramHuman Resources Management Certificate Program for Indian ExecutivesGlobal Professional Certificate in Effective Leadership and ManagementCSM® Certification TrainingCSPO® Certification TrainingLeading SAFe® 5.1 Training (SAFe® Agilist Certification)SAFe® 5.1 POPM CertificationSAFe® 5.1 Scrum Master Certification (SSM)Implementing SAFe® 5.1 with SPC CertificationSAFe® 5 Release Train Engineer (RTE) CertificationPMP® Certification TrainingPRINCE2® Foundation and Practitioner Certification
Law
Job Linked
Bootcamps
Study Abroad
MS in Data AnalyticsMS in Project ManagementMS in Information TechnologyMasters Degree in Data Analytics and VisualizationMasters Degree in Artificial IntelligenceMBS in Entrepreneurship and MarketingMSc in Data AnalyticsMS in Data AnalyticsMS in Computer ScienceMaster of Science in Business AnalyticsMaster of Business Administration MS in Data ScienceMS in Information TechnologyMaster of Business AdministrationMS in Applied Data ScienceMaster of Business Administration | STEMMS in Data AnalyticsMaster of Business AdministrationMS in Information Technology and Administrative Management MS in Computer Science Master of Business Administration Master of Business Administration-90 ECTSMSc International Business ManagementMS Data Science Master of Business Administration MSc Business Intelligence and Data ScienceMS Data Analytics MS in Management Information SystemsMSc International Business and ManagementMS Engineering ManagementMS in Machine Learning EngineeringMS in Engineering ManagementMSc Data EngineeringMSc Artificial Intelligence EngineeringMPS in InformaticsMPS in Applied Machine IntelligenceMS in Project ManagementMPS in AnalyticsMS in Project ManagementMS in Organizational LeadershipMPS in Analytics - NEU CanadaMBA with specializationMPS in Informatics - NEU Canada Master in Business AdministrationMS in Digital Marketing and MediaMSc Sustainable Tourism and Event ManagementMSc in Circular Economy and Sustainable InnovationMSc in Impact Finance and Fintech ManagementMS Computer ScienceMBA in Technology, Innovation and EntrepreneurshipMSc Data Science with Work PlacementMSc Global Business Management with Work Placement MBA with Work PlacementMS in Robotics and Autonomous SystemsMS in Civil EngineeringMS in Internet of ThingsMSc International Logistics and Supply Chain ManagementMBA- Business InformaticsMSc International ManagementMBA in Strategic Data Driven ManagementMSc Digital MarketingMBA Business and MarketingMSc in Sustainable Global Supply Chain ManagementMSc Digital Business Analytics MSc in International HospitalityMSc Luxury and Innovation ManagementMaster of Business Administration-International Business ManagementMS in Computer EngineeringMS in Industrial and Systems EngineeringMaster in ManagementMSc MarketingMSc Global Supply Chain ManagementMS in Information Systems and Technology with Business Intelligence and Analytics ConcentrationMSc Corporate FinanceMSc Data Analytics for BusinessMaster of Business AdministrationMaster of Business AdministrationMaster of Business AdministrationMSc in International FinanceMSc in International Management and Global LeadershipMaster of Business AdministrationBachelor of BusinessBachelor of Business AnalyticsBachelor of Information TechnologyMaster of Business AdministrationMBA Business AnalyticsMSc in Marketing Analytics and Data IntelligenceMS Biotechnology Management and EntrepreneurshipMSc in Luxury and Fashion ManagementMaster of Business Administration (90 ECTS)Bachelor of Business Administration (180 ECTS)B.Sc. Computer Science (180 ECTS) MSc in International Corporate Finance MSc in Sustainable Luxury and Creative IndustriesMSc Digital MarketingMSc Global Supply Chain Management (PGMP)MSc Marketing (PGMP)MSc Corporate Finance (PGMP)MSc Data Analytics for Business (PGMP)MS Business AnalyticsMaster of Business AdministrationMS Quantitative FinanceMS Fintech ManagementMS Business Analytics PGMPState University of New York Bachelors Program - STEM
For College Students

Quantitative Variables in Univariate Analysis Using Excel

$$/$$

You have learnt how to conduct univariate analysis on categorical variables. Now, let's look at quantitative or numeric variables.

 

Prerequisites

In this segment, Anand will take you through various summary metrics. Knowledge of these concepts is very essential for this topic and the forthcoming topics in other modules, so make sure that you familiarise yourself with those concepts before moving ahead. 

 

Before going further, please go through some basics of statistics given below.

Mean: This is the sum of all the data values, divided by the total number of sample values.

  • Suppose you want to compare two cricket batsmen based on the number of runs that they score for their teams in every match. Mean is the best way to measure the central tendency in this case. It is commonly represented by the symbol 𝝁.    
  • Mean can be calculated using the Excel function AVERAGE(A1:A20) if the data is distributed over A1:A20 in the Excel workbook.
  • For instance, consider 5 people whose scores in a Mathematics test are shown below.
  • Student nameScore (out of 20 marks)
    Raj12
    Pawrush14
    Srijan19
    Anjali20
    Anamika20

In the above example, the mean value would be the sum of all the score values (85) divided by the number of values (5), which is 17.

 

Mode: In your sample data, the value that has the highest frequency is the mode.

  • In the case of categorical data, it is not possible to measure the mean for a central tendency. This is because mathematical operations cannot be performed on categorical data.
  •  If you consider democratic elections, the decision is made on the basis of who received the maximum number of votes. Essentially, the mode wins in this case.   
  • Note: There can be more than one mode in a sample. For instance, there can be elections in which three parties participate, two of those get 40% of the votes each, and the third party gets 20% of the votes. In this case, there are two modes since two parties have the highest (equal) number of votes.

 

Median: If you arrange the sample data in ascending order of frequency, from left to right, the value in the middle is called the median.

  • The reason why it is a good measure of central tendency is that the number of samples is the same on both its sides (left and right).
  • When there are extremes or outliers in a sample of numerical data, the median is a better measure of central tendency.
  • Note: For even number of data points or intervals, there are two medians, and for an odd number of data points, there is one median.

 

Let’s now learn how to analyse quantitative variables.

$$/$$

Mean and median are single values that give a broad representation of the entire data. As Anand clearly stated, it is very important to understand when to use these metrics to avoid inaccurate analysis.


While mean gives an average of all the values, median gives a typical value that could be used to represent the entire group. As a simple rule of thumb, always question someone if they use mean because median is almost always a better measure of ‘representativeness’.

 

Let’s now look at some other summary descriptive statistics such as mode, interquartile distance, standard deviation, etc.

$$/$$

Standard deviation and interquartile difference are both used to represent the spread of the data.

 

Interquartile difference is a much better metric than standard deviation if there are outliers in the data. This is because the standard deviation will be influenced by outliers, whereas the interquartile difference will simply ignore them.

 

You also saw how box plots are used to understand the spread of data.