COURSES
MBAData Science & AnalyticsDoctorate Software & Tech AI | ML MarketingManagement
Professional Certificate Programme in HR Management and AnalyticsPost Graduate Certificate in Product ManagementExecutive Post Graduate Program in Healthcare ManagementExecutive PG Programme in Human Resource ManagementMBA in International Finance (integrated with ACCA, UK)Global Master Certificate in Integrated Supply Chain ManagementAdvanced General Management ProgramManagement EssentialsLeadership and Management in New Age BusinessProduct Management Online Certificate ProgramStrategic Human Resources Leadership Cornell Certificate ProgramHuman Resources Management Certificate Program for Indian ExecutivesGlobal Professional Certificate in Effective Leadership and ManagementCSM® Certification TrainingCSPO® Certification TrainingLeading SAFe® 5.1 Training (SAFe® Agilist Certification)SAFe® 5.1 POPM CertificationSAFe® 5.1 Scrum Master Certification (SSM)Implementing SAFe® 5.1 with SPC CertificationSAFe® 5 Release Train Engineer (RTE) CertificationPMP® Certification TrainingPRINCE2® Foundation and Practitioner Certification
Law
Job Linked
Bootcamps
Study Abroad
MS in Data AnalyticsMS in Project ManagementMS in Information TechnologyMasters Degree in Data Analytics and VisualizationMasters Degree in Artificial IntelligenceMBS in Entrepreneurship and MarketingMSc in Data AnalyticsMS in Data AnalyticsMS in Computer ScienceMaster of Science in Business AnalyticsMaster of Business Administration MS in Data ScienceMS in Information TechnologyMaster of Business AdministrationMS in Applied Data ScienceMaster of Business Administration | STEMMS in Data AnalyticsMaster of Business AdministrationMS in Information Technology and Administrative Management MS in Computer Science Master of Business Administration Master of Business Administration-90 ECTSMSc International Business ManagementMS Data Science Master of Business Administration MSc Business Intelligence and Data ScienceMS Data Analytics MS in Management Information SystemsMSc International Business and ManagementMS Engineering ManagementMS in Machine Learning EngineeringMS in Engineering ManagementMSc Data EngineeringMSc Artificial Intelligence EngineeringMPS in InformaticsMPS in Applied Machine IntelligenceMS in Project ManagementMPS in AnalyticsMS in Project ManagementMS in Organizational LeadershipMPS in Analytics - NEU CanadaMBA with specializationMPS in Informatics - NEU Canada Master in Business AdministrationMS in Digital Marketing and MediaMSc Sustainable Tourism and Event ManagementMSc in Circular Economy and Sustainable InnovationMSc in Impact Finance and Fintech ManagementMS Computer ScienceMBA in Technology, Innovation and EntrepreneurshipMSc Data Science with Work PlacementMSc Global Business Management with Work Placement MBA with Work PlacementMS in Robotics and Autonomous SystemsMS in Civil EngineeringMS in Internet of ThingsMSc International Logistics and Supply Chain ManagementMBA- Business InformaticsMSc International ManagementMBA in Strategic Data Driven ManagementMSc Digital MarketingMBA Business and MarketingMSc in Sustainable Global Supply Chain ManagementMSc Digital Business Analytics MSc in International HospitalityMSc Luxury and Innovation ManagementMaster of Business Administration-International Business ManagementMS in Computer EngineeringMS in Industrial and Systems EngineeringMaster in ManagementMSc MarketingMSc Global Supply Chain ManagementMS in Information Systems and Technology with Business Intelligence and Analytics ConcentrationMSc Corporate FinanceMSc Data Analytics for BusinessMaster of Business AdministrationMaster of Business AdministrationMaster of Business AdministrationMSc in International FinanceMSc in International Management and Global LeadershipMaster of Business AdministrationBachelor of BusinessBachelor of Business AnalyticsBachelor of Information TechnologyMaster of Business AdministrationMBA Business AnalyticsMSc in Marketing Analytics and Data IntelligenceMS Biotechnology Management and EntrepreneurshipMSc in Luxury and Fashion ManagementMaster of Business Administration (90 ECTS)Bachelor of Business Administration (180 ECTS)B.Sc. Computer Science (180 ECTS) MSc in International Corporate Finance MSc in Sustainable Luxury and Creative IndustriesMSc Digital MarketingMSc Global Supply Chain Management (PGMP)MSc Marketing (PGMP)MSc Corporate Finance (PGMP)MSc Data Analytics for Business (PGMP)MS Business AnalyticsMaster of Business AdministrationMS Quantitative FinanceMS Fintech ManagementMS Business Analytics PGMPState University of New York Bachelors Program - STEM
For College Students

Analysis Approach: Deriving New Columns in Excel

$$/$$

In the previous segment, you learnt about the five simple patterns through which you can classify any insight into. Now you’ll learn the methods through which you can actually generate those insights.

$$/$$

As explained by Anand above, the five patterns are sufficient in categorising the insights. But, the dataset that you might have may not necessarily be in the format where you would be applying the patterns directly. Therefore, you also need to learn a few techniques to analyse your data so that you can apply the five patterns on them and start generating insights. These analyses can be categorised into two types: exploratory data analysis and hypothesis-driven analysis. Furthermore, these analysis techniques utilise 2 methods of data manipulation in order to extract insights - creating new columns and reducing the number of rows.

 

Let’s start with the first method through which you can analyse the data to extract insights -  by deriving new columns

$$/$$

The first method through which you can analyse the data is by deriving new columns. For example, if you have a dataset, where only the revenue and cost information is available, you can go ahead and create a separate column where you would be calculating the profit and applying the five patterns on them. As discussed in the video, some of the most common ways of adding new columns to the data are:

  • Metadata Lookup: The metadata is essentially an additional dataset or a data-sheet that is available with you that provides information on the original data. In the example mentioned by Anand, the population of the country was available in a separate sheet and you performed a VLOOKUP to create that column in the original dataset.
  • Calculations: You can perform a variety of calculations using the numeric columns in your dataset. For example, in the video above, you created a new column named Suicide Rate (%)  from the Suicides and the Population columns.
  • Binning: This process essentially bins a given numeric column to specific categories. In the above example, you converted the Suicide Rate into specific bins and categorised them as High, Medium or Low
  • Business-Specific Metrics: This part would be specific to your domain and hence the metrics or KPIs that you might be using would be a useful additional column.

These methods may not seem exhaustive as a variety of analysis procedures can be used to derive new columns. But they more or less encompass a standard way through which you should proceed once you have the data with you to check for new insights.

Another nifty way of creating new columns and the one which is heavily used right now is through the use of machine learning. Watch the following video to understand the different ways in which we can derive new information from a given dataset.

$$/$$

As explained above, the various machine learning techniques that you can use to create new columns are as follows.

  • Classification
  • Clustering
  • Time Series Analysis
  • Feature Extraction
  • Sentiment Analysis

(Note: You would be learning some of these techniques later in the course)

 

To summarise, there are broadly two techniques through which you can create new columns - By performing calculations and through models- either statistical or machine learning. The statistical models often take a sample of the original data and infer from it the behaviour of the entire population whereas in machine learning models you run algorithms on a set of predefined data called "train data" to formulate the model and then run it again  on another set of data called "test data" to test the model's accuracy and precision. If some of the terms in the previous sentence seem like some kind of jargon to you, then don't worry. You'll be learning these concepts in detail in the next two courses. For the time being, a cursory understanding of the difference is sufficient.

 

After deriving these new columns, you can go ahead and apply the five patterns that you learnt earlier to generate insights. In the next segment, you’ll learn about the other way of analysing the given data- that is through summarising the rows.