true

Learn Data Science from the Best Tutors

Affordable fees
1-1 or Group class
Flexible Timings
Verified Tutors

Search in

Mathematics used in various Machine learning concepts

19/05/2020 0 0

Mathematics is the building block for data science. This blog focuses on various mathematical concepts that are used in machine learning. The mathematical concepts used for machine learning are categorized into statistics, probability, differential calculus. Let’s discuss one by one.

1.Statistics

In mathematical terms, statistics is defined as the set of equations, which are helpful to interpret and analyze things. In machine learning, statistics plays a very important role in understanding the data in a dataset. Various statistical analysis helps us to understand the distribution, summary, etc. of data.

1.1.Exploratory data analysis

EDA or exploratory data analysis is one of the critical steps in data science. It helps us to analyze the data patterns, errors, outliers, etc. Statistics being the backbone for this step, various concepts such as standard deviation, variance, mean, median, etc. are used.

We consider data that is outside three standard deviations (In general) as the outliers. We understand data distribution by plotting a bar graph, which helps us understand whether data is distributed across mean or is the data skewed towards one side.

2.Probability

Probability is the branch of mathematics which is concerned with the numerical description of explaining how likely an event is to occur. This theory is very useful in making predictions. Estimation and predictions constitute an important part of Data Science, and thus, most of the concepts involve probability theory.

2.1.Classification algorithms

Most of the classification problems in data science involve the predictions of classes, where we classify each observation to exactly one class. The base idea behind the classification problem is probability. The probabilities of all the classes are calculated based on the trained data; the class with the highest probability is assigned to that observation.

2.2.Loss function

One of the loss functions used for classification problems is the cross-entropy loss which is a measure of the classification model. Cross-entropy loss increases as the predicted probability diverge from the actual label. It is one of the most important calculations when it comes to machine learning for classification.

3.Differential calculus

Data science is incomplete without differential calculus. Differentiation forms an intrinsic part of data science, especially in machine learning. Differentiation or calculus is the study of the rate of changes in quantities.

3.1.Gradient Descent

In machine learning, our goal is to reduce the cost to our input data. We use cost function, which is the measure of the error in the predictions of the model. To achieve the lowest possible value of the cost function is the main goal of gradient descent which in turn improves the accuracy. Gradient descent uses differentiation where the partial derivative of the cost function is calculated, which will point to the global minima. The downfall of the gradient is controlled by the learning rate.

The same concept is applied for deep learning models where the optimizer used as gradient descent will use the partial derivative concept to adjust the weights to get the optimal weights.

0 Like 0 Dislike

Follow 2

Other Lessons for You

Data Scientist Survey by IBM for 2020

According to IBM, there will be an increase by 3,50,000 to 2,80,000 opening in year 2020. Finance and Professional service having expected growth by 60%

Subhasish C.

0 0

What is Dummy Regression?

What is a Dummy variable? A Dummy variable or Indicator Variable is an artificial variable created to represent an attribute with two or more distinct categories/levels. Basically the binary variables...

Ashish R.

0 0

Big Data & Hadoop - Introductory Session - Data Science for Everyone

Data Science for Everyone An introductory video lesson on Big Data, the need, necessity, evolution and contributing factors. This is presented by Skill Sigma as part of the "Data Science for Everyone" series.

Skill Sigma

0 0

DATA SCIENCE UNLEASHED Demo

DATA SCIENCE live demo recording This Demo addresses most of your basic questions about Data Science like What is Data Science ? What are the Pre requisites ? What all should I learn to call myself...

Gravitty

2 0

13 Things Every Data Scientist Must Know Today

We have spent close to a decade in data science & analytics now. Over this period, We have learnt new ways of working on data sets and creating interesting stories. However, before we could succeed,...

SV Tech Soft

0 0

Find Data Science Classes near you

Looking for Data Science Classes?

Learn from Best Tutors on UrbanPro.

Are you a Tutor or Training Institute?

Join UrbanPro Today to find students near you

Data Science Questions

Which is the best institute or college for a data scientist course with placement support in Pune?

18 Answers

How to learn Data Science?

6 Answers

What background is required for data science?

5 Answers

I have been in the teaching field for 4+ years working as an assistant professor now I need to get into...

20 Answers

Hi, anyone personal tutor who can teach data science with 100% job guarantee?

13 Answers

Looking for Data Science Classes?

The best tutors for Data Science Classes are on UrbanPro

Select the best Tutor
Book & Attend a Free Demo
Pay and start Learning

Learn Data Science with the Best Tutors

The best Tutors for Data Science Classes are on UrbanPro

I am a Student I am a Tutor
Name*	Please enter your full name. Please enter institute name.
Email*	Please enter your email address.
Phone*	Please enter a valid phone number.
Location*	Please enter a pincode or area name.
City*	Please enter city name.
Category*	Please enter category.
Gender*	Male Female Please select your gender.
Email ID/ Mobile No.*	Please enter either mobile no. or email.
Enter Password*	Please enter OTP Please enter Password Sorry, this phone number is not verified, Please login with your email Id.