true

Learn Data Science from the Best Tutors

Affordable fees
1-1 or Group class
Flexible Timings
Verified Tutors

Search in

What is Dummy Regression?

09/12/2016 0 0

What is a Dummy variable?
A Dummy variable or Indicator Variable is an artificial variable created to represent an attribute with two or more distinct categories/levels. Basically the binary variables created from a categorical variable with having multiple levels are termed as dummy variables.

Why is it used?

Regression analysis treats all independent (X) variables in the analysis as numerical. Numerical variables are interval or ratio scale variables whose values are directly comparable, e.g. ‘10 is twice as much as 5’, or ‘3 minus 1 equals 2’. Often, however, you might want to include an attribute or nominal scale variable such as ‘Product Brand’ or ‘Type of Defect’ in your study. Say you have three types of defects, numbered ‘1’, ‘2’ and ‘3’. In this case, ‘3 minus 1’ doesn’t mean anything. You can’t subtracting defect 1 from defect 3. The numbers here are used to indicate or identify the levels of ‘Defect Type’ and do not have intrinsic meaning of their own. Dummy variables are created in this situation to ‘trick’ the regression algorithm into correctly analysing attribute variables.

Things to keep in mind about dummy variables:

Dummy variables assign the numbers ‘0’ and ‘1’ to indicate membership in any mutually exclusive and exhaustive category.

The number of dummy variables necessary to represent a single attribute variable is equal to the number of levels (categories) in that variable minus one.
For a given attribute variable, none of the dummy variables constructed can be redundant. That is, one dummy variable cannot be a constant multiple or a simple linear relation of another.
The interaction of two attribute variables (e.g. Gender and Marital Status) is represented by a third dummy variable which is simply the product of the two individual dummy variables.
The decision as to which level is not coded is often arbitrary. The level which is not coded is the category to which all other categories will be compared. As such, often the biggest group will be the not- coded category.

0 Like 0 Dislike

Follow 0

Other Lessons for You

Studying mathematics and related subjects

learning mathematical concepts requires two preconditions - that you understand and write rigorous proofs for even simple concepts and that you understand it intuitively. If either you didnt develop an...

Kamal

0 0

Code: Gantt Chart: Horizontal bar using matplotlib for tasks with Start Time and End Time

import pandas as pd from datetime import datetimeimport matplotlib.dates as datesimport matplotlib.pyplot as plt def gantt_chart(df_phase): # Now convert them to matplotlib's internal format... ...

Rishi B.

0 0

Learn Data Science In 8 Steps

8 Steps To Learn Data Science There have been a lot of surveys over the past few years on the educational background of data scientists. As a result, there have also been many different results. In the...

Ranjit Mishra

1 0

Discrimination, classification and pattern recognition

The importance of classification in science has already been remarked upon inChapter 6, where techniques were described for examining multivariate data forthe presence of relatively distinct groups or...

Rajendra M.

0 0

REFERENCE BOOKS FOR DATA SCIENCE

Dear All, You can use the following books to master the DATA SCIENCE Concepts 1) First Course in Probability-Ronald Russel 2)Applied Regression Analysis-Drapper and Smith 3)Applied Multivariate Analysis-Richard...

Data Labs Training and Consulting Services

3 0

Find Data Science Classes near you

Online Data Science Instructor

Looking for Data Science Classes?

Learn from Best Tutors on UrbanPro.

Are you a Tutor or Training Institute?

Join UrbanPro Today to find students near you

Data Science Questions

What are the topics covered in Data Science?

5 Answers

What background is required for data science?

5 Answers

Who is the best trainer/institute in Hyderabad for Data Analyst/Scientist?

23 Answers

I want to get into data science but I dont have any prior knowledge on any of the programing languages, how do I go about it?

19 Answers

Hi, I am Anand having 4 years of experience in IT Recruitment but I have interest to build my career...

5 Answers

Looking for Data Science Classes?

The best tutors for Data Science Classes are on UrbanPro

Select the best Tutor
Book & Attend a Free Demo
Pay and start Learning

Learn Data Science with the Best Tutors

The best Tutors for Data Science Classes are on UrbanPro

I am a Student I am a Tutor
Name*	Please enter your full name. Please enter institute name.
Email*	Please enter your email address.
Phone*	Please enter a valid phone number.
Location*	Please enter a pincode or area name.
City*	Please enter city name.
Category*	Please enter category.
Gender*	Male Female Please select your gender.
Email ID/ Mobile No.*	Please enter either mobile no. or email.
Enter Password*	Please enter OTP Please enter Password Sorry, this phone number is not verified, Please login with your email Id.