Data Mining Assignment | Homework for You
ISM 6136 Summer 2020
Assignment 1
Using the data set, describe the structure of the variables. Your analysis should include the key descriptive
statistics with your evaluation of them. Your evaluation should be primarily an evaluation of the key variables
not a laundry list of each variable’s characteristics. One of the key elements to consider is the suitability for
further analysis of that variable or if it can be used in combination with other variables to identify interesting
relationships.
Your description should be in memo form with appropriate appendixes (screen shots of the statistical
output). The memo should be no longer than 3 pages and should be organized with the most important
elements toward the beginning of the memo. The audience for your memo is a technically savvy senior vice
president.
There are 65,031 records in the Excel data file (Satisfaction Survey.xlsx) that contains data on flights between
1 Jan 14 to 31 Mar 14.
The fields are:
Col: Title Comments .
A Satisfaction, code 1-5 5 being best
B Airline Status: blue, gold, silver, platinum
C Age: in years
D Gender: Male, Female
E Year of First Flight 4 digit year
F No. of flights numeric
G Percent of Flight with other Airlines numeric
H Type of Travel Business travel, Personal Travel, Mileage ticket
I No. of other Loyalty Cards numeric
J Shopping Amount at Airport dollars
K Eating and Drinking at Airport dollars
L Class: Business Eco, Eco Plus,
M Day of Month: numeric field
N Flight Date: dd-mm-yy
O Airline Code 2 digit character
P Airline Name character
Q Origin City character
R Origin State character
S Destination City character
T Destination State character
U Scheduled Departure Hour 2 digit value
V Departure Delay in Minutes minutes
W Arrival Delay in Minutes minutes
X Flight Cancelled: No/Yes
Y Flight time in Minutes minutes
Z Flight Distance: in miles
AA Arrival Delay Greater 5 Minutes no/yes
Homework for You