πŸ“š MGM3165 Β· Business Analytics

Business Analytics Homework

πŸ“˜ Chapter 2 β€” Descriptive Statistics πŸ“Š Chapter 3 β€” Data Visualization πŸ‘€ FAN BEN Β· BC254574
πŸ“˜
Chapter 2
Descriptive Statistics
Frequency distributions, central tendency, dispersion, percentiles, box plots, and growth rates.
7 Questions
πŸ“Š
Chapter 3
Data Visualization
Line charts, column charts, stacked and clustered bar charts for business data.
4 Questions
Chapter 2 β€” Descriptive Statistics
Q10Cumulative Frequency Distribution
β–Ύ
Consider the following frequency distribution. Construct a cumulative frequency distribution.
Answer
ClassFrequencyCumulative Frequency
10–191010
20–291424
30–391741
40–49748
50–59250
Total5050
Cumulative frequency = running total of all frequencies. The final row always equals total sample size n = 50.
Q13Mean & Median
β–Ύ
a) Data: 10, 20, 12, 17, 16 β€” Compute mean and median.
b) Add value 12 β†’ Data: 10, 20, 12, 17, 16, 12 β€” Compute and compare.
Part a
xΜ„ = (10+20+12+17+16) / 5 = 75 / 5 = 15
Sorted: 10, 12, [16], 17, 20 β†’ Median = 16 (middle of n=5)
Mean
15
Median
16
Part b
xΜ„ = (10+20+12+17+16+12) / 6 = 87 / 6 = 14.5
Sorted: 10, 12, [12, 16], 17, 20 β†’ Median = (12+16)/2 = 14
Mean
14.5
Median
14
Adding 12 (below original mean of 15) pulls both values lower. Mean: 15β†’14.5 Β· Median: 16β†’14.
Q14Percentiles (20th, 25th, 65th, 75th)
β–Ύ
Data: 27, 25, 20, 15, 30, 34, 28, 25 β€” Compute the 20th, 25th, 65th, and 75th percentiles.
Method
  • Sort the data: 15, 20, 25, 25, 27, 28, 30, 34 (n = 8)
  • Compute location: L = (P / 100) Γ— n
  • Non-integer L β†’ round up, take that position's value
  • Integer L β†’ average of position L and L+1
Results
PercentileL = (P/100)Γ—8RuleAnswer
20th1.6Non-integer β†’ position 220
25th (Q1)2.0Integer β†’ avg(pos 2, 3)22.5
65th5.2Non-integer β†’ position 628
75th (Q3)6.0Integer β†’ avg(pos 6, 7)29
Q15Mean, Median & Mode
β–Ύ
Data: 53, 55, 70, 58, 64, 57, 53, 69, 57, 68, 53 β€” Compute mean, median, mode.
Answer
xΜ„ = 657 / 11 = 59.73
Sorted: 53,53,53,55,57,[57],58,64,68,69,70 β†’ Median = 57 (6th of 11)
Mode = 53 (appears 3 times β€” highest frequency)
Mean
59.73
Median
57
Mode
53
Q16Geometric Mean β€” Annual Growth Rate
β–Ύ
Asset declines from $5,000 to $3,500 over nine years. What is the mean annual growth rate?
Formula & Calculation
xΜ„g = (End Value / Start Value)^(1/n) βˆ’ 1
xΜ„g = (3500 / 5000)^(1/9) βˆ’ 1 = (0.7)^0.1111 βˆ’ 1 = βˆ’0.0374
Annual Growth Rate
βˆ’3.74%
The asset loses 3.74% per year on average. Geometric mean is used because ordinary averaging of percentage changes over multiple periods gives misleading results.
Q17Mutual Fund Comparison
β–Ύ
Stivers fund: $10,000 β†’ $18,000 over 8 years.
Trippi fund: $5,000 β†’ $10,600 over 8 years.
Which mutual fund performed better?
Geometric Mean Annual Return
Stivers: (18000/10000)^(1/8) βˆ’ 1 = (1.8)^0.125 βˆ’ 1 = 7.63%
Trippi: (10600/5000)^(1/8) βˆ’ 1 = (2.12)^0.125 βˆ’ 1 = 9.85%
Stivers
7.63%
Trippi
9.85%
Trippi performed better β€” 9.85% annual return vs Stivers at 7.63%. Despite a lower initial investment, Trippi achieved a higher compound growth rate over 8 years.
Q19Wait-Tracking System Analysis
β–Ύ
Compare patient wait times (minutes) for offices with vs. without a wait-tracking system.
a) Mean & median   b) Variance & std dev   c & d) Box plots   e) Conclusion
a & b β€” Summary Statistics
StatisticWith SystemWithout System
Data (sorted)9,11,12,12,13,14,15,18,31,3712,16,17,20,23,24,31,37,44,67
Mean (xΜ„)17.229.1
Median13.523.5
Variance (sΒ²)86.18275.66
Std Dev (s)9.2816.60
c & d β€” Box Plot Five-Number Summary
StatisticWith SystemWithout System
Min912
Q11217
Median13.523.5
Q31837
IQR620
Upper Fence2767
Outliers3137None
e) Yes β€” offices with a wait-tracking system have significantly shorter and more consistent wait times. Mean: 17.2 vs 29.1 min. Std Dev: 9.28 vs 16.60. The system effectively reduces and stabilizes patient wait times.
Q24Covariance & Correlation Coefficient
β–Ύ
x = {4, 6, 11, 3, 16}  |  y = {50, 50, 40, 60, 30}
a) Scatter diagram   b) Relationship   c) Covariance   d) Correlation coefficient
a β€” Scatter Diagram
b β€” Relationship
The scatter diagram shows a negative relationship β€” as x increases, y tends to decrease.
c β€” Sample Covariance
xΜ„ = 8 | Θ³ = 46
xα΅’yα΅’xα΅’ βˆ’ xΜ„yα΅’ βˆ’ Θ³(xα΅’βˆ’xΜ„)(yα΅’βˆ’Θ³)
450βˆ’44βˆ’16
650βˆ’24βˆ’8
11403βˆ’6βˆ’18
360βˆ’514βˆ’70
16308βˆ’16βˆ’128
Ξ£βˆ’240
s_xy = βˆ’240 / (5βˆ’1) = βˆ’60
d β€” Correlation Coefficient
sβ‚“ = √(118/4) = 5.43 | s_y = √(520/4) = 11.40
r_xy = βˆ’60 / (5.43 Γ— 11.40) β‰ˆ βˆ’0.97
Covariance
βˆ’60
Correlation r
βˆ’0.97
r = βˆ’0.97 indicates a very strong negative linear relationship. Values close to βˆ’1 mean near-perfect inverse correlation between x and y.
Chapter 3 β€” Data Visualization
Q11Vehicle Production β€” Line & Clustered Bar
β–Ύ
OICA: Toyota, GM, Volkswagen, Hyundai vehicle production (millions) over 5 years.
a) Line chart   b) Discuss trends   c) Clustered bar β€” leading manufacturer each year?
a β€” Line Chart
b β€” Discussion
GM led in Years 1–2 but declined sharply in Year 4. Toyota showed consistent performance and led from Year 3 onward. Hyundai showed the strongest growth β€” nearly doubling from Year 1 to Year 5. The Year 4 dip likely reflects a major global market disruption.
c β€” Clustered Bar Chart
Leading manufacturer by year: Year 1: GM (8.97M) Β· Year 2: GM (9.35M) Β· Year 3: Toyota (9.24M) Β· Year 4: Toyota (7.23M) Β· Year 5: Toyota (8.56M)
Q13Insurance Sales β€” Column Chart
β–Ύ
Top 6 salespeople: Harish(24), David(41), Kristina(19), Steven(23), Tim(53), Mona(39).
a) Column chart   b) Sort mostβ†’fewest   c) Add data labels
a, b & c β€” Column Chart (sorted, with labels)
Tim leads with 53 contracts. Order: Tim(53) β†’ David(41) β†’ Mona(39) β†’ Harish(24) β†’ Steven(23) β†’ Kristina(19).
Q16Smartphone Ownership by Age
β–Ύ
Survey: smartphone ownership % by age group (18–24 to 65+).
a) Stacked column   b) Clustered column   c) Which is better and why?
a β€” Stacked Column Chart
b β€” Clustered Column Chart
c) As age increases, smartphone ownership drops sharply while "No Cell Phone" rises. The clustered chart is better for comparing individual categories across age groups. The stacked chart shows totals well but makes segment comparisons harder. Clustered is preferred here.
Q17Store Manager Time Allocation
β–Ύ
Logan Outdoor Equipment Co. β€” 6 locations, 4 task categories (% of time).
a) Stacked bar   b) Clustered bar   c) Multiple bar   d) Which is preferable?   e) Inferences
a β€” Stacked Bar Chart
b β€” Clustered Bar Chart
d β€” Preferable form?
The clustered bar chart is preferable β€” it allows direct comparison of each task type across all six locations simultaneously.
e β€” Inferences
Customer interaction dominates at Boise (64%) and Olympia (54%). Portland spends the most time in required meetings (52%). Missoula has the highest idle time (30%), suggesting potential inefficiency. Time allocation varies significantly across locations, indicating inconsistent management practices.