J. Niessing, INSEAD
T. Evgeniou, INSEAD
A boating company had become a victim of the crisis in the boating industry. The business problem of the “Boat” case study, although hypothetical, depicts very well the sort of business problems faced by many real companies in an increasingly data-intensive business environment. The management team was now exploring various growth options. Expanding further in some markets, in particular North America, was no longer something to consider for the distant future. It was becoming an immediate necessity.
The team believed that in order to develop a strategy for North America, they needed a better understanding of their current and potential customers in that market. They believed that they had to build more targeted boats for their most important segments there. To that purpose, the boating company had been commissioned a segmentation project for that market. Being a data-friendly company, the decision was made to develop a segmentation in a data-driven way - none of that soft hold-hands-and-discuss stuff.
With the aid of a market research firm, the boating company gathered various data about the boating market in the US through interviews with almost 3,000 boat owners and intenders. The data consisted, among others, 29 attitudes towards boating, which respondents indicated on a 5-point scale (Q. 1, 1-29 shown in the survey in the appendix).
Other types of information had been collected, such as demographics (Q. 11-15 in the survey). Finally, the boating company had compiled information about the boats, such as the length of the boat they owned, how they used their boats, and the price of the boats (Q. 2-10 in the survey). A segmentation and profile of the market would be done using all this data.
Based on past experience, the decision was made to develop a segmentation that was mostly driven by attitudes that consumer held regarding boating - the attitudinal data from the 29 questions in Q.1.
As a first step, the company decided to simplify the data by grouping the 29 attitudinal statements in Q1 into a few meaningful sets/factors. This could be done using a standard method: factor analysis. The objective is to summarize information (the variation) in correlated raw attributes to a smaller number of manageable factors - which are typically uncorrelated or independent. In the process one decreases the number of raw attributes while keeping most of the information in the data in a statistical sense. This data reduction technique is a very useful step in helping interpreting the data and make better decisions. It also helps to identify the key variables that should be used in the segmentation procedure.
We first have to read the whole dataset, which is in CSV format.
For the Factor Analysis, we will only use the 29 items of the data that describe the attitudes towards boating of people in the sample. Such 29 items are measured on a Likert scale of 1-5. All the 29 items are integer variables that take values between 1 (=strongly disagree) and 5 (=strongly agree).
Every factor analysis has three basic decision points:
Decide the number of factors,
Choose an extraction method,
Choose a rotation method.
Sometimes it makes sense to run a quick correlation matrix before the factor analysis. It usually gives a first glance at the inter-relationships between variables and it is often an input for factor analysis (i.e. eigenvalue decomposition of the correlation matrix). In this example the correlaton matrix for the 29 attitudes looks like this:
Q1_1_When buying a boat I do a lot of shopping around and visit multiple dealers | Q1_2_When buying a boat getting the lowest price is more important than the boat brand | Q1_3_The brand of boat I buy says a lot about who I am | Q1_4_I only consider buying a boat from a reputable brand | Q1_5_I am willing to pay a premium for a brand with a reputation for high quality | Q1_6_Owning a boat is a way of rewarding myself for my hard work | Q1_7_Owning a boat gives me a sense of achievement | Q1_8_When buying a boat functionality is more important than style | Q1_9_I see my boat as a status symbol | Q1_10_When buying a boat I rely more on expert opinion than my own e g consumer reports salespeople | Q1_11_I tend to perform minor boat repairs and maintenance on my own | Q1_12_When it comes to boating I tend to prefer a basic boat with little to no frills | Q1_13_When buying a boat I tend to buy the latest and greatest | Q1_14_When buying accessories for my boat I tend to buy the latest and greatest | Q1_15_I am serious about the technology on my boat | Q1_16_People tend to come to me for advice about boating | Q1_17_I consider myself more knowledgeable about boating than the rest of my boating peers | Q1_18_Boating gives me a feeling of adventure | Q1_19_Boating allows me to experience nature | Q1_20_When not on my boat I often find myself doing boating related activities | Q1_21_Boating helps me escape from everyday life and relax | Q1_22_Boating helps me stay active | Q1_23_Boating allows me to excel in the sports that I am passionate about e g fishing racing water sports | Q1_24_Boating gives me an outlet to socialize with family and or friends | Q1_25_Being in charge of my boat is very important to me | Q1_26_Having a powerful boat is what is most important to me | Q1_27_Boating is the number one thing I do with my spare time | Q1_28_Boating is my true passion in life | Q1_29_Boating is one of many things I do in my spare time | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Q1_1_When buying a boat I do a lot of shopping around and visit multiple dealers | 1.00 | 0.008983 | 0.1075 | 0.1983 | 0.1835 | 0.2678 | 0.1846 | 0.08861 | 0.0778 | 0.1071 | 0.1449 | -0.05305 | 0.1225 | 0.1785 | 0.2611 | 0.1572 | 0.1478 | 0.2533 | 0.274 | 0.1941 | 0.2353 | 0.2315 | 0.1876 | 0.2091 | 0.2271 | 0.1038 | 0.1268 | 0.176 | 0.2016 |
Q1_2_When buying a boat getting the lowest price is more important than the boat brand | 0.008983 | 1.00 | -0.02956 | -0.2101 | -0.2058 | -0.03727 | 0.02025 | 0.1967 | 0.08722 | 0.1583 | 0.04285 | 0.3702 | 0.006216 | -0.03048 | -0.07769 | -0.01528 | 0.03892 | -0.03771 | -0.03631 | 0.05335 | -0.09945 | -0.07645 | 0.002847 | -0.0755 | 0.008886 | 0.0738 | 0.04546 | 0.01602 | -0.02895 |
Q1_3_The brand of boat I buy says a lot about who I am | 0.1075 | -0.02956 | 1.00 | 0.2636 | 0.3997 | 0.3401 | 0.4436 | -0.05102 | 0.5789 | 0.1356 | 0.09649 | -0.08893 | 0.4845 | 0.4615 | 0.3762 | 0.3872 | 0.379 | 0.2379 | 0.1414 | 0.3856 | 0.182 | 0.2814 | 0.3396 | 0.2277 | 0.358 | 0.471 | 0.4008 | 0.4253 | 0.1659 |
Q1_4_I only consider buying a boat from a reputable brand | 0.1983 | -0.2101 | 0.2636 | 1.00 | 0.3667 | 0.2015 | 0.1753 | 0.003764 | 0.1676 | 0.1001 | 0.0588 | -0.1598 | 0.2727 | 0.2941 | 0.2961 | 0.1821 | 0.1749 | 0.1845 | 0.1855 | 0.1833 | 0.183 | 0.2335 | 0.1642 | 0.2291 | 0.218 | 0.1923 | 0.165 | 0.2124 | 0.1865 |
Q1_5_I am willing to pay a premium for a brand with a reputation for high quality | 0.1835 | -0.2058 | 0.3997 | 0.3667 | 1.00 | 0.2929 | 0.2854 | -0.03322 | 0.3287 | 0.1357 | 0.06923 | -0.1725 | 0.4504 | 0.4599 | 0.4176 | 0.3553 | 0.3184 | 0.2342 | 0.1842 | 0.3229 | 0.1873 | 0.2737 | 0.2934 | 0.2492 | 0.2903 | 0.3372 | 0.294 | 0.325 | 0.1778 |
Q1_6_Owning a boat is a way of rewarding myself for my hard work | 0.2678 | -0.03727 | 0.3401 | 0.2015 | 0.2929 | 1.00 | 0.546 | 0.04036 | 0.3489 | 0.1245 | 0.1461 | -0.1154 | 0.2857 | 0.3138 | 0.3143 | 0.2718 | 0.2362 | 0.4375 | 0.3555 | 0.354 | 0.4242 | 0.4088 | 0.3179 | 0.367 | 0.42 | 0.3074 | 0.3447 | 0.3924 | 0.2715 |
Q1_7_Owning a boat gives me a sense of achievement | 0.1846 | 0.02025 | 0.4436 | 0.1753 | 0.2854 | 0.546 | 1.00 | -0.005651 | 0.4908 | 0.1217 | 0.1152 | -0.1091 | 0.3489 | 0.3619 | 0.3449 | 0.3091 | 0.2852 | 0.4009 | 0.2815 | 0.3633 | 0.3274 | 0.3902 | 0.3044 | 0.3328 | 0.4194 | 0.3865 | 0.3706 | 0.3994 | 0.2406 |
Q1_8_When buying a boat functionality is more important than style | 0.08861 | 0.1967 | -0.05102 | 0.003764 | -0.03322 | 0.04036 | -0.005651 | 1.00 | -0.09188 | 0.08512 | 0.1426 | 0.2354 | -0.04788 | -0.02173 | 0.06132 | 0.02284 | 0.04813 | 0.07381 | 0.09498 | 0.04331 | 0.06314 | 0.05412 | 0.09931 | 0.0186 | 0.09906 | -0.04454 | 0.0292 | 0.04579 | 0.09732 |
Q1_9_I see my boat as a status symbol | 0.0778 | 0.08722 | 0.5789 | 0.1676 | 0.3287 | 0.3489 | 0.4908 | -0.09188 | 1.00 | 0.143 | 0.05987 | -0.04386 | 0.4801 | 0.4267 | 0.3327 | 0.3892 | 0.3656 | 0.218 | 0.0744 | 0.3659 | 0.1435 | 0.2325 | 0.2887 | 0.2304 | 0.3191 | 0.5027 | 0.3997 | 0.4025 | 0.1147 |
Q1_10_When buying a boat I rely more on expert opinion than my own e g consumer reports salespeople | 0.1071 | 0.1583 | 0.1356 | 0.1001 | 0.1357 | 0.1245 | 0.1217 | 0.08512 | 0.143 | 1.00 | -0.08855 | 0.1223 | 0.1614 | 0.1051 | 0.1111 | -0.03012 | -0.02844 | 0.1406 | 0.09459 | 0.09628 | 0.07687 | 0.09113 | 0.06955 | 0.1312 | 0.07601 | 0.1348 | 0.07711 | 0.06922 | 0.05107 |
Q1_11_I tend to perform minor boat repairs and maintenance on my own | 0.1449 | 0.04285 | 0.09649 | 0.0588 | 0.06923 | 0.1461 | 0.1152 | 0.1426 | 0.05987 | -0.08855 | 1.00 | 0.08748 | 0.07991 | 0.1345 | 0.205 | 0.3192 | 0.308 | 0.1119 | 0.1233 | 0.25 | 0.1291 | 0.1701 | 0.1921 | 0.0773 | 0.2489 | 0.09386 | 0.157 | 0.1829 | 0.1659 |
Q1_12_When it comes to boating I tend to prefer a basic boat with little to no frills | -0.05305 | 0.3702 | -0.08893 | -0.1598 | -0.1725 | -0.1154 | -0.1091 | 0.2354 | -0.04386 | 0.1223 | 0.08748 | 1.00 | -0.1131 | -0.1652 | -0.1677 | -0.02316 | 0.01641 | -0.1194 | -0.09364 | 0.008689 | -0.1682 | -0.1144 | -0.03205 | -0.1693 | -0.0547 | -0.06386 | 0.002133 | -0.009516 | -0.03549 |
Q1_13_When buying a boat I tend to buy the latest and greatest | 0.1225 | 0.006216 | 0.4845 | 0.2727 | 0.4504 | 0.2857 | 0.3489 | -0.04788 | 0.4801 | 0.1614 | 0.07991 | -0.1131 | 1.00 | 0.6382 | 0.4577 | 0.4271 | 0.4259 | 0.2035 | 0.1111 | 0.3868 | 0.1354 | 0.2301 | 0.3187 | 0.1963 | 0.3234 | 0.481 | 0.3981 | 0.4041 | 0.1903 |
Q1_14_When buying accessories for my boat I tend to buy the latest and greatest | 0.1785 | -0.03048 | 0.4615 | 0.2941 | 0.4599 | 0.3138 | 0.3619 | -0.02173 | 0.4267 | 0.1051 | 0.1345 | -0.1652 | 0.6382 | 1.00 | 0.4999 | 0.4272 | 0.3976 | 0.2493 | 0.1773 | 0.4065 | 0.2082 | 0.2852 | 0.3565 | 0.209 | 0.3533 | 0.4649 | 0.3934 | 0.4049 | 0.2109 |
Q1_15_I am serious about the technology on my boat | 0.2611 | -0.07769 | 0.3762 | 0.2961 | 0.4176 | 0.3143 | 0.3449 | 0.06132 | 0.3327 | 0.1111 | 0.205 | -0.1677 | 0.4577 | 0.4999 | 1.00 | 0.4117 | 0.3943 | 0.3164 | 0.2634 | 0.407 | 0.21 | 0.3264 | 0.353 | 0.2655 | 0.428 | 0.3732 | 0.3504 | 0.3784 | 0.2449 |
Q1_16_People tend to come to me for advice about boating | 0.1572 | -0.01528 | 0.3872 | 0.1821 | 0.3553 | 0.2718 | 0.3091 | 0.02284 | 0.3892 | -0.03012 | 0.3192 | -0.02316 | 0.4271 | 0.4272 | 0.4117 | 1.00 | 0.6314 | 0.2029 | 0.138 | 0.5216 | 0.1584 | 0.30 | 0.3996 | 0.1941 | 0.3858 | 0.402 | 0.4758 | 0.4964 | 0.202 |
Q1_17_I consider myself more knowledgeable about boating than the rest of my boating peers | 0.1478 | 0.03892 | 0.379 | 0.1749 | 0.3184 | 0.2362 | 0.2852 | 0.04813 | 0.3656 | -0.02844 | 0.308 | 0.01641 | 0.4259 | 0.3976 | 0.3943 | 0.6314 | 1.00 | 0.1709 | 0.1219 | 0.4456 | 0.1255 | 0.2584 | 0.3631 | 0.1486 | 0.3624 | 0.3981 | 0.4379 | 0.4625 | 0.2094 |
Q1_18_Boating gives me a feeling of adventure | 0.2533 | -0.03771 | 0.2379 | 0.1845 | 0.2342 | 0.4375 | 0.4009 | 0.07381 | 0.218 | 0.1406 | 0.1119 | -0.1194 | 0.2035 | 0.2493 | 0.3164 | 0.2029 | 0.1709 | 1.00 | 0.4873 | 0.2811 | 0.4728 | 0.437 | 0.2903 | 0.4227 | 0.3722 | 0.2386 | 0.2467 | 0.3128 | 0.2992 |
Q1_19_Boating allows me to experience nature | 0.274 | -0.03631 | 0.1414 | 0.1855 | 0.1842 | 0.3555 | 0.2815 | 0.09498 | 0.0744 | 0.09459 | 0.1233 | -0.09364 | 0.1111 | 0.1773 | 0.2634 | 0.138 | 0.1219 | 0.4873 | 1.00 | 0.213 | 0.44 | 0.38 | 0.2436 | 0.3658 | 0.3239 | 0.1394 | 0.1756 | 0.232 | 0.2788 |
Q1_20_When not on my boat I often find myself doing boating related activities | 0.1941 | 0.05335 | 0.3856 | 0.1833 | 0.3229 | 0.354 | 0.3633 | 0.04331 | 0.3659 | 0.09628 | 0.25 | 0.008689 | 0.3868 | 0.4065 | 0.407 | 0.5216 | 0.4456 | 0.2811 | 0.213 | 1.00 | 0.2258 | 0.3346 | 0.4045 | 0.2375 | 0.4056 | 0.4034 | 0.5033 | 0.5203 | 0.2473 |
Q1_21_Boating helps me escape from everyday life and relax | 0.2353 | -0.09945 | 0.182 | 0.183 | 0.1873 | 0.4242 | 0.3274 | 0.06314 | 0.1435 | 0.07687 | 0.1291 | -0.1682 | 0.1354 | 0.2082 | 0.21 | 0.1584 | 0.1255 | 0.4728 | 0.44 | 0.2258 | 1.00 | 0.4212 | 0.2391 | 0.4242 | 0.3022 | 0.1478 | 0.2356 | 0.2604 | 0.2943 |
Q1_22_Boating helps me stay active | 0.2315 | -0.07645 | 0.2814 | 0.2335 | 0.2737 | 0.4088 | 0.3902 | 0.05412 | 0.2325 | 0.09113 | 0.1701 | -0.1144 | 0.2301 | 0.2852 | 0.3264 | 0.30 | 0.2584 | 0.437 | 0.38 | 0.3346 | 0.4212 | 1.00 | 0.3368 | 0.3828 | 0.3662 | 0.232 | 0.3458 | 0.3756 | 0.3412 |
Q1_23_Boating allows me to excel in the sports that I am passionate about e g fishing racing water sports | 0.1876 | 0.002847 | 0.3396 | 0.1642 | 0.2934 | 0.3179 | 0.3044 | 0.09931 | 0.2887 | 0.06955 | 0.1921 | -0.03205 | 0.3187 | 0.3565 | 0.353 | 0.3996 | 0.3631 | 0.2903 | 0.2436 | 0.4045 | 0.2391 | 0.3368 | 1.00 | 0.2303 | 0.3238 | 0.331 | 0.3904 | 0.4386 | 0.2344 |
Q1_24_Boating gives me an outlet to socialize with family and or friends | 0.2091 | -0.0755 | 0.2277 | 0.2291 | 0.2492 | 0.367 | 0.3328 | 0.0186 | 0.2304 | 0.1312 | 0.0773 | -0.1693 | 0.1963 | 0.209 | 0.2655 | 0.1941 | 0.1486 | 0.4227 | 0.3658 | 0.2375 | 0.4242 | 0.3828 | 0.2303 | 1.00 | 0.3058 | 0.2051 | 0.2442 | 0.2547 | 0.2713 |
Q1_25_Being in charge of my boat is very important to me | 0.2271 | 0.008886 | 0.358 | 0.218 | 0.2903 | 0.42 | 0.4194 | 0.09906 | 0.3191 | 0.07601 | 0.2489 | -0.0547 | 0.3234 | 0.3533 | 0.428 | 0.3858 | 0.3624 | 0.3722 | 0.3239 | 0.4056 | 0.3022 | 0.3662 | 0.3238 | 0.3058 | 1.00 | 0.3364 | 0.3532 | 0.4013 | 0.2334 |
Q1_26_Having a powerful boat is what is most important to me | 0.1038 | 0.0738 | 0.471 | 0.1923 | 0.3372 | 0.3074 | 0.3865 | -0.04454 | 0.5027 | 0.1348 | 0.09386 | -0.06386 | 0.481 | 0.4649 | 0.3732 | 0.402 | 0.3981 | 0.2386 | 0.1394 | 0.4034 | 0.1478 | 0.232 | 0.331 | 0.2051 | 0.3364 | 1.00 | 0.4504 | 0.4729 | 0.1499 |
Q1_27_Boating is the number one thing I do with my spare time | 0.1268 | 0.04546 | 0.4008 | 0.165 | 0.294 | 0.3447 | 0.3706 | 0.0292 | 0.3997 | 0.07711 | 0.157 | 0.002133 | 0.3981 | 0.3934 | 0.3504 | 0.4758 | 0.4379 | 0.2467 | 0.1756 | 0.5033 | 0.2356 | 0.3458 | 0.3904 | 0.2442 | 0.3532 | 0.4504 | 1.00 | 0.6153 | 0.2336 |
Q1_28_Boating is my true passion in life | 0.176 | 0.01602 | 0.4253 | 0.2124 | 0.325 | 0.3924 | 0.3994 | 0.04579 | 0.4025 | 0.06922 | 0.1829 | -0.009516 | 0.4041 | 0.4049 | 0.3784 | 0.4964 | 0.4625 | 0.3128 | 0.232 | 0.5203 | 0.2604 | 0.3756 | 0.4386 | 0.2547 | 0.4013 | 0.4729 | 0.6153 | 1.00 | 0.2608 |
Q1_29_Boating is one of many things I do in my spare time | 0.2016 | -0.02895 | 0.1659 | 0.1865 | 0.1778 | 0.2715 | 0.2406 | 0.09732 | 0.1147 | 0.05107 | 0.1659 | -0.03549 | 0.1903 | 0.2109 | 0.2449 | 0.202 | 0.2094 | 0.2992 | 0.2788 | 0.2473 | 0.2943 | 0.3412 | 0.2344 | 0.2713 | 0.2334 | 0.1499 | 0.2336 | 0.2608 | 1.00 |
There are three common ways to select the optimal number of factors from a statistical perspective:
All factors with eigenvalue larger than 1
Based on the cumulative variance we would like to explain (e.g. at least 50% for example)
Pick the number of factors corresponding to the “elbow” of the scree plot
The first two approaches to select the number of factors/derived variables can be seen in the following table:
In this data 5 factors have an eigenvalue > 1. But the team didn't wanted to select the number based on statistical criteria only. They looked at different factor solutions (varying from 4 to 14 factors in total) and realized that they would loose too much information if they would only go with 5 factors. Looking also at the explained variance and based on business sense the team decided to use 10 factors. As shown in the table above these 10 factors explain 66.61 % of the variance. If they would stick to the 5 factors with an eigenvalue over 1 they would only explain a little bit more than 50% of the variance.
A common basic visualization when running the factor analysis is the Screeplot, which depicts the eigenvalues of the factors and thus allows gaining a quick overview on how many factors should be retained:
Based on this screeplot you could say that the selected number of factors should be between 3 and 6 factors but as mentioned above the team decided to use 10 factors based on the explained variance because the explained variance was the key selection criterion in this case.
The extraction method will produce factor loadings for every item on every extracted factor. Researchers hope their results will show what is called simple structure, with most items having a large loading on one factor but small loadings on other factors.
There at least five extraction methods. The evidence supports the use of one of these two:
The extraction and rotation steps are usually specificed in the same command in R (next subsection shows such commands).
Once an initial solution is obtained, the loadings are rotated. Rotation is a way of maximizing high loadings and minimizing low loadings so that the simplest possible structure is achieved.
There are two commen types:
no rotation
varimax, which is one of the key rotation methods used
We use the fa() function from the library Phsych because it allows us to specify different factoring methods (e.g. PAF, ML) and also different types of rotation (e.g. varimax, Oblimin and others). In such a function we have to specify r (the corelation matrix: corMat), nfactors (number of factors), rotate (type of rotation or transformations of the solution: oblimin), and fm (factoring method: principal axis).
Below are the tables of factor loadings and other statistics for the principal axis factoring and two rotation scenarios. Only the factors selected based on variance are shown. The first table is showing the unrotated 10-factor solution while the second table is showing the results from the variamx rotation:
From the factor analysis above, we can identify the attitudinal questions that clearly load into a single derived factor (i.e. a construct). Based on this, we can have an idea of the underlying construct (not observed in the data) that determines the attitudes of different customers. For instance, Q1_27, Q1_28 and Q1_20 load strongly into the same factor (besides other attitudes). Looking at the description of these questions, we can have an idea of the construct that determines the customer responses to these attitude questions. The questions have the following statements:
Q1_27: “Boating is the number one thing I do with my spare time”
Q1_28: “Boating is my true passion in life”
Q1_20: “When not on my boat, I often find myself doing boating related activities”
Based on the descriptions of the attitudes that are loading into the first factor, we could say that the first factor measures a construct of “Passion and expertise around boating”. We do not observe the extent to which passion is important for the respondents (the questionnaire does not ask a direct question about something called “personal image”); however, the actual attitudes allow deriving such construct.
Based on the factor analsyis described above we can obtain 10 factor scores for each of almost 3000 customers. These 10 new derived variables can be used instead of the whole 29 attitudinal variables for further analysis. Once we decided the factors to use (like in this example now 10), we typically replace the original data with a new dataset where each observation (row) is now described not using the original raw attributes but using instead the selected factors/derived attributes.
The factor scores are useful because they allow comparing customers based on few attitude dimensions (in this case 10 of them). For creating market segments, we could use these 10 derived variables (constructs) instead of the 29 raw attitudinal variables. For example the factor scores for the first 5 people are:
Derived Variable (Factor) 1 | Derived Variable (Factor) 2 | Derived Variable (Factor) 3 | Derived Variable (Factor) 4 | Derived Variable (Factor) 5 | Derived Variable (Factor) 6 | Derived Variable (Factor) 7 | Derived Variable (Factor) 8 | Derived Variable (Factor) 9 | Derived Variable (Factor) 10 | |
---|---|---|---|---|---|---|---|---|---|---|
1 | 0.1 | 1.2 | 2.1 | 1.2 | 1.3 | -1.2 | -0.5 | -0.9 | 0.6 | 0.4 |
2 | -0.2 | 0.2 | 1.2 | 1.1 | 0.2 | -0.6 | -0.0 | -0.7 | -1.3 | -0.3 |
3 | 0.1 | -0.7 | 2.5 | 0.3 | 0.8 | -0.7 | -0.7 | -1.7 | -1.4 | -0.1 |
4 | -0.3 | 0.3 | 2.6 | -0.7 | 0.1 | 0.0 | -0.4 | -0.4 | 1.3 | 0.4 |
5 | 0.7 | 0.1 | 2.2 | 0.4 | 0.9 | 0.8 | -0.2 | -0.0 | -0.4 | -0.9 |
Instead of the derived constructs we could also use 'a' single statement from each factor as a proxy for the factor. Usually researchers pick the attitude per factor with the highest loading for that specific factor (eg. looking at the rotated factor loadings in the table above we would choose Q1_27 for the first factor: “Boating is the number one thing I do with my spare time”). This is the approach that we would like to follow here. So for the 2nd step - the cluster analysis - we are using the following 10 statements instead of the full list of 29 statements. Each of these statements has the highest loading within the factor they belong to:
Raw Attributes | |
---|---|
Factor 1 | Q1_27_Boating is the number one thing I do with my spare time |
Factor 2 | Q1_18_Boating gives me a feeling of adventure |
Factor 3 | Q1_14_When buying accessories for my boat I tend to buy the latest and greatest |
Factor 4 | Q1_7_Owning a boat gives me a sense of achievement |
Factor 5 | Q1_4_I only consider buying a boat from a reputable brand |
Factor 6 | Q1_11_I tend to perform minor boat repairs and maintenance on my own |
Factor 7 | Q1_10_When buying a boat I rely more on expert opinion than my own e g consumer reports salespeople |
Factor 8 | Q1_8_When buying a boat functionality is more important than style |
Factor 9 | Q1_1_When buying a boat I do a lot of shopping around and visit multiple dealers |
Factor 10 | Q1_29_Boating is one of many things I do in my spare time |
Clustering organizes observations that are close into groups. This will allow us to define different market segments. In cluster analysis we have to define:
How do we define close? How do we group things? How do we visualize the grouping? How do we interpret the grouping?
Clustering analysis involves calculating how different indivudals in the sample are based on a series of attributes. For instance, in the factor analysis we calculated for each individual in the sample 10 factor scores that summarize well the variance of the 29 attitudional questions.
People that are similar in terms of the 10 selected attitudinal characteristics can be grouped together. The dendrogram allows having a quick visualization of how different customers stick together. The dendrogram groups customers 'little by little' based on the how different (similar) the 10 attitudes are. People with similar attitudes would stick together and depending on where you cut the dendogram tree you can have different groups or segments. The question of how many groups or clusters we will have depends on the structure of the data but also involves some intuition or knowledge about the market so that the solutions makes not only sense from a statistical perspective but also from a business perspective. So clustering is a very iterative process where the team needs to look at many different solutions to identify the optimal solutions. The optimal solution can only be identified if we profile these solutions and interpret and compare the different segment profiles.
After a few iterations the team looked at a few different segment profiles (3-7 segment solutions). Based on the inital segment profiles we have decided to move on with a 5 cluster solution as the optimal solution from a business perspective in the boating industry.
The figure below is the Dendrogram.
The next figure (distance plot) displays the heights of the dendrogram branches as we move from smaller to larger segments:
Once the team has decided on the number of factors they have to profile and describe these segments in a better way. Based on the groups identified by the cluster analysis (recall that cluster analysis calculates differences/similarities of customers based on 10 attitudes that stand as proxies for the 10 identified factors) we now have to profile 5 market segments. Below we have the number of observations (i.e. the number of customers of the sample) in each market segment:
Size of Cluster 1 | Size of Cluster 2 | Size of Cluster 3 | Size of Cluster 4 | Size of Cluster 5 | |
---|---|---|---|---|---|
1 | 467 | 624 | 552 | 682 | 488 |
In a next step we can fully profile each of the 5 segments using the other variables from the survey. We can for example look at differences across groups in variables such as income, age, boat preferences, etc. and determine if those differences are significant from a statistical point of view when the data is continuous (e.g. the level income). The usual approach is to use ANOVA (or MANOVA if we want to compare multiple variables across groups at the same time).
In this example most of the variables are categorical. In such cases we can look at the frequency of each of the categories of a variable for each segment group (segment profile). Each variable can be used to decscribe the segments. Ideally we find variables where segments differ a lot but it could also be the case that segments are very similar for some variables. Eg. in the table below we can see that each of the 5 segments have a similar proportion of people in each income category. Therefore, we cannot say that one segment is necessarily wealthier than the other.
Average income in cluster:1 | Average income in cluster:2 | Average income in cluster:3 | Average income in cluster:4 | Average income in cluster:5 |
---|
This is why finding the optimal solution is a very itereative process. We need to interpret not only the statistical criteria. It is even more important to look at different segment profiles to find the solution that makes the most business sense.
The tables below show a more detailed segment profile for the 5 segments that the team decided to use. The first table is showing the average scores for all 29 attitudes for the 5 segments:
Average of Cluster 1 | Average of Cluster 2 | Average of Cluster 3 | Average of Cluster 4 | Average of Cluster 5 | |
---|---|---|---|---|---|
Q1_1_When buying a boat I do a lot of shopping around and visit multiple dealers | 4.0 | 4.3 | 4.2 | 3.8 | 3.8 |
Q1_2_When buying a boat getting the lowest price is more important than the boat brand | 3.1 | 2.8 | 2.9 | 2.9 | 2.7 |
Q1_3_The brand of boat I buy says a lot about who I am | 3.3 | 3.8 | 3.1 | 2.6 | 2.7 |
Q1_4_I only consider buying a boat from a reputable brand | 3.7 | 4.2 | 4.1 | 3.6 | 3.8 |
Q1_5_I am willing to pay a premium for a brand with a reputation for high quality | 3.5 | 4.1 | 3.7 | 3.1 | 3.4 |
Q1_6_Owning a boat is a way of rewarding myself for my hard work | 4.0 | 4.5 | 4.0 | 3.6 | 3.6 |
Q1_7_Owning a boat gives me a sense of achievement | 3.7 | 4.4 | 3.7 | 3.2 | 3.3 |
Q1_8_When buying a boat functionality is more important than style | 3.7 | 3.6 | 4.0 | 3.8 | 3.5 |
Q1_9_I see my boat as a status symbol | 3.1 | 3.6 | 2.9 | 2.3 | 2.5 |
Q1_10_When buying a boat I rely more on expert opinion than my own e g consumer reports salespeople | 3.8 | 3.4 | 3.6 | 2.9 | 3.4 |
Q1_11_I tend to perform minor boat repairs and maintenance on my own | 2.8 | 4.1 | 4.1 | 4.0 | 1.8 |
Q1_12_When it comes to boating I tend to prefer a basic boat with little to no frills | 2.9 | 2.7 | 2.8 | 3.1 | 2.8 |
Q1_13_When buying a boat I tend to buy the latest and greatest | 3.2 | 3.7 | 3.1 | 2.4 | 2.7 |
Q1_14_When buying accessories for my boat I tend to buy the latest and greatest | 3.4 | 4.1 | 3.4 | 2.6 | 2.8 |
Q1_15_I am serious about the technology on my boat | 3.7 | 4.2 | 3.8 | 3.3 | 3.2 |
Q1_16_People tend to come to me for advice about boating | 3.2 | 3.9 | 3.2 | 2.8 | 2.4 |
Q1_17_I consider myself more knowledgeable about boating than the rest of my boating peers | 3.1 | 3.7 | 3.2 | 2.9 | 2.4 |
Q1_18_Boating gives me a feeling of adventure | 4.2 | 4.6 | 4.3 | 3.7 | 3.8 |
Q1_19_Boating allows me to experience nature | 4.1 | 4.5 | 4.3 | 4.0 | 4.0 |
Q1_20_When not on my boat I often find myself doing boating related activities | 3.3 | 3.9 | 3.2 | 2.8 | 2.5 |
Q1_21_Boating helps me escape from everyday life and relax | 4.2 | 4.6 | 4.3 | 4.0 | 4.0 |
Q1_22_Boating helps me stay active | 4.0 | 4.5 | 4.1 | 3.8 | 3.7 |
Q1_23_Boating allows me to excel in the sports that I am passionate about e g fishing racing water sports | 3.6 | 4.1 | 3.7 | 3.2 | 3.1 |
Q1_24_Boating gives me an outlet to socialize with family and or friends | 4.1 | 4.5 | 4.2 | 3.9 | 3.9 |
Q1_25_Being in charge of my boat is very important to me | 3.8 | 4.4 | 3.9 | 3.5 | 3.2 |
Q1_26_Having a powerful boat is what is most important to me | 3.1 | 3.7 | 3.0 | 2.4 | 2.6 |
Q1_27_Boating is the number one thing I do with my spare time | 3.6 | 4.1 | 2.9 | 2.6 | 2.5 |
Q1_28_Boating is my true passion in life | 3.5 | 4.1 | 3.3 | 2.8 | 2.8 |
Q1_29_Boating is one of many things I do in my spare time | 3.9 | 4.4 | 4.1 | 3.9 | 3.7 |
Average of Cluster 1 | Average of Cluster 2 | Average of Cluster 3 | Average of Cluster 4 | Average of Cluster 5 | |
---|---|---|---|---|---|
Q11_Gender | 1.5 | 1.4 | 1.4 | 1.3 | 1.6 |
Q12_HOUSEHOLD income | 13.3 | 13.5 | 13.6 | 13.1 | 13.7 |
Q13_Are you now married widowed divorced separated never married or living with a partner | 2.3 | 2.2 | 2.0 | 1.9 | 2.0 |
Q14_Current Employment Status | 2.2 | 2.0 | 2.3 | 2.4 | 2.4 |
Q15_Age in cluster | 2.3 | 2.2 | 2.6 | 2.8 | 2.7 |
Average of Cluster 1 | Average of Cluster 2 | Average of Cluster 3 | Average of Cluster 4 | Average of Cluster 5 | |
---|---|---|---|---|---|
Q2_How many powerboats do you currently own | 0.8 | 1.0 | 0.9 | 0.9 | 0.8 |
Q2_Cluster_Own a boat | 0.7 | 0.8 | 0.8 | 0.8 | 0.7 |
Q3_How likely would you be to purchase a new boat in the future | 4.2 | 4.4 | 4.2 | 4.0 | 4.0 |
Q4_Approximately how much did you pay for your current Boat | 4.1 | 4.3 | 3.9 | 3.4 | 3.9 |
Q5_What is the horse power of your engine brand current boat engine the boat you want to buy | 3.4 | 4.1 | 3.3 | 2.7 | 2.8 |
Q6_Length of Boat in Feet | 23.3 | 24.7 | 22.5 | 21.5 | 22.2 |
Q7_1_I normally boat alone | 2.2 | 2.4 | 2.2 | 2.3 | 1.9 |
Q7_2_I boat with my spouse or significant other | 4.0 | 4.1 | 4.0 | 3.9 | 4.0 |
Q7_3_I boat with my family including kids | 3.8 | 4.0 | 3.9 | 3.6 | 3.8 |
Q7_4_I boat with my friends | 3.7 | 3.9 | 3.7 | 3.5 | 3.6 |
Q8_How would you rate your experience regarding boating | 2.1 | 2.5 | 2.4 | 2.4 | 2.0 |
Q9_1_Fishing Below is a list of activities that may or may not do while boating Please indicate how often you engage in each of these activities | 3.4 | 3.8 | 3.7 | 3.6 | 3.2 |
Q9_2_Swimming Below is a list of activities that may or may not do while boating Please indicate how often you engage in each of these activities | 3.5 | 3.7 | 3.4 | 3.2 | 3.2 |
Q9_3_Cruising Below is a list of activities that may or may not do while boating Please indicate how often you engage in each of these activities | 3.8 | 4.0 | 3.7 | 3.4 | 3.7 |
Q9_4_Water Sports Below is a list of activities that may or may not do while boating Please indicate how often you engage in each of these activities | 3.3 | 3.5 | 3.2 | 2.9 | 3.1 |
Q9_5_Entertaining socializing Below is a list of activities that may or may not do while boating Please indicate how often you engage in each of these activities | 3.6 | 3.8 | 3.5 | 3.1 | 3.5 |
Q10_During your boating season how many days out of the year do you typically use your boat | 46.7 | 51.5 | 47.4 | 43.7 | 41.4 |
ATTITUDES / NEEDS / MOTIVATIONS
Strongly Disagree (1)
Disagree (2)
Neither Agree or Disagree (3)
Agree (4)
Strongly Agree (5)
PURCHASE PROCESS AND BEHAVIOR
How many powerboats do you currently own?
How likely would you be to purchase a new boat in the future?
Definitely will not (1)
Probably will not (2)
Might or might not (3)
Probably will (4)
Definitely will (5)
Not including any taxes, fees or accessories, approximately how much did you pay for your boat? How much are you planning to spend on your new boat?
l0.$1,000,000 or more 11. Don't know
What is the horse power of your engine? What horse power are you looking for?
0 - 14 HP
15- 60 HP
61 - 115 HP
116 - 150 HP
151 - 200 HP
201 - 250 HP
251 - 350 HP
more than 350 HP
Don't Know
What is the length of your boat? What is the length of the boat you are considering buying? (e.g., 20 feet)
Which of the following statements best describes WHO you boat with?
Never (1), Rarely (2), Sometimes (3), Often (4), Most of the time (5)
I boat with my friends
How would you rate your experience regarding boating?
Beginner
Intermediate
Advanced
Expert
Below is a list of activities that may or may not do while boating. Using the scale provided, please indicate how often you engage in each of these activities.
Never (1) Rarely (2) Sometimes (3) Often (4) Most of the time (5)
Entertaining/socializing
During your boating season, how many days out of the year do you typically use your boat?
DEMOGRAPHICS
Female
We would like to get a better estimate of your total HOUSEHOLD income in the past 12 months before taxes.
Less than $5,000
$5,000 to $7,499
$7,500 to $9,999
$10,000 to $12,499
$12,500 to $14,999
$15,000 to $19,999
$20,000 to $24,999
$25,000 to $29,999
$30,000 to $34,999
$35,000 to $39,999
$40,000 to $49,999
$50,000 to $59,999
$60,000 to $74,999
$75,000 to $84,999
$85,000 to $99,999
$100,000 to $124,999
$125,000 to $149,999
$150,000 to $174,999
$175,000 or more
Are you now married, widowed, divorced, separated, never married, or living with a partner?
Married
Widowed
Divorced
Separated
Never married
Living with partner
Which statement best describes your current employment status?
Working - as a paid employee 1
Working - self-employed 2
Not working - on temporary layoff from a job
Not working - looking for work
Not working - retired
Not working - disabled
Not working - other
What is your age?
*BOATS: BRAND EQUITY *
Strongly Disagree (1)
Disagree (2)
Neither Agree or Disagree (3)
Agree (4)
Strongly Agree (5)
Origin/ Heritage
Is a brand that has been around for a long time Brand reputation
Has best in-class customer service
Has a strong dealer network Brand leadership
Is a leader in cutting edge technology
Is a leader in safety
Is known for its innovative products
Users of the brand
Is a brand for people who are serious about boating
Is a good brand for people that are new to boating
Is a brand I see in the water all the time
Boating Experience
Offers boats that provide a fast and powerful boating experience
Offers the best boats for socializing
Offers the best boats for water sports (e.g., tubing, ski, wakeboard)
Aesthetics
Offers boats with superior interior style
Offers boats with superior exterior style
Offers boats that stand out from the crowd
Offers boats that look cool
Product Quality/ Reliability/ Innovation
Offers boats that can handle rough weather or choppy water
Offers boats that can handle frequent and heavy usage
Offers a wide breadth of product offerings and accessories
Product Safety
Ease of use / convenience
Offers boats that are easy to maintain and/or repair
Offers boats that are easy to use
Offers boats that are easy to clean up
Value/Price
Emotional Benefits
Is a brand that gives me peace of mind
Makes me feel I made a smart decision
Self Expressive Benefits
BOATS:
BRAND FUNNEL
0=would not recommend/don't know
1=would definitely recommend
0=no
1=yes