Discussion:
Clustering of Ordinal Variables
(too old to reply)
m***@gmail.com
2016-07-05 13:15:42 UTC
Permalink
Hi,

I need to perform clustering on ordinal variables. I have responses of certain questionnaire , which has 12 questions. The response of 12000 customers is recorded into 12 columns. The values in columns can be anything from 1 to 5.

What method we can use to cluster them. I need output like which questions are key to study one cluster.

Any help is much appreciated.

R's
Mohit
Rich Ulrich
2016-07-06 18:13:54 UTC
Permalink
Post by m***@gmail.com
Hi,
I need to perform clustering on ordinal variables. I have responses of certain questionnaire , which has 12 questions. The response of 12000 customers is recorded into 12 columns. The values in columns can be anything from 1 to 5.
What method we can use to cluster them. I need output like which questions are key to study one cluster.
Any help is much appreciated.
If these questions make up a "rating scale" where the designers
have attempted to provide equal intervals, then the responses
that range from 1-5 can be (probably) regarded as "interval";
"ordinal" should probably be reserved to describe responses that
are more questionable.

The simple clustering of items for a rating scale is achieved by
performing a Factor Analysis, with varimax rotation to get a
set of factors that you will describe by the strongest loadings.

If your questions are not designed as a rating scale, there could
be a number of other possibilities. For instance, if two questions
are paramount, your best "clusters" might be formed by grouping
continuous cells in their 5x5 table of crosstabulations.

For more comments, you might repeat your question and
provide specifics about the questions and the anchor points.
--
Rich Ulrich
Kenneth M. Lin
2016-10-16 01:10:28 UTC
Permalink
Did you check to see how many distinct ways did 12,000 customers answer the
12 questions?

What’s "R's"?

wrote in message news:e2b154c6-28c3-4989-a3de-***@googlegroups.com...

Hi,

I need to perform clustering on ordinal variables. I have responses of
certain questionnaire , which has 12 questions. The response of 12000
customers is recorded into 12 columns. The values in columns can be anything
from 1 to 5.

What method we can use to cluster them. I need output like which questions
are key to study one cluster.

Any help is much appreciated.

R's
Mohit

Loading...