Questions tagged [nominal-data]

There are 2 types of categorical data consisting of ordinal data and nominal data. According to Wikipedia, Nominal data are a group of data objects that can be collectively grouped on the basis of a qualitative property. Specifically, nominal data cannot be ordered and cannot be measured, unlike ordinal data. For example, Gender is a nominal variable whose values of “Male” and “Female” do not follow an order. In the data science field, there are 2 common ways to handle nominal data which are dummy encoding and one-hot encoding.

15 questions
22
votes
3 answers

How to specify a variable in pandas as ordinal/categorical?

I am trying to run some Machine learning algo on a dataset using scikit-learn. My dataset has some features which are like categories. Like one feature is A, which has values 1,2,3 specifying the quality of something. 1:Upper, 2: Second, 3: Third…
Baktaawar
  • 7,086
  • 24
  • 81
  • 149
12
votes
4 answers

Highcharts percentage of total for simple bar chart

I have a simple 1-series bar chart where each bar has a nominal value. I can plot this fine with the data labels and axis representing the value for each bar but I'd like to have the data label and axis show the percentage of the total of the series…
harryg
  • 23,311
  • 45
  • 125
  • 198
2
votes
1 answer

Create dataframe with cluster assignment using nomclust in R

I am using nomclust to run hierarchical cluster analysis over nominal data. In order to explain what I need, I am using the dataset CA.methods which is in the same package. I will run the nomclust() function and later I will plot by using…
antecessor
  • 2,688
  • 6
  • 29
  • 61
2
votes
1 answer

How to implement feature importance on nominal categorical features in tree based classifiers?

I am using SKLearn XGBoost model for my binary classification problem. My data contains nominal categorical features (such as race) for which one hot encoding should be used to feed them to the tree based models. On the other hand, using…
2
votes
2 answers

MATLAB - Directly convert nominal data to numeric

Is there any Matlab function to directly convert nominal data to numeric one ? N.b that currently, and after several searches, I use NumericGroup=str2num(char(NominalGroup)) Thanks,
1
vote
2 answers

How to change attributes or value labels of a contrast?

I want to perform a regression analysis with R, using a difference contrast for a nominal independent variable. However the contrast produces factor level names that are not suitable for publication. So I want to change them. The problem is how to…
1
vote
1 answer

How to pass a record to a decision tree?

I'm working a small project. Lets say, I have a table of around 100K records containing columns like Age, Gender, Region, Life(nominal - range of days the products is used) etc., Here Life is a dependent variable and all others are independent…
pegasus
  • 21
  • 2
0
votes
0 answers

SEM/CFA in lavaan with nominal exogenous variable

I am trying to fit a measurement model with some latent variables as exonegous variables and a non-latent endogenous variable. Then I use two nominal variables as controls, gender (SPOL) and entrepreneurial family background (DRUZ_PODJ). The values…
0
votes
0 answers

Discriminant Correspondence Analysis - Nominal Data as Input or Integer Data as Input?

Here's where I'm getting the definition of Discriminant Correspondence Analysis from: https://www.semanticscholar.org/paper/Discriminant-Correspondence-Analysis-Abdi/55b6fa9ad5c3fae17cd7123656556abbadf5deb6 I'd like to use Discriminant…
0
votes
2 answers

K-means clustering on a nominal data?

I am performing clustering on some data points which are in various degrees, like low, medium, and high. Is it advisable to convert them into numbers like low-1, med-2, hig-3, and apply k means directly, or should I use any other method? I…
asif
  • 25
  • 1
  • 9
0
votes
2 answers

machine learning, nominal data normalization

i am working on kmeans clustering . i have 3d dataset as no.days,frequency,food ->day is normalized by means & std deviation(SD) or better to say Standardization. which gives me range of [-2 to 14] ->for frequency and food which are NOMINAL data in…
0
votes
1 answer

Clustering nominal data

I am trying to apply a clustering algorithm to my data set. My data set is of movies, and some of the attributes are nominal. for example: movie 1: [ IMDB popularity: 1.02 Genre: Drama Sub-genre: Horror Rating: 1.23% ] movie 2: [ IMDB popularity:…
Binyamin Even
  • 3,318
  • 1
  • 18
  • 45
0
votes
1 answer

How to handle the Nominal Data by Weka J48

When I ran J48 of weka with binary split option, such decision tree was built. http://www.fastpic.jp/viewer.php?file=2693704973.jpg Input explanation variable is 1 nominal data which was made by question id + answer id. 1 nominal data, 1…
keita
  • 15
  • 6
0
votes
2 answers

Qualitative Classification in Neural Network on Weka

I have a training set where the input vectors are speed, acceleration and turn angle change. Output is a crisp class- an activity state from the given set {rest, walk, run}. e.g- say for input vectors [3.1 1.2 2]-->run ; [2.1 1 1]-->walk and so on.…
user26161
  • 131
  • 1
  • 1
  • 7
-1
votes
2 answers

Plotting clusters of nominal data in R

Imagine we have 7 categories (e.g. religion), and we would like to plot them not in a linear way, but in clusters that are automatically chosen to be nicely aligned. Here the individuals within groups have the same response, but should not be…
PascalVKooten
  • 20,643
  • 17
  • 103
  • 160