Questions tagged [data-mining]

Data mining is the process of analyzing large amounts of data in order to find patterns and commonalities.

Data mining, also known as knowledge discovery, is the process of digging through and analyzing enormous sets of data and then extracting the meaning of the data. Data mining tools like SQL Server Analysis Services, predict behaviors and future trends, allowing businesses to make proactive, knowledge-driven decisions. Data mining tools can answer business questions that traditionally were too time consuming to resolve. They scour databases for hidden patterns, finding predictive information that experts may miss because it lies outside their expectations. Input to learning mining algorithms is called cases, samples, examples, instances, events, and observations.

machine-learning, artificial-intelligence and statistics provide many techniques used in data mining, in combination with database technologies for efficiency. Please use the appropriate tag (e.g. machine-learning) to refer to the raw methods.
Cluster analysis (dataclustering) and outlier detection (outliers) are two of the main challenges from data mining.
Wiki Links
Data Mining Introduction

3094 questions

279

votes

15 answers

What is the difference between linear regression and logistic regression?

When we have to predict the value of a categorical (or discrete) outcome we use logistic regression. I believe we use linear regression to also predict the value of an outcome given the input values. Then, what is the difference between the two…

machine-learning data-mining linear-regression

asked Aug 27 '12 at 17:49

London guy

27,522
44
121
179

209

votes

12 answers

Can someone give an example of cosine similarity, in a very simple, graphical way?

Cosine Similarity article on Wikipedia Can you show the vectors here (in a list or something) and then do the math, and let us see how it works?

text data-mining cosine-similarity

asked Nov 17 '09 at 04:03

TIMEX

259,804
351
777
1,080

203

votes

20 answers

Difference between classification and clustering in data mining?

Can someone explain what the difference is between classification and clustering in data mining? If you can, please give examples of both to understand the main idea.

machine-learning classification cluster-analysis data-mining terminology

asked Feb 21 '11 at 10:39

Kristaps

2,047
2
14
5

146

votes

8 answers

How does the Amazon Recommendation feature work?

What technology goes in behind the screens of Amazon recommendation technology? I believe that Amazon recommendation is currently the best in the market, but how do they provide us with such relevant recommendations? Recently, we have been involved…

algorithm language-agnostic data-mining

asked Feb 24 '10 at 04:57

Rachel

100,387
116
269
365

136

votes

6 answers

Why is the F-Measure a harmonic mean and not an arithmetic mean of the Precision and Recall measures?

When we calculate the F-Measure considering both Precision and Recall, we take the harmonic mean of the two measures instead of a simple arithmetic mean. What is the intuitive reason behind taking the harmonic mean and not a simple average?

machine-learning classification data-mining

asked Oct 14 '14 at 08:22

London guy

27,522
44
121
179

132

votes

3 answers

Why does one hot encoding improve machine learning performance?

I have noticed that when One Hot encoding is used on a particular data set (a matrix) and used as training data for learning algorithms, it gives significantly better results with respect to prediction accuracy, compared to using the original matrix…

machine-learning data-mining scikit-learn data-analysis

asked Jul 04 '13 at 12:04

maheshakya

2,198
7
28
43

120

votes

8 answers

What is an intuitive explanation of the Expectation Maximization technique?

Expectation Maximization (EM) is a kind of probabilistic method to classify data. Please correct me if I am wrong if it is not a classifier. What is an intuitive explanation of this EM technique? What is expectation here and what is being…

machine-learning cluster-analysis data-mining mathematical-optimization expectation-maximization

asked Aug 04 '12 at 10:56

London guy

27,522
44
121
179

109

votes

6 answers

1D Number Array Clustering

So let's say I have an array like this: [1,1,2,3,10,11,13,67,71] Is there a convenient way to partition the array into something like this? [[1,1,2,3],[10,11,13],[67,71]] I looked through similar questions yet most people suggested using k-means…

arrays cluster-analysis data-mining dimension partition-problem

asked Jul 16 '12 at 22:25

E.H.

3,271
4
19
18

votes

6 answers

Mixing categorial and continuous data in Naive Bayes classifier using scikit-learn

I'm using scikit-learn in Python to develop a classification algorithm to predict the gender of certain customers. Amongst others, I want to use the Naive Bayes classifier but my problem is that I have a mix of categorical data (ex: "Registered…

python machine-learning data-mining classification scikit-learn

asked Jan 10 '13 at 09:08

user1499144

1,063
2
9
9

votes

5 answers

What is the difference between Gradient Descent and Newton's Gradient Descent?

I understand what Gradient Descent does. Basically it tries to move towards the local optimal solution by slowly moving down the curve. I am trying to understand what is the actual difference between the plain gradient descent and the Newton's…

machine-learning data-mining mathematical-optimization gradient-descent newtons-method

asked Aug 22 '12 at 05:27

London guy

27,522
44
121
179

votes

3 answers

What information can we access from the client?

I'm trying to compile a list of information that is accessible via javascript such as: Geo-location IP address Browser software Exit location Entrance location I understand that a user can alter any of this information and that it's reliability is…

javascript data-mining data-retrieval

asked Nov 18 '11 at 09:29

George Reith

13,132
18
79
148

votes

7 answers

PCA For categorical features?

In my understanding, I thought PCA can be performed only for continuous features. But while trying to understand the difference between onehot encoding and label encoding came through a post in the following link: When to use One Hot Encoding vs…

python machine-learning scikit-learn data-mining

asked Nov 24 '16 at 22:11

data_person

4,194
7
40
75

votes

1 answer

Decision tree vs. Naive Bayes classifier

I am doing some research about different data mining techniques and came across something that I could not figure out. If any one have any idea that would be great. In which cases is it better to use a Decision tree and other cases a Naive Bayes…

data-mining decision-tree bayesian-networks

asked Apr 25 '12 at 14:33

Y2theZ

10,162
38
131
200

votes

11 answers

Calculate AUC in R?

Given a vector of scores and a vector of actual class labels, how do you calculate a single-number AUC metric for a binary classifier in the R language or in simple English? Page 9 of "AUC: a Better Measure..." seems to require knowing the class…

r machine-learning data-mining auc

asked Feb 04 '11 at 21:24

Andrew

1,619
3
19
24

votes

6 answers

How many principal components to take?

I know that principal component analysis does a SVD on a matrix and then generates an eigen value matrix. To select the principal components we have to take only the first few eigen values. Now, how do we decide on the number of eigen values that we…

machine-learning data-mining svd

asked Aug 22 '12 at 06:31

London guy

27,522
44
121
179

2 3

…

99 100 Next