Questions tagged [machine-learning]

Implementation questions about machine learning algorithms. General questions about machine learning (concepts, theory, methodology, terminology, etc.) should be posted to their specific communities.

Machine learning revolves around developing self-learning computer algorithms that function by virtue of discovering patterns in data and making intelligent decisions based on such patterns.

Machine learning is a subfield of computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence. Machine learning explores the construction and study of algorithms that can learn from and make predictions about data. Such algorithms operate by building a model from example inputs in order to make data-driven predictions or decisions rather than following strictly static program instructions.

NOTE: If you want to use this tag for a question not directly concerning implementation, then consider posting on Cross Validated, Data Science, or Artificial Intelligence instead; otherwise you're probably off-topic. Please choose one site only and do not cross-post to more than one - see Is cross-posting a question on multiple Stack Exchange sites permitted if the question is on-topic for each site? (tl;dr: no).

Classic Problems:

Classification (supervised learning) classification supervised-learning
Regression (supervised learning) regression
Clustering (unsupervised learning) cluster-analysis unsupervised-learning
Density estimation
Sampling
Reinforcement Learning reinforcement-learning

Relevant Algorithms:

Principal component analysis (PCA) pca
Artificial neural networks (ANN) neural-network
Support vector machines (SVM) svm support-vector-machines
K-nearest neighbor (kNN) knn nearest-neighbor
k-means k-means
Bayesian networks bayesian-networks
Gaussian mixture model (GMM) mixture-model
Decision trees decisiontrees
Genetic algorithms genetic-algorithm
Simulated annealing simulated-annealing
Hidden Markov model (HMM) hidden-markov-models
Conditional Random Field (CRF)
Gaussian Processes gaussian-process
Kalman filter kalman kalman-filter
Particle filter particle-filter
Gibbs sampling
Graphical models
Ensemble methods (bagging, boosting, ...) ensemble-learning
Deep learning deep-learning
Q-Learning q-learning

Applications:

Computer vision (e.g, object tracking, gesture recognition) computer-vision
Image recognition (e.g, face, gait, iris, handwriting) image-recognition face-recognition ocr
Speech recognition speech-recognition
Speaker recognition voice-recognition
Natural language processing (NLP) nlp
Music information retrieval (MIR)
Bioinformatics bioinformatics
Spam filtering spam-filtering
Anomaly detection anomaly-detection
Automatic vehicle driving
Recommendation system recommendation-engine
Machine translation machine-translation

Software:

LibSVM libsvm
Weka weka
Orange orange
Shogun shogun
scikit-learn scikit-learn
PyBrain pybrain
Apache Mahout mahout
RapidMiner rapidminer
KNIME knime
Waffles
Azure Machine Learning azure-machine-learning
nltk nltk
Caffe caffe
TensorFlow tensorflow
Theano theano
Keras keras
OpenNMT opennmt
XGBoost xgboost
CatBoost catboost
Stanford CoreNLP stanford-nlp

Related-tags:

Video Lectures:-

Machine Learning with Python

55241 questions

957

votes

18 answers

What is the role of the bias in neural networks?

I'm aware of the gradient descent and the back-propagation algorithm. What I don't get is: when is using a bias important and how do you use it? For example, when mapping the AND function, when I use two inputs and one output, it does not give the…

machine-learning neural-network artificial-intelligence backpropagation

asked Mar 19 '10 at 21:18

Karan

11,509
8
34
38

654

votes

13 answers

What is the difference between a generative and a discriminative algorithm?

algorithm machine-learning generative

asked May 18 '09 at 19:44

unj2

52,135
87
247
375

575

votes

5 answers

A simple explanation of Naive Bayes Classification

I am finding it hard to understand the process of Naive Bayes, and I was wondering if someone could explain it with a simple step by step process in English. I understand it takes comparisons by times occurred as a probability, but I have no idea…

algorithm machine-learning dataset classification naivebayes

asked Apr 08 '12 at 00:56

Aeonitis

5,887
3
14
8

476

votes

14 answers

Epoch vs Iteration when training neural networks

What is the difference between epoch and iteration when training a multi-layer perceptron?

machine-learning neural-network deep-learning artificial-intelligence terminology

asked Jan 20 '11 at 21:11

mohammad

4,905
4
16
13

473

votes

8 answers

What are logits? What is the difference between softmax and softmax_cross_entropy_with_logits?

In the tensorflow API docs they use a keyword called logits. What is it? A lot of methods are written like: tf.nn.softmax(logits, name=None) If logits is just a generic Tensor input, why is it named logits? Secondly, what is the difference between…

python machine-learning tensorflow

asked Dec 12 '15 at 14:03

Shubhashis

10,411
11
33
48

464

votes

18 answers

How does the Google "Did you mean?" Algorithm work?

I've been developing an internal website for a portfolio management tool. There is a lot of text data, company names etc. I've been really impressed with some search engines ability to very quickly respond to queries with "Did you mean: xxxx". I…

algorithm machine-learning nlp spell-checking text-search

asked Nov 20 '08 at 23:34

Andrew Harry

13,773
18
67
102

429

votes

10 answers

What is the meaning of the word logits in TensorFlow?

In the following TensorFlow function, we must feed the activation of artificial neurons in the final layer. That I understand. But I don't understand why it is called logits? Isn't that a mathematical function? loss_function =…

tensorflow machine-learning neural-network deep-learning cross-entropy

asked Jan 04 '17 at 02:02

Milad P.

4,707
3
12
9

395

votes

6 answers

What are advantages of Artificial Neural Networks over Support Vector Machines?

ANN (Artificial Neural Networks) and SVM (Support Vector Machines) are two popular strategies for supervised machine learning and classification. It's not often clear which method is better for a particular project, and I'm certain the answer is…

machine-learning neural-network classification svm

asked Jul 24 '12 at 13:59

Channel72

24,139
32
108
180

352

votes

22 answers

Convert array of indices to one-hot encoded array in NumPy

Given a 1D array of indices: a = array([1, 0, 3]) I want to one-hot encode this as a 2D array: b = array([[0,1,0,0], [1,0,0,0], [0,0,0,1]])

python numpy machine-learning numpy-ndarray one-hot-encoding

asked Apr 23 '15 at 18:24

James Atwood

4,289
2
17
17

303

votes

26 answers

How to implement the Softmax function in Python

From the Udacity's deep learning class, the softmax of y_i is simply the exponential divided by the sum of exponential of the whole Y vector: Where S(y_i) is the softmax function of y_i and e is the exponential and j is the no. of columns in the…

python numpy machine-learning logistic-regression softmax

asked Jan 23 '16 at 20:52

alvas

115,346
109
446
738

293

votes

27 answers

What is the difference between supervised learning and unsupervised learning?

In terms of artificial intelligence and machine learning, what is the difference between supervised and unsupervised learning? Can you provide a basic, easy explanation with an example?

machine-learning artificial-intelligence supervised-learning unsupervised-learning

asked Dec 02 '09 at 10:37

TIMEX

259,804
351
777
1,080

279

votes

15 answers

What is the difference between linear regression and logistic regression?

When we have to predict the value of a categorical (or discrete) outcome we use logistic regression. I believe we use linear regression to also predict the value of an outcome given the input values. Then, what is the difference between the two…

machine-learning data-mining linear-regression

asked Aug 27 '12 at 17:49

London guy

27,522
44
121
179

271

votes

3 answers

How to interpret loss and accuracy for a machine learning model

When I trained my neural network with Theano or Tensorflow, they will report a variable called "loss" per epoch. How should I interpret this variable? Higher loss is better or worse, or what does it mean for the final performance (accuracy) of my…

machine-learning neural-network mathematical-optimization deep-learning objective-function

asked Dec 29 '15 at 20:33

mamatv

3,581
4
19
25

255

votes

7 answers

Is there a rule-of-thumb for how to divide a dataset into training and validation sets?

Is there a rule-of-thumb for how to best divide data into training and validation sets? Is an even 50/50 split advisable? Or are there clear advantages of having more training data relative to validation data (or vice versa)? Or is this choice…

machine-learning

asked Nov 28 '12 at 16:42

robguinness

16,266
14
55
65

254

votes

10 answers

How do I initialize weights in PyTorch?

How do I initialize weights and biases of a network (via e.g. He or Xavier initialization)?

python machine-learning deep-learning neural-network pytorch

asked Mar 22 '18 at 16:34

Fábio Perez

23,850
22
76
100

2 3

…

99 100 Next