Questions tagged [feature-extraction]

In pattern recognition and in image processing, feature extraction is a special form of dimensionality reduction. Transforming the input data into the set of features is called feature extraction. If the features extracted are carefully chosen it is expected that the features set will extract the relevant information from the input data in order to perform the desired task using this reduced representation instead of the full size input.

Feature extraction involves simplifying the amount of resources required to describe a large set of data accurately. When performing analysis of complex data one of the major problems stems from the number of variables involved. Analysis with a large number of variables generally requires a large amount of memory and computation power or a classification algorithm which overfits the training sample and generalizes poorly to new samples. Feature extraction is a general term for methods of constructing combinations of the variables to get around these problems while still describing the data with sufficient accuracy.

Best results are achieved when an expert constructs a set of application-dependent features. Nevertheless, if no such expert knowledge is available general dimensionality reduction techniques may help.

Source: Wikipedia

1664 questions

votes

9 answers

The easiest way for getting feature names after running SelectKBest in Scikit Learn

I'm trying to conduct a supervised machine-learning experiment using the SelectKBest feature of scikit-learn, but I'm not sure how to create a new dataframe after finding the best features: Let's assume I would like to conduct the experiment…

asked Oct 03 '16 at 19:35

Aviade

2,057
4
27
49

votes

2 answers

What is the difference between feature detection and descriptor extraction?

Does anyone know the difference between feature detection and descriptor extraction in OpenCV 2.3? I understand that the latter is required for matching using DescriptorMatcher. If that's the case, what is FeatureDetection used for?

image-processing opencv computer-vision feature-detection feature-extraction

asked Jul 26 '11 at 15:55

Chris Arriola

1,636
3
17
23

votes

5 answers

Feature Selection and Reduction for Text Classification

I am currently working on a project, a simple sentiment analyzer such that there will be 2 and 3 classes in separate cases. I am using a corpus that is pretty rich in the means of unique words (around 200.000). I used bag-of-words method for feature…

python nlp svm sentiment-analysis feature-extraction

asked Nov 28 '12 at 11:21

clancularius

votes

2 answers

What is a feature descriptor in image processing (algorithm or description)?

I get often confused with the meaning of the term descriptor in the context of image features. Is a descriptor the description of the local neighborhood of a point (e.g. a float vector), or is a descriptor the algorithm that outputs the description?…

image-processing computer-vision feature-detection feature-extraction

asked Dec 22 '14 at 01:09

Richard

votes

5 answers

Linear Regression :: Normalization (Vs) Standardization

I am using Linear regression to predict data. But, I am getting totally contrasting results when I Normalize (Vs) Standardize variables. Normalization = x -xmin/ xmax – xmin Zero Score Standardization = x - xmean/ xstd a) Also,…

machine-learning linear-regression feature-extraction

asked Aug 20 '15 at 01:32

Santosh Kumar

votes

4 answers

Extracting HoG Features using OpenCV

I am trying to extract features using OpenCV's HoG API, however I can't seem to find the API that allow me to do that. What I am trying to do is to extract features using HoG from all my dataset (a set number of positive and negative images), then…

opencv computer-vision feature-detection object-recognition feature-extraction

asked Jul 24 '12 at 07:33

sub_o

2,642
5
28
41

votes

7 answers

Issue with OneHotEncoder for categorical features

I want to encode 3 categorical features out of 10 features in my datasets. I use preprocessing from sklearn.preprocessing to do so as the following: from sklearn import preprocessing cat_features = ['color', 'director_name', 'actor_2_name'] enc =…

scikit-learn feature-extraction categorical-data

asked Apr 24 '17 at 12:56

Medo

votes

2 answers

Convolutional Neural Network (CNN) for Audio

I have been following the tutorials on DeepLearning.net to learn how to implement a convolutional neural network that extracts features from images. The tutorial are well explained, easy to understand and follow. I want to extend the same CNN to…

neural-network convolution feature-extraction supervised-learning deep-learning

asked Mar 18 '14 at 05:28

moeabdol

4,779
6
44
43

votes

7 answers

Are there any fast alternatives to SURF and SIFT for scale-invariant feature extraction?

SURF is patented, as is SIFT. ORB and BRIEF are not patented, but their features are not scale-invariant, seriously limiting their usefulness in complex scenarios. Are there any feature extractors that can extract scale-invariant features as fast as…

opencv computer-vision feature-detection feature-extraction

asked Apr 14 '12 at 22:16

Diego

5,024
6
38
47

votes

2 answers

Which OCR Engine is better: Tesseract or OCRopus?

I have tried Tesseract with iPhone and assessed its accuracy to be 70% without image preprocessing. I also noticed that it might be poor in extracting digits. I have heard about OCRopus OCR engine: which is better, Tesseract or OCRopus, in terms of…

ocr tesseract feature-extraction

asked Apr 05 '12 at 17:08

Ahmed Hussein

votes

3 answers

What does the distance attribute in DMatches mean?

I have a short question: When I do feature-matching in OpenCV, what does the distance attribute mean of DMatches in MatOfMatches? I know that I have to filter matches with bigger distance because they aren't as good as them with lower distance. But…

opencv feature-detection feature-extraction

asked Jun 08 '13 at 06:35

stetro

votes

3 answers

scikit-learn TfidfVectorizer meaning?

I was reading about TfidfVectorizer implementation of scikit-learn, i don´t understand what´s the output of the method, for example: new_docs = ['He watches basketball and baseball', 'Julie likes to play basketball', 'Jane loves to play…

machine-learning nlp scikit-learn feature-extraction document-classification

asked Sep 17 '14 at 23:50

anon

votes

5 answers

How are HoG features represented graphically?

I'm implementing the Histogram of Oriented Gradient features from "Histograms of oriented gradients for human detection" and I'd like to visualise the result. All papers on these features use a standard visualisation, but I can't find any…

image-processing opencv computer-vision feature-extraction

asked Nov 02 '12 at 16:02

theotherphil

votes

4 answers

Why do we maximize variance during Principal Component Analysis?

I'm trying to read through PCA and saw that the objective was to maximize the variance. I don't quite understand why. Any explanation of other related topics would be helpful

machine-learning feature-extraction

asked Sep 12 '12 at 20:10

karthik A

votes

2 answers

Getting feature names from within a FeatureUnion + Pipeline

I am using a FeatureUnion to join features found from the title and description of events: union = FeatureUnion( transformer_list=[ # Pipeline for pulling features from the event's title ('title', Pipeline([ ('selector',…

python-3.x scikit-learn nlp feature-extraction

asked Feb 27 '17 at 06:44

Huey

2,714
6
28
34

2 3

…

99 100 Next