Questions tagged [scikits]

Scientific Python toolkits

Scikits is the collective name of a number of Python libraries for various scientific tasks, such as statistics, machine learning and image and audio processing. These scikits build on Numpy and Scipy.

An overview of available scikits is given at http://scikits.appspot.com/scikits

218 questions
38
votes
1 answer

Fitting a scikits.learn.hmm.GaussianHMM to variable length training sequences

I'd like to fit a scikits.learn.hmm.GaussianHMM to training sequences of different length. The fit method, however, prevents using sequences of different length by doing obs = np.asanyarray(obs) which only works on a list of equally shaped arrays.…
Christian
  • 2,214
  • 4
  • 26
  • 37
36
votes
7 answers

Missing values in scikits machine learning

Is it possible to have missing values in scikit-learn ? How should they be represented? I couldn't find any documentation about that.
Vladtn
  • 2,506
  • 3
  • 27
  • 23
29
votes
5 answers

Cannot import scikits-learn even though it seems to be installed

Per the scikit-learn user guide, I installed scikit-learn using pip install -U scikit-learn. So using pip search scikit-learn, I get this search result: scikit-learn - A set of python modules for machine learning and data mining INSTALLED: 0.12.1…
Alan
  • 7,066
  • 5
  • 30
  • 38
28
votes
3 answers

Python statistics package: difference between statsmodel and scipy.stats

I need some advice on selecting statistics package for Python, I've done quite some search, but not sure if I get everything right, specifically on the differences between statsmodels and scipy.stats. One thing that I know is those with scikits…
herrfz
  • 4,814
  • 4
  • 26
  • 37
20
votes
3 answers

How do I create a sklearn.datasets.base.Bunch object in scikit-learn from my own data?

In most of the Scikit-learn algorithms, the data must be loaded as a Bunch object. For many example in the tutorial load_files() or other functions are used to populate the Bunch object. Functions like load_files() expect data to be present in…
David
  • 4,634
  • 7
  • 35
  • 42
14
votes
4 answers

Numpy: How to randomly split/select an matrix into n-different matrices

I have a numpy matrix with shape of (4601, 58). I want to split the matrix randomly as per 60%, 20%, 20% split based on number of rows This is for Machine Learning task I need Is there a numpy function that randomly selects rows?
daydreamer
  • 87,243
  • 191
  • 450
  • 722
14
votes
4 answers

Removing features with low variance using scikit-learn

scikit-learn provides various methods to remove descriptors, a basic method for this purpose has been provided by the given tutorial below, http://scikit-learn.org/stable/modules/feature_selection.html but the tutorial does not provide any method…
jax
  • 3,927
  • 7
  • 41
  • 70
14
votes
2 answers

scikits learn and nltk: Naive Bayes classifier performance highly different

I am comparing two Naive Bayes classifiers: one from NLTK and and one from scikit-learn. I'm dealing with a multi-class classification problem (3 classes: positive (1), negative (-1), and neutral (0)). Without performing any feature selection (that…
D T
  • 677
  • 12
  • 23
13
votes
2 answers

Re-compose a Tensor after tensor factorization

I am trying to decompose a 3D matrix using python library scikit-tensor. I managed to decompose my Tensor (with dimensions 100x50x5) into three matrices. My question is how can I compose the initial matrix again using the decomposed matrix produced…
Jose Ramon
  • 5,572
  • 25
  • 76
  • 152
11
votes
4 answers

How to whiten matrix in PCA

I'm working with Python and I've implemented the PCA using this tutorial. Everything works great, I got the Covariance I did a successful transform, brought it make to the original dimensions not problem. But how do I perform whitening? I tried…
mabounassif
  • 2,311
  • 6
  • 29
  • 46
11
votes
2 answers

How does one install scikits.audiolab 0.11.0 on Windows

I am currently trying to install scikits.audiolab 0.11.0 for Python 3.5.4 on Windows 10 and it requires libsndfile from http://www.mega-nerd.com/libsndfile/. I have installed this into my conda virtual environment so that the file path looks like…
9
votes
3 answers

Iteratively fitting polynomial curve

I want to iteratively fit a curve to data in python with the following approach: Fit a polynomial curve (or any non-linear approach) Discard values > 2 standard deviation from mean of the curve repeat steps 1 and 2 till all values are within…
user308827
  • 21,227
  • 87
  • 254
  • 417
9
votes
1 answer

How to aggregate timeseries in Python?

I have two different timeseries with partially overlapping timestamps: import scikits.timeseries as ts from datetime import datetime a = ts.time_series([1,2,3], dates=[datetime(2010,10,20), datetime(2010,10,21), datetime(2010,10,23)], freq='D') b =…
eumiro
  • 207,213
  • 34
  • 299
  • 261
9
votes
1 answer

Error installing scikits.audiolab when using python setup.py egg_info

I am trying to install scikits.audiolab with using the pip tool. Pip appears to run the command python setup.py egg_info from within the scikits.audiolab source directory. When it does so, I get this…
andrew
  • 2,524
  • 2
  • 24
  • 36
9
votes
3 answers

Python package for signal processing

I am looking for a Python package to perform an efficient Constant Q Transform (ie using an FFT to speed up the process). I found a toolbox named CQ-NSGT/sliCQ Toolbox, but I get the following error: File "build\bdist.win32\egg\nsgt\__init__.py",…
Flavian Hautbois
  • 2,940
  • 6
  • 28
  • 45
1
2 3
14 15