Questions tagged [vowpalwabbit]

Vowpal Wabbit is a highly scalable, open source, online machine learning software written in C++. It supports, amongst other features, classification, regression, matrix-factorization, multiple loss functions, multiple update strategies, and regularization.

301 questions
19
votes
2 answers

How to perform logistic regression using vowpal wabbit on very imbalanced dataset

I am trying to use vowpal wabbit for logistic regression. I am not sure if this is the right syntax to do it For training, I do ./vw -d ~/Desktop/new_data.txt --passes 20 --binary --cache_file cache.txt -f lr.vw --loss_function logistic --l1…
user34790
  • 2,020
  • 7
  • 30
  • 37
16
votes
1 answer

Correctness of logistic regression in Vowpal Wabbit?

I have started using Vowpal Wabbit for logistic regression, however I am unable to reproduce the results it gives. Perhaps there is some undocumented "magic" it does, but has anyone been able to replicate / verify / check the calculations for…
sling
  • 163
  • 1
  • 4
14
votes
1 answer

Vowpal Wabbit how to represent categorical features

I have the following data with all categorical variables: class education income social_standing 1 basic low good 0 low high V_good 1 high low not_good 0 …
Ashok K Harnal
  • 1,191
  • 2
  • 15
  • 28
14
votes
6 answers

write a gzip file from data frame

I'm trying to write a data frame to a gzip file but having problems. Here's my code example: df1 <- data.frame(id = seq(1,10,1), var1 = runif(10), var2 = runif(10)) gz1 <- gzfile("df1.gz","w" ) writeLines(df1) Error in writeLines(df1) : invalid…
screechOwl
  • 27,310
  • 61
  • 158
  • 267
9
votes
2 answers

VowpalWabbit: Differences and scalability

I am trying to ascertain how VowpalWabbit's "state" is maintained as the size of our input set grows. In a typical machine learning environment, if I have 1000 input vectors, I would expect to send all of those at once, wait for a model building…
9
votes
1 answer

How to demo Vowpal Wabbit's contextual bandits in real online mode?

Following the available docs and resources, it is not really clear how to accomplish a simple getting-started flow where you'd launch Vowpal Wabbit as a daemon (possibly even without any pre-learnt model) and have it online learn and explore ― I'm…
matanster
  • 15,072
  • 19
  • 88
  • 167
9
votes
1 answer

Get raw predictions from Vowpal Wabbit in daemon mode

I'm starting Vowpal Wabbit in daemon mode with the following command: vw --loss_function hinge --ect 250 --quiet --passes 5 -b 25 --daemon --port 10001 --pid_file pidfile This works well and I'm able to get predictions by connecting to the socket…
Osvald Ivarsson
  • 1,243
  • 9
  • 6
9
votes
1 answer

Vowpal Wabbit inverted_hash option produces empty output, but why?

I'm trying to get a vowpal wabbit model saved with inverted hashes. I have a valid model produced with the following: vw --oaa 2 -b 24 -d mydata.vw --readable_model mymodel.readable which produces a model file like this: Version 7.7.0 Min…
Ben Collins
  • 20,538
  • 18
  • 127
  • 187
9
votes
1 answer

Calculating AUC when using Vowpal Wabbit

Is there anyway to compute AUC within Vowpal Wabbit? One of the reasons I am using Vowpal Wabbit is the large size of the data file. I can calculate the AUC outside of the Vowpal Wabbit environment using the output of Vowpal Wabbit but this might be…
nirajanrk
  • 103
  • 5
8
votes
1 answer

Vowpal Wabbit: Low-rank matrix factorization?

I have a very basic question. I'd like to do low-rank matrix factorization and I was looking at the Vowpal Wabbit documentation on the topic. My question is: Is there a difference between these two approaches? (implementation or otherwise) $ vw…
Kris
  • 22,079
  • 3
  • 30
  • 35
8
votes
3 answers

Dealing with class imbalance in multi-label classification

I've seen a few questions on class imbalance in a multiclass setting. However, I have a multi-label problem, so how would you deal with it in this case? I have a set of around 300k text examples. As mentioned in the title, each example has at least…
8
votes
4 answers

Errors due to vowpal wabbit's dependencies on boost library

I'm trying real hard to install vowpal wobbit and it fails when i run the make file, throwing: cd library; make; cd .. g++ -g -o ezexample temp2.cc -L ../vowpalwabbit -l vw -l allreduce -l boost_program_options -l z -l pthread ld:…
madCode
  • 3,733
  • 5
  • 26
  • 31
7
votes
1 answer

Gradient boosting on Vowpal Wabbit

Is there a way to use gradient boosting on regression using Vowpal Wabbit? I use various techniques that come with Vowpal Wabbit that are helpful. I want to try gradient boosting along with that, but I can't find a way to implement gradient boosting…
breadnbutter
  • 113
  • 5
7
votes
1 answer

Vowpal Wabbit Logistic Regression

I am performing logistic regression using Vowpal Wabbit on a dataset with 25 features and 48 million instances. I have a question on current predict values. Should it be within 0 or 1. average since example example current …
7
votes
2 answers

Need some pointers on Vowpal Wabbit with respect to cross validation

My goal is to do a grid search over various VW models in their parameter space (trying different loss functions and regularizations etc). Since the model could use multiple passes, I would like to use cross validation. I am wondering if I should…
vkmv
  • 1,345
  • 1
  • 14
  • 24
1
2 3
20 21