Vowpal Wabbit is a highly scalable, open source, online machine learning software written in C++. It supports, amongst other features, classification, regression, matrix-factorization, multiple loss functions, multiple update strategies, and regularization.
Questions tagged [vowpalwabbit]
301 questions
19
votes
2 answers
How to perform logistic regression using vowpal wabbit on very imbalanced dataset
I am trying to use vowpal wabbit for logistic regression. I am not sure if this is the right syntax to do it
For training, I do
./vw -d ~/Desktop/new_data.txt --passes 20 --binary --cache_file cache.txt -f lr.vw --loss_function logistic --l1…

user34790
- 2,020
- 7
- 30
- 37
16
votes
1 answer
Correctness of logistic regression in Vowpal Wabbit?
I have started using Vowpal Wabbit for logistic regression, however I am unable to reproduce the results it gives. Perhaps there is some undocumented "magic" it does, but has anyone been able to replicate / verify / check the calculations for…

sling
- 163
- 1
- 4
14
votes
1 answer
Vowpal Wabbit how to represent categorical features
I have the following data with all categorical variables:
class education income social_standing
1 basic low good
0 low high V_good
1 high low not_good
0 …

Ashok K Harnal
- 1,191
- 2
- 15
- 28
14
votes
6 answers
write a gzip file from data frame
I'm trying to write a data frame to a gzip file but having problems.
Here's my code example:
df1 <- data.frame(id = seq(1,10,1), var1 = runif(10), var2 = runif(10))
gz1 <- gzfile("df1.gz","w" )
writeLines(df1)
Error in writeLines(df1) : invalid…

screechOwl
- 27,310
- 61
- 158
- 267
9
votes
2 answers
VowpalWabbit: Differences and scalability
I am trying to ascertain how VowpalWabbit's "state" is maintained as the size of our input set grows. In a typical machine learning environment, if I have 1000 input vectors, I would expect to send all of those at once, wait for a model building…

jayunit100
- 17,388
- 22
- 92
- 167
9
votes
1 answer
How to demo Vowpal Wabbit's contextual bandits in real online mode?
Following the available docs and resources, it is not really clear how to accomplish a simple getting-started flow where you'd launch Vowpal Wabbit as a daemon (possibly even without any pre-learnt model) and have it online learn and explore ― I'm…

matanster
- 15,072
- 19
- 88
- 167
9
votes
1 answer
Get raw predictions from Vowpal Wabbit in daemon mode
I'm starting Vowpal Wabbit in daemon mode with the following command:
vw --loss_function hinge --ect 250 --quiet --passes 5 -b 25 --daemon --port 10001 --pid_file pidfile
This works well and I'm able to get predictions by connecting to the socket…

Osvald Ivarsson
- 1,243
- 9
- 6
9
votes
1 answer
Vowpal Wabbit inverted_hash option produces empty output, but why?
I'm trying to get a vowpal wabbit model saved with inverted hashes. I have a valid model produced with the following:
vw --oaa 2 -b 24 -d mydata.vw --readable_model mymodel.readable
which produces a model file like this:
Version 7.7.0
Min…

Ben Collins
- 20,538
- 18
- 127
- 187
9
votes
1 answer
Calculating AUC when using Vowpal Wabbit
Is there anyway to compute AUC within Vowpal Wabbit?
One of the reasons I am using Vowpal Wabbit is the large size of the data file.
I can calculate the AUC outside of the Vowpal Wabbit environment using the output of Vowpal Wabbit but this might be…

nirajanrk
- 103
- 5
8
votes
1 answer
Vowpal Wabbit: Low-rank matrix factorization?
I have a very basic question. I'd like to do low-rank matrix factorization and I was looking at the Vowpal Wabbit documentation on the topic. My question is:
Is there a difference between these two approaches? (implementation or otherwise)
$ vw…

Kris
- 22,079
- 3
- 30
- 35
8
votes
3 answers
Dealing with class imbalance in multi-label classification
I've seen a few questions on class imbalance in a multiclass setting. However, I have a multi-label problem, so how would you deal with it in this case?
I have a set of around 300k text examples. As mentioned in the title, each example has at least…

richizy
- 2,002
- 3
- 21
- 26
8
votes
4 answers
Errors due to vowpal wabbit's dependencies on boost library
I'm trying real hard to install vowpal wobbit and it fails when i run the make file, throwing:
cd library; make; cd ..
g++ -g -o ezexample temp2.cc -L ../vowpalwabbit -l vw -l allreduce -l boost_program_options -l z -l pthread
ld:…

madCode
- 3,733
- 5
- 26
- 31
7
votes
1 answer
Gradient boosting on Vowpal Wabbit
Is there a way to use gradient boosting on regression using Vowpal Wabbit? I use various techniques that come with Vowpal Wabbit that are helpful. I want to try gradient boosting along with that, but I can't find a way to implement gradient boosting…

breadnbutter
- 113
- 5
7
votes
1 answer
Vowpal Wabbit Logistic Regression
I am performing logistic regression using Vowpal Wabbit on a dataset with 25 features and 48 million instances. I have a question on current predict values. Should it be within 0 or 1.
average since example example current …

user1586694
- 71
- 3
7
votes
2 answers
Need some pointers on Vowpal Wabbit with respect to cross validation
My goal is to do a grid search over various VW models in their parameter space (trying different loss functions and regularizations etc). Since the model could use multiple passes, I would like to use cross validation. I am wondering if I should…

vkmv
- 1,345
- 1
- 14
- 24