Questions tagged [bart]

14 questions
3
votes
1 answer

google-cloud/aiplatform vertex AI predictionserviceclient truncated response NodeJS

I am trying to get the aiplatform client working on a NodeJS project, It seems to work, I mean the credentials are fine, and I get a "valid" response back. But the predictions' content comes truncated (using curl I get the complete list) just the…
2
votes
1 answer

BART loading from HuggingFace requires logging in

I'm trying to use pretrained model from HuggingFace. However, I get the following error, OSError: bart-large is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models' If this is a private repository, make…
cerebrou
  • 5,353
  • 15
  • 48
  • 80
2
votes
2 answers

Is there a way to retrieve the data from a BART package model in R?

I was wondering if there was a way to retrieve the data from a model built from the BART package in R? It seems to be possible using other bart packages, such as dbarts... but I can't seem to find a way to get the original data back from a BART…
Electrino
  • 2,636
  • 3
  • 18
  • 40
1
vote
1 answer

Using tidymodels in R, my BART workflow changes after I fit it once. Why?

I have been trying to train a BART model using the tidymodels framework but I am running into some problems. I can declare the model, the recipe, and the workflow alright, but once I fit the workflow, two unwanted things happen: The original model…
Martin
  • 13
  • 5
1
vote
0 answers

Retraining of facebook/bart-large-mnli possible?

I want to retrain the bart-large-mnli for my own type of datasets and labels. Currently I am using the zero-shot classification for my dataset classification. from transformers import pipeline classifier = pipeline("zero-shot-classification", …
Sunny
  • 15
  • 6
1
vote
1 answer

How to pretrain BART using custom dataset(Not fine tuning!!)

I'm using huggingface transformers 4.19.0 I want to pretrain BART model using my custom dataset. To make it clear, I'm not asking about fine tuning BART to down stream task but asking about "pre training BART". But I can't find method or class for…
1
vote
0 answers

How to write a config.json to train a Language model

Looking at NLP models on Huggingface I can see that each of those has a so called config.json file. Now I want to train a BART mode from scratch along this repo. To do so, I have to "write" a config.json myself. Now I can not find any documentation…
JoWohln
  • 21
  • 2
0
votes
0 answers

Empty value for state['args'] when I load model with BARTmodel.from_pretained()

First I train the model with code: module load libffi source $HOME/env38/bin/activate ​ TOTAL_NUM_UPDATES=20000 WARMUP_UPDATES=500 …
0
votes
0 answers

To DAE pretrained and fine_tune BART model from hugging face

It is my first NLP task, and I would like to use BART model and tokenizer from hugging face to pre_train and fine_tune.The code shown as below. tokenizer = BertTokenizer.from_pretrained('bert-base-uncased') model =…
VictorZhu
  • 31
  • 2
0
votes
1 answer

set.seed() Machine learning models in R

Can anyone clarify how is the best procedure to set.seed() before running a machine learning algorithms? I have built a random forest model, a gbm model and a bart model. Does every of them require a seed for reproducible results? I have not split…
Anjeline
  • 21
  • 4
0
votes
0 answers

How to train a language model for my data

I have a dataset of IDs that are meaningful to me. I want to use language models to generate IDs based on a few IDs that I give as a starting point. Let's say my dataset is like a sequence of IDs in each line separated by whitespace, more…
Ali
  • 96
  • 6
0
votes
0 answers

IndexError: index out of range in self error while running a pre trained bart model for text summarization

I have the following pipeline for text summarization from a pertained bart model: summarizer = pipeline("summarization", model="philschmid/bart-large-cnn-samsum") but I get the following error when I try longer documents: "IndexError: index out of…
0
votes
0 answers

BART for classification in Python

I have a problem in fitting BART model for classification in python. I have tried to follow https://github.com/JakeColtman/bartpy unfoturnately I get an error for multi-class outcome. Does anyone know how to implement BART for multiclass problems in…
0
votes
1 answer

BART Tokenizer tokenises same word differently?

I have noticed that if I tokenize a full text with many sentences, I sometimes get a different number of tokens than if I tokenise each sentence individually and add up the tokens. I have done some debugging and have this small reproducible example…