Highest Voted 'distilbert' Questions

38

votes

5 answers

ValueError: TextEncodeInput must be Union[TextInputSequence, Tuple[InputSequence, InputSequence]] - Tokenizing BERT / Distilbert Error

def split_data(path): df = pd.read_csv(path) return train_test_split(df , test_size=0.1, random_state=100) train, test = split_data(DATA_DIR) train_texts, train_labels = train['text'].to_list(), train['sentiment'].to_list() test_texts,…

asked Aug 21 '20 at 05:59

Raoof Naushad

526
1
5
7

7

votes

1 answer

Text generation using huggingface's distilbert models

I've been struggling with huggingface's DistilBERT model for some time now, since the documentation seems very unclear and their examples (e.g.…

machine-learning nlp pytorch huggingface-transformers distilbert

asked Dec 08 '19 at 22:56

George

3,521
4
30
75

6

votes

2 answers

what 's the meaning of "Using bos_token, but it is not set yet."

When I run the demo.py from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("distilbert-base-multilingual-cased") model = AutoModel.from_pretrained("distilbert-base-multilingual-cased", return_dict=True) #…

multilingual huggingface-transformers huggingface-tokenizers distilbert

asked Dec 21 '20 at 03:06

young

61
1
4

4

votes

0 answers

huggingface distillbert classification using multiprocessing

I am trying to use torch multiprocessing to parallelize the predictions from two separate huggingface distillbert classification models. It seems to be deadlocked at the prediction step. I am using python 3.6.5, torch 1.5.0 and huggingface…

pytorch python-multiprocessing huggingface-transformers distilbert

asked Jun 12 '20 at 13:46

user3245722

323
2
3
9

2

votes

0 answers

How can I train distilBERT more efficiently on my large text classification task?

I've been thrown into the deep end a bit with a task at work. I need to use DistilBERT for a multi-class text classification problem, but here's the kicker the dataset is gigantic - we're talking millions of samples! I've been messing around with…

python performance nlp bert-language-model distilbert

asked Jul 25 '23 at 06:07

user22280248

2

votes

1 answer

ValueError: The model did not return a loss from the inputs, only the following keys: logits

I want to make a metaphor detector model. As a pretrained model, I use a DistilBert model that I have, previously, fine-tuned with masking (this is the model I use to make the new metaphor detection model). The new model is the one that gives me an…

python bert-language-model text-classification distilbert

asked Mar 25 '23 at 01:29

Cristi Fernandez

21
1

2

votes

0 answers

HuggingFace Trainer() does nothing - only on Vertex AI workbench, works on colab

I am having issues getting the Trainer() function in huggingface to actually do anything on Vertex AI workbench notebooks. I'm totally stumped and have no idea how to even begin to try debug this. I made this small notebook:…

jupyter-notebook pytorch huggingface-transformers google-cloud-vertex-ai distilbert

asked Aug 19 '22 at 10:02

andrewm4894

1,451
4
17
37

2

votes

1 answer

How to get output_attentions of a pretrained Distilbert Model?

I am using a pretrained DistilBert model: from transformers import TFDistilBertModel,DistilBertConfig dbert = 'distilbert-base-uncased' config = DistilBertConfig(max_position_embeddings=256 , dropout=0.2, …

python tensorflow tf.keras huggingface-transformers distilbert

asked Apr 19 '22 at 10:06

user8888

31
4

2

votes

3 answers

How to use DistilBERT Huggingface NLP model to perform sentiment analysis on new data?

I am using DistilBERT to do sentiment analysis on my dataset. The dataset contains text and a label for each row which identifies whether the text is a positive or negative movie review (eg: 1 = positive and 0 = negative). Here is the code from the…

python machine-learning nlp sentiment-analysis distilbert

asked Oct 24 '21 at 14:53

brownie_coder

17
1
4

2

votes

1 answer

Input to reshape doesn't match requested shape

I know others have posted similar questions already, but I couldn't find a solution that was appropriate here. I've written a custom keras layer to average outputs from DistilBert based on a mask. That is, I have dim=[batch_size, n_tokens_out, 768]…

python tensorflow keras distilbert

asked Jul 31 '20 at 05:04

bmt

21
2

2

votes

1 answer

cannot import name 'DISTILBERT_PRETRAINED_MODEL_ARCHIVE_MAP' from 'transformers.modeling_distilbert'

I am trying to train the distil BERT model for Question Answering purpose. I have installed simple transformers and everything but when I try to run the following command: model = QuestionAnsweringModel('distilbert', …

python neural-network bert-language-model nlp-question-answering distilbert

asked Jun 03 '20 at 12:33

swapnil agashe

69
1
12

1

vote

0 answers

DistilBert for self-supervision - switch heads for pre-training: MaskedLM and SequenceClassification

Say I want to train a model for sequence classification. And so I define my model to be: model = DistilBertForSequenceClassification.from_pretrained("bert-base-uncased") My question is - what would be the optimal way if I want to pre-train this…

huggingface-transformers distilbert self-supervised-learning

asked Feb 16 '23 at 15:17

R90

21
3

1

vote

1 answer

Difference between from_config and from_pretrained in HuggingFace

num_labels = 3 if task.startswith("mnli") else 1 if task=="stsb" else 2 preconfig = DistilBertConfig(n_layers=6) model1 = AutoModelForSequenceClassification.from_config(preconfig) model2 =…

huggingface-transformers transformer-model distilbert

asked Jun 21 '22 at 04:20

strnlz

13
5

1

vote

1 answer

PyTorch - FineTuning bert - Oscillating loss - Very bad accuracy

I have been trying to train a model on vulnerability detection through source code. And, after a little bit of searching, I thought a very good starting point could be using a pre-trained transformer model from HuggingFace with PyTorch and…

python deep-learning pytorch distilbert

asked May 11 '22 at 20:59

Desperados

434
5
13

1

vote

1 answer

How to solve this problem with distilbert tokenizer?

from transformers import DistilBertTokenizerFast tokenizer = DistilBertTokenizerFast.from_pretrained('distilbert-base-uncased') tokenized_input = tokenizer( sentences, truncation=True, is_split_into_words=True, padding='max_length',…

python huggingface-transformers huggingface-tokenizers distilbert

asked Nov 24 '21 at 21:55

Gozdi

41
1
1
6

Questions tagged [distilbert]