Highest Voted 'huggingface' Questions

20

votes

1 answer

Early stopping in Bert Trainer instances

I am fine-tuning a BERT model for a multiclass classification task. My problem is that I don't know how to add "early stopping" to those Trainer instances. Any ideas?

asked Sep 07 '21 at 11:02

soulwreckedyouth

465
1
3
12

13

votes

1 answer

No module named 'huggingface_hub.snapshot_download'

When I try to run the quick start notebook of this repo, I get the error ModuleNotFoundError: No module named 'huggingface_hub.snapshot_download'. How can I fix it? I already installed huggingface_hub using pip. I get the error after compiling the…

python huggingface

asked Nov 24 '22 at 06:14

albus_c

6,292
14
36
77

10

votes

2 answers

How do I save a Huggingface dataset?

How do I write a HuggingFace dataset to disk? I have made my own HuggingFace dataset using a JSONL file: Dataset({ features: ['id', 'text'], num_rows: 18 }) I would like to persist the dataset to disk. Is there a preferred way to do this? Or, is…

huggingface-datasets huggingface

asked Apr 26 '22 at 23:57

Campbell Hutcheson

549
2
4
12

7

votes

5 answers

SSLError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /dslim/bert-base-NER/resolve/main/tokenizer_config.json

I am facing below issue while loading the pretrained BERT model from HuggingFace due to SSL certificate error. Error: SSLError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url:…

python-3.x huggingface-transformers bert-language-model huggingface-tokenizers huggingface

asked Jan 13 '23 at 15:09

Nikita Malviya

181
1
2
7

6

votes

2 answers

How can i solve ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.20.1` when using Huggingface's TrainArguments?

I'm using the transformers library in Google colab, and When i am using TrainingArguments from transformers library i'm getting Import error with this code: from transformers import TrainingArguments training_args = TrainingArguments( …

python nlp importerror huggingface-transformers huggingface

asked Jun 10 '23 at 21:51

Stubborn Goat

63
2
4

6

votes

4 answers

How to fix nsfw error for stable diffusion?

I always get the "Potential NSFW content was detected in one or more images. A black image will be returned instead. Try again with a different prompt and/or seed." error when using stable diffusion, even with the code that was given on…

python huggingface

asked Sep 23 '22 at 12:59

Niklas Mohler

61
1
1
3

6

votes

1 answer

StableDiffusion Colab - How to "make sure you're logged in with `huggingface-cli login`?"

I'm trying to run the Colab example of the Huggingface StableDiffusion generative text-to-image…

google-colaboratory huggingface-datasets huggingface

asked Aug 22 '22 at 21:25

Twenkid

825
7
15

6

votes

3 answers

Does Huggingface's "resume_from_checkpoint" work?

I currently have my trainer set up as: training_args = TrainingArguments( output_dir=f"./results_{model_checkpoint}", evaluation_strategy="epoch", learning_rate=5e-5, per_device_train_batch_size=4, per_device_eval_batch_size=4, …

pytorch huggingface-transformers huggingface

asked Jun 18 '22 at 19:46

Penguin

1,923
3
21
51

5

votes

0 answers

Starcoder finetuning - How to select the GPU and how to estimate the time it will take to finetune

I'd like to finetune Starcoder (https://huggingface.co/bigcode/starcoder) on my dataset and on a GCP VM instance. It's says in the documentation that for training the model, they used 512 Tesla A100 GPUs and it took 24 days. I also saw the model…

deep-learning pytorch huggingface language-model large-language-model

asked Jun 01 '23 at 17:22

Aadesh

403
3
13

5

votes

1 answer

Which HuggingFace summarization models support more than 1024 tokens? Which model is more suitable for programming related articles?

If this is not the best place to ask this question, please lead me to the most accurate one. I am planning to use one of the Huggingface summarization models (https://huggingface.co/models?pipeline_tag=summarization) to summarize my lecture video…

nlp huggingface-transformers summarization huggingface mlmodel

asked Oct 27 '22 at 21:45

Furkan Gözükara

22,964
77
205
342

4

votes

2 answers

Why does llama-index still require an OpenAI key when using Hugging Face local embedding model?

I am creating a very simple question and answer app based on documents using llama-index. Previously, I had it working with OpenAI. Now I want to try using no external APIs so I'm trying the Hugging Face example in this link. It says in the example…

python huggingface-transformers huggingface llm llama-index

asked Jul 26 '23 at 13:19

Mikey A. Leonetti

2,834
3
22
36

4

votes

1 answer

Target modules for applying PEFT / LoRA on different models

I am looking at a few different examples of using PEFT on different models. The LoraConfig object contains a target_modules array. In some examples, the target modules are ["query_key_value"], sometimes it is ["q", "v"], sometimes something else. I…

nlp huggingface-transformers huggingface fine-tune peft

asked Jul 26 '23 at 05:23

ahron

803
6
29

4

votes

1 answer

What is the official way to run a wandb sweep with hugging face (HF) transformers so that all the HF features work e.g. distributed training?

Intially I wanted to run a hugging face run such that if the user wanted to run a sweep they could (and merge them with the command line arguments given) or just execute the run with the arguments from command line. The merging is so that the train…

machine-learning huggingface-transformers huggingface wandb huggingface-trainer

asked Jun 29 '23 at 23:52

Charlie Parker

5,884
57
198
323

4

votes

3 answers

Getting RuntimeError: expected scalar type Half but found Float in AWS P3 instances in opt6.7B fine tune

I have a simple code which takes a opt6.7B model and fine tunes it. When I run this code in Google colab(Tesla T4, 16GB) it runs without any problem. But when I try to run the the same code in AWS p3-2xlarge environment (Tesla V100 GPU, 16GB) it…

python pytorch huggingface-transformers huggingface

asked Apr 03 '23 at 09:53

SRC

2,123
3
31
44

4

votes

1 answer

How to fine tune a Huggingface Seq2Seq model with a dataset from the hub?

I want to train the "flax-community/t5-large-wikisplit" model with the "dxiao/requirements-ner-id" dataset. (Just for some experiments) I think my general procedure is not correct, but I don't know how to go further. My Code: Load tokenizer and…

python nlp huggingface-transformers huggingface-tokenizers huggingface

asked Mar 27 '23 at 10:33

jonash_01

43
5

Questions tagged [huggingface]