Using RoBERTa-base for QA model outputs the context not an answer

Question

I'm trying to use this model from deepset/roberta-base-squad2 to essentially go through a column of work related activities and have it answer the question what are the necessary skills for this job ? However the model is simply handing me back my context or my question+context. I'm not quite sure why it's doing that.

Here's what I'm running,

import pandas as pd
import torch
from transformers import AutoTokenizer, AutoModelForQuestionAnswering

# Your DataFrame loading code here
# df = pd.read_csv("your_data.csv")

def generate_skills(question, context):
    tokenizer = AutoTokenizer.from_pretrained("deepset/roberta-base-squad2")
    model = AutoModelForQuestionAnswering.from_pretrained("deepset/roberta-base-squad2")

    inputs = tokenizer(question, context, return_tensors='pt')
    outputs = model(**inputs)

    start_scores = outputs.start_logits
    end_scores = outputs.end_logits

    start_index = torch.argmax(start_scores)
    end_index = torch.argmax(end_scores) + 1

    tokens = inputs['input_ids'][0][start_index:end_index]
    answer = tokenizer.decode(tokens, skip_special_tokens=True)

    return answer

def generate_skills_for_row(row):
    context = row['top_words']
    question = "What are the necessary skills a data scientist should have?"
    skills = generate_skills(question, context)
    return skills

# Create a new column 'skills' based on the 'top_words' column
df['skills'] = df.apply(generate_skills_for_row, axis=1)

Can you show us one of your contexts? I assume the text differs too much from the Wikipedia articles the model was trained on. The model you are using is not a foundation model and therefore utterly domain specific. Maybe you can try Llama or Flan? — cronoik, May 03 '23 at 06:17
@cronoik example context: machine learning, data analysis, etl process, design development, data warehouse this will give an answer as such: machine learning, data analysis, etl process, design development, data warehouse So it just repeats the context. — Moe_blg, May 03 '23 at 06:55
That is not a good example to prove my point since I would say returning the entire context is acceptable (!) in that case. What I was trying to say, is that the model was trained on data of Wikipedia articles and the context contained enough information to find an answer without having knowledge of the topic (please note that I simplify here). Your question and context pairs require far more knowledge of the topic compared to squad. in case all of your contexts are just a list of words, you could try a zeroshot model. — cronoik, May 03 '23 at 07:29

score 1 · Answer 1 · answered Jun 07 '23 at 08:55

I think of this model as an extractive model (On the models' page it states "Downstream-task: Extractive QA"). This means that it simply extracts the right answer from the context you give it. If the answer isn't available in the context, you'll get an empty result. Also, it generally only locates the answer in one place in the context. So, if the answer is spread over several places in the context, it will only give you part of the correct answer.

Also, this model will probably perform better if you tailor it to the specific task you're doing. Check out the main model's page on HF they explicitely mention "You can use the raw model for masked language modeling, but it's mostly intended to be fine-tuned on a downstream task"

Using RoBERTa-base for QA model outputs the context not an answer

1 Answers1