0

When accessing google_patents_research.publications table from BigQuery I can filter results by the top_terms column. This column uses Google's machine learning algo to search the text inside a patent and extract the words it finds most meaningful. I was wondering if there is a similar search using BigQuery that can be done on PubMed's database. According to NCBI docs there is a dataset called nih-sra-datastore which should be accessible from BigQuery, but it doesn't show up when searching for it in BigQuery's console search box.

David912
  • 378
  • 1
  • 2
  • 11

1 Answers1

1

If you are referring to nih-sra-datastore as explained on this page The National Center for Biotechnology Information it looks that its making reference to the project-id not the dataset as such.

So in order to have access to it you have to follow these steps (on BigQuery explorer):

  • Click on +ADD DATA
  • Hover over Pin a Project and select Enter a Project Name
  • Add nih-sra-datastore

You will pin this public project to your BigQuery explorer.

Betjens
  • 1,353
  • 2
  • 4
  • 13
  • 1
    Has the bigQuery UI been updated because I'm not seeing 'Pin a project' anywhere in the current dialogue when I click `+ ADD DATA`? I've been trying to figure out how to add `nih-sra-datastore` and haven't been able to figure it out using the docs (https://www.ncbi.nlm.nih.gov/sra/docs/sra-bigquery/) seems like they are out of date or missing a step – Richard J. Acton Oct 13 '22 at 14:12