8

BigQuery can read from Google Drive as a federated source. See here. I want to be able to read a table in BigQuery into my Dataflow pipeline that is pointing to a Drive document.

Hooking up BigQuery to the file in Drive works perfectly fine:

enter image description here

But, when I then try to read that table into my Dataflow pipeline I (understandably) get the following error:

No suitable credentials found to access Google Drive. Contact the table owner for assistance.

[..]
PCollection<TableRow> results = pipeline.apply("whatever",
BigQueryIO.Read.fromQuery("SELECT * from [CPT_7414_PLAYGROUND.google_drive_test]"))
.apply(ParDo.of(new DoFn<TableRow, TableRow>() {
[..]

How do I permission Dataflow to be able to read from a table in BigQuery that is pointing to Drive?

Graham Polley
  • 14,393
  • 4
  • 44
  • 80
  • You can read some details here http://stackoverflow.com/questions/37756157/queries-done-in-sheets-pulling-from-bigquery-permissions-help-needed . As I understand you need to ask for Drive permissions too. – Pavel Kulikov Jun 19 '16 at 11:25

1 Answers1

2

Dataflow does not currently support reading from a federated table backed by Drive, but this is coming soon.

danielm
  • 3,000
  • 10
  • 15