3

I would like to retrieve Google Analytics Data using the Google Analytics API. I want to do this with Hadoop Flume to store them on hdfs.

For example, I want to retrieve the result of this http call:

https://www.googleapis.com/analytics/v3/data/ga?ids=ga%3A92525220&start-date=2016-08-25&end-date=2016-08-25&metrics=ga%3Asessions&access_token=XXX

I can find a flume configuration to collect HTTP queries on this post: How to setup a HTTP Source for testing Flume setup?

However I cannot see an http URL and I am confused about how to use it.

Could you provide an example of how to retrieve Google Analytics API data or any API with Hadoop Flume?

Do I need intermediate coding (java, python...) to call the API? Isn't it the role of Flume to directly call the API?

Community
  • 1
  • 1
rom
  • 3,592
  • 7
  • 41
  • 71
  • Questions asking us to recommend or find a book, tool, software library, tutorial or other off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it. – Linda Lawton - DaImTo Aug 29 '16 at 12:00

1 Answers1

0

You can try Http-source and if it doesn't fit your needs, then go with Custom source

  • In the HTTP source example I give in my post, there is no URL with parameters, so I don't understand how to use it. Is there a url parameter in HTTP source? – rom Sep 12 '16 at 15:41
  • @rom, sorry, my bad. Http source will not work, its a listener for get & post requests. You have to go with custom Flume source. Flume doc has a good example to start with it – Mykhail Martsyniuk Sep 29 '16 at 11:05