3

Is it possible to run Apache Pig jobs from within a Java application, without forking an external process?

It seems both Pig and Hadoop are written in Java but don't really offer Java APIs. Rather than relying on shell scripts, I'd rather use these tools form within a Java Spring application.

DeejUK
  • 12,891
  • 19
  • 89
  • 169

3 Answers3

2

It seems there is Java API for Pig.

According to this API, there is a PigRunner class.

With that, you could easily add it to your Spring application, by creating a dedicated Spring bean.

ndeverge
  • 21,378
  • 4
  • 56
  • 85
2

From what I've seen document wise and example wise is to you the PigServer class. They have examples of using it here: http://pig.apache.org/docs/r0.8.1/setup.html#Sample+Code

NerdyNick
  • 813
  • 1
  • 9
  • 17
1

See Spring Hadoop project and its Pig support.