parquet-tools
is just one module of parquet-mr
. It depends on some of the other modules.
When you build from a source version that corresponds to a release, those other modules will be available to Maven, because release artifacts are published as a part of the release process.
However, when building from a snapshot version, you have to make those dependencies available yourself. There are two ways to do so:
Option 1: Build and install all modules of the parent directory:
git clone https://github.com/apache/parquet-mr
cd parquet-mr
mvn install -Plocal
This will put the snapshot artifacts in your local ~/.m2
directory. Subsequently, you can (re)build just parquet-tools
like you initially tried, because now the snapshot artifacts will already be available from ~/.m2
.
Option 2: Build the parquet-mr
modules from the parent directory, while asking Maven to build needed modules as well along the way:
git clone https://github.com/apache/parquet-mr
cd parquet-mr
mvn package -pl parquet-tools -am -Plocal
Option 1 will build more projects than option 2, so if you only need parquet-tools
, you are better off with the latter. Please note though that probably both will require installation of a thrift compiler.