0

I have a parquet file located in hdfs system. I am using webhdfs API to read file but not getting the schema details in proper format.

Any help would be appriciated?

  • You may take advantage from [How do I get schema column names from Parquet file?](https://stackoverflow.com/questions/33883640/) or [Inspect Parquet](https://stackoverflow.com/questions/36140264/). – U880D Oct 20 '20 at 12:09
  • 1
    Hi but this has command line interface. i have to write a java code to execute webhdfs api in read mode and get schema details. Actually i tried but API response is not in correct format. Is there any way that we can get schema details from parquet file using java code from hdfs location? – ProfessionSDET Oct 20 '20 at 14:45
  • 2
    Maybe reviewing `parquet-tools` source (Java) can help? https://github.com/apache/parquet-mr/tree/parquet-1.11.x/parquet-tools – mazaneicha Oct 20 '20 at 17:46
  • but parquet-tools available for unix system but how can i execute these query in window using java code? – ProfessionSDET Oct 21 '20 at 08:59

0 Answers0