I have a simple PIG script that uses only the FILTER
command in it. It looks something like this ...
--sample_script.pig
some_data = load './a_file' as (col1:chararray, col2:chararray);
contains_ = filter some_data by (col2 == '1') OR (col2 == '2');
store contains_ into './a_new_file';
When I run this script it outputs a folder a_new_file
with 3 files in it part-m-00000
, part-m-00001
and _SUCCESS
. From what I can gather, the way I have written my script doesn't require a reduce phase. Is there a different way to write this so this script will output only one file?
Thanks.