I have two table,table of netpack_busstop has 100,000,000,the other table of ic_card_trade has 100,000.My query SQL is like this:
SELECT
count(*)
FROM
ic_card_trade tmpic
LEFT JOIN netpack_busstop tmpnp
ON tmpic.line_no = tmpnp.line_no
AND tmpic.bus_no = tmpnp.bus_no
I run this job on hadoop use more then 40min,it is so long.
I want to deal with hive sql quick ,use less time.I don't know how to do this by sql.