correct query design? cross joins driving ad-hoc reporting interface

Question

I'm hoping some of the more experienced database/dwh developers or DBAs can weigh in on this one:

My team is using OBIEE as a front-end tool to drive ad-hoc reporting being done by our business units.

There is a lot of latency when generating sets that are relatively small. We are facing ~1 hour to produce ~50k records.

I looked into one of the queries that is behaving this way, and I was surprised to find that all of the tables being referenced are being cross-joined, and then filters are being applied in the WHERE clause.

So, to illustrate, the queries tend to look like this:

SELECT ...
FROM tbl1
    ,tbl2
    ,tbl3
    ,tbl4
WHERE tbl1.col1 = tbl2.col1
and tbl3.col2 = tbl2.col2
and tbl4.col3 = tbl3.col3

instead of like this:

SELECT ...
FROM tbl1
INNER JOIN tbl2
    ON tbl1.col1 = tbl2.col1
INNER JOIN tbl3
    ON tbl3.col2 = tbl2.col2
INNER JOIN tbl4
    ON tbl4.col3 = tbl3.col3

Now, from what I know about the order of query operations, the FROM clause gets performed before the WHERE clause, so the first example would perform much more slowly than the latter example. Am I correct (please answer only if you know the answer in the context of Oracle DB)? Unfortunately, I don't have the admin rights to run a trace against the 2 different versions of the query.

Is there a reason to set up the query the first way, related to how the OBIEE interface works? Remember, the query is the result of a user drag-and-dropping attributes into a sandbox, from a 'bank' of attributes. Selecting any combination of the attributes is supposed to generate output (if the data exists). The attributes come from many different tables. I don't have any experience in designing the mecahnism that generates the SQL based on this kind of ad-hoc attribute selection, so I don't know whether the query design in the first example is required to service this kind of reporting tool.

These two query variants likely produce the same execution plan, which you should review anyway if you want to find the bottleneck. — mustaccio, Nov 17 '14 at 21:00

Sebas · Accepted Answer · 2014-11-17T23:24:44.830

4

Don't worry, historically Oracle used the first notation for inner joins but later on adopted ANSI SQL standards.

The results in terms of performance and returned recordsets are exactly the same, the implicit 'comma' joins are not crossing resultset but effectively integrating the WHERE filters. If you doubt it, run an EXPLAIN SELECT command for both queries and you will see the forcasted algorithms will be identical.

Expanding this answer you may notice in the future the analogous notation (+) in place of outer joins. This answer will also stand correct in that context.

The real issue comes when both notations (implicit and explicit joins) are mixed in the same query. This would be asking for trouble big time, but I doubt you find such a case in OBIEE.

edited Nov 17 '14 at 23:24

answered Nov 17 '14 at 20:39

Sebas

21,192
9
55
109

nice, thanks for the info. +1. I will keep this open for a while longer just in case someone else has something to add. – sion_corn Nov 17 '14 at 20:45
The usage of the `(+)` operator is discouraged by Oracle. Oracle recommends to use an explicit `OUTER JOIN` operator rather than the proprietary `(+)`. – Nov 17 '14 at 22:53
1

Will try the `EXPLAIN SELECT` - thanks for your help. – sion_corn Nov 18 '14 at 15:12
@a_horse_with_no_name: Thanks! I always wanted to know what Oracle's preference was. Do you have a source for that? – wolφi May 29 '18 at 14:11
@wolφi: [the manual](https://docs.oracle.com/database/121/SQLRF/queries006.htm#SQLRF52354): "*Oracle recommends that you use the FROM clause OUTER JOIN syntax rather than the Oracle join operator*" – May 29 '18 at 14:16

jackohug · Answer 2 · 2014-11-18T14:08:23.567

1

Those are inner joins, not cross joins, they just use the old syntax for doing it rather than ANSI as you were expecting.

Most join queries contain at least one join condition, either in the FROM clause or in the WHERE clause. (Oracle Documentation)

For a simple query such as in your example the execution should be exactly the same.

Where you have set outer joins (in the business model join) you will see OBI produce a query where the inner joins are made in the WHERE clause and the outer joins are done ANSI in the FROM statement – just to make things really hard to debug!

SELECT ...
FROM tbl1
    ,tbl2
    ,tbl3 left outer join
         tbl4 on tbl3.col1 = tbl4.col2
WHERE tbl1.col1 = tbl2.col1
and tbl3.col2 = tbl2.col2
and tbl4.col3 = tbl3.col3

edited Nov 18 '14 at 14:08

answered Nov 18 '14 at 13:50

jackohug

440
2
9

thanks for the info, +1. You are right, I was seeing a combination of the Oracle-specific inner joins and the ANSI LOJs, and was very alarmed by it. Good to clear this up. Thanks. – sion_corn Nov 18 '14 at 15:11
I'm not sure when you found this query but this is the worst thing to do, ever. see: http://stackoverflow.com/a/11180050/1291428 – Sebas Nov 18 '14 at 16:00
@Sebas I didn't say it was good. That's what OBI produces. Has for 5+ years so Oracle don't seem to be in any hurry to change it. – jackohug Nov 19 '14 at 16:43

correct query design? cross joins driving ad-hoc reporting interface

2 Answers2