Multi-column IndexScan plan selection fix #1305

vkonagar · 2018-04-17T18:28:02Z

This PR fixes the issue #1299. This changes the way we find index match for predicate columns in IndexScan rule implementation. Specifically, this change makes sure that the optimizer picks a multi-column index only if the predicate columns match the index columns in order.

coveralls · 2018-04-17T19:37:39Z

Coverage decreased (-0.002%) to 77.267% when pulling 3a20b40 on vkonagar:master into 54b02a4 on cmu-db:master.

apavlo · 2018-04-18T21:10:18Z

@vkonagar Can you provide test cases for this to know what you fixed? Thanks!

vkonagar · 2018-04-19T00:49:26Z

@apavlo Andy, I have added a test to verify the query plan correctness with respect to multi-column indexes.

This doesn't fix the cost model for multi-column indexes, which is not currently supported in the optimizer. I have talked to bowei and we will look into that.

chenboy · 2018-04-19T05:45:06Z

As discussed in today's meeting, we want to fix the cost model to consider multi-column indices. Let me see if I can fix it. @GustavoAngulo @nappelson I'm wondering if we have a testing infrastructure for cost model correctness right now?

linmagit

Please see the comment.

Forgot to add the comment... will submit again

linmagit

Please see the comment.

linmagit · 2018-04-21T00:28:32Z

src/optimizer/rule_impls.cpp

          index_expr_type_list.push_back(expr_type_list[offset]);
          index_value_list.push_back(value_list[offset]);
+        } else {


I don't think we should check for an exact same ordering here. For example, if you have an index on column (a, b), and your predicates are "b = 5 and a = 1", then we should be able to use the index scan. However, the check here won't identify that because it requires the order in the predicates to be exactly the same as in the index.

After thinking about this, I actually think that you should just keep the old index_key_column_id_list. You just need to add a flag about whether the lead (highest) column in the index has been referenced in the index. As long as that is true, we should be able to use the index for the scan. Thoughts? @chenboy @vkonagar

Agree. I also think we don't need to consider order here. The way to fix this issue is letting the cost model compute the correct cost for these indices.

Hi, @chenboy! @vkonagar and I are discussing some implementation details for this on Slack. We've added you into the channel. There's some cost model related issue we think you probably have a better idea on what's going on. Can you take a look at Slack? Thanks!

apavlo · 2018-06-21T13:31:26Z

This is another important fix that we are going to need for TPC-C.

vkonagar added 2 commits April 17, 2018 13:23

Fix multi-column index rule in optimizer

1d7314b

Fix formatting

f7c4035

vkonagar requested review from linmagit, GustavoAngulo, sivaprasadsudhir and pbollimp April 17, 2018 18:28

vkonagar added ready_for_review in progress and removed ready_for_review labels Apr 17, 2018

Add test for multi-column index scan plans

a711428

linmagit previously requested changes Apr 21, 2018

View reviewed changes

linmagit suggested changes Apr 21, 2018

View reviewed changes

Merge branch 'master' into master

3a20b40

apavlo added ready_for_review and removed in progress labels Jun 21, 2018

apavlo added this to the tpcc milestone Jun 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-column IndexScan plan selection fix #1305

Multi-column IndexScan plan selection fix #1305

vkonagar commented Apr 17, 2018

coveralls commented Apr 17, 2018 •

edited

Loading

apavlo commented Apr 18, 2018

vkonagar commented Apr 19, 2018

chenboy commented Apr 19, 2018

linmagit left a comment

linmagit left a comment

linmagit Apr 21, 2018

chenboy Apr 21, 2018

linmagit Apr 22, 2018

apavlo commented Jun 21, 2018

Multi-column IndexScan plan selection fix #1305

Are you sure you want to change the base?

Multi-column IndexScan plan selection fix #1305

Conversation

vkonagar commented Apr 17, 2018

coveralls commented Apr 17, 2018 • edited Loading

apavlo commented Apr 18, 2018

vkonagar commented Apr 19, 2018

chenboy commented Apr 19, 2018

linmagit left a comment

Choose a reason for hiding this comment

linmagit left a comment

Choose a reason for hiding this comment

linmagit Apr 21, 2018

Choose a reason for hiding this comment

chenboy Apr 21, 2018

Choose a reason for hiding this comment

linmagit Apr 22, 2018

Choose a reason for hiding this comment

apavlo commented Jun 21, 2018

coveralls commented Apr 17, 2018 •

edited

Loading