[fix](statistics) Avoid re-estimating pruned partition predicates in stats#63764
Open
foxtail463 wants to merge 1 commit into
Open
[fix](statistics) Avoid re-estimating pruned partition predicates in stats#63764foxtail463 wants to merge 1 commit into
foxtail463 wants to merge 1 commit into
Conversation
Contributor
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
Contributor
Author
|
run buildall |
79c9f8e to
84aed36
Compare
Contributor
Author
|
run buildall |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Problem Summary:
For OLAP scans, partition pruning can already reduce the scan row count to the selected partitions. However, the original partition predicate is intentionally kept in the filter until post-processing so MV rewrite can still match the original query predicate.
During CBO stats calculation, this means the filter estimator may apply the same partition predicate again on top of the already-pruned scan row count, causing row count underestimation. For example, after pruning to one partition,
id = 1may already be reflected in the scan cardinality, butcomputeFilterstill estimates selectivity forid = 1.This change reuses the recorded
PartitionPrunablePredicateon OLAP scans and skips those already-pruned conjuncts during filter statistics estimation, while preserving the existing plan shape and post-processing behavior.