-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Research performance improvements in N-way merging #2148
Comments
I'm interested in this. Is there more information/context about it? |
Hi @jackwener, currently @richox has two efforts trying to achieve this: The first one intended to use The second attempt tries to use Perhaps you guys could discuss this to see if it's possible to come up with a faster solution? |
Yes, I will provide some more coherent thoughts later today
…On Tue, Apr 5, 2022 at 9:54 AM Yijie Shen ***@***.***> wrote:
I created this issue from an item @alamb <https://github.com/alamb> has
listed in the umbrella PR. Maybe @alamb <https://github.com/alamb> can
also share some insights here.
—
Reply to this email directly, view it on GitHub
<#2148 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AADXZMPFETOUOE7YTV26GMTVDRAZZANCNFSM5SOKD74A>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Thoughts on N-way merging: We (Influxdb IOx in general and myself in particular) are very interested in this as well because the Here is what I was thinking about how to proceed:
Areas for investigation / things to spike out:
cc @tustvold |
One other thing to throw into the mix would be to optimise sorts of dictionary encoded columns. If the dictionary is sorted, the savings could be significant as you only need to compare the integer keys. Even if the dictionary isn't sorted it might be faster to sort the dictionary first, and then sort the now sorted keys. Just an idea as at least in the case of IOx, we will only be sorting on dictionary encoded string columns and not plain columns. |
I filed an issue for this #2150 |
During my school days, I tried to implement an external sorting algorithm using the |
Closing this ticket as I believe it is not tracking anything anymore, feel free to reopen if I am mistaken. SortPreservingMerge is now implemented as an n-way tournament tree making use of an order-preserving row encoding for multi-column sorts, and specialized cursors for single column sorts. I'm not aware of any major low-hanging fruit to make it run faster |
No description provided.
The text was updated successfully, but these errors were encountered: