-
Notifications
You must be signed in to change notification settings - Fork 773
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ParserError when use OVER for Hive and Spark dialect #2043
Comments
what’s the difference between partition by and distribute by or are they the same? |
the second link you posted has no reference to distribute by |
They are different. Distribute by adjust the allocation to reducer, e.g., send the rows with the same col value to the same reducer. The link for distribute by: |
i don't think that's right, that link is for select * from x SORT BY y, that's not the same as a window function |
That's true. But I have tested on Hive 3.1.0 and the expression is executed correctly. So I guess this means this is still a valid expression? BTW, does over(distribute by) for spark make sense to you? |
yes, databricks documentation confirms it's simply an alias |
Hi @tobymao , This issues is not resolved? I just tried on master branch but seems: |
you need to specify the dialect
|
@tobymao Another issue is the current parser does not support: The SORT BY clause is claimed to be the alias for order by in: |
Before you file an issue
Fully reproducible code snippet
Official Documentation
https://spark.apache.org/docs/latest/sql-ref-syntax-qry-select-window.html
https://docs.cloudera.com/HDPDocuments/HDP2/HDP-2.0.0.2/ds_Hive/language_manual/ptf-window.html
The text was updated successfully, but these errors were encountered: