Details
-
Bug
-
Status: Patch Available
-
Major
-
Resolution: Unresolved
-
3.1.0
-
None
Description
Test case:
create table table_a (a int, b string,c string);
create table table_b (a int, b string,c string);
insert into table_a values (11,'a','aa'),(22,'b','bb'),(33,'c','cc'),(44,'d','dd'),(5,'e','ee'),(6,'f','ff'),(7,'g','gg');
insert into table_b values (11,'a','aa'),(22,'b','bb'),(33,'c','cc'),(44,'d','dd'),(5,'e','ee'),(6,'f','ff'),(7,'g','gg');
alter table table_a set tblproperties ("bucketing_version"='1');
alter table table_b set tblproperties ("bucketing_version"='2');
Hivesql:
set hive.auto.convert.join=false;
set mapred.reduce.tasks=2;
select ta.a as a_a, tb.b as b_b from table_a ta join table_b tb on(ta.a=tb.a);
set hive.execution.engine=mr;
----------+
| a_a | b_b |
----------+
| 5 | e |
| 6 | f |
| 7 | g |
| 11 | a |
| 22 | b |
| 33 | c |
| 44 | d |
----------+
set hive.execution.engine=tez;
----------+
| a_a | b_b |
----------+
| 6 | f |
| 5 | e |
| 11 | a |
| 33 | c |
----------+
Attachments
Attachments
Issue Links
- is fixed by
-
HIVE-21304 Make bucketing version usage more robust
-
- Closed
-
- relates to
-
HIVE-21304 Make bucketing version usage more robust
-
- Closed
-
-
HIVE-22098 Data loss occurs when multiple tables are join with different bucket_version
-
- Patch Available
-