fantasy12345zsq opened a new pull request, #43185:
URL: https://github.com/apache/doris/pull/43185

   Proposed changes
    1. original issue:
        we found presto scan byte much more bigger then Doris, such as query:
        select l_orderkey from dev.lineitem_orc_100g_backup1 where l_orderkey = 
7192579;
   
![image](https://github.com/user-attachments/assets/9306b159-8be0-416c-806d-d96b559d5e9a)
       this query just have 1 rows result, scan bytes presto vs Doris:
       presto:
   
![image](https://github.com/user-attachments/assets/754778ba-9031-4139-a5a3-68a7f45923eb)
       Doris:
   
![image](https://github.com/user-attachments/assets/8055a128-19ee-441a-8ddd-a051ab6e0af2)
   
![image](https://github.com/user-attachments/assets/b0e3fa88-b6bb-4041-94d1-b04d4f1a4283)
     2. root cause:
        we use hive 1.x version,query predicate column name not match orc 
column name, in this case: 
        schame is l_orderkey name, in orc field name is _col1。
   3. how to fix:
       in _init_search_argument change l_orderkey -> _col1。
   4. profile:
       query latency:
   
![image](https://github.com/user-attachments/assets/7044f0fd-28b6-41a0-901a-01845a585e41)
       scan bytes:
   
![image](https://github.com/user-attachments/assets/e661e379-6ef1-44c1-94bb-14592020210b)
   
       
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to