Hi Joice:
Thanks your reporting.

It seems the root cause of this issue is missing "_orc_acid_version" file.
>From Hive version >= 3.0, delta/base files will always have file
'_orc_acid_version' with value >= '2'.
Maybe the hive3 of HDP has similar issue?
https://issues.apache.org/jira/browse/HIVE-16964

A workaround is try to create table without transactional props.
TBLPROPERTIES("transactional"="true")
pls try it, thanks.

Best regards,
Kaka


Joice Jacob <joicejacob1...@gmail.com> 于2023年10月6日周五 22:58写道:

> Hi,
> I've checked the Hive data directory, and I couldn't find the
> "_orc_acid_version" file. I have attached screenshots for your reference.
>
> Are there any recommended workarounds or alternative approaches that I can
> consider to resolve this issue? I'm open to exploring different solutions
> to ensure the successful integration of Doris with Hive in CDP. Any
> guidance or suggestions would be greatly appreciated.
>
> Screenshots attached for your reference.
>
> Best regards,
> Joice
>
> [image: image.png]
>
> On Fri, Oct 6, 2023 at 6:28 PM Mingyu Chen <morning...@163.com> wrote:
>
>> Could you list your hive data dir, to see what files exist? like:
>> _orc_acid_version
>>
>>
>>
>>
>>
>>
>>
>>
>> --
>>
>> Best Regards
>> Mingyu Chen
>>
>> Email:
>> morning...@apache.org
>>
>>
>>
>>
>>
>> At 2023-10-05 22:31:58, "Mingyu Chen" <morning...@163.com> wrote:
>> >Oh, I see, let me check it again.
>> >
>> >
>> >
>> >
>> >--
>> >
>> >Best Regards
>> >Mingyu Chen
>> >
>> >Email:
>> >morning...@apache.org
>> >
>> >
>> >
>> >
>> >
>> >At 2023-10-05 21:49:28, "Joice Jacob" <joicejacob1...@gmail.com> wrote:
>> >>Hi,
>> >>I wanted to share some important information regarding the CDP Hive
>> version
>> >>that I am currently using, which is Hive 3.1.3000.7.1.7.0-551.
>> >>
>> >>In CDP Hive version 3.x, a significant change has been introduced
>> regarding
>> >>managed tables. By default, managed tables in Hive 3.x are considered
>> >>transactional. This means that if you create a managed table explicitly
>> >>specifying it as transactional with a value of "false," it will be
>> treated
>> >>as an external table instead.
>> >>Thank you for your attention to this matter, and I appreciate your
>> >>continued support.
>> >>
>> https://www.thecodersstop.com/hadoop/apache-hive-3-changes-in-cdp-upgrade-part-1/
>> >>Best regards,
>> >>Joice
>> >>
>> >>On Thu, Oct 5, 2023 at 7:01 PM Mingyu Chen <morning...@163.com> wrote:
>> >>
>> >>> I saw that your hive table is with property "transactional" = "true",
>> >>> And Doris only support ACID table with Hive 3, not support with Hive
>> 2.x.
>> >>> So you may need to create non-transactional hive table for Doris to
>> visit.
>> >>>
>> >>>
>> >>> --
>> >>> Best Regards
>> >>> Mingyu Chen
>> >>>
>> >>> Email:
>> >>> morning...@apache.org
>> >>>
>> >>>
>> >>> 在 2023-10-05 13:30:25,"Joice Jacob" <joicejacob1...@gmail.com> 写道:
>> >>>
>> >>> Dear Doris Community & Jiafeng.Zhang
>> >>> ,
>> >>> Thank you for your prompt response and your willingness to assist
>> with the
>> >>> issue I've been encountering with the integration of Doris and
>> Cloudera's
>> >>> Hive.
>> >>>
>> >>> To provide you with the necessary information, here are the details
>> you
>> >>> requested:
>> >>>
>> >>> *1. Hive Catalog Creation Statement:*
>> >>>
>> >>> My Hive catalog creation statement is as follows:
>> >>>
>> >>> CREATE CATALOG hive PROPERTIES (
>> >>>     'type'='hms',
>> >>>     'hive.metastore.uris' = 'thrift://10.128.0.4:9083',
>> >>> 'hive.version' = '3.1.3',
>> >>>     'hive.metastore.sasl.enabled' = 'true',
>> >>>     'hive.metastore.kerberos.principal' = 'hive/instanc...@hadoop.com
>> ',
>> >>>     'hadoop.security.authentication' = 'kerberos',
>> >>>     'hadoop.kerberos.keytab' = '/home/techuser/doris/hive.keytab',
>> >>>     'hadoop.kerberos.principal' = 'hive/instanc...@hadoop.com',
>> >>>     'yarn.resourcemanager.principal' = 'yarn/instanc...@hadoop.com'
>> >>> );
>> >>>
>> >>> *2. CDP Hive Version:*
>> >>>
>> >>> The CDP Hive version I am using is: Hive 3.1.3000.7.1.7.0-551
>> >>>
>> >>> *3. Configuration Attempt:*
>> >>>
>> >>> I have tried specifying the Hive version in my catalog configuration
>> with
>> >>> the following statement:
>> >>> "hive.version=2.1.0"
>> >>> Despite attempting to set the Hive version to 2.1.0, I continue to
>> >>> experience the same issue, which is detailed in my previous emails.
>> >>>
>> >>> *4. Logs: *
>> >>> I will attach both the fe.log and fe.warn logs to this email for your
>> >>> reference. These logs should provide additional context regarding the
>> issue
>> >>> I'm facing.
>> >>>
>> >>> Once again, I want to express my gratitude for your assistance and
>> support
>> >>> in resolving this matter. I look forward to your insights and
>> >>> recommendations based on the provided logs and catalog configuration.
>> >>>
>> >>> Please feel free to let me know if you require any further
>> information or
>> >>> if there are additional steps I should take to assist in diagnosing
>> and
>> >>> resolving the issue.
>> >>>
>> >>> [image: doris_error.JPG]
>> >>>  fe.log log at hive catlog creation time
>> >>> [image: fe_log_at_hivecatalog.JPG]
>> >>> Hive managed table script
>> >>> [image: image.png]
>> >>>
>> >>>
>> >>> Best regards,
>> >>> Joice
>> >>>
>> >>> On Thu, Oct 5, 2023 at 8:10 AM Jiafeng.Zhang <zhang...@gmail.com>
>> wrote:
>> >>>
>> >>>> Can you provide your hive catalog creation statement, your fe.log
>> log at
>> >>>> that time, and your cdp hive version? This will help us locate the
>> >>>> problem,
>> >>>> thank you.
>> >>>> You can also try specifying your hive version in your catalog
>> statement:
>> >>>> "hive.version=2.1.0"
>> >>>>
>> >>>> Joice Jacob <joicejacob1...@gmail.com> 于2023年10月5日周四 00:45写道:
>> >>>>
>> >>>> > I am reaching out once again to seek assistance and share a
>> specific
>> >>>> issue
>> >>>> > I've encountered while integrating Doris with Cloudera-flavored
>> Hive.
>> >>>> The
>> >>>> > error message I'm facing is as follows:
>> >>>> > detailMessage = get file split failed for table: baby1, err:
>> >>>> > java.lang.Exception: Hive 2.x versioned full-acid tables need to
>> run
>> >>>> major
>> >>>> > compaction.
>> >>>> >
>> >>>> > This error message appears when I attempt to query a Hive table
>> from
>> >>>> > Doris, and it seems to be related to Hive 2.x versioned full-ACID
>> tables
>> >>>> > requiring a major compaction.
>> >>>> > I would like to ask the Doris community for guidance on how to
>> handle
>> >>>> this
>> >>>> > issue effectively. Specifically, I am interested in understanding
>> the
>> >>>> best
>> >>>> > practices and steps to follow when dealing with Cloudera-flavored
>> Hive
>> >>>> > tables that require major compaction for Doris integration.
>> >>>> >
>> >>>> > If anyone in the community has successfully addressed this issue
>> or can
>> >>>> > provide insights into how to configure and manage
>> Cloudera-flavored Hive
>> >>>> > tables for integration with Doris, your expertise would be highly
>> >>>> > appreciated.
>> >>>> >
>> >>>> > Thank you for your time and support, and I look forward to
>> receiving
>> >>>> your
>> >>>> > valuable input.
>> >>>> > Joice
>> >>>> >
>> >>>> > On Wed, Oct 4, 2023 at 9:23 PM Joice Jacob <
>> joicejacob1...@gmail.com>
>> >>>> > wrote:
>> >>>> >
>> >>>> >> Dear Doris Community,
>> >>>> >> I have an update on the issue I previously mentioned regarding the
>> >>>> >> integration of Hive on a CDP distribution with Doris 2.0.1.1.
>> >>>> >>
>> >>>> >> After further investigation, I have identified that the issue is
>> >>>> related
>> >>>> >> to Hive managed tables being transactional by default. This
>> appears to
>> >>>> be
>> >>>> >> causing the error I encountered earlier.
>> >>>> >>
>> >>>> >> To address this issue, I am seeking guidance from the community
>> on any
>> >>>> >> specific configurations or settings that need to be adjusted for
>> Hive
>> >>>> >> transactional tables when using Hive as the catalog in Doris. Are
>> >>>> there any
>> >>>> >> recommended configurations or best practices that I should follow
>> to
>> >>>> ensure
>> >>>> >> smooth integration and query execution?
>> >>>> >>
>> >>>> >> Any insights or recommendations from the Doris community would be
>> >>>> greatly
>> >>>> >> appreciated. Your expertise and guidance will be instrumental in
>> >>>> helping me
>> >>>> >> resolve this challenge.
>> >>>> >>
>> >>>> >> Thank you for your continued support, and I look forward to your
>> >>>> valuable
>> >>>> >> input.
>> >>>> >>
>> >>>> >> Best regards,
>> >>>> >> Joice
>> >>>> >>
>> >>>> >> On Wed, Oct 4, 2023 at 6:54 PM Joice Jacob <
>> joicejacob1...@gmail.com>
>> >>>> >> wrote:
>> >>>> >>
>> >>>> >>> Dear Doris Community,
>> >>>> >>>
>> >>>> >>> I am reaching out to the community to seek assistance with an
>> >>>> >>> integration issue I've encountered while trying to use Hive on a
>> CDP
>> >>>> >>> distribution with Doris 2.0.1.1.
>> >>>> >>>
>> >>>> >>> Here are the details of my setup:
>> >>>> >>>
>> >>>> >>> Doris Version: 2.0.1.1
>> >>>> >>> Hive Version: 3.1.3
>> >>>> >>> Cluster Security: Kerberized
>> >>>> >>>
>> >>>> >>> I have successfully created a Hive catalog in Doris and have
>> been able
>> >>>> >>> to set up the integration between Hive and Doris. However, when I
>> >>>> attempt
>> >>>> >>> to query a Hive table using Doris, I encounter the following
>> error:
>> >>>> >>>
>> >>>> >>> ERROR 1105 (HY000): errCode = 2, detailMessage = get file split
>> failed
>> >>>> >>> for table: baby1, err: java.lang.Exception: Hive 2.x versioned
>> >>>> full-acid
>> >>>> >>> tables need to run major compaction.
>> >>>> >>>
>> >>>> >>> I have already performed a major compaction as recommended, but
>> I am
>> >>>> >>> still encountering the same error.
>> >>>> >>>
>> >>>> >>> I would greatly appreciate any insights, guidance, or solutions
>> that
>> >>>> the
>> >>>> >>> Doris community can offer to help me resolve this issue. If
>> anyone has
>> >>>> >>> encountered a similar problem or has expertise in integrating
>> Hive
>> >>>> with
>> >>>> >>> Doris, your assistance would be invaluable.
>> >>>> >>>
>> >>>> >>> Thank you in advance for your time and support. I look forward to
>> >>>> >>> hearing from the community and working together to find a
>> solution to
>> >>>> this
>> >>>> >>> challenge.
>> >>>> >>>
>> >>>> >>> [image: baby_table.JPG]
>> >>>> >>> [image: doris_hive_catlog_result.JPG]
>> >>>> >>>
>> >>>> >>> [image: doris_fe_log.JPG]
>> >>>> >>>
>> >>>> >>> [image: hive_metastore_error.JPG]
>> >>>> >>>
>> >>>> >>> Thanks
>> >>>> >>> Joice
>> >>>> >>>
>> >>>> >>>
>> >>>> >>>
>> >>>> >>>
>> >>>> >>>
>> >>>>
>> >>>> --
>> >>>> 张家峰
>> >>>>
>> >>>
>>
>

Reply via email to