felixwluo opened a new pull request, #34589: URL: https://github.com/apache/doris/pull/34589
## Proposed changes Issue Number: close #xxx <!--Describe your changes.--> fixed an issue where the hive catalog field delimiter is an empty string, causing it to be core 1、hive build statement ``` CREATE TABLE `hive_q1`( `id` int, `name` string) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' WITH SERDEPROPERTIES ( 'field.delim'='', 'serialization.format'='') STORED AS INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat' LOCATION 'hdfs://HDFSxxxxx/usr/hive/warehouse/hive_q1' TBLPROPERTIES ( 'transient_lastDdlTime'='1715175063') ``` 2、problem `When the field.delim attribute value is an empty string, the be core dumps because of the separator problem` 3、be stack ``` 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) in /usr/local/service/doris/lib/be/doris_be 1# os::Linux::chained_handler(int, siginfo*, void*) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 2# JVM_handle_linux_signal in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 3# signalHandler(int, siginfo*, void*) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 4# 0x00007F75D264E400 in /lib64/libc.so.6 5# __memset_sse2 in /lib64/libc.so.6 6# doris::vectorized::PlainCsvTextFieldSplitter::_split_field_multi_char(doris::Slice const&, std::vector<doris::Slice, std::allocator<doris::Slice> >*) in /usr/local/service/doris/lib/be/doris_be 7# doris::vectorized::CsvReader::_line_split_to_values(doris::Slice const&, bool*) in /usr/local/service/doris/lib/be/doris_be 8# doris::vectorized::CsvReader::_fill_dest_columns(doris::Slice const&, doris::vectorized::Block*, std::vector<COW<doris::vectorized::IColumn>::mutable_ptr<doris::vectorized::IColumn>, std::allocator<COW<doris::vectorized::IColumn>::mutable_ptr<doris::vectorized::IColumn> > >&, unsigned long*) in /usr/local/service/doris/lib/be/doris_be 9# doris::vectorized::CsvReader::get_next_block(doris::vectorized::Block*, unsigned long*, bool*) in /usr/local/service/doris/lib/be/doris_be 10# doris::vectorized::VFileScanner::_get_block_impl(doris::RuntimeState*, doris::vectorized::Block*, bool*) in /usr/local/service/doris/lib/be/doris_be 11# doris::vectorized::VScanner::get_block(doris::RuntimeState*, doris::vectorized::Block*, bool*) in /usr/local/service/doris/lib/be/doris_be 12# doris::vectorized::ScannerScheduler::_scanner_scan(doris::vectorized::ScannerScheduler*, doris::vectorized::ScannerContext*, std::shared_ptr<doris::vectorized::VScanner>) in /usr/local/service/doris/lib/be/doris_be 13# std::_Function_handler<void (), doris::vectorized::ScannerScheduler::_schedule_scanners(doris::vectorized::ScannerContext*)::$_1::operator()() const::{lambda()#4}>::_M_invoke(std::_Any_data const&) in /usr/local/service/doris/lib/be/doris_be 14# doris::WorkThreadPool<true>::work_thread(int) in /usr/local/service/doris/lib/be/doris_be 15# execute_native_thread_routine in /usr/local/service/doris/lib/be/doris_be 16# start_thread in /lib64/libpthread.so.0 17# __clone in /lib64/libc.so.6 ``` ## Further comments If this is a relatively large or complex change, kick off the discussion at [d...@doris.apache.org](mailto:d...@doris.apache.org) by explaining why you chose the solution you did and what alternatives you considered, etc... -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org