lyh20093867 opened a new issue, #39043:
URL: https://github.com/apache/doris/issues/39043

   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/doris/issues?q=is%3Aissue) and found no 
similar issues.
   
   
   ### Version
   
   doris-1.2.5-rc01(AVX2) RELEASE (build file://VM-10-6-centos@Unknown)
   
   ### What's Wrong?
   
   be crash frequently,almost one time per hour
   
   -------------------
   start time: Wed Aug  7 15:13:39 CST 2024
   SLF4J: Class path contains multiple SLF4J bindings.
   SLF4J: Found binding in 
[jar:file:/opt/module/be/lib/hadoop_hdfs/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
   SLF4J: Found binding in 
[jar:file:/opt/module/be/lib/java-udf-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
   SLF4J: Actual binding is of type [org.slf4j.impl.Reload4jLoggerFactory]
   F0807 15:13:40.352217 102351 doris_main.cpp:408] fail to open StorageEngine, 
res=[INTERNAL_ERROR]init path failed, error=[IO_ERROR]open rocksdb failed, 
path=/data1/doris/be/storage;[IO_ERROR]open rocksdb failed, 
path=/data2/doris/be/storage;
   *** Check failure stack trace: ***
       @     0x55b0a223ccad  google::LogMessage::Fail()
       @     0x55b0a223f1e9  google::LogMessage::SendToLog()
       @     0x55b0a223c816  google::LogMessage::Flush()
       @     0x55b0a223f859  google::LogMessageFatal::~LogMessageFatal()
       @     0x55b09c453a8b  main
       @     0x7f8760cfc555  __libc_start_main
       @     0x55b09c45202a  _start
       @              (nil)  (unknown)
   *** Query id: 0-0 ***
   *** Aborted at 1723014820 (unix time) try "date -d @1723014820" if you are 
using GNU date ***
   *** Current BE git commitID: Unknown ***
   *** SIGABRT unkown detail explain (@0x3e800018fcf) received by PID 102351 
(TID 0x7f87616dcb40) from PID 102351; stack trace: ***
    0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, 
siginfo_t*, void*) at /root/doris/be/src/common/signal_handler.h:420
    1# 0x00007F8760D10400 in /lib64/libc.so.6
    2# gsignal in /lib64/libc.so.6
    3# abort in /lib64/libc.so.6
    4# 0x000055B0A2247699 in /opt/module/be/lib/doris_be
    5# 0x000055B0A223CCAD in /opt/module/be/lib/doris_be
    6# google::LogMessage::SendToLog() in /opt/module/be/lib/doris_be
    7# google::LogMessage::Flush() in /opt/module/be/lib/doris_be
    8# google::LogMessageFatal::~LogMessageFatal() in 
/opt/module/be/lib/doris_be
    9# main at /root/doris/be/src/service/doris_main.cpp:428
   10# __libc_start_main in /lib64/libc.so.6
   11# _start in /opt/module/be/lib/doris_be
   
   start time: Wed Aug  7 15:14:02 CST 2024
   SLF4J: Class path contains multiple SLF4J bindings.
   SLF4J: Found binding in 
[jar:file:/opt/module/be/lib/hadoop_hdfs/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
   SLF4J: Found binding in 
[jar:file:/opt/module/be/lib/java-udf-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
   SLF4J: Actual binding is of type [org.slf4j.impl.Reload4jLoggerFactory]
   *** Query id: fb1e59d3c354575-87ffc98a44c1f8b1 ***
   *** Aborted at 1723018291 (unix time) try "date -d @1723018291" if you are 
using GNU date ***
   *** Current BE git commitID: Unknown ***
   *** SIGSEGV address not mapped to object (@0x0) received by PID 104066 (TID 
0x7efc645df700) from PID 0; stack trace: ***
    0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, 
siginfo_t*, void*) at /root/doris/be/src/common/signal_handler.h:420
    1# os::Linux::chained_handler(int, siginfo*, void*) in 
/opt/module//openjdk/jre/lib/amd64/server/[libjvm.so](http://libjvm.so/)
    2# JVM_handle_linux_signal in 
/opt/module//openjdk/jre/lib/amd64/server/[libjvm.so](http://libjvm.so/)
    3# signalHandler(int, siginfo*, void*) in 
/opt/module//openjdk/jre/lib/amd64/server/[libjvm.so](http://libjvm.so/)
    4# 0x00007EFD7FF3E400 in /lib64/libc.so.6
    5# je_arena_dalloc_promoted at ../src/arena.c:1604
    6# __GI__dl_deallocate_tls in /lib64/ld-linux-x86-64.so.2
    7# __free_stacks in /lib64/libpthread.so.0
    8# __deallocate_stack in /lib64/libpthread.so.0
    9# pthread_join in /lib64/libpthread.so.0
   10# std::thread::join() at 
../../../../../libstdc++-v3/src/c++11/[thread.cc:114](http://thread.cc:114/)
   11# doris::ThreadGroup::join_all() in /opt/module/be/lib/doris_be
   12# doris::KafkaDataConsumerGroup::start_all(doris::StreamLoadContext*) at 
/root/doris/be/src/runtime/routine_load/data_consumer_group.cpp:141
   13# doris::RoutineLoadTaskExecutor::exec_task(doris::StreamLoadContext*, 
doris::DataConsumerPool*, std::function<void (doris::StreamLoadContext*)>) at 
/root/doris/be/src/runtime/routine_load/routine_load_task_executor.cpp:306
   14# std::_Function_handler<void (), std::_Bind_result<void, void 
(doris::RoutineLoadTaskExecutor::*(doris::RoutineLoadTaskExecutor*, 
doris::StreamLoadContext*, doris::DataConsumerPool*, 
doris::RoutineLoadTaskExecutor::submit_task(doris::TRoutineLoadTask 
const&)::$_0))(doris::StreamLoadContext*, doris::DataConsumerPool*, 
std::function<void (doris::StreamLoadContext*)>)> >::_M_invoke(std::_Any_data 
const&) at 
/var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
   15# doris::PriorityThreadPool::work_thread(int) at 
/root/doris/be/src/util/priority_thread_pool.hpp:146
   16# execute_native_thread_routine at 
../../../../../libstdc++-v3/src/c++11/[thread.cc:84](http://thread.cc:84/)
   17# start_thread in /lib64/libpthread.so.0
   18# clone in /lib64/libc.so.6
   
   start time: Wed Aug  7 16:11:32 CST 2024
   SLF4J: Class path contains multiple SLF4J bindings.
   SLF4J: Found binding in 
[jar:file:/opt/module/be/lib/hadoop_hdfs/common/lib/slf4j-reload4j-1.7.36.jar!/org/slf4j/impl/StaticLoggerBinder.class]
   SLF4J: Found binding in 
[jar:file:/opt/module/be/lib/java-udf-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
   SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
explanation.
   SLF4J: Actual binding is of type [org.slf4j.impl.Reload4jLoggerFactory]
   F0807 16:11:35.432806 12745 doris_main.cpp:408] fail to open StorageEngine, 
res=[INTERNAL_ERROR]init path failed, error=[IO_ERROR]open rocksdb failed, 
path=/data1/doris/be/storage;
   *** Check failure stack trace: ***
       @     0x555669752cad  google::LogMessage::Fail()
       @     0x5556697551e9  google::LogMessage::SendToLog()
       @     0x555669752816  google::LogMessage::Flush()
       @     0x555669755859  google::LogMessageFatal::~LogMessageFatal()
       @     0x555663969a8b  main
       @     0x7fdaa7f5e555  __libc_start_main
       @     0x55566396802a  _start
       @              (nil)  (unknown)
   *** Query id: 0-0 ***
   *** Aborted at 1723018295 (unix time) try "date -d @1723018295" if you are 
using GNU date ***
   *** Current BE git commitID: Unknown ***
   *** SIGABRT unkown detail explain (@0x3e8000031c9) received by PID 12745 
(TID 0x7fdaa893eb40) from PID 12745; stack trace: ***
    0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, 
siginfo_t*, void*) at /root/doris/be/src/common/signal_handler.h:420
    1# 0x00007FDAA7F72400 in /lib64/libc.so.6
    2# gsignal in /lib64/libc.so.6
    3# abort in /lib64/libc.so.6
    4# 0x000055566975D699 in /opt/module/be/lib/doris_be
    5# 0x0000555669752CAD in /opt/module/be/lib/doris_be
    6# google::LogMessage::SendToLog() in /opt/module/be/lib/doris_be
    7# google::LogMessage::Flush() in /opt/module/be/lib/doris_be
    8# google::LogMessageFatal::~LogMessageFatal() in 
/opt/module/be/lib/doris_be
    9# main at /root/doris/be/src/service/doris_main.cpp:428
   10# __libc_start_main in /lib64/libc.so.6
   11# _start in /opt/module/be/lib/doris_be
   
   ---------------------------------------------
   logs in be.INFO
   
   I0807 15:13:40.349098 103013 data_dir.cpp:745] path: /data1/doris/be/storage 
total capacity: 7999426224128, available capacity: 7905475198976
   I0807 15:13:40.349145 103014 data_dir.cpp:745] path: /data2/doris/be/storage 
total capacity: 7999426224128, available capacity: 7907773960192
   I0807 15:13:40.349195 103013 data_dir.cpp:192] path: 
/data1/doris/be/storage, hash: 8911238614356524183
   I0807 15:13:40.349210 103014 data_dir.cpp:192] path: 
/data2/doris/be/storage, hash: 8825406747811533581
   W0807 15:13:40.351935 103013 olap_meta.cpp:80] rocks db open failed, 
reason:IO error: While lock file: /data1/doris/be/storage/meta/LOCK: Resource 
temporarily unavailable
   W0807 15:13:40.351981 103014 olap_meta.cpp:80] rocks db open failed, 
reason:IO error: While lock file: /data2/doris/be/storage/meta/LOCK: Resource 
temporarily unavailable
   W0807 15:13:40.351994 103013 data_dir.cpp:204] init OlapMeta failed, error: 
[IO_ERROR]open rocksdb failed, path=/data1/doris/be/storage
   W0807 15:13:40.352031 103013 data_dir.cpp:115] _init_meta failed, error: 
[IO_ERROR]open rocksdb failed, path=/data1/doris/be/storage
   W0807 15:13:40.352030 103014 data_dir.cpp:204] init OlapMeta failed, error: 
[IO_ERROR]open rocksdb failed, path=/data2/doris/be/storage
   W0807 15:13:40.352039 103013 storage_engine.cpp:208] Store load failed, 
status=[IO_ERROR]open rocksdb failed, path=/data1/doris/be/storage, 
path=/data1/doris/be/storage
   W0807 15:13:40.352041 103014 data_dir.cpp:115] _init_meta failed, error: 
[IO_ERROR]open rocksdb failed, path=/data2/doris/be/storage
   W0807 15:13:40.352072 103014 storage_engine.cpp:208] Store load failed, 
status=[IO_ERROR]open rocksdb failed, path=/data2/doris/be/storage, 
path=/data2/doris/be/storage
   W0807 15:13:40.352162 102351 storage_engine.cpp:172] _init_store_map failed, 
error: [INTERNAL_ERROR]init path failed, error=[IO_ERROR]open rocksdb failed, 
path=/data1/doris/be/storage;[IO_ERROR]open rocksdb failed, 
path=/data2/doris/be/storage;
   W0807 15:13:40.352175 102351 storage_engine.cpp:99] open engine failed, 
error: [INTERNAL_ERROR]init path failed, error=[IO_ERROR]open rocksdb failed, 
path=/data1/doris/be/storage;[IO_ERROR]open rocksdb failed, 
path=/data2/doris/be/storage;
   F0807 15:13:40.352217 102351 doris_main.cpp:408] fail to open StorageEngine, 
res=[INTERNAL_ERROR]init path failed, error=[IO_ERROR]open rocksdb failed, 
path=/data1/doris/be/storage;[IO_ERROR]open rocksdb failed, 
path=/data2/doris/be/storage;
   
   
   
   
   
   ### What You Expected?
   
   don't crash
   
   ### How to Reproduce?
   
   _No response_
   
   ### Anything Else?
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to