yeqown opened a new issue, #48460:
URL: https://github.com/apache/doris/issues/48460

   ### Search before asking
   
   - [x] I had searched in the 
[issues](https://github.com/apache/doris/issues?q=is%3Aissue) and found no 
similar issues.
   
   
   ### Version
   
   3.0.3
   
   ### What's Wrong?
   
   I'm trying to deploy Doris based k8s, while starting Doris cluster I got the 
following error:
   
   ```plain
   Defaulted container "compute" out of: compute, default-init (init)
   [Fri Feb 28 02:56:24 UTC 2025] [info] Process conf file be.conf ...
   /opt/apache-doris/be_disaggregated_entrypoint.sh: line 73: 
/opt/apache-doris/be/conf/: Is a directory
   [Fri Feb 28 02:56:24 UTC 2025] [info] use root no password show backends 
result 10221        
test-disaggregated-cluster-cg1-1.test-disaggregated-cluster-cg1.default.svc.cluster.local
       9050    -1      -1      -1      -1      NULL    NULL    false   false   
0       0.000   0.000   1.000 B 0.000   0.00 %  0.00 %  0.000   
{"cloud_unique_id" : "1:1751150972:t1Ws6Mrv", "compute_group_status" : 
"NORMAL", "private_endpoint" : "", "compute_group_name" : "cg1", "location" : 
"default", "public_endpoint" : "", "compute_group_id" : "ZNES_zRC"}        
java.net.UnknownHostException: 
test-disaggregated-cluster-cg1-1.test-disaggregated-cluster-cg1.default.svc.cluster.local
                
{"lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false,"isActive":true,"currentFragmentNum":0,"lastFragmentUpdateTime":0}
      287             1       0.00
   10222        
test-disaggregated-cluster-cg1-0.test-disaggregated-cluster-cg1.default.svc.cluster.local
       9050    -1      -1      -1      -1      NULL    NULL    false   false   
0       0.000   0.000   1.000 B 0.000   0.00 %  0.00 %  0.000 
{"cloud_unique_id" : "1:1751150972:YA6LXvvg", "compute_group_status" : 
"NORMAL", "private_endpoint" : "", "compute_group_name" : "cg1", "location" : 
"default", "public_endpoint" : "", "compute_group_id" : "ZNES_zRC"}  
java.net.UnknownHostException: 
test-disaggregated-cluster-cg1-0.test-disaggregated-cluster-cg1.default.svc.cluster.local
                
{"lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false,"isActive":true,"currentFragmentNum":0,"lastFragmentUpdateTime":0}
      287             1       0.00
   10223        
test-disaggregated-cluster-cg1-2.test-disaggregated-cluster-cg1.default.svc.cluster.local
       9050    -1      -1      -1      -1      NULL    NULL    false   false   
0       0.000   0.000   1.000 B 0.000   0.00 %  0.00 %  0.000 
{"cloud_unique_id" : "1:1751150972:Ox_RJuee", "compute_group_status" : 
"NORMAL", "private_endpoint" : "", "compute_group_name" : "cg1", "location" : 
"default", "public_endpoint" : "", "compute_group_id" : "ZNES_zRC"}  
java.net.UnknownHostException: 
test-disaggregated-cluster-cg1-2.test-disaggregated-cluster-cg1.default.svc.cluster.local
                
{"lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false,"isActive":true,"currentFragmentNum":0,"lastFragmentUpdateTime":0}
      287             1       0.00
   10224        
test-disaggregated-cluster-cg2-0.test-disaggregated-cluster-cg2.default.svc.cluster.local
       9050    -1      -1      -1      -1      NULL    NULL    false   false   
0       0.000   0.000   1.000 B 0.000   0.00 %  0.00 %  0.000 
{"cloud_unique_id" : "1:1751150972:E_SJoMU8", "compute_group_status" : 
"NORMAL", "private_endpoint" : "", "compute_group_name" : "cg2", "location" : 
"default", "public_endpoint" : "", "compute_group_id" : "oZ2gH5Ml"}  
java.net.UnknownHostException: 
test-disaggregated-cluster-cg2-0.test-disaggregated-cluster-cg2.default.svc.cluster.local
                
{"lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false,"isActive":true,"currentFragmentNum":0,"lastFragmentUpdateTime":0}
      287             1       0.00
   10251        
test-disaggregated-cluster-cg2-1.test-disaggregated-cluster-cg2.default.svc.cluster.local
       9050    -1      -1      -1      -1      NULL    NULL    false   false   
0       0.000   0.000   1.000 B 0.000   0.00 %  0.00 %  0.000 
{"cloud_unique_id" : "1:1751150972:B_h0m9vp", "compute_group_status" : 
"NORMAL", "private_endpoint" : "", "compute_group_name" : "cg2", "location" : 
"default", "public_endpoint" : "", "compute_group_id" : "oZ2gH5Ml"}  
java.net.UnknownHostException: 
test-disaggregated-cluster-cg2-1.test-disaggregated-cluster-cg2.default.svc.cluster.local
                
{"lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false,"isActive":true,"currentFragmentNum":0,"lastFragmentUpdateTime":0}
      287             1       0.00
   10252        
test-disaggregated-cluster-cg2-2.test-disaggregated-cluster-cg2.default.svc.cluster.local
       9050    -1      -1      -1      -1      NULL    NULL    false   false   
0       0.000   0.000   1.000 B 0.000   0.00 %  0.00 %  0.000 
{"cloud_unique_id" : "1:1751150972:nmt5aHJC", "compute_group_status" : 
"NORMAL", "private_endpoint" : "", "compute_group_name" : "cg2", "location" : 
"default", "public_endpoint" : "", "compute_group_id" : "oZ2gH5Ml"}  
java.net.UnknownHostException: 
test-disaggregated-cluster-cg2-2.test-disaggregated-cluster-cg2.default.svc.cluster.local
                
{"lastStreamLoadTime":-1,"isQueryDisabled":false,"isLoadDisabled":false,"isActive":true,"currentFragmentNum":0,"lastFragmentUpdateTime":0}
      287             1       0.00  .
   [Fri Feb 28 02:56:24 UTC 2025] [info] Check myself 
(test-disaggregated-cluster-cg1-0.test-disaggregated-cluster-cg1.default.svc.cluster.local:9050)
 exist in FE, start be directly ...
   /etc/podinfo/annotationsis not exists.
   [Fri Feb 28 02:56:24 UTC 2025] run start_be.sh
   Disable swap memory before starting be
   ```
   
   All comute group pods are all `CrashLoopBackOff`.
   
   ```plain
   test-disaggregated-cluster-cg1-0                              0/1     
CrashLoopBackOff   2 (34s ago)   57s
   test-disaggregated-cluster-cg1-1                              0/1     
CrashLoopBackOff   2 (36s ago)   57s
   test-disaggregated-cluster-cg1-2                              0/1     
CrashLoopBackOff   2 (33s ago)   57s
   test-disaggregated-cluster-cg2-0                              0/1     
CrashLoopBackOff   2 (35s ago)   57s
   test-disaggregated-cluster-cg2-1                              0/1     
CrashLoopBackOff   2 (34s ago)   57s
   test-disaggregated-cluster-cg2-2                              0/1     
CrashLoopBackOff   2 (32s ago)   57s
   ```
   
   The 73th line of `be_disaggregated_entrypoint.sh`:
   
![Image](https://github.com/user-attachments/assets/7b1e0054-2cd8-4947-823e-f4b6ffaf38cc)
   
   It looks the `$file` is undefined? from the logs print: 
   
   ```bash
   [Fri Feb 28 02:56:24 UTC 2025] [info] Process conf file be.conf ...
   /opt/apache-doris/be_disaggregated_entrypoint.sh: line 73: 
/opt/apache-doris/be/conf/: Is a directory
   ```
   
   ### What You Expected?
   
   Doris cluster with coupute group start successfully.
   
   ### How to Reproduce?
   
   I'm deplying with the guide 
https://doris.apache.org/zh-CN/docs/3.0/install/deploy-on-kubernetes/separating-storage-compute/install-doris-cluster
   
   I did no modification on `be.conf`. 
   
   ### Anything Else?
   
   The k8s cluster is started by minikube based docker.And here is my 
`ddc-sample.yaml` and `be-configmap`:
   
   ```yaml
   # Licensed to the Apache Software Foundation (ASF) under one
   # or more contributor license agreements.  See the NOTICE file
   # distributed with this work for additional information
   # regarding copyright ownership.  The ASF licenses this file
   # to you under the Apache License, Version 2.0 (the
   # "License"); you may not use this file except in compliance
   # with the License.  You may obtain a copy of the License at
   #
   #   http://www.apache.org/licenses/LICENSE-2.0
   #
   # Unless required by applicable law or agreed to in writing,
   # software distributed under the License is distributed on an
   # "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
   # KIND, either express or implied.  See the License for the
   # specific language governing permissions and limitations
   # under the License.
   
   apiVersion: disaggregated.cluster.doris.com/v1
   kind: DorisDisaggregatedCluster
   metadata:
     name: test-disaggregated-cluster
   spec:
     metaService:
       image: apache/doris:ms-3.0.3
       configMaps:
         - name: doris-metaservice
           mountPath: /etc/doris
       fdb:
         configMapNamespaceName:
           name: test-cluster-config
           namespace: default
     feSpec:
       replicas: 2
       image: apache/doris:fe-3.0.3
       configMaps:
         - name: fe-configmap
       persistentVolume:
         persistentVolumeClaimSpec:
           storageClassName:
           accessModes:
             - ReadWriteOnce
           resources:
             requests:
               storage: 10Gi
     computeGroups:
       - uniqueId: cg1
         replicas: 3
         image: apache/doris:be-3.0.3
         configMaps:
           - name: be-configmap
             mountPath: /etc/doris
         persistentVolume:
           annotations:
             doris.computegroup/id: cg1
             doris.deployment/mode: disaggregated
           logNotStore: true
           persistentVolumeClaimSpec:
             storageClassName:
             accessModes:
               - ReadWriteOnce
             resources:
               requests:
                 storage: 10Gi
       - uniqueId: cg2
         replicas: 3
         image: apache/doris:be-3.0.3
         configMaps:
           - name: be-configmap
             mountPath: /etc/doris
         persistentVolume:
           annotations:
             doris.computegroup/id: cg1
             doris.deployment/mode: disaggregated
           logNotStore: true
           persistentVolumeClaimSpec:
             storageClassName:
             accessModes:
               - ReadWriteOnce
             resources:
               requests:
                 storage: 10Gi
   ```
   
   ```yaml
   apiVersion: v1
   kind: ConfigMap
   metadata:
     name: be-configmap
     labels:
       app.kubernetes.io/component: be
   data:
     be.conf: |
       # For jdk 17, this JAVA_OPTS will be used as default JVM options
       JAVA_OPTS_FOR_JDK_17="-Xmx1024m -DlogPath=$LOG_DIR/jni.log 
-Xlog:gc*:$LOG_DIR/be.gc.log.$CUR_DATE:time,uptime:filecount=10,filesize=50M 
-Djavax.security.auth.useSubjectCredsOnly=false -Dsun.security.krb5.debug=true 
-Dsun.java.command=DorisBE -XX:-CriticalJNINatives 
-XX:+IgnoreUnrecognizedVMOptions --add-opens=java.base/java.lang=ALL-UNNAMED 
--add-opens=java.base/java.lang.invoke=ALL-UNNAMED 
--add-opens=java.base/java.lang.reflect=ALL-UNNAMED 
--add-opens=java.base/java.io=ALL-UNNAMED 
--add-opens=java.base/java.net=ALL-UNNAMED 
--add-opens=java.base/java.nio=ALL-UNNAMED 
--add-opens=java.base/java.util=ALL-UNNAMED 
--add-opens=java.base/java.util.concurrent=ALL-UNNAMED 
--add-opens=java.base/java.util.concurrent.atomic=ALL-UNNAMED 
--add-opens=java.base/sun.nio.ch=ALL-UNNAMED 
--add-opens=java.base/sun.nio.cs=ALL-UNNAMED 
--add-opens=java.base/sun.security.action=ALL-UNNAMED 
--add-opens=java.base/sun.util.calendar=ALL-UNNAMED 
--add-opens=java.security.jgss/sun.security.krb5=ALL-UNNAMED
  --add-opens=java.management/sun.management=ALL-UNNAMED"
       file_cache_path = 
[{"path":"/mnt/disk1/doris_cloud/file_cache","total_size":107374182400,"query_limit":107374182400}]
   ```
   
   ### Are you willing to submit PR?
   
   - [x] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [x] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to