This is an automated email from the ASF dual-hosted git repository. yaqian pushed a commit to branch document in repository https://gitbox.apache.org/repos/asf/kylin.git
The following commit(s) were added to refs/heads/document by this push: new 897c186 fix typos 897c186 is described below commit 897c18649f974a5156c664fe502d9a7b511ac932 Author: yaqian.zhang <598593...@qq.com> AuthorDate: Wed Sep 1 18:47:08 2021 +0800 fix typos --- website/_docs40/install/deploy_without_hadoop.cn.md | 9 ++++----- website/_docs40/install/deploy_without_hadoop.md | 11 +++++------ 2 files changed, 9 insertions(+), 11 deletions(-) diff --git a/website/_docs40/install/deploy_without_hadoop.cn.md b/website/_docs40/install/deploy_without_hadoop.cn.md index c1b9826..c7f1252 100644 --- a/website/_docs40/install/deploy_without_hadoop.cn.md +++ b/website/_docs40/install/deploy_without_hadoop.cn.md @@ -5,12 +5,12 @@ categories: install permalink: /cn/docs40/install/deploy_without_hadoop.html --- -相比于 Kylin3,Kylin4.0 实现了全新 spark 构建引擎和 parquet 存储,使 kylin 不依赖 hadoop 环境部署成为可能。与在 AWS EMR 之上部署 Kylin3 相比,直接在 AWS EC2 实例上部署 Kylin4 存在以下优势: +相比于 Kylin 3.x,Kylin 4.0 实现了全新 spark 构建引擎和 parquet 存储,使 kylin 不依赖 hadoop 环境部署成为可能。与在 AWS EMR 之上部署 Kylin 3.x 相比,直接在 AWS EC2 实例上部署 Kylin 4.0 存在以下优势: 1. 节省成本。相比 AWS EMR 节点,AWS EC2 节点的成本更低。 2. 更加灵活。在 EC2 节点上,用户可以更加自主选择自己所需的服务以及组件进行安装部署。 3. 去 Hadoop。Hadoop 生态比较重,需要投入一定的人力成本进行维护,去 Hadoop 可以更加贴近云原生。 -在实现了支持在 spark standalone 模式下进行构建和查询的功能之后,我们在 AWS 的 EC2 实例上对无 hadoop 部署 Kylin4 做了尝试,并成功构建 cube 和进行了查询。 +在实现了支持在 Spark Standalone 模式下进行构建和查询的功能之后,我们在 AWS 的 EC2 实例上对无 Hadoop 部署 Kylin 4.0 做了尝试,并成功构建 Cube 和进行了查询。 ### 环境准备 @@ -55,7 +55,7 @@ permalink: /cn/docs40/install/deploy_without_hadoop.html ```shell mkdir /usr/local/java - tar -xvf jdk-8u291-linux-x64.tar -C /usr/local/java + tar -xvf java-1.8.0-openjdk.tar -C /usr/local/java ``` #### 3 配置 Hadoop @@ -313,8 +313,7 @@ permalink: /cn/docs40/install/deploy_without_hadoop.html kylin.metadata.url=kylin_metadata@jdbc,url=jdbc:mysql://hostname:3306/kylin,username=root,password=password,maxActive=10,maxIdle=10 kylin.env.zookeeper-connect-string=hostname kylin.engine.spark-conf.spark.master=spark://hostname:7077 - # 当 spark.submit.deployMode 配置为 cluster 时,需要额外配置 kylin.engine.spark.standalone.master.httpUrl - kylin.engine.spark-conf.spark.submit.deployMode=client/cluster + kylin.engine.spark-conf.spark.submit.deployMode=client kylin.env.hdfs-working-dir=s3://bucket/kylin kylin.engine.spark-conf.spark.eventLog.dir=s3://bucket/kylin/spark-history kylin.engine.spark-conf.spark.history.fs.logDirectory=s3://bucket/kylin/spark-history diff --git a/website/_docs40/install/deploy_without_hadoop.md b/website/_docs40/install/deploy_without_hadoop.md index a2fcdee..779176d 100644 --- a/website/_docs40/install/deploy_without_hadoop.md +++ b/website/_docs40/install/deploy_without_hadoop.md @@ -5,12 +5,12 @@ categories: install permalink: /docs40/install/deploy_without_hadoop.html --- -Compared with kylin3, kylin4 implements a new spark build engine and parquet storage, making it possible for kylin to deploy without Hadoop environment. Compared with deploying kylin3 on AWS EMR, deploying kylin4 directly on AWS EC2 instances has the following advantages: +Compared with Kylin 3.x, Kylin 4.0 implements a new Spark build engine and parquet storage, making it possible for Kylin to deploy without Hadoop environment. Compared with deploying Kylin 3.x on AWS EMR, deploying kylin4 directly on AWS EC2 instances has the following advantages: 1. Cost saving. Compared with AWS EMR node, AWS EC2 node has lower cost. 2. More flexible. On the EC2 node, users can more independently select the services and components they need for installation and deployment. 3. Remove Hadoop dependency. Hadoop ecology is heavy and needs to be maintained at a certain labor cost. Remove hadoop can be closer to the cloud-native. -After realizing the feature of supporting build and query in spark standalone mode, we tried to deploy kylin4.0 without Hadoop on the EC2 instance of AWS, and successfully built the cube and query. +After realizing the feature of supporting build and query in Spark Standalone mode, we tried to deploy Kylin 4.0 without Hadoop on the EC2 instance of AWS, and successfully built the cube and query. ### Environment preparation - Apply for AWS EC2 Linux instances as required @@ -53,7 +53,7 @@ The component version information provided here is that we selected during the t ```shell mkdir /usr/local/java - tar -xvf jdk-8u291-linux-x64.tar -C /usr/local/java + tar -xvf java-1.8.0-openjdk.tar -C /usr/local/java ``` @@ -326,8 +326,7 @@ The component version information provided here is that we selected during the t kylin.metadata.url=kylin_metadata@jdbc,url=jdbc:mysql://hostname:3306/kylin,username=root,password=password,maxActive=10,maxIdle=10 kylin.env.zookeeper-connect-string=hostname kylin.engine.spark-conf.spark.master=spark://hostname:7077 - # 当 spark.submit.deployMode 配置为 cluster 时,需要额外配置 kylin.engine.spark.standalone.master.httpUrl - kylin.engine.spark-conf.spark.submit.deployMode=client/cluster + kylin.engine.spark-conf.spark.submit.deployMode=client kylin.env.hdfs-working-dir=s3://bucket/kylin kylin.engine.spark-conf.spark.eventLog.dir=s3://bucket/kylin/spark-history kylin.engine.spark-conf.spark.history.fs.logDirectory=s3://bucket/kylin/spark-history @@ -338,7 +337,7 @@ The component version information provided here is that we selected during the t - Execute `bin/kylin.sh start` -- Kylin may encounter ClassNotFound type errors during startup. Please refer to the following methods to restart kylin: +- Kylin may encounter ClassNotFound type errors during startUp. Please refer to the following methods to restart kylin: ```shell # Download commons-collections-3.2.2.jar