Author: lidong
Date: Thu Aug 27 14:10:06 2020
New Revision: 1881248

URL: http://svn.apache.org/viewvc?rev=1881248&view=rev
Log:
KYLIN-4715 Wrong function with kylin document about how to optimize cube build

Modified:
    kylin/site/cn/docs/howto/howto_optimize_build.html
    kylin/site/cn/docs/tutorial/cube_spark.html
    kylin/site/cn/docs24/howto/howto_optimize_build.html
    kylin/site/cn/docs30/howto/howto_optimize_build.html
    kylin/site/cn/docs30/tutorial/cube_spark.html
    kylin/site/docs/howto/howto_optimize_build.html
    kylin/site/docs/tutorial/cube_spark.html
    kylin/site/docs24/howto/howto_optimize_build.html
    kylin/site/docs30/howto/howto_optimize_build.html
    kylin/site/docs30/tutorial/cube_spark.html
    kylin/site/feed.xml

Modified: kylin/site/cn/docs/howto/howto_optimize_build.html
URL: 
http://svn.apache.org/viewvc/kylin/site/cn/docs/howto/howto_optimize_build.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/cn/docs/howto/howto_optimize_build.html (original)
+++ kylin/site/cn/docs/howto/howto_optimize_build.html Thu Aug 27 14:10:06 2020
@@ -293,7 +293,7 @@ INSERT OVERWRITE TABLE kylin_intermediat
 
 <p>有些cuboid可以从一个以上的父cuboid聚合得到,这种情
况下,Kylin会选择最小的一个父cuboid。举例,AB可以从ABC(id:1110)和ABD(id:1101)生成,则ABD会被选中,å›
 
为它的比ABC要小。在这基础上,如果D的基数较小,聚合运算的成本就会比较低。所以,当设计rowkey序列的时候,请记得将基数较小的维度放在末尾。这æ
 ·ä¸ä»…有利于cube构建,而且有助于cube查询,因
为预聚合也遵循相同的规则。</p>
 
-<p>通常来说,从N维到(N/2)维的构建比较慢,因
为这是cuboid数量爆炸性增长的阶段:N维有1个cuboid,(N-1)维有N个cuboid,(N-2)维有N*(N-1)个cuboid,以此类推。经过(N/2)维构建的步骤,整个构建任务会逐渐变快。</p>
+<p>通常来说,从N维到(N/2)维的构建比较慢,因
为这是cuboid数量爆炸性增长的阶段:N维有1个cuboid,(N-1)维有N个cuboid,(N-2)维有N*(N-1)/2个cuboid,以此类推。经过(N/2)维构建的步骤,整个构建任务会逐渐变快。</p>
 
 <h2 id="cube">构建cube</h2>
 

Modified: kylin/site/cn/docs/tutorial/cube_spark.html
URL: 
http://svn.apache.org/viewvc/kylin/site/cn/docs/tutorial/cube_spark.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/cn/docs/tutorial/cube_spark.html (original)
+++ kylin/site/cn/docs/tutorial/cube_spark.html Thu Aug 27 14:10:06 2020
@@ -299,7 +299,8 @@ kylin.engine.livy-conf.livy-arr.jars=hdf
 <p>现在构建步骤中的’extract fact table distinct value’ 和 
‘build dimension dictionary’ 
两个步骤也可以使用Spark进行构建了。相关的配置如下:</p>
 
 <div class="highlight"><pre><code class="language-groff" 
data-lang="groff">kylin.engine.spark-fact-distinct=true
-kylin.engine.spark-dimension-dictionary=true</code></pre></div>
+kylin.engine.spark-dimension-dictionary=true 
+kylin.engine.spark-uhc-dictionary=true</code></pre></div>
 
 <h2 id="section-2">疑难解答</h2>
 

Modified: kylin/site/cn/docs24/howto/howto_optimize_build.html
URL: 
http://svn.apache.org/viewvc/kylin/site/cn/docs24/howto/howto_optimize_build.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/cn/docs24/howto/howto_optimize_build.html (original)
+++ kylin/site/cn/docs24/howto/howto_optimize_build.html Thu Aug 27 14:10:06 
2020
@@ -293,7 +293,7 @@ INSERT OVERWRITE TABLE kylin_intermediat
 
 <p>有些cuboid可以从一个以上的父cuboid聚合得到,这种情
况下,Kylin会选择最小的一个父cuboid。举例,AB可以从ABC(id:1110)和ABD(id:1101)生成,则ABD会被选中,å›
 
为它的比ABC要小。在这基础上,如果D的基数较小,聚合运算的成本就会比较低。所以,当设计rowkey序列的时候,请记得将基数较小的维度放在末尾。这æ
 ·ä¸ä»…有利于cube构建,而且有助于cube查询,因
为预聚合也遵循相同的规则。</p>
 
-<p>通常来说,从N维到(N/2)维的构建比较慢,因
为这是cuboid数量爆炸性增长的阶段:N维有1个cuboid,(N-1)维有N个cuboid,(N-2)维有N*(N-1)个cuboid,以此类推。经过(N/2)维构建的步骤,整个构建任务会逐渐变快。</p>
+<p>通常来说,从N维到(N/2)维的构建比较慢,因
为这是cuboid数量爆炸性增长的阶段:N维有1个cuboid,(N-1)维有N个cuboid,(N-2)维有N*(N-1)/2个cuboid,以此类推。经过(N/2)维构建的步骤,整个构建任务会逐渐变快。</p>
 
 <h2 id="cube">构建cube</h2>
 

Modified: kylin/site/cn/docs30/howto/howto_optimize_build.html
URL: 
http://svn.apache.org/viewvc/kylin/site/cn/docs30/howto/howto_optimize_build.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/cn/docs30/howto/howto_optimize_build.html (original)
+++ kylin/site/cn/docs30/howto/howto_optimize_build.html Thu Aug 27 14:10:06 
2020
@@ -293,7 +293,7 @@ INSERT OVERWRITE TABLE kylin_intermediat
 
 <p>有些cuboid可以从一个以上的父cuboid聚合得到,这种情
况下,Kylin会选择最小的一个父cuboid。举例,AB可以从ABC(id:1110)和ABD(id:1101)生成,则ABD会被选中,å›
 
为它的比ABC要小。在这基础上,如果D的基数较小,聚合运算的成本就会比较低。所以,当设计rowkey序列的时候,请记得将基数较小的维度放在末尾。这æ
 ·ä¸ä»…有利于cube构建,而且有助于cube查询,因
为预聚合也遵循相同的规则。</p>
 
-<p>通常来说,从N维到(N/2)维的构建比较慢,因
为这是cuboid数量爆炸性增长的阶段:N维有1个cuboid,(N-1)维有N个cuboid,(N-2)维有N*(N-1)个cuboid,以此类推。经过(N/2)维构建的步骤,整个构建任务会逐渐变快。</p>
+<p>通常来说,从N维到(N/2)维的构建比较慢,因
为这是cuboid数量爆炸性增长的阶段:N维有1个cuboid,(N-1)维有N个cuboid,(N-2)维有N*(N-1)/2个cuboid,以此类推。经过(N/2)维构建的步骤,整个构建任务会逐渐变快。</p>
 
 <h2 id="cube">构建cube</h2>
 

Modified: kylin/site/cn/docs30/tutorial/cube_spark.html
URL: 
http://svn.apache.org/viewvc/kylin/site/cn/docs30/tutorial/cube_spark.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/cn/docs30/tutorial/cube_spark.html (original)
+++ kylin/site/cn/docs30/tutorial/cube_spark.html Thu Aug 27 14:10:06 2020
@@ -299,7 +299,8 @@ kylin.engine.livy-conf.livy-arr.jars=hdf
 <p>现在构建步骤中的’extract fact table distinct value’ 和 
‘build dimension dictionary’ 
两个步骤也可以使用Spark进行构建了。相关的配置如下:</p>
 
 <div class="highlight"><pre><code class="language-groff" 
data-lang="groff">kylin.engine.spark-fact-distinct=true
-kylin.engine.spark-dimension-dictionary=true</code></pre></div>
+kylin.engine.spark-dimension-dictionary=true
+kylin.engine.spark-uhc-dictionary=true</code></pre></div>
 
 <h2 id="section-2">疑难解答</h2>
 

Modified: kylin/site/docs/howto/howto_optimize_build.html
URL: 
http://svn.apache.org/viewvc/kylin/site/docs/howto/howto_optimize_build.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/docs/howto/howto_optimize_build.html (original)
+++ kylin/site/docs/howto/howto_optimize_build.html Thu Aug 27 14:10:06 2020
@@ -8513,7 +8513,7 @@ INSERT OVERWRITE TABLE kylin_intermediat
 
 <p>Some cuboid can be aggregated from more than 1 parent cubiods, in this 
case, Kylin will select the minimal parent cuboid. For example, AB can be 
generated from ABC (id: 1110) and ABD (id: 1101), so ABD will be used as its id 
is smaller than ABC. Based on this, if D’s cardinality is small, the 
aggregation will be cost-efficient. So, when you design the Cube rowkey 
sequence, please remember to put low cardinality dimensions to the tail 
position. This not only benefit the Cube build, but also benefit the Cube query 
as the post-aggregation follows the same rule.</p>
 
-<p>Usually from the N-D to (N/2)-D the building is slow, because it is the 
cuboid explosion process: N-D has 1 Cuboid, (N-1)-D has N cuboids, (N-2)-D has 
N*(N-1) cuboids, etc. After (N/2)-D step, the building gets faster 
gradually.</p>
+<p>Usually from the N-D to (N/2)-D the building is slow, because it is the 
cuboid explosion process: N-D has 1 Cuboid, (N-1)-D has N cuboids, (N-2)-D has 
N*(N-1)/2 cuboids, etc. After (N/2)-D step, the building gets faster 
gradually.</p>
 
 <h2 id="build-cube">Build Cube</h2>
 

Modified: kylin/site/docs/tutorial/cube_spark.html
URL: 
http://svn.apache.org/viewvc/kylin/site/docs/tutorial/cube_spark.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/docs/tutorial/cube_spark.html (original)
+++ kylin/site/docs/tutorial/cube_spark.html Thu Aug 27 14:10:06 2020
@@ -8518,8 +8518,8 @@ kylin.engine.livy-conf.livy-arr.jars=hdf
 <p>As we all know, the cubing job includes several steps and the steps 
‘extract fact table distinct value’, ‘build dimension dictionary’ and 
‘build UHC dimension dictionary’ can also be built by spark. The 
configurations are as follows.</p>
 
 <div class="highlight"><pre><code class="language-groff" 
data-lang="groff">kylin.engine.spark-fact-distinct=true
-kylin.engine.spark-dimension-dictionary=true 
-kylin.engine.spark-udc-dictionary=true</code></pre></div>
+kylin.engine.spark-dimension-dictionary=true
+kylin.engine.spark-uhc-dictionary=true</code></pre></div>
 
 <h2 id="troubleshooting">Troubleshooting</h2>
 

Modified: kylin/site/docs24/howto/howto_optimize_build.html
URL: 
http://svn.apache.org/viewvc/kylin/site/docs24/howto/howto_optimize_build.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/docs24/howto/howto_optimize_build.html (original)
+++ kylin/site/docs24/howto/howto_optimize_build.html Thu Aug 27 14:10:06 2020
@@ -8514,7 +8514,7 @@ INSERT OVERWRITE TABLE kylin_intermediat
 
 <p>Some cuboid can be aggregated from more than 1 parent cubiods, in this 
case, Kylin will select the minimal parent cuboid. For example, AB can be 
generated from ABC (id: 1110) and ABD (id: 1101), so ABD will be used as its id 
is smaller than ABC. Based on this, if D’s cardinality is small, the 
aggregation will be cost-efficient. So, when you design the Cube rowkey 
sequence, please remember to put low cardinality dimensions to the tail 
position. This not only benefit the Cube build, but also benefit the Cube query 
as the post-aggregation follows the same rule.</p>
 
-<p>Usually from the N-D to (N/2)-D the building is slow, because it is the 
cuboid explosion process: N-D has 1 Cuboid, (N-1)-D has N cuboids, (N-2)-D has 
N*(N-1) cuboids, etc. After (N/2)-D step, the building gets faster 
gradually.</p>
+<p>Usually from the N-D to (N/2)-D the building is slow, because it is the 
cuboid explosion process: N-D has 1 Cuboid, (N-1)-D has N cuboids, (N-2)-D has 
N*(N-1)/2 cuboids, etc. After (N/2)-D step, the building gets faster 
gradually.</p>
 
 <h2 id="build-cube">Build Cube</h2>
 

Modified: kylin/site/docs30/howto/howto_optimize_build.html
URL: 
http://svn.apache.org/viewvc/kylin/site/docs30/howto/howto_optimize_build.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/docs30/howto/howto_optimize_build.html (original)
+++ kylin/site/docs30/howto/howto_optimize_build.html Thu Aug 27 14:10:06 2020
@@ -8513,7 +8513,7 @@ INSERT OVERWRITE TABLE kylin_intermediat
 
 <p>Some cuboid can be aggregated from more than 1 parent cubiods, in this 
case, Kylin will select the minimal parent cuboid. For example, AB can be 
generated from ABC (id: 1110) and ABD (id: 1101), so ABD will be used as its id 
is smaller than ABC. Based on this, if D’s cardinality is small, the 
aggregation will be cost-efficient. So, when you design the Cube rowkey 
sequence, please remember to put low cardinality dimensions to the tail 
position. This not only benefit the Cube build, but also benefit the Cube query 
as the post-aggregation follows the same rule.</p>
 
-<p>Usually from the N-D to (N/2)-D the building is slow, because it is the 
cuboid explosion process: N-D has 1 Cuboid, (N-1)-D has N cuboids, (N-2)-D has 
N*(N-1) cuboids, etc. After (N/2)-D step, the building gets faster 
gradually.</p>
+<p>Usually from the N-D to (N/2)-D the building is slow, because it is the 
cuboid explosion process: N-D has 1 Cuboid, (N-1)-D has N cuboids, (N-2)-D has 
N*(N-1)/2 cuboids, etc. After (N/2)-D step, the building gets faster 
gradually.</p>
 
 <h2 id="build-cube">Build Cube</h2>
 

Modified: kylin/site/docs30/tutorial/cube_spark.html
URL: 
http://svn.apache.org/viewvc/kylin/site/docs30/tutorial/cube_spark.html?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/docs30/tutorial/cube_spark.html (original)
+++ kylin/site/docs30/tutorial/cube_spark.html Thu Aug 27 14:10:06 2020
@@ -8519,7 +8519,7 @@ kylin.engine.livy-conf.livy-arr.jars=hdf
 
 <div class="highlight"><pre><code class="language-groff" 
data-lang="groff">kylin.engine.spark-fact-distinct=true
 kylin.engine.spark-dimension-dictionary=true 
-kylin.engine.spark-udc-dictionary=true</code></pre></div>
+kylin.engine.spark-uhc-dictionary=true</code></pre></div>
 
 <h2 id="troubleshooting">Troubleshooting</h2>
 

Modified: kylin/site/feed.xml
URL: 
http://svn.apache.org/viewvc/kylin/site/feed.xml?rev=1881248&r1=1881247&r2=1881248&view=diff
==============================================================================
--- kylin/site/feed.xml (original)
+++ kylin/site/feed.xml Thu Aug 27 14:10:06 2020
@@ -19,8 +19,8 @@
     <description>Apache Kylin Home</description>
     <link>http://kylin.apache.org/</link>
     <atom:link href="http://kylin.apache.org/feed.xml"; rel="self" 
type="application/rss+xml"/>
-    <pubDate>Mon, 24 Aug 2020 06:59:11 -0700</pubDate>
-    <lastBuildDate>Mon, 24 Aug 2020 06:59:11 -0700</lastBuildDate>
+    <pubDate>Thu, 27 Aug 2020 06:59:14 -0700</pubDate>
+    <lastBuildDate>Thu, 27 Aug 2020 06:59:14 -0700</lastBuildDate>
     <generator>Jekyll v2.5.3</generator>
     
       <item>


Reply via email to