This is an automated email from the ASF dual-hosted git repository. morningman pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-doris.git
The following commit(s) were added to refs/heads/master by this push: new 421828d [Doc] Fix format in doris_storage_optimization.md (#4250) 421828d is described below commit 421828d52afa1113c00250b719f4975685dedc42 Author: Fullstop000 <fullstop1...@gmail.com> AuthorDate: Wed Aug 5 21:45:03 2020 +0800 [Doc] Fix format in doris_storage_optimization.md (#4250) --- docs/en/internal/doris_storage_optimization.md | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/docs/en/internal/doris_storage_optimization.md b/docs/en/internal/doris_storage_optimization.md index c62b6fa..529b85d 100644 --- a/docs/en/internal/doris_storage_optimization.md +++ b/docs/en/internal/doris_storage_optimization.md @@ -111,11 +111,11 @@ For each column, a sparse index of row numbers is established according to page We generate a sparse index of short key every N rows (configurable) with the contents of short key - > line number (ordinal) -### Column's other indexes### +### Column's other indexes ### The format design supports the subsequent expansion of other index information, such as bitmap index, spatial index, etc. It only needs to write the required data to the existing column data, and add the corresponding metadata fields to FileFooterPB. -### Metadata Definition### +### Metadata Definition ### FileFooterPB is defined as: ``` @@ -179,7 +179,7 @@ message FileFooterPB { ``` -## Read-write logic## +## Read-write logic ## ### Write ### @@ -203,7 +203,7 @@ Relevant issues: - rle - bshuf -### Read### +### Read ### 1. Read the magic of the file and judge the type and version of the file. 2. Read FileFooterPB and check sum @@ -220,11 +220,11 @@ Relevant issues: 2. How to achieve efficient block reading? Consider merging adjacent blocks while they are being read, one-time reading? This requires judging whether the block is continuous at the time of reading, and if it is continuous, reading it once. -## Coding## +## Coding ## In the existing Doris storage, plain encoding is adopted for string type encoding, which is inefficient. After comparison, it is found that in Baidu statistics scenario, data will expand more than twice because of string type coding. Therefore, it is planned to introduce dictionary-based coding compression. -## Compression## +## Compression ## It implements a scalable compression framework, supports a variety of compression algorithms, facilitates the subsequent addition of new compression algorithms, and plans to introduce zstd compression. --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org