sajjad-moradi commented on a change in pull request #6546:
URL: https://github.com/apache/incubator-pinot/pull/6546#discussion_r577268997



##########
File path: 
pinot-tools/src/main/java/org/apache/pinot/tools/data/generator/StringGenerator.java
##########
@@ -56,11 +65,18 @@ public void init() {
 
   @Override
   public Object next() {
+    if (numberOfValuesPerEntry == 1) {
+      return getNextString();
+    }
+    return 
MultiValueGeneratorHelper.generateMultiValueEntries(numberOfValuesPerEntry, 
rand, this::getNextString);
+  }
+
+  private String getNextString() {
     return vals.get(rand.nextInt(cardinality));

Review comment:
       I'm not sure why we need to generate millions of records. I assume 
something below 500K records should suffice and my assumption is that we won't 
get into trouble with that number of rows. 
   BTW I have tested it on my mac with 10M rows with high cardinality on 
different columns in order of millions and no issue observed.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to