Re: Cluster sizing for huge dataset

2019-10-01 Thread Julien Laurenceau
Hi, Depending on the use case, you may also consider storage tiering with fresh data on hot-tier (Cassandra) and older data on cold-tier (Spark/Parquet or Presto/Parquet). It would be a lot more complex, but may fit more appropriately the budget and you may reuse some tech already present in your e

Re: Cheat Sheet for Unix based OS, Performance troubleshooting

2019-07-30 Thread Julien Laurenceau
Hi, My personnal favorites when Prometheus is not there and/or insufficient are: - iostat -> log all disk metrics volume by volume - nmon -> logs all metrics, but cannot see volume by volume IOPS - nmonvisualizer Regards JL # Monitor ## install dstat nmon iostat ioperf sar ## install nmonvisua