spark git commit: [SPARK-20501][ML] ML 2.2 QA: New Scala APIs, docs

2017-05-15 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master d4022d495 -> dbe81633a [SPARK-20501][ML] ML 2.2 QA: New Scala APIs, docs ## What changes were proposed in this pull request? Review new Scala APIs introduced in 2.2. ## How was this patch tested? Existing tests. Author: Yanbo Liang Clos

spark git commit: [SPARK-20501][ML] ML 2.2 QA: New Scala APIs, docs

2017-05-15 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.2 a869e8bfd -> 57c87cf2d [SPARK-20501][ML] ML 2.2 QA: New Scala APIs, docs ## What changes were proposed in this pull request? Review new Scala APIs introduced in 2.2. ## How was this patch tested? Existing tests. Author: Yanbo Liang

spark git commit: [SPARK-20707][ML] ML deprecated APIs should be removed in major release.

2017-05-15 Thread yliang
Repository: spark Updated Branches: refs/heads/branch-2.2 10e599f69 -> a869e8bfd [SPARK-20707][ML] ML deprecated APIs should be removed in major release. ## What changes were proposed in this pull request? Before 2.2, MLlib keep to remove APIs deprecated in last feature/minor release. But fro

spark git commit: [SPARK-20707][ML] ML deprecated APIs should be removed in major release.

2017-05-15 Thread yliang
Repository: spark Updated Branches: refs/heads/master c8c878a41 -> d4022d495 [SPARK-20707][ML] ML deprecated APIs should be removed in major release. ## What changes were proposed in this pull request? Before 2.2, MLlib keep to remove APIs deprecated in last feature/minor release. But from Sp

spark git commit: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/master bbd163d58 -> c8c878a41 [SPARK-20588][SQL] Cache TimeZone instances. ## What changes were proposed in this pull request? Because the method `TimeZone.getTimeZone(String ID)` is synchronized on the TimeZone class, concurrent call of this me

spark git commit: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 e84e9dd54 -> 10e599f69 [SPARK-20588][SQL] Cache TimeZone instances. ## What changes were proposed in this pull request? Because the method `TimeZone.getTimeZone(String ID)` is synchronized on the TimeZone class, concurrent call of thi

spark git commit: [SPARK-20735][SQL][TEST] Enable cross join in TPCDSQueryBenchmark

2017-05-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.1 62969e9be -> 14b6a9d34 [SPARK-20735][SQL][TEST] Enable cross join in TPCDSQueryBenchmark ## What changes were proposed in this pull request? Since [SPARK-17298](https://issues.apache.org/jira/browse/SPARK-17298), some queries (q28, q6

spark git commit: [SPARK-20735][SQL][TEST] Enable cross join in TPCDSQueryBenchmark

2017-05-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 a79a120a8 -> e84e9dd54 [SPARK-20735][SQL][TEST] Enable cross join in TPCDSQueryBenchmark ## What changes were proposed in this pull request? Since [SPARK-17298](https://issues.apache.org/jira/browse/SPARK-17298), some queries (q28, q6

spark git commit: [SPARK-20735][SQL][TEST] Enable cross join in TPCDSQueryBenchmark

2017-05-15 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 499ba2cb4 -> bbd163d58 [SPARK-20735][SQL][TEST] Enable cross join in TPCDSQueryBenchmark ## What changes were proposed in this pull request? Since [SPARK-17298](https://issues.apache.org/jira/browse/SPARK-17298), some queries (q28, q61, q

spark git commit: [SPARK-20717][SS] Minor tweaks to the MapGroupsWithState behavior

2017-05-15 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master d2416925c -> 499ba2cb4 [SPARK-20717][SS] Minor tweaks to the MapGroupsWithState behavior ## What changes were proposed in this pull request? Timeout and state data are two independent entities and should be settable independently. Therefo

spark git commit: [SPARK-20717][SS] Minor tweaks to the MapGroupsWithState behavior

2017-05-15 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.2 82ae1f0ac -> a79a120a8 [SPARK-20717][SS] Minor tweaks to the MapGroupsWithState behavior ## What changes were proposed in this pull request? Timeout and state data are two independent entities and should be settable independently. The

spark git commit: [SPARK-17729][SQL] Enable creating hive bucketed tables

2017-05-15 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 271175e2b -> d2416925c [SPARK-17729][SQL] Enable creating hive bucketed tables ## What changes were proposed in this pull request? Hive allows inserting data to bucketed table without guaranteeing bucketed and sorted-ness based on these t

spark git commit: [SPARK-20716][SS] StateStore.abort() should not throw exceptions

2017-05-15 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.2 0bd918f67 -> 82ae1f0ac [SPARK-20716][SS] StateStore.abort() should not throw exceptions ## What changes were proposed in this pull request? StateStore.abort() should do a best effort attempt to clean up temporary resources. It should

spark git commit: [SPARK-20716][SS] StateStore.abort() should not throw exceptions

2017-05-15 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master e1aaab1e2 -> 271175e2b [SPARK-20716][SS] StateStore.abort() should not throw exceptions ## What changes were proposed in this pull request? StateStore.abort() should do a best effort attempt to clean up temporary resources. It should not

spark git commit: [SPARK-12837][SPARK-20666][CORE][FOLLOWUP] getting name should not fail if accumulator is garbage collected

2017-05-15 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 9970aa096 -> e1aaab1e2 [SPARK-12837][SPARK-20666][CORE][FOLLOWUP] getting name should not fail if accumulator is garbage collected ## What changes were proposed in this pull request? After https://github.com/apache/spark/pull/17596 , we d

spark git commit: [SPARK-12837][SPARK-20666][CORE][FOLLOWUP] getting name should not fail if accumulator is garbage collected

2017-05-15 Thread vanzin
Repository: spark Updated Branches: refs/heads/branch-2.2 708f68c8a -> 0bd918f67 [SPARK-12837][SPARK-20666][CORE][FOLLOWUP] getting name should not fail if accumulator is garbage collected ## What changes were proposed in this pull request? After https://github.com/apache/spark/pull/17596 ,

spark git commit: [SPARK-20669][ML] LoR.family and LDA.optimizer should be case insensitive

2017-05-15 Thread yliang
Repository: spark Updated Branches: refs/heads/branch-2.2 cb64064dc -> 708f68c8a [SPARK-20669][ML] LoR.family and LDA.optimizer should be case insensitive ## What changes were proposed in this pull request? make param `family` in LoR and `optimizer` in LDA case insensitive ## How was this pat

spark git commit: [SPARK-20669][ML] LoR.family and LDA.optimizer should be case insensitive

2017-05-15 Thread yliang
Repository: spark Updated Branches: refs/heads/master b0888d1ac -> 9970aa096 [SPARK-20669][ML] LoR.family and LDA.optimizer should be case insensitive ## What changes were proposed in this pull request? make param `family` in LoR and `optimizer` in LDA case insensitive ## How was this patch t

subscribe spark

2017-05-15 Thread 程磊
subscribe spark

spark git commit: [SPARK-20730][SQL] Add an optimizer rule to combine nested Concat

2017-05-15 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 8da6e8b1f -> b0888d1ac [SPARK-20730][SQL] Add an optimizer rule to combine nested Concat ## What changes were proposed in this pull request? This pr added a new Optimizer rule to combine nested Concat. The master supports a pipeline operat