spark git commit: [SPARK-19985][ML] Fixed copy method for some ML Models

2017-04-03 Thread mlnick
Repository: spark Updated Branches: refs/heads/master 93dbfe705 -> 2a903a1ee [SPARK-19985][ML] Fixed copy method for some ML Models ## What changes were proposed in this pull request? Some ML Models were using `defaultCopy` which expects a default constructor, and others were not setting the

spark git commit: [SPARK-20166][SQL] Use XXX for ISO 8601 timezone instead of ZZ (FastDateFormat specific) in CSV/JSON timeformat options

2017-04-03 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2a903a1ee -> cff11fd20 [SPARK-20166][SQL] Use XXX for ISO 8601 timezone instead of ZZ (FastDateFormat specific) in CSV/JSON timeformat options ## What changes were proposed in this pull request? This PR proposes to use `XXX` format instea

spark git commit: [MINOR][DOCS] Replace non-breaking space to normal spaces that breaks rendering markdown

2017-04-03 Thread srowen
Repository: spark Updated Branches: refs/heads/master cff11fd20 -> 364b0db75 [MINOR][DOCS] Replace non-breaking space to normal spaces that breaks rendering markdown # What changes were proposed in this pull request? It seems there are several non-breaking spaces were inserted into several

spark git commit: [MINOR][DOCS] Replace non-breaking space to normal spaces that breaks rendering markdown

2017-04-03 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 ca144106b -> 77700ea38 [MINOR][DOCS] Replace non-breaking space to normal spaces that breaks rendering markdown # What changes were proposed in this pull request? It seems there are several non-breaking spaces were inserted into sever

spark git commit: [SPARK-9002][CORE] KryoSerializer initialization does not include 'Array[Int]'

2017-04-03 Thread srowen
Repository: spark Updated Branches: refs/heads/master 364b0db75 -> fb5869f2c [SPARK-9002][CORE] KryoSerializer initialization does not include 'Array[Int]' [SPARK-9002][CORE] KryoSerializer initialization does not include 'Array[Int]' ## What changes were proposed in this pull request? Array

spark git commit: [SPARK-19969][ML] Imputer doc and example

2017-04-03 Thread mlnick
Repository: spark Updated Branches: refs/heads/master fb5869f2c -> 4d28e8430 [SPARK-19969][ML] Imputer doc and example ## What changes were proposed in this pull request? Add docs and examples for spark.ml.feature.Imputer. Currently scala and Java examples are included. Python example will b

spark git commit: [SPARK-19641][SQL] JSON schema inference in DROPMALFORMED mode produces incorrect schema for non-array/object JSONs

2017-04-03 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 4d28e8430 -> 4fa1a43af [SPARK-19641][SQL] JSON schema inference in DROPMALFORMED mode produces incorrect schema for non-array/object JSONs ## What changes were proposed in this pull request? Currently, when we infer the types for vaild JS

spark git commit: [SPARK-20194] Add support for partition pruning to in-memory catalog

2017-04-03 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 4fa1a43af -> 703c42c39 [SPARK-20194] Add support for partition pruning to in-memory catalog ## What changes were proposed in this pull request? This patch implements `listPartitionsByFilter()` for `InMemoryCatalog` and thus resolves an out

spark git commit: [SPARK-20145] Fix range case insensitive bug in SQL

2017-04-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 703c42c39 -> 58c9e6e77 [SPARK-20145] Fix range case insensitive bug in SQL ## What changes were proposed in this pull request? Range in SQL should be case insensitive ## How was this patch tested? unit test Author: samelamin Author: same

spark git commit: [SPARK-19408][SQL] filter estimation on two columns of same table

2017-04-03 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 58c9e6e77 -> e7877fd47 [SPARK-19408][SQL] filter estimation on two columns of same table ## What changes were proposed in this pull request? In SQL queries, we also see predicate expressions involving two columns such as "column-1 (op) co

spark git commit: [SPARK-10364][SQL] Support Parquet logical type TIMESTAMP_MILLIS

2017-04-03 Thread ueshin
Repository: spark Updated Branches: refs/heads/master e7877fd47 -> 3bfb639cb [SPARK-10364][SQL] Support Parquet logical type TIMESTAMP_MILLIS ## What changes were proposed in this pull request? **Description** from JIRA The TimestampType in Spark SQL is of microsecond precision. Ideally, we

spark git commit: [SPARK-20067][SQL] Unify and Clean Up Desc Commands Using Catalog Interface

2017-04-03 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 3bfb639cb -> 51d3c854c [SPARK-20067][SQL] Unify and Clean Up Desc Commands Using Catalog Interface ### What changes were proposed in this pull request? This PR is to unify and clean up the outputs of `DESC EXTENDED/FORMATTED` and `SHOW TA

spark git commit: [SPARK-19825][R][ML] spark.ml R API for FPGrowth

2017-04-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 51d3c854c -> b34f7665d [SPARK-19825][R][ML] spark.ml R API for FPGrowth ## What changes were proposed in this pull request? Adds SparkR API for FPGrowth: [SPARK-19825](https://issues.apache.org/jira/browse/SPARK-19825): - `spark.fpGrowth