Search code examples
Column type inferred as binary with typed UDAF...


scalaapache-sparkapache-spark-sqlapache-spark-datasetapache-spark-encoders

Read More
Spark Scala Dataset Type Hierarchy...


scalaapache-sparkapache-spark-datasetapache-spark-encoders

Read More
How does spark interprets type of a column in reduce...


scalaapache-sparkfoldapache-spark-dataset

Read More
Unable to find encoder for type stored in a Dataset. error in spite of providing the proper implicit...


apache-sparkapache-spark-dataset

Read More
How can I use GroupBy and than Map over Dataset?...


scalaapache-sparkapache-spark-dataset

Read More
Scala Spark RDDs, DataSet, PairRDDs and Partitoning...


scalaapache-sparkrddapache-spark-dataset

Read More
Spark Dataset: Filter if value is contained in other dataset...


javaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Spark groupBy vs repartition plus mapPartitions...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Do I have to explicitly use Dataframe's methods to take advantage of Dataset's optimization?...


javaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
In Spark dataframe udf, what is the type of function parameters which like struct(col1,col2)?...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Efficient spark dataset operations when partitioned by overlapping columns...


scalaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
What is the efficient way to replace Spark Dataset column value from a sortedMap using Scala?...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Spark SQL Column Manipulation...


apache-sparkdataframeapache-spark-sqlapache-spark-dataset

Read More
Spark dynamic DAG is a lot slower and different from hard coded DAG...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
How can I lit an Option when converting from Dataset to Dataframe...


scalaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Efficient way to do column level operation in Spark 2.0...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Reading Hive table from Spark as a Dataset...


scalaapache-sparkhiveapache-spark-sqlapache-spark-dataset

Read More
Spark Dataset aggregation similar to RDD aggregate(zero)(accum, combiner)...


scalaapache-sparkapache-spark-sqlrddapache-spark-dataset

Read More
DataFrame / Dataset groupBy behaviour/optimization...


performanceapache-sparkdataframeapache-spark-sqlapache-spark-dataset

Read More
Using stat.bloomFilter in Spark 2.0.0 to filter another dataframe...


scalaapache-sparkapache-spark-sqlapache-spark-datasetbloom-filter

Read More
Spark simpler value_counts...


apache-sparkapache-spark-sqlapache-spark-dataset

Read More
Why can't I read these dataframes...


apache-sparkapache-spark-datasetapache-spark-2.0

Read More
Custom schema with nested parent node in spark-xml...


apache-sparkapache-spark-sqlapache-spark-datasetapache-spark-xml

Read More
Why is predicate pushdown not used in typed Dataset API (vs untyped DataFrame API)?...


apache-sparkdataframeapache-spark-sqlapache-spark-dataset

Read More
Spark Dataset unique id performance - row_number vs monotonically_increasing_id...


scalaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Spark 2.0 DataSets groupByKey and divide operation and type safety...


scalaapache-sparkapache-spark-sqlapache-spark-dataset

Read More
Spark Dataset : Example : Unable to generate an encoder issue...


scalaapache-sparkapache-spark-sqlapache-spark-datasetapache-spark-encoders

Read More
Dataframe to Dataset which has type Any...


apache-sparkdataframeapache-spark-sqlapache-spark-dataset

Read More
Spark Encoders: when to use beans()...


javaapache-sparkmemory-managementapache-spark-datasetapache-spark-encoders

Read More
Question regarding kryo and java encoders in datasets...


apache-sparkapache-spark-datasetkryoapache-spark-encoders

Read More
BackNext