代码之家  ›  专栏  ›  技术社区  ›  Kannaiyan

用spark读取Multilple json模式

  •  2
  • Kannaiyan  · 技术社区  · 7 年前

    软件配置:

    Hadoop distribution:Amazon 2.8.3
    Applications:Hive 2.3.2, Pig 0.17.0, Hue 4.1.0, Spark 2.3.0
    

    val df=spark.read.option(“合并架构”,

    抛出错误,

    org.apache.spark.sql.AnalysisException: Unable to infer schema for JSON. It must be specified manually.;
      at org.apache.spark.sql.execution.datasources.DataSource$$anonfun$9.apply(DataSource.scala:207)
      at org.apache.spark.sql.execution.datasources.DataSource$$anonfun$9.apply(DataSource.scala:207)
      at scala.Option.getOrElse(Option.scala:121)
      at org.apache.spark.sql.execution.datasources.DataSource.getOrInferFileFormatSchema(DataSource.scala:206)
      at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:392)
      at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:239)
      at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:227)
      at org.apache.spark.sql.DataFrameReader.json(DataFrameReader.scala:397)
      at org.apache.spark.sql.DataFrameReader.json(DataFrameReader.scala:340)
    

    如何用spark读取带有多个模式的json?

    0 回复  |  直到 7 年前
    推荐文章