代码之家  ›  专栏  ›  技术社区  ›  Jake

cassandra/datastax:编程设置datastax包

  •  0
  • Jake  · 技术社区  · 7 年前

    以下Spark提交脚本有效:

    nohup ./bin/spark-submit   --jars ./ikoda/extrajars/ikoda_assembled_ml_nlp.jar,./ikoda/extrajars/stanford-corenlp-3.8.0.jar,./ikoda/extrajars/stanford-parser-3.8.0.jar \
    --packages datastax:spark-cassandra-connector:2.0.1-s_2.11 \
    --class ikoda.mlserver.Application \
    --conf spark.cassandra.connection.host=192.168.0.33 \
    --master local[*]  ./ikoda/ikodaanalysis-mlserver-0.1.0.jar   1000  > ./logs/nohup.out &
    

    在编程上,我可以通过配置sparkContext来完成同样的操作:

            val conf = new SparkConf().setMaster("local[4]").setAppName("MLPCURLModelGenerationDataStream")
        conf.set("spark.streaming.stopGracefullyOnShutdown", "true")
        conf.set("spark.cassandra.connection.host", sparkcassandraconnectionhost)
        conf.set("spark.driver.maxResultSize", sparkdrivermaxResultSize)
        conf.set("spark.network.timeout", sparknetworktimeout)
    

    问题

    我可以添加——包datasax:spark cassandra connector:2.0.1-s 2.11吗?如果是,怎么做?

    1 回复  |  直到 7 年前
        1
  •  1
  •   Aaron Makubuya    7 年前

    相应的选项是 spark.jars.packages

    conf.set(
      "spark.jars.packages",
      "datastax:spark-cassandra-connector:2.0.1-s_2.11")
    
    推荐文章