kris_feng

浏览: 9562 次

最近访客更多访客>>

kingkongwill

weiwan5675

猩猩福斯

绯想无念

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

spark入门

博客分类：

大数据

了解sprak

Using a Mesos Master URL

The Master URLs for Mesos are in the form mesos://host:5050 for a single-master Mesos cluster, ormesos://zk://host:2181 for a multi-master Mesos cluster using ZooKeeper.

The driver also needs some configuration in spark-env.sh to interact properly with Mesos:

In spark.env.sh set some environment variables:
- export MESOS_NATIVE_LIBRARY=<path to libmesos.so>. This path is typically <prefix>/lib/libmesos.so where the prefix is /usr/local by default. See Mesos installation instructions above. On Mac OS X, the library is called libmesos.dylib instead of libmesos.so.
- export SPARK_EXECUTOR_URI=<URL of spark-1.0.1.tar.gz uploaded above>.
Also set spark.executor.uri to <URL of spark-1.0.1.tar.gz>.

edit /etc/profle append :

export MESOS_NATIVE_LIBRARY="/usr/local/lib/libmesos.so"

Now when starting a Spark application against the cluster, pass a mesos:// URL as the master when creating aSparkContext. For example:

val conf = new SparkConf()
  .setMaster("mesos://HOST:5050")
  .setAppName("My app")
  .set("spark.executor.uri", "<path to spark-1.0.1.tar.gz uploaded above>")
val sc = new SparkContext(conf)

(You can also use spark-submit and configure spark.executor.uri in the conf/spark-defaults.conf file. Note that spark-submit currently only supports deploying the Spark driver in client mode for Mesos.)

When running a shell, the spark.executor.uri parameter is inherited from SPARK_EXECUTOR_URI, so it does not need to be redundantly passed in as a system property.

./bin/spark-shell --master mesos://host:5050

Mesos Run Modes

Spark can run over Mesos in two modes: “fine-grained” (default) and “coarse-grained”.

In “fine-grained” mode (default), each Spark task runs as a separate Mesos task. This allows multiple instances of Spark (and other frameworks) to share machines at a very fine granularity, where each application gets more or fewer machines as it ramps up and down, but it comes with an additional overhead in launching each task. This mode may be inappropriate for low-latency requirements like interactive queries or serving web requests.

The “coarse-grained” mode will instead launch only one long-running Spark task on each Mesos machine, and dynamically schedule its own “mini-tasks” within it. The benefit is much lower startup overhead, but at the cost of reserving the Mesos resources for the complete duration of the application.

To run in coarse-grained mode, set the spark.mesos.coarse property in your SparkConf:

conf.set("spark.mesos.coarse", "true")

In addition, for coarse-grained mode, you can control the maximum number of resources Spark will acquire. By default, it will acquire all cores in the cluster (that get offered by Mesos), which only makes sense if you run just one application at a time. You can cap the maximum number of cores using conf.set("spark.cores.max", "10") (for example).

启动spark：

./bin/spark-shell --master mesos://127.0.1.1:5050

#测试

scala> val file = sc.textFile("hdfs://hadoop-master:9000/tmp/WifiScan_None_20140723.csv")

scala> val count=file.flatMap(line => line.split(" ")).map(word => (word,1)).reduceByKey(_+_)

scala> count.count()

http://spark.apache.org/docs/latest/running-on-mesos.html

分享到：

HDFS命令汇总 | hbase迁移

2015-04-02 21:32
浏览 560
评论(0)
分类:互联网
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

spark入门

Mesos Run Modes

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

spark入门

Mesos Run Modes

评论

发表评论

相关推荐

openstack部署概要

部署openstack

hbase迁移

HDFS命令汇总

Spark群集——Mesosphere方案

部署hadoop

最近访客更多访客>>