安装IDEA就不说了,也很简单。在这里推荐做java和scala的人使用idea,真的比eclipse好用。
1,idea安装scala和sbt插件
2,创建scala项目
3,修改build.sbt,加载spark,spark-sql
name := "myscala" version := "0.1" scalaVersion := "2.11.12" libraryDependencies ++= Seq( "org.apache.spark" % "spark-core_2.11" % "2.3.0", "org.apache.spark" % "spark-sql_2.11" % "2.3.0" )
4,创建测试文件
$ mkdir input $ echo "test my test tank test " > input/testword.txt $ pwd /Users/zhangying/Documents/www/myscala
5,创建scala object文件
import org.apache.spark.SparkContext import org.apache.spark.SparkConf object WordCount { def main(args: Array[String]) { val conf = new SparkConf().setAppName("WordCount").setMaster("local") val sc = new SparkContext(conf) val input = sc.textFile("input") val words = input.flatMap(line => line.split(" ")) val counts = words.map(word => (word, 1)).reduceByKey(_+_) counts.saveAsTextFile("output") } }
6,运行scala object程序
MacBook-Pro:output zhangying$ ll total 20 drwxr-xr-x 8 zhangying staff 272 Aug 3 16:48 ./ drwxr-xr-x 9 zhangying staff 306 Aug 3 16:48 ../ -rw-r--r-- 1 zhangying staff 8 Aug 3 16:48 ._SUCCESS.crc -rw-r--r-- 1 zhangying staff 12 Aug 3 16:48 .part-00000.crc -rw-r--r-- 1 zhangying staff 12 Aug 3 16:48 .part-00001.crc -rw-r--r-- 1 zhangying staff 0 Aug 3 16:48 _SUCCESS -rw-r--r-- 1 zhangying staff 45 Aug 3 16:48 part-00000 -rw-r--r-- 1 zhangying staff 21 Aug 3 16:48 part-00001
转载请注明
作者:海底苍鹰
地址:http://blog.51yip.com/hadoop/2160.html