By Arun Manivannan
Navigate the realm of knowledge research, visualization, and computing device studying with over a hundred hands-on Scala recipes
About This Book
- Implement Scala on your info research utilizing gains from Spark, Breeze, and Zeppelin
- Scale up your information anlytics infrastructure with functional recipes for Scala computer learning
- Recipes for each degree of the knowledge research procedure, from examining and gathering info to disbursed analytics
Who This booklet Is For
This e-book exhibits info scientists and analysts easy methods to leverage their current wisdom of Scala for caliber and scalable facts analysis.
What you'll Learn
- Familiarize and arrange the Breeze and Spark libraries and use facts structures
- Import info from a bunch of attainable resources and create dataframes from CSV
- Clean, validate and rework information utilizing Scala to pre-process numerical and string data
- Integrate critical computing device studying algorithms utilizing Scala stack
- Bundle and scale up Spark jobs by way of deploying them right into a number of cluster managers
- Run streaming and graph analytics in Spark to imagine information, allowing exploratory analysis
This booklet will introduce you to the preferred Scala instruments, libraries, and frameworks via sensible recipes round loading, manipulating, and getting ready your information. it's going to additionally assist you discover and make feel of your information utilizing wonderful and insightfulvisualizations, and desktop studying toolkits.
Starting with introductory recipes on using the Breeze and Spark libraries, become familiar withhow to import info from a number of attainable resources and the way to pre-process numerical, string, and date info. subsequent, you will get an realizing of recommendations that can assist you visualize info utilizing the Apache Zeppelin and Bokeh bindings in Scala, allowing exploratory info research. iscover tips to software indispensable desktop studying algorithms utilizing Spark ML library. paintings via steps to scale your computing device studying versions and set up them right into a standalone cluster, EC2, YARN, and Mesos. eventually dip into the robust ideas offered via Spark Streaming, and laptop studying for streaming facts, in addition to using Spark GraphX.
Style and approach
This ebook includes a wealthy set of recipes that covers the total spectrum of attention-grabbing facts research initiatives and should assist you revolutionize your facts research talents utilizing Scala and Spark.