Seattle Spark Meetup Feb. 10 - Better Together: Dato and Spark

Next Tuesday, February 10, the Seattle Spark Meetup group convenes again at the Concur Technologies office in Bellevue to welcome the great folks at Dato (formerly GraphLab) to help showcase GraphLab Create with Spark Integration. I will then discuss a machine learning classification scenario with GraphLab Create and Spark, followed by a showcase involving machine learning and predicting traffic safety. Here is a breakdown of the sessions and speakers:

  1. By our powers combined… GraphLab Create + Spark, together at last! We’ll briefly introduce Dato (formerly GraphLab, Inc.), GraphLab Create - our platform for predictive apps, and describe how GraphLab Create integrates with Spark through rdd to SFrame conversion. Developers should walk away with a clear understanding of how these complementary technologies can be used together to build amazing applications.

 

  • Speaker: Shawn Scully, one of the many physicist-turned-data scientists and Director of Product at Dato, joins us for this session. Shawn helps developers and data scientists build cool experiences with data.

 

  1. Quick jump start into classification with GraphLab Create and Spark. We'll do a quick jump start on how to work with Spark and GraphLab Create to explore, understand, and classify a small sample of receipt data. This will be a quick demo only (no slides) within a handy iPython Notebook.

 

  • Speaker: I will be guiding everyone through this demo. A little background on me: I am the Senior Director of Data Sciences Engineering at Concur.  I’m a hands-on data architect and developer / hacker with more than 15 years of experience developing internet-scale infrastructure, data platforms, and distributed systems for both On-Premises and Cloud.

 

  1. Sparky <3 GraphLab. We’ll combine Dato's GraphLab Create with Spark, to quickly move through the explore, model, deploy, evaluate, iterate loop. We will be demonstrating using a public data set analyzing traffic safety in the United States.

 

  • Speaker: Amanda Casari, Data Scientist at ProKarma, Inc., presents in our third session. Amanda helps clear out the hype to show the value in data: big, BIG, small and in-between and is very excited to be talking about two of her favorite new technologies for science and engineering.

For those of you unfamiliar with these events, the meetups are a gathering of users and developers of Apache Spark (www.spark-project.org), a high-speed Scala-based cluster programming framework, held in various locations through the Seattle area. The goal of this group is to showcase real world implementations of working with Spark within your Big Data infrastructure. Session are demo heavy and slide light, focusing on getting your development environments up and running including getting up and running, configuration issues, SparkSQL vs. Hive, among other topics.

 

If these topics are of interesting to you, go here to learn more about the Seattle Spark Meetup.

 

 

What:  Better Together: Dato and Spark

When:Tuesday, February 10, 2015, 5:45 PM

Where:Concur Technologies, 601 108th Ave NE, Bellevue, WA (map)

We will meet on the 1st floor of the building in the auditorium across from Jimmy John's. The building is itself across from the Bellevue bus terminal. There is pay parking in the building.

 

Loading next article