The 2nd Annual GraphLab Workshop on 7/1/13 in SF!

Posted on Updated on

The GraphLab Big Learning Workshop is a meeting place for both academia and industry to discuss upcoming challenges of large scale machine learning and solution methods. The main goal for this year’s workshop is to bring together top researchers from academia, as well as top data scientists from industry with the special focus of large scale machine learning on sparse graphs.

We have secured talks and demos about the hottest graph processing systems including: GraphLab, Pregel (Google), Giraph (Facebook) , Cassovary (Twitter), Combinatorial BLAS (LBNL/UCSB), Allegro Graph (Franz) ,Neo4j, Titan (Aurelius), DEX (Sparsity Technologies), YarcData and others!

Event Pictures

Event Details


  • 8am – 9am: Registration, Coffee & Snacks
  • 9am – Presentations begin (See agenda below)
  • 5pm – 7pm Networking with hosted bar / appetizers

For updated agenda and detailed speaker info see the GraphLab Workshop site here.

Preliminary Agenda

Confirmed Speakers

for the GraphLab 2013 Workshop include:

Prof. Carlos Guestrin, University of Washington – Graphlab 2.2 and Beyond

Dr. Avery Ching, Facebook – Graph Processing at Facebook Scale

Prof. Vahab Mirrokni, Google – Clustering and Connected Components in Mapreduce and Beyond

Dr. Pankaj Gupta, Twitter – WTF: The Who to Follow Service at Twitter

Prof. Joe Hellerstein – Professor, UC Berkeley and Co-Founder/CEO, Trifacta – Productivity for Data Analysts: Visualization, Intelligence and Scale

Dr. Lei Tang – Walmart Labs – Adaptive User Segmentation for Recommendation


Dr. Derek Murray – Incremental, iterative and interactive data analysis with Naiad

Dr. Ralf Herbrich, Facebook – TBA

Prof. Mark Oskin, University of Washington, Grappa graph engine.

Featured Projects

Google’s Pregel is their Bulk Synchronous graph framework. Prof. Vahab Mirrokni is going to give an oral talk about graph processing @ Google.
Apache Giraph is the open source equivalent system to Google’s Pregel. Dr. Avery Ching, one of Giraph contributors, will give a talk about large scale graph processing @ Facebook.
Dr. Pankaj Gupta, the creator of Cassovary Graph Processing system @ Twitter will give a talk about Who To Follow (WTF) service in Twitter.
Naiad is a parallel data flow framework from Microsoft with the focus of incremental computation. Dr. Derek Murray from Microsoft Research will present Naiad.
GraphLab is CMU+UW open source graph processing system, which supports both bulk synchronous parallel as well as asynchronous computation. Prof. Carlos Guestrin will present the latest GraphLab project.
Allegro Graph is a high performance graph database with RDF support. Jans Aasman, the CEO of Franz, will give a demo of their newest graph database.
Combinatorial BLAS is a distributed memory parallel graph library from LBNL/UCSB. Dr. Aydin Buluc will present comb-BLAS.
Grappa is a distributed graph processing framework using commodity processors, from The University of Washington. Prof. Mark Oskin will present Grappa.
Titan is a distributed graph database. Dr. Matthias Broecheler will present Titan.
Neo4j is an open source distributed graph database in Java. Alex Averbuch from neo4j will present neo4j.
Infinite Graph from Objectivity is a distributed graph database.
DEX is a high performance and scalable graph database system. Dr. Noerert Martinez will present DEX.
YarcData, a Cray spinoff is creating customized hardware solutions for ultra fast graph processing.
Systap LLC is a startup working on speeding up graph algorithms using GPUs. Bryan Thompson from Systap will present preliminary results of applying the gather apply scatter model on GPU.

Other notable talks at the GraphLab workshop:

Trifacta is the hottest bay area startup out there, started by Prof. Joe Hellerstein from Berkeley and Prof. Jefferey Heer from Stanford. Prof. Joe Hellerstein will talk about Productivity for Data Analysts: Visualization, Intelligence and Scale.
Dr. Lei Tang from Walmart Labs will talk about adaptive user segmentation for collaborative filtering.
Alpine Data Labs is a Greenplum spinoff focusing on big data analytics. Seven Hillion will describe a case study of big data analytics on top of Hadoop.

Gold Sponsors


Loogicblox logo


Pipefish logo



Alpine Data Labs

Technicolor Labs Logo

Media Sponsors


For more Machine Learning events see and follow @mlconf on twitter.


Event Producers


For more Machine Learning events see and follow @mlconf on twitter.

Eventbrite - 2013 Graphlab Workshop on Large Scale Machine Learning


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s