| Wednesday, February 15 |
| 11:15 AM–12:15 PM |
Big Data and Parallel computing are not new topics, but in the last few years, they have taken on much larger meanings with new tools and new paradigms. Come hear Jeremy Edberg, Lead Reliability Engineer for Netflix and formerly Information Cowboy for reddit, give an overview of where Big Data and Parallel computing is today with a mix of war stories and practical examples.
Speaker - Jeremy Edberg, Manager, Site Reliability, Netflix 
Jeremy is currently the Lead Site Reliability Engineer for Netflix, the largest subscription video streaming site in the world. Previously he was the Information Cowboy (aka. Head of Technology) for reddit.com, an online community for sharing and discussing interesting things on the internet that did more than a billion pageviews a month. reddit was an Alexa Top 100 website whose entire operations are run on Amazon’s EC2. Jeremy has spent 15 years in technology: 4 in operations, 3 in security, 4 in consulting, and 4 in academic computing. When working for Sendmail, he helped administer the most bleeding edge Sendmail server on the internet; at eBay he helped protect people from phishing and scams, created patent-pending fraud detection software, and evaluated hardware for the eBay platform; and at UC Berkeley, he wired up large portions of the dorms for internet, back before every college student in America had a net drop in their room. Jeremy has keynoted at conferences such as PyCon and Cloud Connect. He holds a Cognitive Science degree from UC Berkeley.
|
| 1:15 PM–2:15 PM |
In today’s tough economic climate, organizations are searching for the most effective ways to leverage the cloud. Most cloud technologies, however, are a challenge to implement and even harder for developers to use. Apache Hadoop, an open source technology, offers a mature, well-exercised software development platform that enables companies to take full advantage of the cloud in order to better manage and analyze massive amounts of data. We will explore how Apache Hadoop’s approach to distributed computing makes it easy for the average developer to learn how to scale problems the size of the Internet.
Speaker - Owen O'Malley, Co-founder and Sr. Architect, Hortonworks 
Owen has been contributing patches to Hadoop since before it became an independent Apache project. He was the first committer added and still remains one of the most active contributors to Apache Hadoop. He was also the founding chair of the Apache Hadoop Project Management Committee. Prior to co-founding Hortonworks, Owen worked on Yahoo! Search's WebMap project, which built a graph of the known web and applied many heuristics to the entire graph. Once ported to Apache Hadoop, it became the single largest known Hadoop application. He has a PhD in Software Engineering from the University of California, Irvine. Follow Owen on Twitter: @owen_omalley.
|
| 2:30 PM–3:30 PM |
Every day, we create 2.5 quintillion bytes of data. In fact, ninety percent of the data in the world today has been created in the last two years alone. Enterprises are awash with data, easily amassing terabytes and even petabytes of information. The challenge is twofold – how to deal with all of this information to find new insights, and how to integrate those insights into your existing systems and technology. This session will delve into why Big Data is more than just a challenge; it is an opportunity to find insight in new and emerging types of data and content -- to make businesses more agile, and to answer questions that, in the past, were beyond reach.
Speaker - Anjul Bhambhri, VP, Big Data, Information Management, IBM 
Anjul Bhambhri is the Vice President of Big Data. She was previously the Director of IBM Optim application and data life cycle management tools. She is a seasoned professional with over twenty-two years in the database industry. Over this time, Anjul has held various engineering and management positions at IBM, Informix and Sybase. Prior to her assignment in tools, Anjul spearheaded the development of XML capabilities in IBM's DB2 database server. She is a recipient of the YWCA of Silicon Valley's “Tribute to Women in Technology” award for 2009. Anjul holds a degree in Electrical Engineering. You may contact her at bhambhri@us.ibm.com.
|
| 3:45 PM–4:30 PM |
In this session we will get our first taste of using big data to drive one's business. Hear from an expert at one of the biggest sites on the internet tell you about how they use various big data tools and concepts to grow their business and retain their customers.
Speaker - Jerome Boulon, Cloud Architect, Netflix 
Jerome is currently the Lead Architect for the Big Data Infrastructure for Hadoop at Netflix, the world¹s leading Internet subscription service for enjoying movies and TV shows.
He built Honu, the only data collection pipeline in use at Netflix, collecting over 40 Billion events per day, providing key metrics for day-to-day operations and insight business information required by the data science team. Prior to Netflix, Jerome, worked at Yahoo! for the Hadoop team, where he open-sourced Chukwa, another data collection pipeline more suitable for regular data centers. Over the past 15 years, Jerome worked for companies like Convera or Fast Search&Transfer where he built some key components for their Semantic Web search solution, Videotron, the largest Cable Modem provider in Quebec. At Videotron he built the first Network Monitoring System for Docsis cable. He holds a master in computer science from Paul Sabatier University (France).
|
|