Big Data Mini Course: AMP Camp 4 hands-on exercises

UC Berkeley AMPLab

Hands-on Big Data Mini Course

Check out our online Big Data Mini Course! The full course will take 2-4 hours to complete, and in the process you will:

  • Start a ~5 node cluster on EC2 running Hadoop and the Berkeley Data Analytics Stack (BDAS).
  • Interactively explore a real Wikipedia dataset at the Spark and Shark shells.
  • Use Spark Streaming and the Twitter API to generate a real-time list of trending Twitter topics.
  • Write a data clustering algorithm and run it on a real Wikipedia dataset and observe interesting correlations.

Welcome

Welcome to the AMP Camp 4 hands-on exercises! These exercises are extended and enhanced from those given at previous AMP Camp Big Data Bootcamps. They were written by volunteer graduate students and postdocs in the UC Berkelay AMPLab. Many of those same graduate students are present today as teaching assistants. The exercises we cover today will have you working directly with the Spark specific components of the AMPLab’s open-source software stack, called the Berkeley Data Analytics Stack (BDAS).

Course Prerequisites

A few of the components support multiple languages. In some sections of this training material, you can choose which language you want to use as you follow along and gain experience with the tools. The following table shows which languages this mini course supports for each section. You are welcome to mix and match languages depending on your preferences and interests.

会期:
  • 自由时间安排
介绍:
  • 免费:
  • 收费:
  • 证书:
  • MOOC:
  • 视频讲座:
  • 音频讲座:
  • Email-课程:
  • 语言: 英语 Gb

反馈

目前这个课程还没有反馈。您想要留第一个反馈吗?

请注册, 为了写反馈

Show?id=n3eliycplgk&bids=695438
已经在列表:
NVIDIA
还有标题«计算机科学»:
Maxresdefault CS 282: Principles of Operating Systems II: Systems Programming for Android
Developing high quality distributed systems software is hard; developing high...
Banner_ruby Ruby on Rails Tutorial: Learn From Scratch
This post is part of our “Getting Started” series of free text tutorials on...
Logo-30-128x128 NYU Course on Deep Learning (Spring 2014)
Lectures from the NYU Course on Deep Learning (Spring 2014) This is a graduate...
Cppgm C++ Grandmaster Certification
The C++ Grandmaster Certification is an online course in which participants...
Umnchem Computational Chemistry (CHEM 4021/8021)
Modern theoretical methods used in study of molecular structure, bonding, and...

© 2013-2019