Web Intelligence and Big Data

Gautam Shroff, Indian Institute of Technology Delhi

This course is about building 'web-intelligence' applications exploiting big data sources arising social media, mobile devices and sensors, using new big-data platforms based on the 'map-reduce' parallel programming paradigm. In the past, this course has been offered at the Indian Institute of Technology Delhi as well as the Indraprastha Institute of Information Technology Delhi.

The past decade has witnessed the successful of application of many AI techniques used at `web-scale’, on what are popularly referred to as big data platforms based on the map-reduce parallel computing paradigm and associated technologies such as distributed file systems, no-SQL databases and stream computing engines. Online advertising, machine translation, natural language understanding, sentiment mining, personalized medicine, and national security are some examples of such AI-based web-intelligence applications that are already in the public eye. Others, though less apparent, impact the operations of large enterprises from sales and marketing to manufacturing and supply chains. In this course we explore some such applications, the AI/statistical techniques that make them possible, along with parallel implementations using map-reduce and related platforms.

This course was offered thrice during Fall 2012, Spring 2012 and Fall 2013; in Fall of both years it was also taken for credit at IIT Delhi and IIIT Delhi. During this period, I also wrote a book to elucidate the ideas discussed in the course at a 'popular' level:

The Intelligent Web: Search, Smart Algorithms and Big Data published by Oxford University Press, UK, in November 2013.

Now in this edition, the course is being offered in 'self-study' mode.


Introduction and Overview  Look: Search, Indexing and Memory Listen: Streams, Information and Language, Analyzing Sentiment and Intent Load: Databases and their Evolution, Big data Technology and Trends
Programming: Map-Reduce Learn: Classification, Clustering, and Mining, Information Extraction Connect: Reasoning: Logic and its Limits, Dealing with Uncertainty
Programming: Bayesian Inference for Medical Diagnostics Predict: Forecasting, Neural Models, Deep Learning, and Research Topics
Data Analysis: Regression and Feature Selection

Recommended Background

Basic programming, SQL and data structures Exposure to probability, statistics and matrices

Course Format

The course consists of lecture videos, which are between 5 and 15 minutes in length, adding up to a maximum of 1-1.5 hrs per week. There are 1-2 integrated quiz questions per lecture video. Additional short quizzes will test basic understanding. However, the current edition of the course is being offered in 'self-study' mode, so there are no homeworks, assignments or exams. Nor is there active support by the instructor or TA, but discussion forums are available for peer-learning.


  • Will I get a certificate after completing this class?

    No. In the past, statements of accomplishment were given. However,  the current edition of the course is being offered for 'self-study', without any graded homework or exams, and so no certificates.

  • Do I need any additional materials?

    Access to a computer on which Python 2.7 either is already installed or can be downloaded and installed. See http://www.python.org.

  • 2014年4月20日, 9 星期
  • 2013年8月26日, 12 星期
  • 2013年3月24日, 10 星期
  • 2012年8月27日, 10 星期
  • 免费:
  • 收费:
  • 证书:
  • MOOC:
  • 视频讲座:
  • 音频讲座:
  • Email-课程:
  • 语言: 英语 Gb



请注册, 为了写反馈

Small-icon.hover Machine Learning
Machine learning: from the basics to advanced topics. Includes statistics...
Big_data5 Big Data for Better Performance
Learn how you can predict customer demand and preferences by using the data...
Jjc55ckwloph2koysqtvwd8hc4vzfodhg-x5jxcvfkth-dkw_id8zy9ax2w8opvyr6ioyoevprvclihvmde=s0#w=1725&h=1060 Intro to Hadoop and MapReduce. How to Process Big Data
In this short course, learn the fundamentals of MapReduce and Apache Hadoop...
40684_d8c2_5 Data Organization - Learn Big Data Management - Udemy
Infrastructure, Algorithms, and Visualizations
72466_9c8d_9 Become a Hadoop Developer |Training|Tutorial
Learn Hadoop and get certified & bag one of the highest paying IT jobs in current...
102388_7f9d_9 Online Courses - Anytime, Anywhere
Learn Analytics from scratch- Ace Excel, cluster and factor analysis, linear...
72c27b2f-3419-430f-a28f-10dbc7120457-a14087e5df76.small DNA Sequences: Alignments and Analysis
Learn how to align and analyze DNA sequences using web and software based tools...
Cbc86bfc-8b76-4cb9-88d8-faa8a8abd820-50fa32daa1bc.small Software Testing Fundamentals
Learn how to locate software bugs and defects using the latest testing techniques...
7ca98c09-a207-40c7-8a84-b9c48ecdf920-f25c990d1f5f.small Cloud Computing Management
Learn methods for managing cloud computing projects and build an understanding...
91f52ef3-fa3f-4934-9d19-8d5a32635cd4-d99e27f09d19.small Data Science: R Basics
Build a foundation in R and learn how to wrangle, analyze, and visualize data...
B4072f23-f746-43a1-9819-8e3d8b066f38-76465b3bdbcc.small Data Science: Visualization
Learn basic data visualization principles and how to apply them using ggplot2...
Success-from-the-start-2 First Year Teaching (Secondary Grades) - Success from the Start
Success with your students starts on Day 1. Learn from NTC's 25 years developing...
New-york-city-78181 Understanding 9/11: Why Did al Qai’da Attack America?
This course will explore the forces that led to the 9/11 attacks and the policies...
Small-icon.hover Aboriginal Worldviews and Education
This course will explore indigenous ways of knowing and how this knowledge can...
Ac-logo Analytic Combinatorics
Analytic Combinatorics teaches a calculus that enables precise quantitative...
Talk_bubble_fin2 Accountable Talk®: Conversation that Works
Designed for teachers and learners in every setting - in school and out, in...

© 2013-2019