NOC | Bandit Algorithm (Online Machine Learning)

Course abstract

In many scenarios, one faces uncertain environments where a-priori the best action to play is unknown. How to obtain best possible reward/utility in such scenarios. One natural way is to first explore the environment and to identify the `best’ actions and exploit them. However, this give raise to an exploration vs exploitation dilemma, where on hand hand we need to do sufficient explorations to identify the best action so that we are confident about its optimality, and on the other hand, best actions need to exploited more number of times to obtain higher reward. In this course we will study many bandit algorithms that balance exploration and exploitation well in various random environment to accumulate good rewards over the duration of play. Bandit algorithms find applications in online advertising, recommendation systems, auctions, routing, e-commerce or in any filed online scenarios where information can be gather in an increment fashion.

Course Instructor

Prof. Manjesh hanawal

Prof. Manjesh hanawal received the M. S. degree in ECE from the Indian Institute of Science, Bangalore, India, in 2009, and the PhD degree from INRIA, Sophia Antipolis, France, and the University of Avignon, France, in 2013. After two years of postdoc at Boston University, he is now an Assistant Professor in Industrial Engineering and Operations Research at the IIT Bombay, India. His research interests include performance evaluation, machine learning and network economics. He is a recipient of Inspire Faculty Award from DST and Early Career Research Award from SERB.
More info

Teaching Assistant(s)

No teaching assistant data available for this course yet

Course Duration : Sep-Dec 2020

View Course

Enrollment : 20-May-2020 to 21-Sep-2020

Exam registration : 14-Sep-2020 to 02-Nov-2020

Exam Date : 19-Dec-2020

Enrolled

3591

Registered

Certificate Eligible

Certified Category Count

Gold

Silver

Elite

Successfully completed

Participation

Success

Elite

Silver

Gold

Legend

AVERAGE ASSIGNMENT SCORE >=10/25 AND EXAM SCORE >= 30/75 AND FINAL SCORE >=40
BASED ON THE FINAL SCORE, Certificate criteria will be as below:
>=90 - Elite + Gold
75-89 -Elite + Silver
>=60 - Elite
40-59 - Successfully Completed

Final Score Calculation Logic

Assignment Score = Average of best 8 out of 12 assignments.
Final Score(Score on Certificate)= 75% of Exam Score + 25% of Assignment Score

Bandit Algorithm (Online Machine Learning) - Toppers list

M AKASH KUMAR 90%

Indian Institute of Technology,Madras

Enrollment Statistics

Total Enrollment: 3591

Registration Statistics

Total Registration : 11

Assignment Statistics

Assignment

Exam score

Final score

Course Name: Bandit Algorithm (Online Machine Learning)

Course abstract

Course Instructor

Prof. Manjesh hanawal

Teaching Assistant(s)

Course Duration : Sep-Dec 2020

View Course

Enrollment : 20-May-2020 to 21-Sep-2020

Exam registration : 14-Sep-2020 to 02-Nov-2020

Exam Date : 19-Dec-2020

Enrolled

Registered

Certificate Eligible

Certified Category Count

Gold

Silver

Elite

Successfully completed

Participation

Success

Elite

Silver

Gold

Legend

Final Score Calculation Logic

Enrollment Statistics

Total Enrollment: 3591

Registration Statistics

Total Registration : 11

Assignment Statistics

Score Distribution Graph - Legend

Assignment Score: Distribution of average scores garnered by students per assignment.

Exam Score : Distribution of the final exam score of students.

Final Score : Distribution of the combined score of assignments and final exam, based on the score logic.