Masters of Science in Business Analytics

Hateful Memes

Your goal is to predict whether a meme is hateful or non-hateful. This is a binary classification problem with multimodal input data consisting of the meme image itself (the image mode) and a string representing the text in the meme image (the text mode).

Given a meme id, meme image file, and a string representing the text in the meme image, your trained model should output the probability that the meme is hateful.

Background and Skills Preferred

Python programming, machine learning basics

Timelines

now – 9/20, learning basic text mining; data collection, basic data exploration and analysis, understanding project objectives
9/21 – 10/10, model selection, testing models, parameter tuning
10/11 – 10/31, finding good models, numerical tests, report results (try to contribute to the competition by submitting our results)
11/1-11/20, document results, comparing with others, improving models
11/21 – 12/10, finalizing tests, writing project report, project presentation

Deliverables

Python code with numerical demos; Project report; Project presentation with slides.

Source

Facebook Artificial Intelligence

Notes

Competition ends 10/30. Large dataset ~3.4GB, image/text & json files.