Posts in 'Machine learning'

Ramandeep Singh Nanda

Why you should use square root of Gini Index

In this post I will explain why you should use square root of Gini index while building decision tree classification models. In decision tress, We know that at every node we need to choose a feature that provides the best split i.e. the feature that reduces the child nodes ...

Ramandeep Singh Nanda

Opening Box Office Weekend Prediction

Introduction

We investigate whether tweets the amount and sentiment in them can predict opening weekend of box office. Specifically we target a threshold i.e. around 30 million dollars, but more specifically it is the mean of the opening weekend of the entire dataset.

About the Dataset

Labeled data for classifying tweets

This dataset was obtained from multiple sources and contains manually labeled dataset.