My intro to Kaggle
Anyone who has peaked into the worlds of data science or machine learning is likely to have heard of 'Kaggle'. They've been a resource for curated machine learning datasets for a number of years, and are becoming a hub for learning about, talking about, and practicing machine learning. I spent a short while working my way through the short but highly interactive courses on the Kaggle site, and found these to be terrifically rewarding and engaging. The next step from working on any of these courses is to try your hand at a competition hosted on the site. The titanic competition is the most famous and likely the most accessible of these endeavours. The titanic competition provides a number of features about the passengers of the titanic, and asks entrants to predict survival based upon what is known. The features are as follows: Variable Definition Key survival Survival 0 = No, 1 = Yes pclass Ticket class 1 = 1st, 2 = 2nd, 3 = 3rd sex Sex...