Beginners Guide to Feature Selection and Categorical Embeddings with Project on Unclean Structured Data