Project 2 : Programmers Salary Prediction
Determine the salary of a developer.
We do this by exploring the following:
-
Since we want to make a salary prediction, we will try two models, Linear Regression model and a lasso regression model
-
Since our data contains 4 no numerical columns, we use OneHotEncoder function from sklean libriary
- OneHotEncoder converts categorical columns to nemerical columns by getting their dummpy values.
- I choose OneHotEncoder over get dummies because it can easily be added to my pipeline
-
Next we use a train test split to separate our data into the test set and the training set
-
Final we use a Pipline with cross validation te decide between the Lasso Regression and the linear regression model
-
Save model and build functions to make a future calculation
