Hello everyone, Your term project is now available. We have chosen a project fro
ID: 3818842 • Letter: H
Question
Hello everyone, Your term project is now available. We have chosen a project from www.kaggle.com as your term project. You can access exhaustive information from the link provided below. Your task is to classify the data which has 93 features and more than 200.000 products from Otto group. It is still possible to submit your result through website. In this project, you can use any kind of programming language(python, scala, matlab, java, R etc) and framework(scikit,mllib etc.) to build your model. Project: https://www.kaggle.eom/c/otto-group-product-classification-challenge/data (Of course you can copy the following code snippet from Kernel!) import pandas as pd train = pd.read csv("../input/train.csv")Explanation / Answer
require(xgboost) require(methods) train = read.csv('../input/train.csv',header=TRUE,stringsAsFactors = F) test = read.csv('../input/test.csv',header=TRUE,stringsAsFactors = F) train = train[,-1] test = test[,-1] y = train[,ncol(train)] y = gsub('Class_','',y) y = as.integer(y)-1 #xgboost take features in [0,numOfClass) x = rbind(train[,-ncol(train)],test) x = as.matrix(x) x = matrix(as.numeric(x),nrow(x),ncol(x)) trind = 1:length(y) teind = (nrow(train)+1):nrow(x) # Set necessary parameter paramRelated Questions
Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.