Interview Questions Answers.ORG
Interviewer And Interviewee Guide
Interviews
Quizzes
Home
Quizzes
Interviews Data Warehouse Job Interviews:Ab InitioActuateAnalysis ServicesBO DesignerBusiness ObjectsCognos Data WarehouseColdFusionCrystal Enterprise SuiteData IntegrationData ScientistData StageData Warehouse AnalystData Warehouse BIData Warehouse Data MartData Warehouse DeveloperData Warehouse ManagerData Warehouse SupervisorData Warehouses MicroStrategyData WarehousingData Warehousing BasicsDatabase PartitioningEnterprise Data WarehouseETL (Extract, transform, load)General DatawarehousingHyperionImpromptuInformatica Data WarehousingLead Data ScientistMicro StrategyOperational Data StoreReportnetSAS (Statistical Analysis System)Warehouse AssistantWarehouse ConceptsWarehouse DataStageWarehouse Operation ManagerWarehouse Worker
Copyright © 2018. All Rights Reserved
Lead Data Scientist Interview Question:
Tell us how would you validate a model you created to generate a predictive model of a quantitative outcome variable using multiple regression?
Submitted by: MuhammadProposed methods for model validation:
☛ If the values predicted by the model are far outside of the response variable range, this would immediately indicate poor estimation or model inaccuracy.
☛ If the values seem to be reasonable, examine the parameters; any of the following would indicate poor estimation or multi-collinearity: opposite signs of expectations, unusually large or small values, or observed inconsistency when the model is fed new data.
☛ Use the model for prediction by feeding it new data, and use the coefficient of determination (R squared) as a model validity measure.
☛ Use data splitting to form a separate dataset for estimating model parameters, and another for validating predictions.
☛ Use jackknife resampling if the dataset contains a small number of instances, and measure validity with R squared and mean squared error (MSE).
Submitted by: Muhammad
☛ If the values predicted by the model are far outside of the response variable range, this would immediately indicate poor estimation or model inaccuracy.
☛ If the values seem to be reasonable, examine the parameters; any of the following would indicate poor estimation or multi-collinearity: opposite signs of expectations, unusually large or small values, or observed inconsistency when the model is fed new data.
☛ Use the model for prediction by feeding it new data, and use the coefficient of determination (R squared) as a model validity measure.
☛ Use data splitting to form a separate dataset for estimating model parameters, and another for validating predictions.
☛ Use jackknife resampling if the dataset contains a small number of instances, and measure validity with R squared and mean squared error (MSE).
Submitted by: Muhammad
Copyright 2007-2024 by Interview Questions Answers .ORG All Rights Reserved.
https://InterviewQuestionsAnswers.ORG.
https://InterviewQuestionsAnswers.ORG.