加勒比久久综合,国产精品伦一区二区,66精品视频在线观看,一区二区电影

合肥生活安徽新聞合肥交通合肥房產(chǎn)生活服務(wù)合肥教育合肥招聘合肥旅游文化藝術(shù)合肥美食合肥地圖合肥社保合肥醫(yī)院企業(yè)服務(wù)合肥法律

代寫MS6711、代做Python語言程序
代寫MS6711、代做Python語言程序

時間:2025-03-07  來源:合肥網(wǎng)hfw.cc  作者:hfw.cc 我要糾錯



MS6711 Data Mining
Homework 2
Instruction
This homework contains both coding and non-coding questions. Please submit two files,
1. One word or pdf document of answers and plots of ALL questions without coding details.
2. One jupyter notebook of your codes.
3. Questions 1 and 2 are about concepts, 3 - 6 are about coding.
1
Problem 1 [20 points]
We perform best subset, forward stepwise and backward stepwise selection on the same dataset with p
predictors. For each approach, we obtain p + 1 models containing 0, 1, 2, · · · , p predictors. Explain your
answer.
1. Which of the three models with same number of k predictors has smallest training RSS?
2. Which of the three models with same number of k predictors has smallest testing RSS? (best
subset, forward, backward, or cannot determine?)
3. True or False: The predictors in the k-variable model identified by forward stepwise are a subset of
the predictors in the (k + 1)-variable model identified by forward stepwise selection.
4. True or False: The predictors in the k-variable model identified by best subset are a subset of the
predictors in the (k + 1)-variable model identified by best subset selection.
5. True or False: The lasso, relative to OLS, is less flexible and hence will give improved prediction
accuracy when its increase in bias is less than its decrease in variance.
2
Problem 2 [20 points]
Suppose we estimate Lasso by minimizing
||Y − Xβ||2
2 + λ||β||1
for a particular value of λ. For part 1 to 5, indicate which of (a) to (e) is correct and explain your answer.
1. As we increase λ from 0, the training RSS will
(a) Increase initially, and then eventually start decreasing in an inverted U shape.
(b) Decrease initially, and then eventually start increasing in a U shape.
(c) Steadily increase.
(d) Steadily decrease.
(e) Remain constant.
2. Repeat 1. for test RSS.
3. Repeat 1. for variance.
4. Repeat 1. for (squared) bias.
3
Problem 3 [20 points]
These data record the level of atmospheric ozone concentration from eight daily meteorological mea surements made in the Los Angeles basin in 1976. We have the 330 complete cases1. We want to find
climate/weather factors that impact ozone readings. Ozone is a hazardous byproduct of burning fossil
fuels and can harm lung function. The data set for this problem is:
Variable name Definition
ozone Long Maximum Ozone
vh Vandenberg 500 mb Height
wind Wind speed (mph)
humidity Humidity (%)
temp Sandburg AFB Temperature
ibh Inversion Base Height
dpg Daggot Pressure Gradint
ibt Inversion Base Temperature
vis Visibility (miles)
doy Day of the Year
[Note: I would recommend you use R for this question, since python does not have package for
forward / backward selection. See the code example on Canvas. Or you may use the sample python code
I provided.]
1. Report result of linear regression using all variables. Note that ozone is the response variable to
predict. What variables are significant?
2. Report the selected variables using the following model selection approaches.
(a) All subset selection.
(b) Forward stepwise
(c) Backward stepwise
3. Compare the outcome of these methods with the significant variables found in the full linear regres sion in question 1.
4. Potentially, other transformation of covariates might be important. What happens if you do all
subset selection using both the original variables and their square? That is, for all variables, include
4
both
X, X2
in the linear regression model for all subset selection.
5
Problem 4 [20 points]
In this exercise, we will predict the number of applications received using the other variables in the College
data set.
Private Public/private school indicator
Apps Number of applications received
Accept Number of applicants accepted
Enroll Number of new students enrolled
Top10perc New students from top 10% of high school class
Top25perc 1 = New students from top 25 % of high school class
F.Undergrad Number of full-time undergraduates
P.Undergrad Number of part-time undergraduates
Outstate Out-of-state tuition
Room.Board Room and board costs
Books Estimated book costs
Personal Estimated personal spending
PhD Percent of faculty with Ph.D.
Terminal Percent of faculty with terminal degree
S.F.Ratio Student faculty ratio
perc.alumni Percent of alumni who donate
Expend Instructional expenditure per student
Grad.Rate Graduation rate
1. Split the data set into a training set and a test set.
2. Fit a linear regression model using OLS on the training set, and report the test error obtained.
3. Fit a ridge regression model on the training set, with λ chosen by cross-validation. Report the test
error obtained.
4. Fit a lasso model on the training set, with λ chosen by cross-validation. Report the test error
obtained, along with the number of non-zero coefficient estimates.
5. Fit a PCR model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of M selected by cross-validation.
6. Fit a PLS model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of number of components selected by cross-validation.
6
Problem 5 [20 points]
We will now try to predict per capita crime rate in the Boston data set.
crim per capita crime rate by town.
zn proportion of residential land zoned for lots over 25,000 sq.ft.
indus proportion of non-retail business acres per town.
chas Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).
nox nitrogen oxides concentration (parts per 10 million).
rm 1 = average number of rooms per dwelling.
age proportion of owner-occupied units built prior to 1940.
dis weighted mean of distances to five Boston employment centres.
rad index of accessibility to radial highways.
tax full-value property-tax rate per $10,000.
ptratio pupil-teacher ratio by town.
black 1000(Bk − 0.63)2 where Bk is the proportion of blacks by town.
lstat lower status of the population (percent).
medv median value of owner-occupied homes in $1000s.
1. Try out some of the regression methods explored in this chapter, such as best subset selection, the
lasso, ridge regression, PCR and partial least squares. Present and discuss results for the approaches
that you consider.
2. Propose a model (or set of models) that seem to perform well on this data set, and justify your
answer. Make sure that you are evaluating model performance using validation set error, cross validation, or some other reasonable alternative, as opposed to using training error.
3. Does your chosen model involve all of the features in the data set? Why or why not?
7
Problem 6 [20 points]
In a bike sharing system the process of obtaining membership, rental, and bike return is automated
via a network of kiosk locations throughout a city. In this problem, you will try to combine historical
usage patterns with weather data to forecast bike rental demand in the Capital Bikeshare program in
Washington, D.C.
You are provided hourly rental data collected from the Capital Bikeshare system spanning two years.
The file Bike train.csv, as the training set, contains data for the first 19 days of each month, while
Bike test.csv, as the test set, contains data from the 20th to the end of the month. The dataset includes
the following information:
daylabel day number ranging from 1 to 731
year, month, day, hour hourly date
season 1=spring,2=summer,3=fall,4=winter
holiday whether the day is considered a holiday
workingday whether the day is neither a weekend nor a holiday
weather 1 = clear, few clouds, partly cloudy
2 = mist + cloudy, mist + broken clouds, mist + few clouds, mist
3 = light snow, light rain + thunderstorm + scattered clouds, light rain
4 = 4 = heavy rain + ice pallets + thunderstorm + mist, snow + fog
temp temperature in Celsius
atemp ’feels like’ temperature in Celsius
humidity relative humidity
wind speed wind speed
count number of total rentals, outcome variable to predict
Predictions will be evaluated using the root mean squared error (RMSE), calculated as
RMSE =
v
u
u t
n
1
nX
i=1
(yi − ybi)
2
where yi
is the true count, ybi
is the prediction, and n is the number of entries to be evaluated.
Build a model on train dataset to predict the bikeshare counts for the hours recorded in the test
dataset. Report your prediction RMSE on testing set.
Some tips
• This is a relatively open question, you may use any model you learnt from this class.
8
• It will be helpful to examine the data graphically to spot any seasonal pattern or temporal trend.
• There is one day in the training data with weird atemp record and another day with abnormal
humidity. Find those rows and think about what you want to do with them. Is there anything
unusual in the test data?
• It might be helpful to transform the count to log(count + 1). If you did that, do not forget to
transform your predicted values back to count.
• Think about how you would include each predictor into the model, as continuous or as categorical?
• Is there any transformation of the predictors or interactions between them that you think might be
helpful?
Try to summarize your exploration of the data, and modeling process. You may fit a few models and
chose one from them. You will receive points based on your write-up and test RMSE. This is not a
competition among the class to achieve the minimal RMSE, but your result should be in a reasonable
range.


請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp



 

掃一掃在手機打開當前頁
  • 上一篇:INT5051代做、代寫Python編程設(shè)計
  • 下一篇:代寫COMP3334、代做C/C++,Python編程
  • 無相關(guān)信息
    合肥生活資訊

    合肥圖文信息
    2025年10月份更新拼多多改銷助手小象助手多多出評軟件
    2025年10月份更新拼多多改銷助手小象助手多
    有限元分析 CAE仿真分析服務(wù)-企業(yè)/產(chǎn)品研發(fā)/客戶要求/設(shè)計優(yōu)化
    有限元分析 CAE仿真分析服務(wù)-企業(yè)/產(chǎn)品研發(fā)
    急尋熱仿真分析?代做熱仿真服務(wù)+熱設(shè)計優(yōu)化
    急尋熱仿真分析?代做熱仿真服務(wù)+熱設(shè)計優(yōu)化
    出評 開團工具
    出評 開團工具
    挖掘機濾芯提升發(fā)動機性能
    挖掘機濾芯提升發(fā)動機性能
    海信羅馬假日洗衣機亮相AWE  復(fù)古美學與現(xiàn)代科技完美結(jié)合
    海信羅馬假日洗衣機亮相AWE 復(fù)古美學與現(xiàn)代
    合肥機場巴士4號線
    合肥機場巴士4號線
    合肥機場巴士3號線
    合肥機場巴士3號線
  • 短信驗證碼 目錄網(wǎng) 排行網(wǎng)

    關(guān)于我們 | 打賞支持 | 廣告服務(wù) | 聯(lián)系我們 | 網(wǎng)站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網(wǎng) 版權(quán)所有
    ICP備06013414號-3 公安備 42010502001045

    国产精品高清一区二区| 久久久成人网| 欧美日韩免费观看视频| 亚洲国产综合在线看不卡| 你懂的成人av| 午夜激情电影在线播放| 亚州av乱码久久精品蜜桃| 日韩美女精品| 麻豆成人久久精品二区三区红| 久久国产精品亚洲77777| 欧美激情久久久久久久久久久| 日本欧美一区二区三区| 国产在线美女| 黄色另类av| 中文字幕伦av一区二区邻居| 亚洲欧美久久精品| 国产精品蜜月aⅴ在线| 亚洲最黄网站| 欧美影院三区| 97se亚洲| 精品一区二区三区亚洲| 欧美激情福利| 第四色男人最爱上成人网| 午夜综合激情| 欧美粗暴jizz性欧美20| 91成人福利| 偷拍自拍亚洲色图| 日本va欧美va瓶| 51一区二区三区| 国产白浆在线免费观看| 亚洲美女视频在线免费观看| 99久久99视频只有精品| 国产精品99久久免费观看| 亚州精品视频| 亚洲综合专区| 麻豆国产精品一区二区三区| 97精品国产综合久久久动漫日韩| 免费一区二区视频| 91蜜臀精品国产自偷在线| 伊人天天综合| 国产麻豆精品| 麻豆一区二区三| 国产日韩亚洲| 精品福利在线| 青草综合视频| av成人亚洲| 最新日韩一区| 黄色成人在线视频| 欧洲午夜精品| 欧美一区=区三区| 中文在线8资源库| 日av在线不卡| 蜜桃一区二区三区四区| 免费欧美日韩国产三级电影| 免费在线看成人av| 欧美国产一级| a日韩av网址| 欧洲精品一区二区三区| 日日av拍夜夜添久久免费| 电影亚洲精品噜噜在线观看| 视频一区在线免费看| 亚洲精品66| 日韩专区中文字幕一区二区| 欧美一级网站| 综合激情网站| 久久99久久人婷婷精品综合| 偷拍自拍一区| 国产精品流白浆在线观看| 久久精品在线| 亚洲成人三区| 午夜在线播放视频欧美| 久久久久久一区二区| 色一区二区三区| 欧美亚洲黄色| 麻豆国产精品官网| 欧美日本成人| 国产香蕉精品| 亚洲精品一区二区在线看| 午夜在线a亚洲v天堂网2018| 国产精品久久久久蜜臀| 四虎在线精品| 欧美精品大片| 视频精品二区| 国模吧视频一区| 麻豆精品网站| 不卡亚洲精品| 国产精品一区二区美女视频免费看| 欧美激情在线精品一区二区三区| 亚洲不卡视频| 激情久久中文字幕| 日韩中文字幕麻豆| 欧美天堂在线| 国产精品片aa在线观看| 精品国产美女| 先锋亚洲精品| 福利精品在线| 国产精品一国产精品| 精品视频黄色| 久久av一区| 婷婷精品久久久久久久久久不卡| 欧美精品二区| 成午夜精品一区二区三区软件| 激情五月***国产精品| 亚洲第一区色| 久久久免费毛片| 免费看一区二区三区| 香蕉视频一区| zzzwww在线看片免费| 日本不卡视频在线| 日韩影片在线观看| 好吊一区二区三区| 亚洲mmav| 五月综合久久| 在线国产一区二区| 欧美日韩视频免费观看| 国产精品探花在线观看| 欧美一区二区麻豆红桃视频| 亚洲美女久久精品| av日韩久久| 亚洲性图久久| 亚洲精品.com| 日韩超碰人人爽人人做人人添| 欧洲杯半决赛直播| 国产精品成人国产| 蜜桃在线一区| 免费av网站大全久久| 欧美日韩专区| 99久久久久国产精品| 日韩在线不卡| 日韩精品一区国产| 成人激情视频| 国产欧美69| 亚洲少妇在线| 超碰在线99| 日韩一区二区中文| 亚洲国产欧美日韩在线观看第一区 | 麻豆国产一区| 免费精品99久久国产综合精品| 免费在线成人| 天天躁日日躁狠狠躁欧美| 天天综合网天天| 日韩极品在线| 97欧美在线视频| 亚洲另类av| 日韩中文字幕麻豆| 久久av中文| 爽成人777777婷婷| 警花av一区二区三区| 久久久久免费| 日韩成人免费在线| 狠狠躁少妇一区二区三区| 日韩成人18| 校园春色亚洲| 好吊妞国产欧美日韩免费观看网站| 日韩深夜视频| 超碰成人在线观看| 日韩精品第一| 亚洲网站啪啪| 欧美日韩一区二区国产| 午夜日韩电影| 国产精品免费精品自在线观看| 香蕉久久久久久久av网站| 欧美猛男同性videos| 欧美激情国产在线| 91免费精品国偷自产在线在线| 欧美天堂视频| 亚洲二区精品| 亚洲在线久久| 日韩不卡一区| 国产精品毛片视频| 久久精品久久精品| 亚洲激情婷婷| 日韩最新在线| 欧美国产大片| 欧美69视频| 国产亚洲电影| 色综合天天色| 婷婷综合网站| 亚洲婷婷丁香| 偷拍视频一区二区三区| 欧美先锋资源| 国产中文欧美日韩在线| 91av亚洲| 天堂网在线观看国产精品| 国产乱码精品一区二区三区亚洲人| 日本一区二区三区视频| 精品少妇av| 99综合久久| 日韩精品不卡一区二区| 亚洲精品在线观看91| 国产精品中文字幕亚洲欧美| 日韩一区二区中文| 伊人情人综合网| 警花av一区二区三区| 99精品热6080yy久久| 久热综合在线亚洲精品| 精品免费视频| 欧美禁忌电影| 久久国产乱子精品免费女| 色呦哟—国产精品|