Target:
Identify, analyze, and evaluate quantitative and qualitative healthcare data and information for effective decision making in various healthcare settings
Explain data to a health care audience in a clear, concise and persuasive manner, consistent with best practices in the field of health care management to inform or advocate change.
Assess the ethical challenges in designing a research study in the use of human subjects to ensure compliance with national and local standards.
The main idea of the final exam is to evaluate students analytical skills. For the final exam, students will need to use RStudio (2 % BONUS), Excel or any other statistical packages (approved by the faculty).
Download file from here
Data for RStudio users:
Data for RStudio Users
Data for Excel users:
Data for Excel Users
Codes for RStudio users:
Dplyr Codes
No Dyplr Codes
:
Final Exam Questions Updated: 3/14/2021
University of Maryland Global Campus (UMGC)
HMGT 400 Research and Data Analysis in Health- FINAL EXAM
Dataset: HMGTFINALEXAM.csv (Please download dataset from the class)
Required program: RStudio or R Programming.
Author, Hossein Zare, PhD
Citation: Zare, H. (2019). HMGT 400 Research and Data Analysis in Health Care. FINAL EXAM. UMGC.EDU
Question #1 (15 credits): [RStudio Users and Excel Users]
The FINAL EXAM dataset, provides some information about hospitals in 2011 and 2012, download the FINAL
EXAM data and then complete the descriptive table. Please answer the following questions.
A) In term of hospital characters what are the significant difference between 2011 and 2012?
B) In term of socio-economic variables what are the significant difference between 2011 and 2012?
To report the “Per Capita Hospital Beds to Population”, you need to divide “total_hospital_beds/
tot_population)
C) Based on your findings in which years hospitals had better performance? How hospital performance related
to hospital characteristics and socio-economic characteristics? Please write at least three main different
movements between 2011 and 2012.
Table 1. Descriptive statistics between hospitals in 2011 & 2012
N
Hospital Characteristics
1. Hospital beds
2. Number of paid Employee
3. Number of non-paid Employee
4. Internes and Residents
5. System Membership
2011
Mean
100
St. Dev
N
2012
Mean
150
St. Dev
p-value
0.004
6. Total hospital cost
7. Total hospital revenues
8. Hospital net benefit
9. Available Medicare days
10. Available Medicaid days
11. Total Hospital Discharge
12. Medicare discharge
13. Medicaid discharge
Socio-Economic Variables
14. Per Capita Hospital Beds to
Population
15. Percent of population under
poverty
16. Percent of Female population
under poverty
17. Percent of Male population
under poverty
18. Median Household Income
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 1
Question #2 (15 credits): [RStudio Users and Excel Users]
Use the final exam dataset and then answer the following questions:
1) Compare the following information between for-profit and non-for-profit hospitals.
2) What are the main significant differences between for-profit and non-for-profit hospitals? Which test is
the best fit test? Why?
3) Use a box-plot and compare Hospital net benefit between for-profit and non-for-profit hospitals.
4) Show another scatter plot and compare hospital cost (x-axes) and revenue (y-axes) and discuss your
findings?
5) Comparing hospital net-benefit which hospitals has better performance? To answer this question first
compute the hospital net benefits with subtracting hospital costs and revenues and then use ttest to
compare the significant differences between FP and NFP hospitals.
6) Overall, what are the main significant differences between for-profit and non-for-profit hospitals?
Table 2. Descriptive statistics between FP & NFP
N
For Profit
Mean
St. Dev
N
Non-For-Profit
Mean
St. Dev
p-value
Hospital Characteristics
1. Hospital beds
2. Number of paid Employee
3. Number of non-paid Employee
4. Internes and Residents
5. System Membership
6. Total hospital cost
7. Total hospital revenues
8. Hospital net benefit
9. Available Medicare days
10. Available Medicaid days
11. Total Hospital Discharge
12. Medicare discharge
13. Medicaid discharge
Socio-Economic Variables
14. Per Capita Hospital Beds to
Population
15. Percent of population under
poverty
16. Percent of Female population
under poverty
17. Percent of Male population
under poverty
18. Median Household Income
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 2
Question #3 (15 credits): [RStudio Users and Excel Users]
The dataset provides Herfindahl–Hirschman Index for health insurance market, please use the herf_ins variable and
answer the following questions:
For this exercise you do not need to compute the HHI, but if you have any questions, please do not hesitate to ask
me, but try to learn more about this you will need that to report your findings.
Please remember for the class exercise you used the herf_cat as a hospital Herfindahl index. For this question make
sure to use herf_ins as Herfindahl index for insurance market.
Use the final exam dataset and then answer the following questions:
1) In a short paragraph explain the Herfindahl index you can use the reference provided in the class
exercise or any other citation.
2) Compare the following information between hospitals located in high, moderate and low competitive
health insurance markets?
o 2.1. What are the main significant differences between hospitals in different markets? (use
Anova test)
o 2.2. What is the impact of being in high-competitive health insurance market on hospital
revenues and cost?
o 2.3. Do you think being in high-competitive market has positive impact on net hospital
benefits?
o 2.4. What about the number of Medicare and Medicaid discharge? Do you think hospitals in
higher completive market more likely to accept more Medicare and Medicaid patients?
o 2.5. What is the impact of other variables?
(Note: to answer to the last question, please compute the ratio-Medicare-discharge and ratio-Medicaiddischarge first and then run 2 ttest) high vs. moderate and high vs. low competitive market), please
support your findings with box-plot).
Table 3. Comparing hospital characteristics and market
Hospital Characteristics
1. Hospital beds
2. Number of paid Employee
3. Number of non-paid Employee
4. Internes and Residents
5. System Membership
High Competitive
Market
N
Mean
STD
Moderate Competitive
Market
N
Mean
STD
Low Competitive
Market
N
Mean
STD
ANOVA/Chi
-Sq
(results)
6. Total hospital cost
7. Total hospital revenues
8. Hospital net benefit
9. Available Medicare days
10. Available Medicaid days
11. Total Hospital Discharge
12. Medicare discharge-ratio
13. Medicaid discharge-ratio
Socio-Economic Variables
14. Per Capita Hospital Beds to
Population
15. Median Household Income
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 3
Question #4 (Credits 20)- [RStudio Users]
Linear Regression Model
If you have chosen to work with RStudio, please run the following model and complete the following tables.
1st Model:
Run a linear model and predict the difference between hospital beds (use the bed-tot) and hospital’s ownership on hospital net-be
nefit? Discuss your finding, do you think having higher beds has positive impact on the hospital net benefit? What about the own
ership?
Benefit=F(b0+B1bed+b2Own2+B3ow2+b4own3+e)
Hospital Characteristics
Hospital beds
Ownership
For Profit
Non-for profit
Other
N
R-Squared
Model 1a
Coef.
St. Err
p-value
Ref.
Ref.
Ref.
Df+K+1
2nd Model:
Now, estimate the impact of being a member of a system on hospital net benefit? And discuss your finding (not more than 2
lines)? Is it significant?
Hospital Characteristics
Hospital beds
Ownership
For Profit
Non-for profit
Other
Membership
System Membership
N
R-Squared
Model 2
Coef.
St. Err
p-value
Ref.
Ref.
Ref.
3nd Model:
Now, include the ratio of ratio-Medicare-discharge and ratio-Medicaid-discharge in your model? How do you evaluate
the impact of having higher Medicare and Medicaid patients on hospital net benefit?
Hospital Characteristics
Hospital beds
Ownership
For Profit
Non-for profit
Other
Membership
System Membership
Socio-Economic Characteristics
Medicare discharge ratio
Medicaid discharge ratio
N
R-Squared
Model 3
Coef.
Ref.
St.
Err
pvalue
Ref.
Ref.
Based on your finding please recommend 3 policies to improve hospital performance, please make sure to use the
final model for your recommendation.
Discuss your findings.
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 4
Question #4 (Credits 20)- [Excel Users]
Linear Regression Model
If you have chosen to work with Excel, please run the models and complete the following tables.
Model 1:
Run a linear model and predict the difference between hospital beds (use the bed-tot) and hospital net-benefit in teac
hing hospitals?
Note: hospital-net-benefit= total_hosp_revenue – total_hosp_cost
Y(benefit), B0+B1(beds)
Hospital Characteristics
Model-1
Hospital beds
N
R Square
Coef.
ST. ERR
T Stat
P-values
Lower 95%
Upper 95%
Model 2:
Run a linear model and predict the difference between hospital beds (use the bed-tot) and hospital net-benefit in non
-teaching hospitals?
Use the results from model 1 and model 2 and compare the results between teaching and non-teaching hospitals.
Hospital Characteristics
Model-2
Hospital beds
N
R Square
Coef.
ST. ERR
T Stat
P-values
Lower 95%
Upper 95%
Model 3:
Now, include the ratio of ratio-Medicare-discharge and ratio-Medicaid-discharge in first model? How do you
evaluate the impact of having higher Medicare and Medicaid patients on hospital net-benefit in teaching hospitals?
Hospital Characteristics
Model-3
Hospital beds
ratio-Medicare-discharge
ratio-Medicaid-discharge
N
R Square
Coef.
ST. ERR
T Stat
P-values
Lower 95%
Upper 95%
Model 4:
Now, include the ratio of ratio-Medicare-discharge and ratio-Medicaid-discharge in first model? How do you
evaluate the impact of having higher Medicare and Medicaid patients on hospital net-benefit in non-teaching
hospitals?
Hospital Characteristics
Model-4
Hospital beds
ratio-Medicare-discharge
ratio-Medicaid-discharge
N
R Square
Coef.
ST. ERR
T Stat
P-values
Lower 95%
Upper 95%
Based on your finding please recommend 3 policies to improve hospital performance, please make sure to use the
final model for your recommendation.
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 5
Question #5 (Credits 20) [RStudio Users]
If you have chosen to work with RStudio, please run three models and complete the following tables.
Model 1: Run a logit model and use being a member of network and find out its impact on hospital beds and
hospital ownership? (Model 1)
Hospital Characteristics
Hospital beds
Ownership
For Profit
Non-for profit
Other
N
AIC
Coef.
St. Err
p-value
Ref.
Ref.
Ref.
Model 2: Now, include hospital revenue and report the Coeff.? (Model 2)
Hospital Characteristics
Hospital beds
Ownership
For Profit
Non-for profit
Other
Hospital revenue
N
AIC
Coef.
St. Err
p-value
Ref.
Ref.
Ref.
Model 3: Now, include the ratio of ratio-Medicare-discharge and ratio-Medicaid-discharge in your model? And
keep all variables you used for models 1, 2 & 3 and discuss your findings? Do you recommend keeping membership
for a hospital? Why or why not? (Model 3)
Hospital Characteristics
Hospital beds
Ownership
For Profit
Non-for profit
Other
Hospital revenue
Medicare discharge ratio
Medicaid discharge ratio
N
AIC
Coef.
St. Err
p-value
Ref.
Ref.
Ref.
Based on your finding please recommend 3 policies to improve hospital performance in hospitals, please make sure
to use the final model for your recommendation.
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 6
Question #5 (Credits 20)- [Excel users]
If you have chosen to work with Excel, please run above three models and complete the following tables.
Model 1: Run a regression model and use being a member of network and find out its impact on hospital cost?
(Model 1)
Coef.
ST. ERR
T Stat
P-values
Lower 95%
Upper 95%
Model-1
Hospital cost
N
R Square
Model 2: For the 2nd model run a regression model and use being a member of network and find out its impact on
hospital cost and hospital revenue? (Model 2)
Coef.
ST. ERR
T Stat
P-values
Lower 95%
Upper 95%
Model-2
Hospital cost
Hospital Revenue
N
R Square
Model 3: For the 3rd model run a regression model and use being a member of network and find out its impact on
ratio-Medicare-discharge and ratio-Medicaid-discharge.
Coef.
ST. ERR
T Stat
P-values
Lower 95%
Upper 95%
Model-3
Hospital cost
Hospital Revenue
Medicare discharge ratio
Medicaid discharge ratio
N
R Square
Based on your finding please recommend 3 policies and discuss the impact of being on a network on hospital cost,
hospital revenue and find out its impact on ratio-Medicare-discharge and ratio-Medicaid-discharge. Do you
recommend keeping membership for a hospital? Why or why not?
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 7
Question 6: (15 credits) [RStudio Users and Excel Users]
Note: Please limit your answer for each question to maximum 2 paragraphs and make sure to
support your findings with at least one citation – following APA 6.
1. Please offer a research question for the study using human subject research.
2. Explain the difference between the research process involving human subjects and the research
process not involving human subjects.
3. Discuss ethical implications surrounding human subject research studies.
4. Explain the governance of the human subject research studies over the data and the process.
5. Provide examples of the consequences for not meeting IRB (Institutional Review Board) protocol
requirements.
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 8
Useful formula and guideline.
The computation of the p-value is illustrated in following figures.
HMGT 400 Research and Data Analysis in Health—FINAL EXAM, 9
sink(“C:/UMUC/FinalExam.txt”)
##################
##################
# Q#1
##################
##################
# Step 1: Install package dplyr & read it
# install.packages(‘dplyr’)
library(dplyr)
# Step 2: Read your data
# Pl change the location of file, please see the following video to learn about the location of file in your computer.
hosp