机构:
Univ South Australia, Sch Informat Technol & Math Sci, GPO Box 2471, Adelaide, SA 5001, AustraliaUniv South Australia, Sch Informat Technol & Math Sci, GPO Box 2471, Adelaide, SA 5001, Australia
Kolyshkina, Inna
[1
]
Brownlow, Marcus
论文数: 0引用数: 0
h-index: 0
机构:
Brownlow Consulting, Adelaide, SA 5006, AustraliaUniv South Australia, Sch Informat Technol & Math Sci, GPO Box 2471, Adelaide, SA 5001, Australia
Brownlow, Marcus
[2
]
Taylor, Jarrad
论文数: 0引用数: 0
h-index: 0
机构:
Business Process Visualisat Australia, Adelaide, SA 5000, AustraliaUniv South Australia, Sch Informat Technol & Math Sci, GPO Box 2471, Adelaide, SA 5001, Australia
Taylor, Jarrad
[3
]
机构:
[1] Univ South Australia, Sch Informat Technol & Math Sci, GPO Box 2471, Adelaide, SA 5001, Australia
[2] Brownlow Consulting, Adelaide, SA 5006, Australia
[3] Business Process Visualisat Australia, Adelaide, SA 5000, Australia
来源:
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW)
|
2013年
关键词:
random forests;
generalised boosted models;
multivariate adaptive regression splines;
R;
Tableau;
early childhood development;
AEDI;
open data;
GovHack;
social health atlas;
D O I:
10.1109/ICDMW.2013.61
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
This article describes the results of a data mining project designed to explore the key drivers of the Australian Early Development Index (AEDI), a numerical indicator of early childhood development vulnerability. The work was conducted during GovHack 2013, a 48-hour Australian Open Data competition where participants were required to use published open data sets provided by various Australian government and other agencies. We applied advanced machine learning techniques (random forests, generalised boosted regression models and multivariate adaptive regression splines) to the South Australian state and national data to gain insights into the key drivers of AEDI and to quantify the levers that the state government, community and individuals could apply to improve the situation. We found that after accounting for the population specifics and socioeconomic conditions, for example unemployment level and Index of Relative Socioeconomic Disadvantage, the most important factors impacting early childhood development were lack of motor vehicle in the household, inability to afford buying medication and maternal smoking during pregnancy. We quantified the impact of each of these factors and suggested relevant potential Government interventions. We then visualised our findings and created a Web app that allowed various intervention strategies to be interactively explored, based on the derived relationship between early child development index and its key drivers.