TY - JOUR
T1 - Recent Contributions of Data Mining to Language Learning Research
AU - Warschauer, Mark
AU - Yim, Soobin
AU - Lee, Hansol
AU - Zheng, Binbin
N1 - Publisher Copyright:
© Cambridge University Press 2019.
PY - 2019
Y1 - 2019
N2 - This paper will review the role of data mining in research on second language learning. Following a general introduction to the topic, three areas of data mining research will be summarized - clustering techniques, text-mining, and social network analysis - with examples from both the broader field and studies conducted by the authors. The application of data mining in second language learning research is relatively new, and more theoretical and empirical support is needed in the appropriate collection, use, and interpretation of data for specific research and pedagogical objectives. The three examples that we introduce illustrate how new data sources accessible in online environments can be analyzed to better understand the optimal instructional context for corpus-based vocabulary learning (clustering technique), characteristics and patterns of collaborative written interaction using Google Docs (text mining and visualizations), and issues of access and community in computer-mediated discussion (social network analysis). Implications of these new techniques for L2 research will be discussed.
AB - This paper will review the role of data mining in research on second language learning. Following a general introduction to the topic, three areas of data mining research will be summarized - clustering techniques, text-mining, and social network analysis - with examples from both the broader field and studies conducted by the authors. The application of data mining in second language learning research is relatively new, and more theoretical and empirical support is needed in the appropriate collection, use, and interpretation of data for specific research and pedagogical objectives. The three examples that we introduce illustrate how new data sources accessible in online environments can be analyzed to better understand the optimal instructional context for corpus-based vocabulary learning (clustering technique), characteristics and patterns of collaborative written interaction using Google Docs (text mining and visualizations), and issues of access and community in computer-mediated discussion (social network analysis). Implications of these new techniques for L2 research will be discussed.
UR - http://www.scopus.com/inward/record.url?scp=85069784452&partnerID=8YFLogxK
U2 - 10.1017/S0267190519000023
DO - 10.1017/S0267190519000023
M3 - Review article
AN - SCOPUS:85069784452
SN - 0267-1905
VL - 39
SP - 93
EP - 112
JO - Annual Review of Applied Linguistics
JF - Annual Review of Applied Linguistics
ER -