Deep Learning for Discussion-Based Cross-Domain Performance Prediction of MOOC Learners Grouped by Language on FutureLearn

Ismail Duru, Ayse Saliha Sunar, Su White, Banu Diri

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

Analysing learners’ behaviours in MOOCs has been used to identify predictive features associated with positive outcomes in engagement and learning success. Early methods predominantly analysed numerical features of behaviours such as the page views, video views, and assessment grades. Analysing extracted numeric features using baseline machine learning algorithms performed well to predict the learners’ future performance in MOOCs. We propose categorising learners by likely English language proficiency and extending the range of data to include the content of comment texts. We compare results to a model trained with a combined set of extracted features. Not all platforms provide this rich variety of data. We analysed a series of a FutureLearn language focused MOOCs. Our data were from discussions embedded into each lesson’s content. Analysing whether we gained any additional insights, over 420,000 comments were used to train the algorithm. We created a method for identifying one’s possible first language from their country. We found that using comments alone is a weaker predictive approach than using a combination including extracted features from learners’ activities. Our study contributes to research on generalisability of learning algorithms. We replicated the method across different MOOCs—the performance varies on the model though it always remained over 50%. One of the deep learning architecture, Bidirectional LSTM, trained with discussions on the language learning 73% successfully predicted learners’ performance on a different MOOC.

Original languageEnglish
Pages (from-to)3613-3629
Number of pages17
JournalArabian Journal for Science and Engineering
Volume46
Issue number4
DOIs
Publication statusPublished - Apr 2021
Externally publishedYes

Keywords

  • Deep learning
  • English as a second language
  • FutureLearn
  • MOOCs
  • Natural language processing
  • Predictive models

Fingerprint

Dive into the research topics of 'Deep Learning for Discussion-Based Cross-Domain Performance Prediction of MOOC Learners Grouped by Language on FutureLearn'. Together they form a unique fingerprint.

Cite this