Corpus-Assisted Research
After watching the recorded presentations, join these authors for a live panel discussion on December 4, 2020 at 8:30 am – 9:00 am (CST). Moderator: Kimberly Becker
Presenter | Abstract |
---|---|
Hong Ma and Jinglei Wang Assistant Professor | Assigning Students' Writing Samples to CEFR Levels Automatically: A Machin-learning Approach This project intends to propose a method of assigning students' writing samples to CEFR levels (Common European Framework of Reference for Languages) automatically. We believe that the method we proposed, relying on big data and machine-learning algorithm, will facilitate future endeavors in alignment and writing evaluation. |
Yongkook Won Visiting Researcher | Topic Modeling Analysis of Research Trends of English Language Teaching in Korea The goal of this study is to understand the research trends of English language teaching (ELT) in Korea for the last 20 years from 2000 to 2019. To this end, 11 major academic journals in Korea related to ELT were selected, and abstracts of 7,035 articles published in the journals were collected and analyzed. The number of articles published in the journals continued to increase from the first half of the 2000s to the first half of the 2010s, but decreased somewhat in the late 2010s. Text data in the abstracts were preprocessed using NLTK tokenizer (Bird, Loper, & Klein, 2009) and spaCy POS tagger (Honnibal & Montani, 2017), and only the nouns in the data were used for further analysis. Based on the previous studies on ELT research trends (Kim & Kim, 2015), 25 topics were extracted from abstracts of the articles by applying latent Dirichlet allocation (LDA) topic modeling with the R package topicmodels (Grün & Hornik, 2011). Teacher, tertiary education, listening, language testing, and curriculum appeared as topics that were frequently studied in the field of ELT. A result of time series regression analysis shows that rising topics include task-based learning, tertiary education, vocabulary, affective factors, and peer feedback, while falling topics include speaking, culture, and computer-assisted language learning (CALL) (at α = .001). |