Removing Duplicates Method of English Tests

Shi-jiao ZHANG, Yuan SUN, Zhen ZHU

Abstract


This paper is an exploration to find a way to remove duplicates of English tests. Considering those duplicates of English tests, and it is very difficult to remove duplicates by manual work. So we use a method to remove duplicates of English tests. The English test was composed of three parts--question stem, options and answer analysis. Because different parts have different characters, we use different strategies in different parts. Firstly, we use word2vec method in answer analyses, synonyms’ distance calculation in options, and coincidences ratio calculation in question stem. Then we regard the answers of above as the variables in the multiple regression model. Finally, we achieve removing duplicates in English tests.

Keywords


Removing duplicates, Word2vec, Synonyms’ distance, Contact ratio


DOI
10.12783/dtcse/aics2016/8226

Refbacks

  • There are currently no refbacks.