Removing Duplicates Method of English Tests
Abstract
This paper is an exploration to find a way to remove duplicates of English tests. Considering those duplicates of English tests, and it is very difficult to remove duplicates by manual work. So we use a method to remove duplicates of English tests. The English test was composed of three parts--question stem, options and answer analysis. Because different parts have different characters, we use different strategies in different parts. Firstly, we use word2vec method in answer analyses, synonyms’ distance calculation in options, and coincidences ratio calculation in question stem. Then we regard the answers of above as the variables in the multiple regression model. Finally, we achieve removing duplicates in English tests.
Keywords
Removing duplicates, Word2vec, Synonyms’ distance, Contact ratio
DOI
10.12783/dtcse/aics2016/8226
10.12783/dtcse/aics2016/8226
Refbacks
- There are currently no refbacks.