next up previous contents
Next: 目次   目次

概要:

This study examines the similarities of documents by text mining.

Here, the documents are syllabuses from the department of electrical and computer engineering in Gifu National College of Technology. The results will help motivating students to learn.

The procedure is as follows.

  1. Extract key words from the syllabus, and calculate their level of importance of each word.
  2. Based on numbers gained from procedure 1, calculate similarities between the two syllabuses, using vector space.
  3. Exclude words common to all the syllabuses from extracted words to emphasize the similarities.
  4. Calculate the similarities of the syllabus, using the same method as used in procedure 2.

After excluding common words from the syllabus, the characteristics of the syllabus appear in the key words more clearly than before. Therefore, the similarities became more reliable than before.





Deguchi Lab. 2010年3月5日