next up previous contents
Next: 目次   目次

概要:

This study examines the similarities of documents by text mining.

Here, the documents are syllabuses from the department of electrical and computer engineering in Gifu National College of Technology.

The results will help motivating students to learn.

To calculate the degree of similarity, the following methods are adopted to our school's syllabuses.

The procedure is as follows.

  1. Use direct term matching method.
  2. Use Latent Semantic Analysis.

The ideal pretreatment for a syllabus was carried out by hand work.

Processed documents led to better results than not-processed ones did.

Next, the degree of similarity was calculated with and without compound words.

As a result, there is an advantage using compound words.





Deguchi Lab. 2012年3月9日