Show simple item record

dc.contributor.authorGiang, Nguyen Hoang
dc.date.accessioned2018-04-11T03:15:53Z
dc.date.accessioned2018-05-17T04:15:45Z
dc.date.available2018-04-11T03:15:53Z
dc.date.available2018-05-17T04:15:45Z
dc.date.issued2016
dc.identifier.other022003045
dc.identifier.urihttp://10.8.20.7:8080/xmlui/handle/123456789/2471
dc.description.abstractWeb document modelling plays an important role in many fields including data mining and information retrieval. Its applications are in search engines, web rating systems, and web recommendation systems. Documents are various in content and representation. Therefore, there is a need of common model so that information can be extracted effectively and efficiently. This study is carried out to create an efficient web document modelling scheme. Many systems can benefit from this, such as, ranking systems, text classification systems, and web recommendation systems. It bases on vector space model (VSM) with TF-IDF weighting. Cosine similarity is used to measure the similarity of difference documents. WordNet is also help in understanding queries. Two modelling scheme are tested, one does not base on WordNet, another use WordNet for query expansion. The results are tested using precision computed for top-20 searching results. [1] The precision is around 80% for simple queries and becomes lower for more complex ones. The WordNet based method works better with long queries. The search-time is from 0.2 to 0.5 second.en_US
dc.description.sponsorshipDr. Nguyen Thi Thanh Sangen_US
dc.language.isoen_USen_US
dc.publisherInternational University - HCMCen_US
dc.subjectWeb document modeling; Vector space modelen_US
dc.titleBuilding a web document modeling toolen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record