研究生: |
吳建良 Chien-Liang Wu |
---|---|
論文名稱: |
依使用者查詢及回饋探勘同類型喜好之網頁推薦系統 A Web Page Recommendation System Based on Clusters of Query Interest |
指導教授: | 柯佳伶 |
學位類別: |
碩士 Master |
系所名稱: |
資訊教育研究所 Graduate Institute of Information and Computer Education |
論文出版年: | 2002 |
畢業學年度: | 90 |
語文別: | 中文 |
論文頁數: | 57 |
中文關鍵詞: | 網頁探勘 、群集分析 、關聯規則 、合作式過濾 、推薦系統 |
英文關鍵詞: | Web Mining, Clustering Analysis, Association Rule, Collaborative Filtering, Recommendation System |
論文種類: | 學術論文 |
相關次數: | 點閱:188 下載:13 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出建構一個網頁查詢推薦系統之相關技術。此系統從使用者查詢關鍵字及瀏覽網頁的記錄中過濾出有用的資訊,以使用者查詢關鍵字、瀏覽網頁、以及其回饋值構成使用者查詢交易記錄單位,並記錄在使用者側寫之中。本論文提出瀏覽關聯二分圖群集演算法,以二元位元向量來加快查詢關鍵字相似度的計算,找出查詢關鍵字和瀏覽網頁的聚落(查詢喜好聚落)。具有多種不同類型喜好的使用者可分屬於多個查詢喜好聚落中,使得以合作式過濾做推薦時,具有部份喜好相同的使用者瀏覽行為也能提供重要的推薦資訊。接下來,系統根據查詢喜好聚落將使用者查詢交易記錄做分割,針對每個分割探勘出查詢關鍵字與網頁文章之間的關聯規則,並以使用者瀏覽網頁回饋值計算出關聯規則的支持度與確信度。最後根據這些探勘所得的資訊,對於會員使用者,系統以合作式過濾方式,推薦使用者側寫所包含的查詢喜好聚落中相關性高的網頁文章給使用者。對於匿名使用者,系統則提供如搜尋引擎以關鍵字搜尋的方式,依據查詢關鍵字所屬聚落中推薦相關性高的網頁,使查詢結果更精簡並切合使用者需求。
Most previous works on recommendation systems of web pages were designed based on collaborative filtering according to the clusters of user browsing behavior. In these approaches, a user only belongs to certain one cluster. If most users have multiple kinds of browsing interests, the number of users in the same cluster will be small and the information used for recommendation is limited. In addition, the information of users who have partially similar behavior is not considered. In this thesis, the strategies for constructing a query and recommendation system of web pages are proposed. First, the query keywords, browsed web pages, and user feedback values are extracted from web logs to be query transactions. A clustering algorithm is proposed to find the clusters of queries and related web pages, called the clusters of query interest , from the query transactions. A user who has multiple kinds of query interests can belong to more than one cluster. Then user query transactions are partitioned based on the clusters of query interest. In each partition, the association rules of queries and web pages are mined, where the support and confidence of rules are computed based on feedback values of users. According to the mined information, two main functions are provided in the system. A member user can ask a recommendation request. Based on clusters of query interest contained in the user profile, the highly associated web pages are recommended. On the other hand, an anonymous user can ask a query recommendation request to the system by giving query keywords. According to the cluster of query interest that the query keywords belong to, the highly associated web pages are returned as query results. Therefore, the query results will be more simplified and meet the requirements of most users.
[1] R. Agarwal, and R. Srikant, “Fast Algorithm for Mining Association Rule in Large Databases,” in Proceeding of The 20th International Conference on Very Large DataBases, 1994.
[2] D. Barbara, and P. Chen, “Using the Fractal Dimension to Cluster Datasets”, in Proceeding of the 6th International Conference on Knowledge Discovery and Data mining, ACM SIGKDD, 2000.
[3] D. Beeferman, and A. Berger, “Agglomerative Clustering of A Search Engine Query Log,“ in Proceedings of the 6th International Conference on Knowledge Discovery and Data Mmining, ACM SIGKDD, 2000.
[4] C.H. Cheng , A.W. Fu, and Y. Zhang, “Entropy-based Subspace Clustering for Mining Numerical Data,” in Proceeding of International Conference on Knowledge Discovery and Data Mining, ACM SIGKDD, 1999.
[5] R. Cooley , J. Srivastava, M. Deshpande, and P.N. Tan, “Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data,” To appear in SIGKDD Explorations, Vol. 1, Issue 2, 2000.
[6] R. Cooley, B. Mobasher, and J. Srivastava, “Web Mining:Information and Pattern Discovery on the World Wide Web,” in Proceeding of the 9th International Conference on Tools with Artificial Intelligence, IEEE, 1999.
[7] I.S. Dhillon, “Co-Clustering Documents and Words Using Bipartite Spectral Graph Partitioning,” in Proceeding of the 7th International Conference on Knowledge Discovery and Data Mining, ACM SIGKDD, 2001.
[8] X. Fu, J. Budzik, and K.J. Hammond, “Mining Navigation History for Recommendation,” in Proceeding of International Conference on Intelligent User Interfaces, ACM, 2000.
[9] S. Guha, R. Rastogi, and K. Shim, “ROCK:A Robust Clustering Algorithm for Categorical Attributes,” in Proceeding of the 15th International Conference on Data Engineering, IEEE, 1998.
[10] H.J. Kim, and S.G. Lee, “A Semi-Supervised Document Clustering Technique for Information Organization,” in Proceeding of the 9th International Conference on Information and Knowledge Management, 2000.
[11] M. Kwak, and D.S. Cho, “Collaborative Filtering with Automatic Rating for Recommendation,” in Proceedings of International Symposium on Industrial Electronics, Volume: 1 , 2001.
[12] B. Mobasher, H. Dai, T. Luo, and M. Nakagawa, “Improving the Effectiveness of Collaborative Filtering on Anonymous Web Usage Data,” Technical Reports, March 2001.
[13] B. Mobasher, R. Cooley, and J. Srivastava, “Creating Adaptive Web Sites Through Usage-Based Clustering of URLs,” in Proceedings of the Workshop on Knowledge and Data Engineering Exchange, 1999.
[14] D.S. Modha, and W.S. Spangler, “Clustering Hypertext with Applications to Web Searching,” in Proceeding of the 11th on Hypertext and hypermedia, ACM, 2000.
[15] Z. Su, Q. Yang, H. Zhang, X. Xu, and Y. Hu, “Correlation-based Document Clustering using Web Logs,” in Proceeding of the 34th Annual Hawaii International Conference on System Sciences, 2001.
[16] K. Wang, C. Xu, and B. Liu, “Clustering Transactions Using Large Items,” in Proceeding of the 8th International Conference on Information and Knowledge Management, ACM, 1999.
[17] J. Wen, J.Y. Nie, and H.J. Zhang, “Clustering User Queries of A Search Engine,” in Proceedings of the 10th International World Wide Web Conference, 2001.
[18] Y.H. Wu, Y.C. Chen, and A.L.P. Chen, “Enabling Personalized Recommendation on the Web based on User Interests and Behaviors,“ in Proceeding of International Workshop on Research Issues in Data Engineering , RIDE IEEE, 2001.
[19] K.L. Wu, C.C. Aggarwal, and P.S. Yu, “Personalization with Dynamic Profiler,” Advanced Issues of E-Commerce and Web-Based Information Systems, in Proc. of the 3th International Workshop on Computer Networks and Mobile Computing(WECWIS), 2001.
[20] J. Xiao, and Y. Zhang, “Clustering of Web Users Using Session-Based Similarity Measures,” in Proceedings of International Conference on Computer Networks and Mobile Computing, 2001.