
FOLLOWUS
College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
Wei-ming LU, E-mail: luwm@zju.edu.cn
收稿:2018-06-08,
修回:2019-;8-14,
网络出版:2019-09-05,
纸质出版:2020-03
Scan QR Code
鲁伟明, 刘佳卉, 徐玮, 等. EncyCatalogRec:针对百科文章补全的目录推荐[J]. 信息与电子工程前沿(英文), 2020,21(3):436-447.
LU Wei-ming, LIU Jia-hui, XU Wei, et al. EncyCatalogRec: catalog recommendation for encyclopedia article completion[J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21(3): 436-447.
鲁伟明, 刘佳卉, 徐玮, 等. EncyCatalogRec:针对百科文章补全的目录推荐[J]. 信息与电子工程前沿(英文), 2020,21(3):436-447. DOI: 10.1631/FITEE.1800363.
LU Wei-ming, LIU Jia-hui, XU Wei, et al. EncyCatalogRec: catalog recommendation for encyclopedia article completion[J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21(3): 436-447. DOI: 10.1631/FITEE.1800363.
目前,在线百科(如维基百科等)已提供海量且主题多样的文章。然而,部分文章内容仍不够完善。本文提出EncyCatalogRec,一种能为百科文章推荐相关目录,从而帮助用户更好完善百科内容的系统。首先,将百科文章和目录项表达为内嵌向量,基于局部敏感哈希方法检索得到相关文章,并以这些文章的目录项为候选项;然后,基于检索得到的文章及其目录项构建关系图,进一步转为乘积图;在乘积图上,将目录推荐问题转为直推式学习问题;最后,基于学习排序算法对推荐得到的目录项排序。热启动和冷启动场景实验均证实,本文所提方法性能优于已有方法。最后通过示例验证了所提方法性能。
Online encyclopedias such as Wikipedia provide a large and growing number of articles on many topics. However
the content of many articles is still far from complete. In this paper
we propose EncyCatalogRec
a system to help generate a more comprehensive article by recommending catalogs. First
we represent articles and catalog items as embedding vectors
and obtain similar articles via the locality sensitive hashing technology
where the items of these articles are considered as the candidate items. Then a relation graph is built from the articles and the candidate items. This is further transformed into a product graph. So
the recommendation problem is changed to a transductive learning problem in the product graph. Finally
the recommended items are sorted by the learning-to-rank technology. Experimental results demonstrate that our approach achieves state-of-the-art performance on catalog recommendation in both warm- and cold-start scenarios. We have validated our approach by a case study.
S Banerjee , , , P Mitra . . Filling the gaps: improving Wikipedia stubs . . Proc ACM Symp on Document Engineering , , 2015a . . p.117 - - 120 . . DOI: 10.1145/2682571.2797073 http://doi.org/10.1145/2682571.2797073 . .
S Banerjee , , , P Mitra . . WikiKreator: improving Wikipedia stubs automatically . . Proc 53 rd Annual Meeting of the Association for Computational Linguistics and the 7 th Int Joint Conf on Natural Language Processing , , 2015b . . p.867 - - 877 . . DOI: 10.3115/v1/P15-1084 http://doi.org/10.3115/v1/P15-1084 . .
S Banerjee , , , P Mitra . . WikiWrite: generating Wikipedia articles automatically . . Proc 25 th Int Joint Conf on Artificial Intelligence , , 2016 . . p.2740 - - 2746 . . . .
C Bizer , , , J Lehmann , , , G Kobilarov , , , 等 . . DBpedia—a crystallization point for the web of data . . J Web Semant , , 2009 . . 7 ( ( 3 ): ): 154 - - 165 . . DOI: 10.1016/j.websem.2009.07.002 http://doi.org/10.1016/j.websem.2009.07.002 . .
M Datar , , , N Immorlica , , , P Indyk , , , 等 . . Locality-sensitive hashing scheme based on p -stable distributions . . Proc 20 th Annual Symp on Computational Geometry , , 2004 . . p.253 - - 262 . . DOI: 10.1145/997817.997857 http://doi.org/10.1145/997817.997857 . .
B Fetahu , , , K Markert , , , A Anand . . Automated news suggestions for populating Wikipedia entity pages . . Proc 24 th ACM Int Conf on Information and Knowledge Management , , 2015 . . p.323 - - 332 . . DOI: 10.1145/2806416.2806531 http://doi.org/10.1145/2806416.2806531 . .
M Gambhir , , , V Gupta . . Recent automatic text summarization techniques: a survey . . Artif Intell Rev , , 2017 . . 47 ( ( 1 ): ): 1 - - 66 . . DOI: 10.1007/s10462-016-9475-9 http://doi.org/10.1007/s10462-016-9475-9 . .
TH Haveliwala . . Topic-sensitive PageRank . . Proc 11 th Int Conf on World Wide Web , , 2002 . . p.517 - - 526 . . DOI: 10.1145/511446.511513 http://doi.org/10.1145/511446.511513 . .
XN He , , , LZ Liao , , , HW Zhang , , , 等 . . Neural collaborative filtering . . Proc 26 th Int Conf on World Wide Web , , 2017 . . p.173 - - 182 . . DOI: 10.1145/3038912.3052569 http://doi.org/10.1145/3038912.3052569 . .
J Hoffart , , , FM Suchanek , , , K Berberich , , , 等 . . YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia . . Artif Intell , , 2013 . . 194 28 - - 61 . . DOI: 10.1016/j.artint.2012.06.001 http://doi.org/10.1016/j.artint.2012.06.001 . .
T Joachims . . Optimizing search engines using clickthrough data . . Proc 8 th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining , , 2002 . . p.133 - - 142 . . DOI: 10.1145/775047.775067 http://doi.org/10.1145/775047.775067 . .
T Joachims . . Training linear SVMs in linear time . . Proc 12 th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining , , 2006 . . p.217 - - 226 . . DOI: 10.1145/1150402.1150429 http://doi.org/10.1145/1150402.1150429 . .
Y Koren , , , R Bell , , , C Volinsky . . Matrix factorization techniques for recommender systems . . Computer , , 2009 . . 42 ( ( 8 ): ): 30 - - 37 . . DOI: 10.1109/MC.2009.263 http://doi.org/10.1109/MC.2009.263 . .
QV Le , , , T Mikolov . . Distributed representations of sentences and documents . . Proc 31 st Int Conf on Machine Learning , , 2014 . . p.1188 - - 1196 . . . .
HX Liu , , , YM Yang . . Bipartite edge prediction via transductive learning over product graphs . . Proc 32 nd Int Conf on Machine Learning , , 2015 . . p.1880 - - 1888 . . . .
X Luo , , , MC Zhou , , , YN Xia , , , 等 . . An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems . . IEEE Trans Ind Inform , , 2014 . . 10 ( ( 2 ): ): 1273 - - 1284 . . DOI: 10.1109/TII.2014.2308433 http://doi.org/10.1109/TII.2014.2308433 . .
T Mikolov , , , I Sutskever , , , K Chen , , , 等 . . Distributed representations of words and phrases and their compositionality . . Proc 26 th Int Conf on Neural Information Processing Systems , , 2013a . . p.3111 - - 3119 . . . .
T Mikolov , , , K Chen , , , G Corrado , , , 等 . . Efficient estimation of word representations in vector space . . 2013b . . https://arxiv.org/abs/1301.3781 https://arxiv.org/abs/1301.3781 , , . .
R Reinanda , , , E Meij , , , M de Rijke . . Mining, ranking and recommending entity aspects . . Proc 38 th Int ACM SIGIR Conf on Research and Development in Information Retrieval , , 2015 . . p.263 - - 272 . . DOI: 10.1145/2766462.2767724 http://doi.org/10.1145/2766462.2767724 . .
C Sauper , , , R Barzilay . . Automatically generating Wikipedia articles: a structure-aware approach . . Proc 47 th Annual Meeting of the ACL and the 4 th Int Joint Conf on Natural Language Processing of the AFNLP , , 2009 . . p.208 - - 216 . . . .
M Strube , , , SP Ponzetto . . WikiRelate . . Computing semantic relatedness using Wikipedia. Proc 21 st National Conf on Artificial Intelligence , , 2006 . . p.1419 - - 1424 . . . .
FM Suchanek , , , G Kasneci , , , G Weikum . . YAGO: a core of semantic knowledge . . Proc 16 th Int Conf on World Wide Web , , 2007 . . p.697 - - 706 . . DOI: 10.1145/1242572.1242667 http://doi.org/10.1145/1242572.1242667 . .
S Tanaka , , , N Okazaki , , , M Ishizuka . . Learning web query patterns for imitating Wikipedia articles . . Proc 23 rd Int Conf on Computational Linguistics , , 2010 . . p.1229 - - 1237 . . . .
KL Wagstaff , , , E Riloff , , , NL Lanza , , , 等 . . Creating a Mars target encyclopedia by extracting information from the planetary science literature . . AAAI Workshop on Knowledge Extraction from Text , , 2016 . . p.532 - - 536 . . . .
E Wulczyn , , , R West , , , L Zia , , , 等 . . Growing Wikipedia across languages via recommendation . . Proc 25 th Int Conf on World Wide Web , , 2016 . . p.975 - - 985 . . DOI: 10.1145/2872427.2883077 http://doi.org/10.1145/2872427.2883077 . .
Y Zhao , , , G Karypis . . Evaluation of hierarchical clustering algorithms for document datasets . . Proc 11 th Int Conf on Information and Knowledge Management , , 2002 . . p.515 - - 524 . . DOI: 10.1145/584792.584877 http://doi.org/10.1145/584792.584877 . .
Y Zhao , , , G Karypis , , , U Fayyad . . Hierarchical clustering algorithms for document datasets . . Data Min Knowl Discov , , 2005 . . 10 ( ( 2 ): ): 141 - - 168 . . DOI: 10.1007/s10618-005-0361-3 http://doi.org/10.1007/s10618-005-0361-3 . .
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621