Automatic annotation of single documents

Abstract

Сандул Михаил Вадимович. Автоматическое аннотирование одиночных документов. Научный руководитель: Михайлова Е.Г. Направление математика, механика, кафедра Информационно-Аналитических Систем. В работе рассматриваются несколько подходов к автоматическому извлечению ключевых слов. Для двух из них был проведен подробный анализ их эффективности на русскоязычных текстах. В результате был предложен новый алгоритм. Количесвто источников: 10 1) Анализ данных и процессов / А.А. Барсегян, М.С. Куприянов, И.И. Холод, и др. -- БХВ-Петербург, 2009.-510 с. 2) Michael Berry. Text Mining: Applications and Theory / Michael Berry, Jacob Kogan -- 2010, John Wiley and Sons, Ltd.-205 с. 3) Rada Mihalcea, Paul Tarau. TextRank: Bringing Order into Texts // In Proceedings of EMNLP 2004 (ed. Lin D and Wu D), pp. 404–411 4) Siddiqi, S., Sharan, A. Keyword and keyphrase extraction from single Hindi document using statistical approach // 2015 2nd International Conference on Signal Processing and Integrated Networks (SPIN) 2015, pp. 713-718 5) Juan P. Herrera, Pedro A. Pury. Statistical Keyword Detection in Literary Corpora // The European Physical Journal B, 2008, pp. 135-146 6) Xinghua Hu, Bin Wu. Automatic Keyword Exctraction Using Linguistic Features //Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006, pp. 19-23 7) Zhengyang Liu, Jianyi Liu, Wenbin Yao, Cong Wang. Keyword Extraction Using PageRank on Synonym Networks // 2010 International Conference on E-Product E-Service and E-Entertainment, 2010, pp. 1-4 8) ones S., Paynter G. Automatic extraction of document keyphrases for use in digital libraries: evaluation and applications // Journal of the American Society for Information Science and Technology, 2002 9) Gutwin C, Paynter G, Witten I, Nevill-Manning C., Frank E. Improving browsing in digital libraries with keyphrase indexes // Decision Support Systems 27(1–2), 1999, pp. 81–104 10) С.А. Шаров. Частотный словарь [Электронный ресурс] - URL: http://www.artint.ru/projects/frqlist.php (дата обращения 20.05.2017)
Sandul Mikhail. Automatic annotation of Single Documents. Scientific Supervisor: Elena Mikhailova. Department of Mathematics and Mechanics, Sub-Department of Analytical Information Systems. Several approaches to automatic keyword extraction were investigated in this work. Efficiency of two of them were examined on Russian text. As a result new algorithm was proposed. Number of references: 10 1) Анализ данных и процессов / А.А. Барсегян, М.С. Куприянов, И.И. Холод, и др. -- БХВ-Петербург, 2009.-510 с. 2) Michael Berry. Text Mining: Applications and Theory / Michael Berry, Jacob Kogan -- 2010, John Wiley and Sons, Ltd.-205 с. 3) Rada Mihalcea, Paul Tarau. TextRank: Bringing Order into Texts // In Proceedings of EMNLP 2004 (ed. Lin D and Wu D), pp. 404–411 4) Siddiqi, S., Sharan, A. Keyword and keyphrase extraction from single Hindi document using statistical approach // 2015 2nd International Conference on Signal Processing and Integrated Networks (SPIN) 2015, pp. 713-718 5) Juan P. Herrera, Pedro A. Pury. Statistical Keyword Detection in Literary Corpora // The European Physical Journal B, 2008, pp. 135-146 6) Xinghua Hu, Bin Wu. Automatic Keyword Exctraction Using Linguistic Features //Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006, pp. 19-23 7) Zhengyang Liu, Jianyi Liu, Wenbin Yao, Cong Wang. Keyword Extraction Using PageRank on Synonym Networks // 2010 International Conference on E-Product E-Service and E-Entertainment, 2010, pp. 1-4 8) ones S., Paynter G. Automatic extraction of document keyphrases for use in digital libraries: evaluation and applications // Journal of the American Society for Information Science and Technology, 2002 9) Gutwin C, Paynter G, Witten I, Nevill-Manning C., Frank E. Improving browsing in digital libraries with keyphrase indexes // Decision Support Systems 27(1–2), 1999, pp. 81–104 10) С.А. Шаров. Частотный словарь [Электронный ресурс] - URL: http://www.artint.ru/projects/frqlist.php (дата обращения 20.05.2017)

Description

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By