Abstract
Background: As a known keyphrase extraction algorithm, TextRank is an analog of the PageRank algorithm, which relies heavily on the statistics of term frequency in the manner of cooccurrence analysis.
Objective: The frequency-based characteristic made it a bottleneck for performance enhancement, and various improved TextRank algorithms were proposed in recent years. Most of the improvements incorporated semantic information into the keyphrase extraction algorithm and achieved improvement.
Method: In this research, taking both syntactic and semantic information into consideration, we integrated syntactic tree algorithm and word embedding and put forward an algorithm of Word Embedding and Syntactic Information Algorithm (WESIA), which improved the accuracy of the TextRank algorithm.
Results: By applying our method on a self-made test set and a public test set, the result implied that the proposed unsupervised keyphrase extraction algorithm outperformed the other algorithms to some extent.
Keywords: Key phrases, Syntactic distance, Word embedding, Algorithm, TextRank, Word Embedding and Syntactic Information Algorithm (WESIA).
Recent Advances in Computer Science and Communications
Title:Keyphrase Extraction by Improving TextRank with an Integration of Word Embedding and Syntactic Information
Volume: 14 Issue: 9
Author(s): Sheng Zhang, Qi Luo, Yukun Feng, Ke Ding, Daniela Gifu, Silan Zhang*, Xiaohang Ma and Jingbo Xia
Affiliation:
- College of Science, Huazhong Agricultural University, 430070, Wuhan, Hubei, P.R. China
Keywords: Key phrases, Syntactic distance, Word embedding, Algorithm, TextRank, Word Embedding and Syntactic Information Algorithm (WESIA).
Abstract: Background: As a known keyphrase extraction algorithm, TextRank is an analog of the PageRank algorithm, which relies heavily on the statistics of term frequency in the manner of cooccurrence analysis.
Objective: The frequency-based characteristic made it a bottleneck for performance enhancement, and various improved TextRank algorithms were proposed in recent years. Most of the improvements incorporated semantic information into the keyphrase extraction algorithm and achieved improvement.
Method: In this research, taking both syntactic and semantic information into consideration, we integrated syntactic tree algorithm and word embedding and put forward an algorithm of Word Embedding and Syntactic Information Algorithm (WESIA), which improved the accuracy of the TextRank algorithm.
Results: By applying our method on a self-made test set and a public test set, the result implied that the proposed unsupervised keyphrase extraction algorithm outperformed the other algorithms to some extent.
Export Options
About this article
Cite this article as:
Zhang Sheng , Luo Qi, Feng Yukun , Ding Ke , Gifu Daniela , Zhang Silan *, Ma Xiaohang and Xia Jingbo , Keyphrase Extraction by Improving TextRank with an Integration of Word Embedding and Syntactic Information, Recent Advances in Computer Science and Communications 2021; 14 (9) . https://dx.doi.org/10.2174/2666255813999200820155846
DOI https://dx.doi.org/10.2174/2666255813999200820155846 |
Print ISSN 2666-2558 |
Publisher Name Bentham Science Publisher |
Online ISSN 2666-2566 |
Call for Papers in Thematic Issues
Advanced Applications of Artificial Intelligence in Manufacturing Technologies
As one of the most advanced fields of study and technology in existence today, artificial intelligence (AI) is finding more and more applications in production and daily life, especially in the industrial sector. This showcases the many applications of AI in mechanical production, including but not limited to: improving worker ...read more
Advanced integration of computer vision and AI algorithms for automated applications in vehicles
Automation is a key component of present automobile industry to enhance the innovation through artificial intelligence. Intelligent automation in the autonomous vehicles can replace humans and provide better safety movements of vehicles. Global shift towards human to automation needs high risk mitigation technologies. The vast challenges of autonomous vehicles are ...read more
Advances in Biomechanics and Biomedical Engineering
Advances in biomechanics and biomedical engineering have revolutionized the way we understand and treat various medical conditions. Biomechanics is the study of the mechanical aspects of living organisms, while biomedical engineering is the application of engineering principles to the field of medicine. Together, these disciplines have led to groundbreaking innovations ...read more
Advancing Computer Vision and Multimedia Communication for Seamless Human-Machine Interaction
The rapid advancements in computer vision and multimedia communication technologies are revolutionizing the way humans interact with machines. These technologies have the potential to enable seamless and natural human-machine interaction, creating new possibilities for communication, collaboration, and entertainment. The findings will have a significant impact on the development of new ...read more
Related Journals
- Author Guidelines
- Graphical Abstracts
- Fabricating and Stating False Information
- Research Misconduct
- Post Publication Discussions and Corrections
- Publishing Ethics and Rectitude
- Increase Visibility of Your Article
- Archiving Policies
- Peer Review Workflow
- Order Your Article Before Print
- Promote Your Article
- Manuscript Transfer Facility
- Editorial Policies
- Allegations from Whistleblowers