Extraction Tools for Hypertension Biomedical Literature Classification
How to cite (IJASEIT) :
Textual information gives us more clear information as it is presented using words and characters, which is easy for human to understand. To extract this kind of information, text mining came into the new sight of technology. Nowadays, the extraction of information can be done by using an extraction tool instead of extracting it manually. By performing the keyword extraction using tools, the extraction of the right keywords can be produced in order to help people to focus onto just the specified reading and eliminating the other document aside. According to Centers for Disease Control and Prevention, hypertension is one of the most popular diseases in the world nowadays and the leading causes of death for Americans since it increases the risk for heart disease and stroke. It is known as “silent killer” because it is commonly gives no warning signs or symptoms to our body. Therefore, the purpose of this research paper is to perform and compare keyword extraction using statistical and linguistic extraction tools for classifying the biomedical literature (hypertension). RStudio is a statistical-based tool and TerMine tool is a linguistic-based tool that has been used to demonstrate the process of extracting the specified keyword from the biomedical literature. Thus, classification evaluation using Naïve Bayes classifier is done in order to evaluate and compare the performance of the both tools. Experimental results show the comparison and the difference between both tools in executing extraction keywords. Finally, the comparison of both tools in extracting the keyword for treatment of hypertension was obtained at the end of the research.