[-] Show simple item record

dc.contributor.advisorLee, Yugyung, 1960-eng
dc.contributor.authorBavirisetty, Venkata Pramod Guptaeng
dc.date.issued2014-07-30eng
dc.date.submitted2014 Springeng
dc.descriptionTitle from PDF of title page, viewed on July 30, 2014eng
dc.descriptionThesis advisor: Yugyung Leeeng
dc.descriptionVitaeng
dc.descriptionIncludes bibliographical references (pages 62-65)eng
dc.descriptionThesis (M. S.)--School of Computing and Engineering. University of Missouri--Kansas City, 2014eng
dc.description.abstractAs huge amounts of data are created rapidly, the demand for the integration and analysis of such data has been growing steadily. It is especially essential to retrieve relevant and accurate evidence in healthcare and biomedical research. Even though query systems based on Ontology, Medical Subject Headings (MeSH), or keyword searches are available, query systems based on evidence and effective retrieval of data from large collections of clinical data are not sufficiently available. This thesis proposes a novel approach to analyze big data sets collected from Clinical trials research and discover significant evidence and association patterns with respect to conditions, treatment, and medication side effects. Our approach makes use of machine learning techniques in the Apache Hadoop framework with support from MetaMap and RxNorm. In this thesis, a heuristic measure of empirical evidence was newly designed considering the association degree of conditions, treatment, and medication side effects and the percentage of people affected. The Apriori algorithm was used to discover strong positive association rules with various measures including support, and confidence. We have examined a large and complex data set (12,327 study results) from clinicaltrials.gov and identified 8,291 strong association rules and 59,228 combinations with 432,841 subjects, 1761 conditions, 2836 drugs, and 27 side effects. The significance of these association patterns was evaluated in terms of the impact factor representing the percentage of the population with a high rate of side effects. Using these association rules and combination strengths, an evidence based query system was implemented to answer some integral questions. This query system also provided an interface to retrieve relevant publications from PubMed. The searching outcomes from this query system are compared with those from the PubMed search based on medical subject headings.eng
dc.description.tableofcontentsAbstract -- Illustrations -- Tables -- Introductions -- Related work -- Evidence based medical query model -- Implementation -- Results & Evaluation -- Conclusion and future work -- Referenceseng
dc.format.extentviii, 66 pageseng
dc.identifier.urihttp://hdl.handle.net/10355/43577eng
dc.subject.lcshEvidence-based medicine -- Data processingeng
dc.subject.lcshManagement information systemseng
dc.subject.otherThesis -- University of Missouri--Kansas City -- Computer scienceeng
dc.titleEvidence based medical query system on large scale dataeng
dc.typeThesiseng
thesis.degree.disciplineComputer Science (UMKC)eng
thesis.degree.grantorUniversity of Missouri--Kansas Cityeng
thesis.degree.levelMasterseng
thesis.degree.nameM. S.eng


Files in this item

[PDF]

This item appears in the following Collection(s)

[-] Show simple item record