[-] Show simple item record

dc.contributor.advisorLee, Yugyung, 1960-eng
dc.contributor.authorWalunj, Vijayeng
dc.date.issued2014-03-27eng
dc.date.submitted2013 Falleng
dc.descriptionTitle from PDF of title page, viewed on March 27, 2014eng
dc.descriptionThesis advisor: Yugyung Leeeng
dc.descriptionVitaeng
dc.descriptionIncludes bibliographical references (pages 53-56)eng
dc.descriptionThesis (M. S.)--School of Computing and Engineering. University of Missouri--Kansas City, 2013eng
dc.description.abstractDue to rapid exponential growth in data, a couple of challenges we face today are how to handle big data and analyze large data sets. An IBM study showed the amount of data created in the last two years alone is 90% of the data in the world today. We have especially seen the exponential growth of images on the Web, e.g., more than 6 billion in Flickr, 1.5 billion in Google image engine, and more than 1 billon images in Instagram [1]. Since big data are not only a matter of a size, but are also heterogeneous types and sources of data, image searching with big data may not be scalable in practical settings. We envision Cloud computing as a new way to transform the big data challenge into a great opportunity. In this thesis, we intend to perform an efficient and accurate classification of a large collection of images using Cloud computing, which in turn supports semantic image searching. A novel approach with enhanced accuracy has been proposed to utilize semantic technology to classify images by analyzing both metadata and image data types. A two-level classification model was designed (i) semantic classification was performed on a metadata of images using TF-IDF, and (ii) image classification was performed using a hybrid image processing model combined with Euclidean distance and SURF FLANN measurements. A Cloud-based Semantic Image Search Engine (CSISE) is also developed to search an image using the proposed semantic model with the dynamic image repository by connecting online image search engines that include Google Image Search, Flickr, and Picasa. A series of experiments have been performed in a large-scale Hadoop environment using IBM's cloud on over half a million logo images of 76 types. The experimental results show that the performance of the CSISE engine (based on the proposed method) is comparable to the popular online image search engines as well as accurate with a higher rate (average precision of 71%) than existing approacheseng
dc.description.tableofcontentsAbstract -- Contents -- Illustrations -- Tables -- Acknowledgements - Introduction -- Related work -- Cloud-based semantic image search engine model -- Cloud-based semantic image search engine (CSISE) implementation -- Experimental results and evaluation -- Conclusion and future work - Referenceseng
dc.format.extentxii, 57 pageseng
dc.identifier.urihttp://hdl.handle.net/10355/41489eng
dc.subject.lcshComputer scienceeng
dc.subject.lcshSearch engineseng
dc.subject.otherThesis -- University of Missouri--Kansas City -- Computer scienceeng
dc.titleCSISE: cloud-based semantic image search engineeng
dc.typeThesiseng
thesis.degree.disciplineComputer Science (UMKC)eng
thesis.degree.grantorUniversity of Missouri--Kansas Cityeng
thesis.degree.levelMasterseng
thesis.degree.nameM. S.eng


Files in this item

[PDF]

This item appears in the following Collection(s)

[-] Show simple item record