Relative depth estimation from single monocular images with deep convolutional network

Yang, Alex (M.S. in computer science)

URI

https://hdl.handle.net/10355/63579
https://doi.org/10.32469/10355/63579

dc.contributor.advisor	Scott, Grant (Grant J.)	eng
dc.contributor.author	Yang, Alex (M.S. in computer science)	eng
dc.date.issued	2017	eng
dc.date.submitted	2017 Fall	eng
dc.description	Field of study: Computer science.	eng
dc.description	Dr. Grant Scott, Thesis Supervisor.	eng
dc.description	"December 2017."	eng
dc.description.abstract	Depth estimation from single monocular images is a theoretical challenge in computer vision as well as a computational challenge in practice. This thesis addresses the problem of depth estimation from single monocular images using a deep convolutional neural fields framework; which consists of convolutional feature extraction, superpixel dimensionality reduction, and depth inference. Data were collected using a stereo vision camera, which generated depth maps though triangulation that are paired with visual images. The visual image (input) and computed depth map (desired output) are used to train the model, which has achieved 83 percent test accuracy at the standard 25 percent tolerance. The problem has been formulated as depth regression for superpixels and our technique is superior to existing state-of-the-art approaches based on its demonstrated its generalization ability, high prediction accuracy, and real-time processing capability. We utilize the VGG-16 deep convolutional network as feature extractor and conditional random fields depth inference. We have leveraged a multi-phase training protocol that includes transfer learning and network fine-tuning lead to high performance accuracy. Our framework has a robust modular nature with capability of replacing each component with different implementations for maximum extensibility. Additionally, our GPU-accelerated implementation of superpixel pooling has further facilitated this extensibility by allowing incorporation of feature tensors with exible shapes and has provided both space and time optimization. Based on our novel contributions and high-performance computing methodologies, the model achieves a minimal and optimized design. It is capable of operating at 30 fps; which is a critical step towards empowering real-world applications such as autonomous vehicle with passive relative depth perception using single camera vision-based obstacle avoidance, environment mapping, etc.	eng
dc.description.bibref	Includes bibliographical references (pages 61-65).	eng
dc.format.extent	1 online resource (viii, 65 pages) : illustrations (chiefly color)	eng
dc.identifier.merlin	b129592006	eng
dc.identifier.oclc	1101434765	eng
dc.identifier.uri	https://hdl.handle.net/10355/63579
dc.identifier.uri	https://doi.org/10.32469/10355/63579	eng
dc.language	English	eng
dc.publisher	University of Missouri--Columbia	eng
dc.relation.ispartofcommunity	University of Missouri--Columbia. Graduate School. Theses and Dissertations	eng
dc.rights	OpenAccess.	eng
dc.rights.license	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License.
dc.source	Submited to University of Missouri--Columbia Graduate School.	eng
dc.title	Relative depth estimation from single monocular images with deep convolutional network	eng
dc.type	Thesis	eng
thesis.degree.discipline	Computer science (MU)	eng
thesis.degree.grantor	University of Missouri--Columbia	eng
thesis.degree.level	Masters	eng
thesis.degree.name	M.S.	eng

Files in this item

Name:: public.pdf
Size:: 6.423Kb
Format:: PDF

View/Open

Name:: research.pdf
Size:: 19.24Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Computer Science electronic theses and dissertations (MU)
The electronic theses and dissertations of the Department of Computer Science.
2017 MU theses - Freely available online

[-] Show simple item record